Posts by Shubham Agrawal
Computer Vision / Video Analytics
Mar 11, 2025
Build Real-Time Multimodal XR Apps with NVIDIA AI Blueprint for Video Search and Summarization
With the recent advancements in generative AI and vision foundational models, VLMs present a new wave of visual computing wherein the models are capable of...
9 MIN READ
Computer Vision / Video Analytics
Feb 26, 2025
Vision Language Model Prompt Engineering Guide for Image and Video Understanding
Vision language models (VLMs) are evolving at a breakneck speed. In 2020, the first VLMs revolutionized the generative AI landscape by bringing visual...
12 MIN READ
Generative AI
Dec 03, 2024
Build an Agentic Video Workflow with Video Search and Summarization
Building a question-answering chatbot with large language models (LLMs) is now a common workflow for text-based interactions. What about creating an AI system...
11 MIN READ
Computer Vision / Video Analytics
Aug 28, 2024
New Foundational Models and Training Capabilities with NVIDIA TAO 5.5
NVIDIA TAO is a framework designed to simplify and accelerate the development and deployment of AI models. It enables you to use pretrained models, fine-tune...
13 MIN READ