Computer Vision / Video Analytics
Dec 09, 2024
Just Released: NVIDIA VILA VLM
Now available in preview, NVIDIA VILA is an advanced multimodal VLM that provides visual understanding of multi-images and video.
1 MIN READ
Dec 05, 2024
Celebrating Open Science and Enterprise AI Innovation on MONAI’s 5th Anniversary
As MONAI celebrates its fifth anniversary, we're witnessing the convergence of our vision for open medical AI with production-ready enterprise solutions. ...
7 MIN READ
Dec 03, 2024
Scaling Action Recognition Models with Synthetic Data
Action recognition models such as PoseClassificationNet have been around for some time, helping systems identify and classify human actions like walking,...
11 MIN READ
Dec 03, 2024
Build an Agentic Video Workflow with Video Search and Summarization
Building a question-answering chatbot with large language models (LLMs) is now a common workflow for text-based interactions. What about creating an AI system...
11 MIN READ
Nov 25, 2024
Just Released: NVIDIA DeepStream 7.1
The new release introduces Python support in Service Maker to accelerate real-time multimedia and AI inference applications with a powerful GStreamer...
1 MIN READ
Nov 21, 2024
AI Unlocks Early Clues to Alzheimer’s Through Retinal Scans
Your eyes could hold the key to unlocking early detection of Alzheimer’s and dementia, with a groundbreaking AI study. Called Eye-AD, the deep learning...
3 MIN READ
Nov 04, 2024
Build a Video Search and Summarization Agent with NVIDIA AI Blueprint
This post was originally published July 29, 2024 but has been extensively revised with NVIDIA AI Blueprint information. Traditional video analytics applications...
11 MIN READ
Oct 31, 2024
Deep Learning AI Model Identifies Breast Cancer Spread without Surgery
A new deep learning model could reduce the need for surgery when diagnosing whether cancer cells are spreading, including to nearby lymph nodes—also known as...
4 MIN READ
Oct 29, 2024
AI-Powered Devices Track Howls to Save Wolves
A new cell-phone-sized device—which can be deployed in vast, remote areas—is using AI to identify and geolocate wildlife to help conservationists track...
5 MIN READ
Oct 24, 2024
Federated Learning in Autonomous Vehicles Using Cross-Border Training
Federated learning is revolutionizing the development of autonomous vehicles (AVs), particularly in cross-country scenarios where diverse data sources and...
10 MIN READ
Oct 23, 2024
Optimizing the CV Pipeline in Automotive Vehicle Development Using the PVA Engine
In the field of automotive vehicle software development, more large-scale AI models are being integrated into autonomous vehicles. The models range from vision...
16 MIN READ
Oct 07, 2024
Accelerating Reality Capture Workflows with AI and NVIDIA RTX GPUs
Reality capture creates highly accurate, detailed, and immersive digital representations of environments. Innovations in site scanning and accelerated data...
10 MIN READ
Oct 07, 2024
Optimizing Microsoft Bing Visual Search with NVIDIA Accelerated Libraries
Microsoft Bing Visual Search enables people around the world to find content using photographs as queries. The heart of this capability is Microsoft's TuringMM...
11 MIN READ
Oct 07, 2024
Generate Image and Text Embeddings with NV-CLIP
NV-CLIP, a cutting-edge multimodal embeddings model for image and text, is now generally available.
1 MIN READ
Oct 07, 2024
Real-Time Surgical Guidance by Fusing Multi-Modal Imaging with NVIDIA Holoscan
Developers in the fields of image-guided surgery and surgical vision face unique challenges in creating systems and applications that can significantly improve...
7 MIN READ
Sep 27, 2024
AI Chatbot Delivers Multilingual Support to African Farmers
Some of Africa’s most resource-constrained farmers are gaining access to on-demand, AI-powered advice through a multimodal chatbot that gives detailed...
4 MIN READ