Home DEVELOPER
  • Home
  • Blog
  • Forums
  • Docs
  • Downloads
  • Training
  • Join
Generative AI

Build Your First Human-in-the-Loop AI Agent with NVIDIA NIM

Read now
Build Your First Human-in-the-Loop AI Agent with NVIDIA NIM
Models / Libraries / Frameworks

AI Unlocks Early Clues to Alzheimer’s Through Retinal Scans

Read now
AI Unlocks Early Clues to Alzheimer’s Through Retinal Scans
Simulation / Modeling / Design

AI Research Delivers Rapid, Accurate Prostate Cancer Predictions

Read now
AI Research Delivers Rapid, Accurate Prostate Cancer Predictions
Top Stories

Llama 3.2 Full-Stack Optimizations Unlock High Performance on NVIDIA GPUs

Read now
Llama 3.2 Full-Stack Optimizations Unlock High Performance on NVIDIA GPUs
Simulation / Modeling / Design

Rapidly Create Real-Time Physics Digital Twins with NVIDIA Omniverse Blueprints

Read now
Rapidly Create Real-Time Physics Digital Twins with NVIDIA Omniverse Blueprints
  • Generative AI
    Build Your First Human-in-the-Loop AI Agent with NVIDIA NIM
  • Models / Libraries / Frameworks
    AI Unlocks Early Clues to Alzheimer’s Through Retinal Scans
  • Simulation / Modeling / Design
    AI Research Delivers Rapid, Accurate Prostate Cancer Predictions
  • Top Stories
    Llama 3.2 Full-Stack Optimizations Unlock High Performance on NVIDIA GPUs
  • Simulation / Modeling / Design
    Rapidly Create Real-Time Physics Digital Twins with NVIDIA Omniverse Blueprints

Recent

See all
Dec 03, 2024

Scaling Action Recognition Models with Synthetic Data

Action recognition models such as PoseClassificationNet have been around for some time, helping systems identify and classify human actions like walking,...
11 MIN READ
Scaling Action Recognition Models with Synthetic Data
Dec 03, 2024

How to Build a Generative AI-Enabled Synthetic Data Pipeline for Perception AI

Training physical AI models used to power autonomous machines, such as robots and autonomous vehicles, requires huge amounts of data. Acquiring large sets of...
6 MIN READ
How to Build a Generative AI-Enabled Synthetic Data Pipeline for Perception AI
An avatar sitting at a computer, which is linked to multiple action icons through the NVIDIA NIM icon.
Dec 03, 2024

Build an Agentic Video Workflow with Video Search and Summarization

Building a question-answering chatbot with large language models (LLMs) is now a common workflow for text-based interactions. What about creating an AI system...
11 MIN READ
Build an Agentic Video Workflow with Video Search and Summarization
Dec 03, 2024

Automate Early Security Patching in CI Pipelines on AWS Using NVIDIA AI Blueprints

The evolution of modern application development has led to a significant shift toward microservice-based architectures. This approach offers great flexibility...
10 MIN READ
Automate Early Security Patching in CI Pipelines on AWS Using NVIDIA AI Blueprints
Dec 03, 2024

Introducing NVIDIA cuPQC for GPU-Accelerated Post-Quantum Cryptography

In the past decade, quantum computers have progressed significantly and could one day be used to undermine current cybersecurity practices. If run on a quantum...
6 MIN READ
Introducing NVIDIA cuPQC for GPU-Accelerated Post-Quantum Cryptography
Image shows a 3D molecular structure of a protein, most likely an antibody, visualized using a ribbon diagram, with the classic Y-shaped configuration characteristic of antibodies.
Dec 03, 2024

In-Silico Antibody Development with AlphaBind Using NVIDIA BioNeMo and AWS HealthOmics

Antibodies have become the most prevalent class of therapeutics, primarily due to their ability to target specific antigens, enabling them to treat a wide range...
6 MIN READ
In-Silico Antibody Development with AlphaBind Using NVIDIA BioNeMo and AWS HealthOmics
Image of the TensorRT-LLM icon next to multiple other icons of computer activities.
Dec 02, 2024

TensorRT-LLM Speculative Decoding Boosts Inference Throughput by up to 3.6x

NVIDIA TensorRT-LLM support for speculative decoding now provides over 3x the speedup in total token throughput. TensorRT-LLM is an open-source library that...
9 MIN READ
TensorRT-LLM Speculative Decoding Boosts Inference Throughput by up to 3.6x
The NVIDIA and AWS logos in white and green on a black background.
Dec 02, 2024

Accelerated Quantum Supercomputing with the NVIDIA CUDA-Q and Amazon Braket Integration

As quantum computers scale, tasks such as controlling quantum hardware and performing quantum error correction become increasingly complex. Overcoming these...
6 MIN READ
Accelerated Quantum Supercomputing with the NVIDIA CUDA-Q and Amazon Braket Integration
Dec 02, 2024

Unified Whole-Body Control for Physically Simulated Humanoids

Creating interactive simulated humanoids that move naturally and respond intelligently to diverse control inputs remains one of the most challenging problems in...
7 MIN READ
Unified Whole-Body Control for Physically Simulated Humanoids
Nov 28, 2024

Supercharging Deduplication in pandas Using RAPIDS cuDF

A common operation in data analytics is to drop duplicate rows. Deduplication is critical in Extract, Transform, Load (ETL) workflows, where you might want to...
12 MIN READ
Supercharging Deduplication in pandas Using RAPIDS cuDF
Nov 25, 2024

Just Released: NVIDIA DeepStream 7.1

The new release introduces Python support in Service Maker to accelerate real-time multimedia and AI inference applications with a powerful GStreamer...
1 MIN READ
Just Released: NVIDIA DeepStream 7.1
Nov 22, 2024

Spotlight: TCS Increases Automotive Software Testing Speeds by 2x Using NVIDIA Generative AI

Generative AI is transforming every aspect of the automotive industry, including software development, testing, user experience, personalization, and safety....
8 MIN READ
Spotlight: TCS Increases Automotive Software Testing Speeds by 2x Using NVIDIA Generative AI

Inference Performance

See all
Image of the TensorRT-LLM icon next to multiple other icons of computer activities.
Dec 02, 2024

TensorRT-LLM Speculative Decoding Boosts Inference Throughput by up to 3.6x

NVIDIA TensorRT-LLM support for speculative decoding now provides over 3x the speedup in total token throughput. TensorRT-LLM is an open-source library that...
9 MIN READ
TensorRT-LLM Speculative Decoding Boosts Inference Throughput by up to 3.6x
Image of an HGX H200
Nov 21, 2024

NVIDIA TensorRT-LLM Multiblock Attention Boosts Throughput by More Than 3x for Long Sequence Lengths on NVIDIA HGX H200

Generative AI models are advancing rapidly. Every generation of models comes with a larger number of parameters and longer context windows. The Llama 2 series...
5 MIN READ
NVIDIA TensorRT-LLM Multiblock Attention Boosts Throughput by More Than 3x for Long Sequence Lengths on NVIDIA HGX H200
Nov 19, 2024

Llama 3.2 Full-Stack Optimizations Unlock High Performance on NVIDIA GPUs

Meta recently released its Llama 3.2 series of vision language models (VLMs), which come in 11B parameter and 90B parameter variants. These models are...
6 MIN READ
Llama 3.2 Full-Stack Optimizations Unlock High Performance on NVIDIA GPUs
Nov 15, 2024

Streamlining AI Inference Performance and Deployment with NVIDIA TensorRT-LLM Chunked Prefill

In this blog post, we take a closer look at chunked prefill, a feature of NVIDIA TensorRT-LLM that increases GPU utilization and simplifies the deployment...
4 MIN READ
Streamlining AI Inference Performance and Deployment with NVIDIA TensorRT-LLM Chunked Prefill
NVIDIA H100.
Nov 08, 2024

5x Faster Time to First Token with NVIDIA TensorRT-LLM KV Cache Early Reuse

In our previous blog post, we demonstrated how reusing the key-value (KV) cache by offloading it to CPU memory can accelerate time to first token (TTFT) by up...
5 MIN READ
5x Faster Time to First Token with NVIDIA TensorRT-LLM KV Cache Early Reuse
Image of an HGX H200
Nov 01, 2024

3x Faster AllReduce with NVSwitch and TensorRT-LLM MultiShot

Deploying generative AI workloads in production environments where user numbers can fluctuate from hundreds to hundreds of thousands – and where input...
5 MIN READ
3x Faster AllReduce with NVSwitch and TensorRT-LLM MultiShot
Oct 28, 2024

NVIDIA GH200 Superchip Accelerates Inference by 2x in Multiturn Interactions with Llama Models

Deploying large language models (LLMs) in production environments often requires making hard trade-offs between enhancing user interactivity and increasing...
7 MIN READ
NVIDIA GH200 Superchip Accelerates Inference by 2x in Multiturn Interactions with Llama Models
Oct 09, 2024

NVIDIA Grace CPU Delivers World-Class Data Center Performance and Breakthrough Energy Efficiency

NVIDIA designed the NVIDIA Grace CPU to be a new kind of high-performance, data center CPU—one built to deliver breakthrough energy efficiency and optimized...
8 MIN READ
NVIDIA Grace CPU Delivers World-Class Data Center Performance and Breakthrough Energy Efficiency
Oct 09, 2024

Boosting Llama 3.1 405B Throughput by Another 1.5x on NVIDIA H200 Tensor Core GPUs and NVLink Switch

The continued growth of LLMs capability, fueled by increasing parameter counts and support for longer contexts, has led to their usage in a wide variety of...
8 MIN READ
Boosting Llama 3.1 405B Throughput by Another 1.5x on NVIDIA H200 Tensor Core GPUs and NVLink Switch
Sep 26, 2024

Low Latency Inference Chapter 2: Blackwell is Coming. NVIDIA GH200 NVL32 with NVLink Switch Gives Signs of Big Leap in Time to First Token Performance

Many of the most exciting applications of large language models (LLMs), such as interactive speech bots, coding co-pilots, and search, need to begin responding...
8 MIN READ
Low Latency Inference Chapter 2: Blackwell is Coming. NVIDIA GH200 NVL32 with NVLink Switch Gives Signs of Big Leap in Time to First Token Performance
Sep 24, 2024

NVIDIA GH200 Grace Hopper Superchip Delivers Outstanding Performance in MLPerf Inference v4.1

In the latest round of MLPerf Inference – a suite of standardized, peer-reviewed inference benchmarks – the NVIDIA platform delivered outstanding...
7 MIN READ
NVIDIA GH200 Grace Hopper Superchip Delivers Outstanding Performance in MLPerf Inference v4.1
Image of an HGX H200
Sep 05, 2024

Low Latency Inference Chapter 1: Up to 1.9x Higher Llama 3.1 Performance with Medusa on NVIDIA HGX H200 with NVLink Switch

As large language models (LLMs) continue to grow in size and complexity, multi-GPU compute is a must-have to deliver the low latency and high throughput that...
5 MIN READ
Low Latency Inference Chapter 1: Up to 1.9x Higher Llama 3.1 Performance with Medusa on NVIDIA HGX H200 with NVLink Switch

Generative AI

See all
Dec 03, 2024

How to Build a Generative AI-Enabled Synthetic Data Pipeline for Perception AI

Training physical AI models used to power autonomous machines, such as robots and autonomous vehicles, requires huge amounts of data. Acquiring large sets of...
6 MIN READ
How to Build a Generative AI-Enabled Synthetic Data Pipeline for Perception AI
An avatar sitting at a computer, which is linked to multiple action icons through the NVIDIA NIM icon.
Dec 03, 2024

Build an Agentic Video Workflow with Video Search and Summarization

Building a question-answering chatbot with large language models (LLMs) is now a common workflow for text-based interactions. What about creating an AI system...
11 MIN READ
Build an Agentic Video Workflow with Video Search and Summarization
Image of the TensorRT-LLM icon next to multiple other icons of computer activities.
Dec 02, 2024

TensorRT-LLM Speculative Decoding Boosts Inference Throughput by up to 3.6x

NVIDIA TensorRT-LLM support for speculative decoding now provides over 3x the speedup in total token throughput. TensorRT-LLM is an open-source library that...
9 MIN READ
TensorRT-LLM Speculative Decoding Boosts Inference Throughput by up to 3.6x
Dec 02, 2024

Unified Whole-Body Control for Physically Simulated Humanoids

Creating interactive simulated humanoids that move naturally and respond intelligently to diverse control inputs remains one of the most challenging problems in...
7 MIN READ
Unified Whole-Body Control for Physically Simulated Humanoids
Nov 22, 2024

Spotlight: TCS Increases Automotive Software Testing Speeds by 2x Using NVIDIA Generative AI

Generative AI is transforming every aspect of the automotive industry, including software development, testing, user experience, personalization, and safety....
8 MIN READ
Spotlight: TCS Increases Automotive Software Testing Speeds by 2x Using NVIDIA Generative AI
Nov 22, 2024

Hymba Hybrid-Head Architecture Boosts Small Language Model Performance

Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,...
12 MIN READ
Hymba Hybrid-Head Architecture Boosts Small Language Model Performance
Image of an HGX H200
Nov 21, 2024

NVIDIA TensorRT-LLM Multiblock Attention Boosts Throughput by More Than 3x for Long Sequence Lengths on NVIDIA HGX H200

Generative AI models are advancing rapidly. Every generation of models comes with a larger number of parameters and longer context windows. The Llama 2 series...
5 MIN READ
NVIDIA TensorRT-LLM Multiblock Attention Boosts Throughput by More Than 3x for Long Sequence Lengths on NVIDIA HGX H200
Nov 21, 2024

Build Your First Human-in-the-Loop AI Agent with NVIDIA NIM

AI agents powered by large language models (LLMs) help organizations streamline and reduce manual workloads. These agents use multilevel, iterative reasoning to...
11 MIN READ
Build Your First Human-in-the-Loop AI Agent with NVIDIA NIM
Nov 21, 2024

Deploying Fine-Tuned AI Models with NVIDIA NIM

For organizations adapting AI foundation models with domain-specific data, the ability to rapidly create and deploy fine-tuned models is key to efficiently...
6 MIN READ
Deploying Fine-Tuned AI Models with NVIDIA NIM
Connected icons show the workflow.
Nov 21, 2024

NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM

NVIDIA JetPack has continuously evolved to offer cutting-edge software tailored to the growing needs of edge AI and robotic developers. With each release,...
8 MIN READ
NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM
A person looking at a computer monitor.
Nov 21, 2024

Powering AI-Augmented Workloads with NVIDIA and Windows 365

We are entering a new era of AI-powered digital workflow, where Windows 365 Cloud PCs are dynamic platforms that host AI technologies and reshape traditional...
7 MIN READ
Powering AI-Augmented Workloads with NVIDIA and Windows 365
Nov 20, 2024

Advancing Neuroscience Research with Visual Question Answering and Multimodal Retrieval

Leading healthcare organizations are turning to generative AI to help build applications that can deliver life-saving impacts. These organizations include the...
8 MIN READ
Advancing Neuroscience Research with Visual Question Answering and Multimodal Retrieval

Data Science

See all
Image shows a 3D molecular structure of a protein, most likely an antibody, visualized using a ribbon diagram, with the classic Y-shaped configuration characteristic of antibodies.
Dec 03, 2024

In-Silico Antibody Development with AlphaBind Using NVIDIA BioNeMo and AWS HealthOmics

Antibodies have become the most prevalent class of therapeutics, primarily due to their ability to target specific antigens, enabling them to treat a wide range...
6 MIN READ
In-Silico Antibody Development with AlphaBind Using NVIDIA BioNeMo and AWS HealthOmics
Nov 28, 2024

Supercharging Deduplication in pandas Using RAPIDS cuDF

A common operation in data analytics is to drop duplicate rows. Deduplication is critical in Extract, Transform, Load (ETL) workflows, where you might want to...
12 MIN READ
Supercharging Deduplication in pandas Using RAPIDS cuDF
Nov 21, 2024

Best Practices for Multi-GPU Data Analysis Using RAPIDS with Dask

As we move towards a more dense computing infrastructure, with more compute, more GPUs, accelerated networking, and so forth—multi-gpu training and analysis...
5 MIN READ
Best Practices for Multi-GPU Data Analysis Using RAPIDS with Dask
A person with a hard hat looks at a computer monitor, which is displaying graphs.
Nov 21, 2024

Spotlight: Advancing Autonomous Operations with AVEVA Dynamic Simulation and NVIDIA Raptor

Industrial engineers are turning to AI to build advanced process simulation solutions and accelerate progress toward fully autonomous operations in the energy,...
6 MIN READ
Spotlight: Advancing Autonomous Operations with AVEVA Dynamic Simulation and NVIDIA Raptor
The process of data curation for LLMs.
Nov 19, 2024

Processing High-Quality Vietnamese Language Data with NVIDIA NeMo Curator

Open-source large language models (LLMs) excel in English but struggle with other languages, especially the languages of Southeast Asia. This is primarily due...
17 MIN READ
Processing High-Quality Vietnamese Language Data with NVIDIA NeMo Curator
Nov 18, 2024

Accelerate Drug and Material Discovery with New Math Library NVIDIA cuEquivariance

AI models for science are often trained to make predictions about the workings of nature, such as predicting the structure of a biomolecule or the properties of...
8 MIN READ
Accelerate Drug and Material Discovery with New Math Library NVIDIA cuEquivariance
Nov 18, 2024

Revolutionizing AI-Driven Material Discovery Using NVIDIA ALCHEMI

AI has proven to be a force multiplier, helping to create a future where scientists can design entirely new materials, while engineers seamlessly transform...
11 MIN READ
Revolutionizing AI-Driven Material Discovery Using NVIDIA ALCHEMI
A photo of two GPU clusters and another picture of four scientific computing workflows demonstrating computational fluid dynamics.
Nov 18, 2024

Effortlessly Scale NumPy from Laptops to Supercomputers with NVIDIA cuPyNumeric

Python is the most common programming language for data science, machine learning, and numerical computing. It continues to grow in popularity among scientists...
12 MIN READ
Effortlessly Scale NumPy from Laptops to Supercomputers with NVIDIA cuPyNumeric
A picture of a hurricane.
Nov 14, 2024

Deep Learning Model Boosts Accuracy in Long-Range Weather and Climate Forecasting

Dale Durran, a professor in the Atmospheric Sciences Department at the University of Washington, introduces a breakthrough deep learning model that combines...
2 MIN READ
Deep Learning Model Boosts Accuracy in Long-Range Weather and Climate Forecasting
Nov 14, 2024

Faster Causal Inference on Large Datasets with NVIDIA RAPIDS

As consumer applications generate more data than ever before, enterprises are turning to causal inference methods for observational data to help shed light on...
4 MIN READ
Faster Causal Inference on Large Datasets with NVIDIA RAPIDS
Nov 13, 2024

NVIDIA RAPIDS 24.10 Introduces Accelerated NetworkX with Zero Code Change, Updates for UMAP and cuDF-Pandas

The RAPIDS v24.10 release takes another step forward in bringing accelerated computing to data scientists and developers with a seamless user experience. This...
8 MIN READ
NVIDIA RAPIDS 24.10 Introduces Accelerated NetworkX with Zero Code Change, Updates for UMAP and cuDF-Pandas
Nov 13, 2024

Mastering LLM Techniques: Data Preprocessing

The advent of large language models (LLMs) marks a significant shift in how industries leverage AI to enhance operations and services. By automating routine...
14 MIN READ
Mastering LLM Techniques: Data Preprocessing

Robotics

See all
Dec 03, 2024

Scaling Action Recognition Models with Synthetic Data

Action recognition models such as PoseClassificationNet have been around for some time, helping systems identify and classify human actions like walking,...
11 MIN READ
Scaling Action Recognition Models with Synthetic Data
Dec 02, 2024

Unified Whole-Body Control for Physically Simulated Humanoids

Creating interactive simulated humanoids that move naturally and respond intelligently to diverse control inputs remains one of the most challenging problems in...
7 MIN READ
Unified Whole-Body Control for Physically Simulated Humanoids
Connected icons show the workflow.
Nov 21, 2024

NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM

NVIDIA JetPack has continuously evolved to offer cutting-edge software tailored to the growing needs of edge AI and robotic developers. With each release,...
8 MIN READ
NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM
Nov 06, 2024

Advancing Humanoid Robot Sight and Skill Development with NVIDIA Project GR00T

Humanoid robots present a multifaceted challenge at the intersection of mechatronics, control theory, and AI. The dynamics and control of humanoid robots are...
10 MIN READ
Advancing Humanoid Robot Sight and Skill Development with NVIDIA Project GR00T
Nov 06, 2024

Spotlight: Galbot Builds a Large-Scale Dexterous Hand Dataset for Humanoid Robots Using NVIDIA Isaac Sim

Robotic dexterous grasping is a critical area of research and development, aimed at enabling robots to interact with and manipulate objects as flexibly as...
5 MIN READ
Spotlight: Galbot Builds a Large-Scale Dexterous Hand Dataset for Humanoid Robots Using NVIDIA Isaac Sim
Nov 06, 2024

Spotlight: Fourier Trains Humanoid Robots for Real-World Roles Using NVIDIA Isaac Gym

This post was written in partnership with the Fourier research team. Training humanoid robots to operate in fields that demand high levels of interaction and...
4 MIN READ
Spotlight: Fourier Trains Humanoid Robots for Real-World Roles Using NVIDIA Isaac Gym
Decorative image of icons and a molecular structure in green.
Nov 04, 2024

Build a Video Search and Summarization Agent with NVIDIA AI Blueprint

This post was originally published July 29, 2024 but has been extensively revised with NVIDIA AI Blueprint information. Traditional video analytics applications...
11 MIN READ
Build a Video Search and Summarization Agent with NVIDIA AI Blueprint
A robot making toast.
Oct 30, 2024

Teaching Robots to Tackle Household Chores

Robotics could make everyday life easier by taking on repetitive or time-consuming tasks. At NVIDIA GTC 2024, researchers from Stanford University unveiled...
2 MIN READ
Teaching Robots to Tackle Household Chores
Oct 25, 2024

NVIDIA Showcases the Future of Intelligent Robots at CoRL 2024

From humanoids to policy, explore the work NVIDIA is bringing to the robotics community.
1 MIN READ
NVIDIA Showcases the Future of Intelligent Robots at CoRL 2024
Oct 24, 2024

Powering the Next Wave of AI Robotics with Three Computers 

NVIDIA has built three computers and accelerated development platforms to enable developers to create physical AI.
1 MIN READ
Powering the Next Wave of AI Robotics with Three Computers 
Oct 22, 2024

A Beginner’s Guide to Simulating and Testing Robots with ROS 2 and NVIDIA Isaac Sim

Physical AI-powered robots need to autonomously sense, plan, and perform complex tasks in the physical world. These include transporting and manipulating...
10 MIN READ
A Beginner’s Guide to Simulating and Testing Robots with ROS 2 and NVIDIA Isaac Sim
Oct 22, 2024

How to Calibrate Sensors with MSA Calibration Anywhere for NVIDIA Isaac Perceptor

Multimodal sensor calibration is critical for achieving sensor fusion for robotics, autonomous vehicles, mapping, and other perception-driven applications....
9 MIN READ
How to Calibrate Sensors with MSA Calibration Anywhere for NVIDIA Isaac Perceptor

Simulation / Modeling / Design

See all
Dec 03, 2024

Scaling Action Recognition Models with Synthetic Data

Action recognition models such as PoseClassificationNet have been around for some time, helping systems identify and classify human actions like walking,...
11 MIN READ
Scaling Action Recognition Models with Synthetic Data
Dec 03, 2024

How to Build a Generative AI-Enabled Synthetic Data Pipeline for Perception AI

Training physical AI models used to power autonomous machines, such as robots and autonomous vehicles, requires huge amounts of data. Acquiring large sets of...
6 MIN READ
How to Build a Generative AI-Enabled Synthetic Data Pipeline for Perception AI
Dec 03, 2024

Introducing NVIDIA cuPQC for GPU-Accelerated Post-Quantum Cryptography

In the past decade, quantum computers have progressed significantly and could one day be used to undermine current cybersecurity practices. If run on a quantum...
6 MIN READ
Introducing NVIDIA cuPQC for GPU-Accelerated Post-Quantum Cryptography
The NVIDIA and AWS logos in white and green on a black background.
Dec 02, 2024

Accelerated Quantum Supercomputing with the NVIDIA CUDA-Q and Amazon Braket Integration

As quantum computers scale, tasks such as controlling quantum hardware and performing quantum error correction become increasingly complex. Overcoming these...
6 MIN READ
Accelerated Quantum Supercomputing with the NVIDIA CUDA-Q and Amazon Braket Integration
Dec 02, 2024

Unified Whole-Body Control for Physically Simulated Humanoids

Creating interactive simulated humanoids that move naturally and respond intelligently to diverse control inputs remains one of the most challenging problems in...
7 MIN READ
Unified Whole-Body Control for Physically Simulated Humanoids
A person with a hard hat looks at a computer monitor, which is displaying graphs.
Nov 21, 2024

Spotlight: Advancing Autonomous Operations with AVEVA Dynamic Simulation and NVIDIA Raptor

Industrial engineers are turning to AI to build advanced process simulation solutions and accelerate progress toward fully autonomous operations in the energy,...
6 MIN READ
Spotlight: Advancing Autonomous Operations with AVEVA Dynamic Simulation and NVIDIA Raptor
A person looking at a computer monitor.
Nov 21, 2024

Powering AI-Augmented Workloads with NVIDIA and Windows 365

We are entering a new era of AI-powered digital workflow, where Windows 365 Cloud PCs are dynamic platforms that host AI technologies and reshape traditional...
7 MIN READ
Powering AI-Augmented Workloads with NVIDIA and Windows 365
An MRI of cancer tumors.
Nov 19, 2024

AI Research Delivers Rapid, Accurate Prostate Cancer Predictions

Prostate cancer researchers unveiled a new AI-powered model that can quickly analyze MRIs to accurately predict how prostate cancer tumors may develop and...
3 MIN READ
AI Research Delivers Rapid, Accurate Prostate Cancer Predictions
Photo of a power line against city lights at twilight.
Nov 19, 2024

NVIDIA cuDSS Library Removes Barriers to Optimizing the US Power Grid

In the wake of ever-growing power demands, power systems optimization (PSO) of power grids is crucial for ensuring efficient resource management,...
7 MIN READ
NVIDIA cuDSS Library Removes Barriers to Optimizing the US Power Grid
Nov 19, 2024

Connect Real-Time IoT Data to Digital Twins for 3D Remote Monitoring

As enterprises increasingly integrate AI into their industrial operations to deliver more automated and autonomous facilities, more operations teams are...
5 MIN READ
Connect Real-Time IoT Data to Digital Twins for 3D Remote Monitoring
Image of an espresso machine sitting on a kitchen counter.
Nov 19, 2024

Building a Generative AI OpenUSD App for Brand-Accurate Marketing Visuals

Today, brands and their creative agencies are under huge strain to create and deliver high-quality, accurate product images at scale, from campaign key visuals...
7 MIN READ
Building a Generative AI OpenUSD App for Brand-Accurate Marketing Visuals
Google QPU development enabling dynamics simulations
Nov 18, 2024

Accelerating Google’s QPU Development with New Quantum Dynamics Capabilities

Quantum dynamics describes how complex quantum systems evolve in time and interact with their surroundings. Simulating quantum dynamics is extremely difficult...
11 MIN READ
Accelerating Google’s QPU Development with New Quantum Dynamics Capabilities

Computer Vision / Video Analytics

See all
Dec 03, 2024

Scaling Action Recognition Models with Synthetic Data

Action recognition models such as PoseClassificationNet have been around for some time, helping systems identify and classify human actions like walking,...
11 MIN READ
Scaling Action Recognition Models with Synthetic Data
An avatar sitting at a computer, which is linked to multiple action icons through the NVIDIA NIM icon.
Dec 03, 2024

Build an Agentic Video Workflow with Video Search and Summarization

Building a question-answering chatbot with large language models (LLMs) is now a common workflow for text-based interactions. What about creating an AI system...
11 MIN READ
Build an Agentic Video Workflow with Video Search and Summarization
Nov 25, 2024

Just Released: NVIDIA DeepStream 7.1

The new release introduces Python support in Service Maker to accelerate real-time multimedia and AI inference applications with a powerful GStreamer...
1 MIN READ
Just Released: NVIDIA DeepStream 7.1
A closeup of an eye.
Nov 21, 2024

AI Unlocks Early Clues to Alzheimer’s Through Retinal Scans

Your eyes could hold the key to unlocking early detection of Alzheimer’s and dementia, with a groundbreaking AI study. Called Eye-AD, the deep learning...
3 MIN READ
AI Unlocks Early Clues to Alzheimer’s Through Retinal Scans
Decorative image of icons and a molecular structure in green.
Nov 04, 2024

Build a Video Search and Summarization Agent with NVIDIA AI Blueprint

This post was originally published July 29, 2024 but has been extensively revised with NVIDIA AI Blueprint information. Traditional video analytics applications...
11 MIN READ
Build a Video Search and Summarization Agent with NVIDIA AI Blueprint
A slide of breast cancer cells.
Oct 31, 2024

Deep Learning AI Model Identifies Breast Cancer Spread without Surgery

A new deep learning model could reduce the need for surgery when diagnosing whether cancer cells are spreading, including to nearby lymph nodes—also known as...
4 MIN READ
Deep Learning AI Model Identifies Breast Cancer Spread without Surgery
Close-up shot of a wolf howling. Courtesy of Pexels/patrice schoefolt.
Oct 29, 2024

AI-Powered Devices Track Howls to Save Wolves

A new cell-phone-sized device—which can be deployed in vast, remote areas—is using AI to identify and geolocate wildlife to help conservationists track...
5 MIN READ
AI-Powered Devices Track Howls to Save Wolves
Decorative image.
Oct 24, 2024

Federated Learning in Autonomous Vehicles Using Cross-Border Training

Federated learning is revolutionizing the development of autonomous vehicles (AVs), particularly in cross-country scenarios where diverse data sources and...
10 MIN READ
Federated Learning in Autonomous Vehicles Using Cross-Border Training
Image of a car speeding on a road in the sunshine.
Oct 23, 2024

Optimizing the CV Pipeline in Automotive Vehicle Development Using the PVA Engine

In the field of automotive vehicle software development, more large-scale AI models are being integrated into autonomous vehicles. The models range from vision...
16 MIN READ
Optimizing the CV Pipeline in Automotive Vehicle Development Using the PVA Engine
Oct 07, 2024

Accelerating Reality Capture Workflows with AI and NVIDIA RTX GPUs

Reality capture creates highly accurate, detailed, and immersive digital representations of environments. Innovations in site scanning and accelerated data...
10 MIN READ
Accelerating Reality Capture Workflows with AI and NVIDIA RTX GPUs
Oct 07, 2024

Optimizing Microsoft Bing Visual Search with NVIDIA Accelerated Libraries

Microsoft Bing Visual Search enables people around the world to find content using photographs as queries. The heart of this capability is Microsoft's TuringMM...
11 MIN READ
Optimizing Microsoft Bing Visual Search with NVIDIA Accelerated Libraries
Oct 07, 2024

Generate Image and Text Embeddings with NV-CLIP

NV-CLIP, a cutting-edge multimodal embeddings model for image and text, is now generally available.
1 MIN READ
Generate Image and Text Embeddings with NV-CLIP

Content Creation / Rendering

See all
A person looking at a computer monitor.
Nov 21, 2024

Powering AI-Augmented Workloads with NVIDIA and Windows 365

We are entering a new era of AI-powered digital workflow, where Windows 365 Cloud PCs are dynamic platforms that host AI technologies and reshape traditional...
7 MIN READ
Powering AI-Augmented Workloads with NVIDIA and Windows 365
Collage of 12 different car and background images.
Oct 07, 2024

Producing Cinematic Content at Scale with a Generative AI-Enabled OpenUSD Pipeline

Producing commercials is resource-intensive, requiring physical locations and various props and setups to display products in different settings and...
7 MIN READ
Producing Cinematic Content at Scale with a Generative AI-Enabled OpenUSD Pipeline
Oct 02, 2024

Accelerating LLMs with llama.cpp on NVIDIA RTX Systems

The NVIDIA RTX AI for Windows PCs platform offers a thriving ecosystem of thousands of open-source models for application developers to leverage and integrate...
5 MIN READ
Accelerating LLMs with llama.cpp on NVIDIA RTX Systems
Decorative image of GDN logo floating in a green cloud above a world map that has other gaming logos floating lower down.
Oct 01, 2024

Revolutionizing Cloud Gaming and Graphics Rendering with NVIDIA GDN

Gaming has always pushed the boundaries of graphics hardware. The most popular games typically required robust GPU, CPU, and RAM resources on a user’s PC or...
7 MIN READ
Revolutionizing Cloud Gaming and Graphics Rendering with NVIDIA GDN
Oct 01, 2024

Simplify and Scale AI-Powered MetaHuman Deployment with NVIDIA ACE and Unreal Engine 5

At Unreal Fest 2024, NVIDIA released new Unreal Engine 5 on-device plugins for NVIDIA ACE, making it easier to build and deploy AI-powered MetaHuman characters...
4 MIN READ
Simplify and Scale AI-Powered MetaHuman Deployment with NVIDIA ACE and Unreal Engine 5
Sep 23, 2024

Just Released: Free OpenUSD Training Courses

Accelerate your OpenUSD workflows with this free curriculum for developers and 3D practitioners.
1 MIN READ
Just Released: Free OpenUSD Training Courses
Two images of the same person, one looking away from the camera (before) and one looking directly at the camera (after). A label in the lower right says Texel.
Sep 16, 2024

Orchestrating Innovation at Scale with NVIDIA Maxine and Texel

The NVIDIA Maxine AI developer platform is a suite of NVIDIA NIM microservices, cloud-accelerated microservices, and SDKs that offer state-of-the-art features...
5 MIN READ
Orchestrating Innovation at Scale with NVIDIA Maxine and Texel
Sep 11, 2024

Enabling Customizable GPU-Accelerated Video Transcoding Pipelines

Today, over 80% of internet traffic is video. This content is generated by and consumed across various devices, including IoT gadgets, smartphones, computers,...
10 MIN READ
Enabling Customizable GPU-Accelerated Video Transcoding Pipelines
GIF of live stream being modified by NVIDIA Holoscan for Media.
Sep 09, 2024

Transform Live Media Pipelines with NVIDIA Holoscan for Media

NVIDIA Holoscan for Media is now ready to be used in live production, taking advantage of the best of both networking and GPU technologies.  Holoscan for...
3 MIN READ
Transform Live Media Pipelines with NVIDIA Holoscan for Media
GIF of an image changing in response to the prompt.
Aug 30, 2024

Fast Inversion for Real-Time Image Editing with Text

Text-to-image diffusion models can generate diverse, high-fidelity images based on user-provided text prompts. They operate by mapping a random sample from a...
8 MIN READ
Fast Inversion for Real-Time Image Editing with Text
Still from the MechaBREAK game.
Aug 20, 2024

Deploy the First On-Device Small Language Model for Improved Game Character Roleplay

At Gamescom 2024, NVIDIA announced our first on-device small language model (SLM) for improving the conversation abilities of game characters. We also announced...
4 MIN READ
Deploy the First On-Device Small Language Model for Improved Game Character Roleplay
Image showing side by side comparison of person on webcam. Left side has the input with the user gazing off screen, the right side has the user’s background replaced with a scene of mountains and the user’s eyes are focused on the camera.
Aug 12, 2024

Elevating Video Communication with the NVIDIA Maxine AI Developer Platform and VideoRequest

Effective video communication is important for everyone who communicates online. For businesses, educators, and content creators, it is vital. NVIDIA Maxine is...
5 MIN READ
Elevating Video Communication with the NVIDIA Maxine AI Developer Platform and VideoRequest

Conversational AI

See all
Nov 22, 2024

Hymba Hybrid-Head Architecture Boosts Small Language Model Performance

Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,...
12 MIN READ
Hymba Hybrid-Head Architecture Boosts Small Language Model Performance
Chatbot avatar in front of a stylized chat screen on a purple background.
Nov 19, 2024

Create a Custom Slackbot LLM Agent with NVIDIA NIM and LangChain

In the dynamic world of modern business, where communication and efficient workflows are crucial for success, AI-powered solutions have become a competitive...
9 MIN READ
Create a Custom Slackbot LLM Agent with NVIDIA NIM and LangChain
GIF shows chat app in use.
Oct 28, 2024

Creating RAG-Based Question-and-Answer LLM Workflows at NVIDIA

The rapid development of solutions using retrieval augmented generation (RAG) for question-and-answer LLM workflows has led to new types of system...
11 MIN READ
Creating RAG-Based Question-and-Answer LLM Workflows at NVIDIA
Oct 23, 2024

Three Building Blocks for Creating AI Virtual Assistants for Customer Service with an NVIDIA AI Blueprint

In today's fast-paced business environment, providing exceptional customer service is no longer just a nice-to-have—it's a necessity. Whether addressing...
10 MIN READ
Three Building Blocks for Creating AI Virtual Assistants for Customer Service with an NVIDIA AI Blueprint
Oct 22, 2024

Scaling LLMs with NVIDIA Triton and NVIDIA TensorRT-LLM Using Kubernetes

Large language models (LLMs) have been widely used for chatbots, content generation, summarization, classification, translation, and more. State-of-the-art LLMs...
16 MIN READ
Scaling LLMs with NVIDIA Triton and NVIDIA TensorRT-LLM Using Kubernetes
Oct 21, 2024

IBM’s New Granite 3.0 Generative AI Models Are Small, Yet Highly Accurate and Efficient

Today, IBM released the third generation of IBM Granite, a collection of open language models and complementary tools. Prior generations of Granite focused on...
5 MIN READ
IBM’s New Granite 3.0 Generative AI Models Are Small, Yet Highly Accurate and Efficient
NCNS logo on a black background.
Oct 16, 2024

Simplify AI Application Development with NVIDIA Cloud Native Stack

In the rapidly evolving landscape of AI and data science, the demand for scalable, efficient, and flexible infrastructure has never been higher. Traditional...
5 MIN READ
Simplify AI Application Development with NVIDIA Cloud Native Stack
Avatars of a patient in a bed with a doctor sitting at a desk in another location, looking at a computer screen.
Oct 01, 2024

Evaluating Medical RAG with NVIDIA AI Endpoints and Ragas

In the rapidly evolving field of medicine, the integration of cutting-edge technologies is crucial for enhancing patient care and advancing research. One such...
11 MIN READ
Evaluating Medical RAG with NVIDIA AI Endpoints and Ragas
Sep 26, 2024

Low Latency Inference Chapter 2: Blackwell is Coming. NVIDIA GH200 NVL32 with NVLink Switch Gives Signs of Big Leap in Time to First Token Performance

Many of the most exciting applications of large language models (LLMs), such as interactive speech bots, coding co-pilots, and search, need to begin responding...
8 MIN READ
Low Latency Inference Chapter 2: Blackwell is Coming. NVIDIA GH200 NVL32 with NVLink Switch Gives Signs of Big Leap in Time to First Token Performance
Sep 25, 2024

Build a Digital Human Interface for AI Apps with an NVIDIA NIM Agent Blueprint

Providing customers with quality service remains a top priority for businesses across industries, from answering questions and troubleshooting issues to...
5 MIN READ
Build a Digital Human Interface for AI Apps with an NVIDIA NIM Agent Blueprint
Sep 25, 2024

Deploying Accelerated Llama 3.2 from the Edge to the Cloud

Expanding the open-source Meta Llama collection of models, the Llama 3.2 collection includes vision language models (VLMs), small language models (SLMs), and an...
6 MIN READ
Deploying Accelerated Llama 3.2 from the Edge to the Cloud
Sep 24, 2024

Accelerating Leaderboard-Topping ASR Models 10x with NVIDIA NeMo

NVIDIA NeMo has consistently developed automatic speech recognition (ASR) models that set the benchmark in the industry, particularly those topping the Hugging...
13 MIN READ
Accelerating Leaderboard-Topping ASR Models 10x with NVIDIA NeMo

Edge Computing

See all
Nov 25, 2024

Just Released: NVIDIA DeepStream 7.1

The new release introduces Python support in Service Maker to accelerate real-time multimedia and AI inference applications with a powerful GStreamer...
1 MIN READ
Just Released: NVIDIA DeepStream 7.1
Nov 22, 2024

Hymba Hybrid-Head Architecture Boosts Small Language Model Performance

Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,...
12 MIN READ
Hymba Hybrid-Head Architecture Boosts Small Language Model Performance
Connected icons show the workflow.
Nov 21, 2024

NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM

NVIDIA JetPack has continuously evolved to offer cutting-edge software tailored to the growing needs of edge AI and robotic developers. With each release,...
8 MIN READ
NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM
Nov 14, 2024

NVIDIA DOCA 2.9 Enhances AI and Cloud Computing Infrastructure with New Performance and Security Features

NVIDIA DOCA enhances the capabilities of NVIDIA networking platforms by providing a comprehensive software framework for developers to leverage hardware...
9 MIN READ
NVIDIA DOCA 2.9 Enhances AI and Cloud Computing Infrastructure with New Performance and Security Features
Close-up shot of a wolf howling. Courtesy of Pexels/patrice schoefolt.
Oct 29, 2024

AI-Powered Devices Track Howls to Save Wolves

A new cell-phone-sized device—which can be deployed in vast, remote areas—is using AI to identify and geolocate wildlife to help conservationists track...
5 MIN READ
AI-Powered Devices Track Howls to Save Wolves
Oct 24, 2024

Powering the Next Wave of AI Robotics with Three Computers 

NVIDIA has built three computers and accelerated development platforms to enable developers to create physical AI.
1 MIN READ
Powering the Next Wave of AI Robotics with Three Computers 
A GIF of a hurricane forecast.
Oct 21, 2024

AI Accurately Forecasts Extreme Weather Up to 23 Days Ahead

New research from the University of Washington is refining AI weather models using deep learning for more accurate predictions and longer-term forecasts. The...
3 MIN READ
AI Accurately Forecasts Extreme Weather Up to 23 Days Ahead
Oct 16, 2024

Maximizing Energy and Power Efficiency in Applications with NVIDIA GPUs

As the demand for high-performance computing (HPC) and AI applications grows, so does the importance of energy efficiency. NVIDIA Principal Developer Technology...
2 MIN READ
Maximizing Energy and Power Efficiency in Applications with NVIDIA GPUs
Decorative image of a person looking at a monitor, which has multiple brain scans displayed.
Oct 16, 2024

Treating Brain Disease with Brain-Machine Interactive Neuromodulation and NVIDIA Jetson

Neuromodulation is a technique that enhances or restores brain function by directly intervening in neural activity. It is commonly used to treat conditions like...
4 MIN READ
Treating Brain Disease with Brain-Machine Interactive Neuromodulation and NVIDIA Jetson
Image of the GB200 NVL2 superchip.
Oct 08, 2024

Bringing AI-RAN to a Telco Near You

Inferencing for generative AI and AI agents will drive the need for AI compute infrastructure to be distributed from edge to central clouds. IDC predicts that...
14 MIN READ
Bringing AI-RAN to a Telco Near You
Photo of an image scanner. Source: ORSI Academy.
Oct 07, 2024

Real-Time Surgical Guidance by Fusing Multi-Modal Imaging with NVIDIA Holoscan

Developers in the fields of image-guided surgery and surgical vision face unique challenges in creating systems and applications that can significantly improve...
7 MIN READ
Real-Time Surgical Guidance by Fusing Multi-Modal Imaging with NVIDIA Holoscan
An image of Antarctica with moss growing. Featured image credit Dr. Krystal Randall
Oct 03, 2024

AI Investigates Antarctica's Disappearing Moss to Uncover Climate Change Clues

Antarctica plays a crucial role in regulating ‌Earth’s climate. Most climate research into the world’s coldest, most windswept continent focuses on the...
5 MIN READ
AI Investigates Antarctica's Disappearing Moss to Uncover Climate Change Clues

Data Center / Cloud

See all
Image of an HGX H200
Nov 21, 2024

NVIDIA TensorRT-LLM Multiblock Attention Boosts Throughput by More Than 3x for Long Sequence Lengths on NVIDIA HGX H200

Generative AI models are advancing rapidly. Every generation of models comes with a larger number of parameters and longer context windows. The Llama 2 series...
5 MIN READ
NVIDIA TensorRT-LLM Multiblock Attention Boosts Throughput by More Than 3x for Long Sequence Lengths on NVIDIA HGX H200
Nov 21, 2024

Deploying Fine-Tuned AI Models with NVIDIA NIM

For organizations adapting AI foundation models with domain-specific data, the ability to rapidly create and deploy fine-tuned models is key to efficiently...
6 MIN READ
Deploying Fine-Tuned AI Models with NVIDIA NIM
Nov 21, 2024

Advancing Ansys Workloads with NVIDIA Grace and NVIDIA Grace Hopper

Accelerated computing is enabling giant leaps in performance and energy efficiency compared to traditional CPU computing. Delivering these advancements requires...
10 MIN READ
Advancing Ansys Workloads with NVIDIA Grace and NVIDIA Grace Hopper
A person looking at a computer monitor.
Nov 21, 2024

Powering AI-Augmented Workloads with NVIDIA and Windows 365

We are entering a new era of AI-powered digital workflow, where Windows 365 Cloud PCs are dynamic platforms that host AI technologies and reshape traditional...
7 MIN READ
Powering AI-Augmented Workloads with NVIDIA and Windows 365
Nov 19, 2024

Llama 3.2 Full-Stack Optimizations Unlock High Performance on NVIDIA GPUs

Meta recently released its Llama 3.2 series of vision language models (VLMs), which come in 11B parameter and 90B parameter variants. These models are...
6 MIN READ
Llama 3.2 Full-Stack Optimizations Unlock High Performance on NVIDIA GPUs
Code showing how to use epilogs with matrix multiplication in nvmath-python.
Nov 18, 2024

Fusing Epilog Operations with Matrix Multiplication Using nvmath-python

nvmath-python (Beta) is an open-source Python library, providing Python programmers with access to high-performance mathematical operations from NVIDIA CUDA-X...
8 MIN READ
Fusing Epilog Operations with Matrix Multiplication Using nvmath-python
Nov 15, 2024

NVIDIA NIM 1.4 Ready to Deploy with 2.4x Faster Inference

The demand for ready-to-deploy high-performance inference is growing as generative AI reshapes industries. NVIDIA NIM provides production-ready microservice...
3 MIN READ
NVIDIA NIM 1.4 Ready to Deploy with 2.4x Faster Inference
Nov 15, 2024

Streamlining AI Inference Performance and Deployment with NVIDIA TensorRT-LLM Chunked Prefill

In this blog post, we take a closer look at chunked prefill, a feature of NVIDIA TensorRT-LLM that increases GPU utilization and simplifies the deployment...
4 MIN READ
Streamlining AI Inference Performance and Deployment with NVIDIA TensorRT-LLM Chunked Prefill
A cloud with a cybersecurity lock icon, surrounded by a sphere of connected nodes.
Nov 14, 2024

Exploring the Case of Super Protocol with Self-Sovereign AI and NVIDIA Confidential Computing

Confidential and self-sovereign AI is a new approach to AI development, training, and inference where the user’s data is decentralized, private, and...
15 MIN READ
Exploring the Case of Super Protocol with Self-Sovereign AI and NVIDIA Confidential Computing
Nov 14, 2024

NVIDIA DOCA 2.9 Enhances AI and Cloud Computing Infrastructure with New Performance and Security Features

NVIDIA DOCA enhances the capabilities of NVIDIA networking platforms by providing a comprehensive software framework for developers to leverage hardware...
9 MIN READ
NVIDIA DOCA 2.9 Enhances AI and Cloud Computing Infrastructure with New Performance and Security Features
Nov 13, 2024

NVIDIA Blackwell Doubles LLM Training Performance in MLPerf Training v4.1

As models grow larger and are trained on more data, they become more capable, making them more useful. To train these models quickly, more performance,...
8 MIN READ
NVIDIA Blackwell Doubles LLM Training Performance in MLPerf Training v4.1
Connected icons on a purple and gray background.
Nov 12, 2024

Spotlight: Accelerating into AI with VDI

The key to starting in AI may be right under your nose. It’s all about seeing the potential in the tools and resources that you already have. Adopt a crawl,...
5 MIN READ
Spotlight: Accelerating into AI with VDI