Home DEVELOPER
  • Home
  • Blog
  • Forums
  • Docs
  • Downloads
  • Training
  • Join

Recent

See all
An image of NYC taxis.
May 15, 2025

Simplify Setup and Boost Data Science in the Cloud using NVIDIA CUDA-X and Coiled

Imagine analyzing millions of NYC ride-share journeys—tracking patterns across boroughs, comparing service pricing, or identifying profitable pickup...
10 MIN READ
Simplify Setup and Boost Data Science in the Cloud using NVIDIA CUDA-X and Coiled
May 15, 2025

Predicting Performance on Apache Spark with GPUs

The world of big data analytics is constantly seeking ways to accelerate processing and reduce infrastructure costs. Apache Spark has become a leading platform...
9 MIN READ
Predicting Performance on Apache Spark with GPUs
May 15, 2025

Path Tracing Optimization in Indiana Jones™: Shader Execution Reordering and Live State Reductions

This post is part of the Path Tracing Optimizations in Indiana Jones™ series.   While adding a path-tracing mode to Indiana Jones and the Great Circle™...
13 MIN READ
Path Tracing Optimization in Indiana Jones™: Shader Execution Reordering and Live State Reductions
A still from the game, Indiana Jones and the Great Circle.
May 15, 2025

Path Tracing Optimizations in Indiana Jones™: Opacity MicroMaps and Compaction of Dynamic BLASs

The first post in this series, Path Tracing Optimization in Indiana Jones™: Shader Execution Reordering and Live State Reductions, covered ray-gen shader...
13 MIN READ
Path Tracing Optimizations in Indiana Jones™: Opacity MicroMaps and Compaction of Dynamic BLASs
May 15, 2025

AI Helps Uncover Potential Alzheimer’s Cause and Treatment

A gene that can be an early indicator for Alzheimer’s disease actually is a cause of the degenerative-brain disorder, said researchers at the University of...
3 MIN READ
AI Helps Uncover Potential Alzheimer’s Cause and Treatment
A drawing of a person holding a phone, with a callout of the phone screen and chat bubbles.
May 15, 2025

Accelerating Embedding Lookups with cuEmbed

NVIDIA recently released cuEmbed, a high-performance, header-only CUDA library that accelerates embedding lookups on NVIDIA GPUs. If you're building...
8 MIN READ
Accelerating Embedding Lookups with cuEmbed
How the Llama-Nemotron 30M Post Training Dataset was created
May 14, 2025

Build Custom Reasoning Models with Advanced, Open Post-Training Datasets

Synthetic data has become a standard part of large language model (LLM) post-training procedures. Using a large number of synthetically generated examples from...
5 MIN READ
Build Custom Reasoning Models with Advanced, Open Post-Training Datasets
Typical data center interconnection schema for Clos fabric.
May 14, 2025

AI Fabric Resiliency and Why Network Convergence Matters

High-performance computing and deep learning workloads are extremely sensitive to latency. Packet loss forces retransmission or stalls in the communication...
7 MIN READ
AI Fabric Resiliency and Why Network Convergence Matters

Inference Performance

See all
Decorative image of a datacenter with floating icons overlaid.
May 06, 2025

LLM Inference Benchmarking Guide: NVIDIA GenAI-Perf and NIM

This is the second post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM. ...
11 MIN READ
LLM Inference Benchmarking Guide: NVIDIA GenAI-Perf and NIM
Apr 21, 2025

Optimizing Transformer-Based Diffusion Models for Video Generation with NVIDIA TensorRT

State-of-the-art image diffusion models take tens of seconds to process a single image. This makes video diffusion even more challenging, requiring significant...
8 MIN READ
Optimizing Transformer-Based Diffusion Models for Video Generation with NVIDIA TensorRT
Apr 02, 2025

NVIDIA Blackwell Delivers Massive Performance Leaps in MLPerf Inference v5.0

The compute demands for large language model (LLM) inference are growing rapidly, fueled by the combination of growing model sizes, real-time latency...
10 MIN READ
NVIDIA Blackwell Delivers Massive Performance Leaps in MLPerf Inference v5.0
Apr 02, 2025

LLM Inference Benchmarking: Fundamental Concepts

This is the first post in the large language model latency-throughput benchmarking series, which aims to instruct developers on common metrics used for LLM...
15 MIN READ
LLM Inference Benchmarking: Fundamental Concepts
Mar 20, 2025

Boost Llama Model Performance on Microsoft Azure AI Foundry with NVIDIA TensorRT-LLM

Microsoft, in collaboration with NVIDIA, announced transformative performance improvements for the Meta Llama family of models on its Azure AI Foundry platform....
4 MIN READ
Boost Llama Model Performance on Microsoft Azure AI Foundry with NVIDIA TensorRT-LLM
Mar 18, 2025

Introducing NVIDIA Dynamo, A Low-Latency Distributed Inference Framework for Scaling Reasoning AI Models

NVIDIA announced the release of NVIDIA Dynamo today at GTC 2025. NVIDIA Dynamo is a high-throughput, low-latency open-source inference serving framework for...
14 MIN READ
Introducing NVIDIA Dynamo, A Low-Latency Distributed Inference Framework for Scaling Reasoning AI Models
Mar 18, 2025

NVIDIA Blackwell Delivers World-Record DeepSeek-R1 Inference Performance

NVIDIA announced world-record DeepSeek-R1 inference performance at NVIDIA GTC 2025. A single NVIDIA DGX system with eight NVIDIA Blackwell GPUs can achieve over...
14 MIN READ
NVIDIA Blackwell Delivers World-Record DeepSeek-R1 Inference Performance
Feb 14, 2025

Optimizing Qwen2.5-Coder Throughput with NVIDIA TensorRT-LLM Lookahead Decoding

Large language models (LLMs) that specialize in coding have been steadily adopted into developer workflows. From pair programming to self-improving AI agents,...
7 MIN READ
Optimizing Qwen2.5-Coder Throughput with NVIDIA TensorRT-LLM Lookahead Decoding

Generative AI

See all
image of the paris skyline
May 14, 2025

Get Trained and Certified at GTC Paris at VivaTech 2025

Join us at GTC Paris on June 10th and choose from six full-day, instructor-led workshops.
1 MIN READ
Get Trained and Certified at GTC Paris at VivaTech 2025
May 12, 2025

Accelerated AI Inference with NVIDIA NIM on Azure AI Foundry

The integration of NVIDIA NIM microservices into Azure AI Foundry marks a major leap forward in enterprise AI development. By combining NIM microservices with...
8 MIN READ
Accelerated AI Inference with NVIDIA NIM on Azure AI Foundry
May 12, 2025

Run Hugging Face Models Instantly with Day-0 Support from NVIDIA NeMo Framework

As organizations strive to maximize the value of their generative AI investments, accessing the latest model developments is crucial to continued success. By...
6 MIN READ
Run Hugging Face Models Instantly with Day-0 Support from NVIDIA NeMo Framework
An illustration showing molecules and a brain.
May 09, 2025

Applying Specialized LLMs with Reasoning Capabilities to Accelerate Battery Research

Scientific research in complex fields like battery innovation is often slowed by manual evaluation of materials, limiting progress to just dozens of candidates...
11 MIN READ
Applying Specialized LLMs with Reasoning Capabilities to Accelerate Battery Research
May 08, 2025

Extending the NVIDIA Agent Intelligence Toolkit to Support New Agentic Frameworks

NVIDIA Agent Intelligence toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents. It focuses on enabling developers to...
12 MIN READ
Extending the NVIDIA Agent Intelligence Toolkit to Support New Agentic Frameworks
May 08, 2025

Revolutionizing Neural Reconstruction and Rendering in gsplat with 3DGUT

Realistic 3D simulation is becoming a cornerstone of modern AI and graphics, from training autonomous vehicles (AV) to powering robotics and digital twins....
5 MIN READ
Revolutionizing Neural Reconstruction and Rendering in gsplat with 3DGUT
Students sitting around a computer.
May 07, 2025

Concept‑Driven AI Teaching Assistant Guides Students to Deeper Insights

In today's educational landscape, generative AI tools have become both a blessing and a challenge. While these tools offer unprecedented access to information,...
8 MIN READ
Concept‑Driven AI Teaching Assistant Guides Students to Deeper Insights
May 07, 2025

Building Nemotron-CC, A High-Quality Trillion Token Dataset for LLM Pretraining from Common Crawl Using NVIDIA NeMo Curator

Curating high-quality pretraining datasets is critical for enterprise developers aiming to train state-of-the-art large language models (LLMs). To enable...
7 MIN READ
Building Nemotron-CC, A High-Quality Trillion Token Dataset for LLM Pretraining from Common Crawl Using NVIDIA NeMo Curator

Data Science

See all
May 08, 2025

Accelerate Deep Learning and LLM Inference with Apache Spark in the Cloud

Apache Spark is an industry-leading platform for big data processing and analytics. With the increasing prevalence of unstructured data—documents, emails,...
10 MIN READ
Accelerate Deep Learning and LLM Inference with Apache Spark in the Cloud
May 08, 2025

Spotlight: Accelerating the Discovery of New Battery Materials with SES AI's Molecular Universe

From the Stone Age to the digital era, materials have been the foundation of our civilization across all epochs. Today, finding new materials leads to progress...
7 MIN READ
Spotlight: Accelerating the Discovery of New Battery Materials with SES AI's Molecular Universe
May 07, 2025

Using Python to Automate 3D Workflows with OpenUSD 

Universal Scene Description (OpenUSD) offers a powerful, open, and extensible ecosystem for describing, composing, simulating, and collaborating within complex...
7 MIN READ
Using Python to Automate 3D Workflows with OpenUSD 
May 02, 2025

An Even Easier Introduction to CUDA (Updated)

Note: This blog post was originally published on Jan 25, 2017, but has been edited to reflect new updates. This post is a super simple introduction to CUDA, the...
16 MIN READ
An Even Easier Introduction to CUDA (Updated)
May 01, 2025

Stacking Generalization with HPO: Maximize Accuracy in 15 Minutes with NVIDIA cuML

Stacking generalization is a widely used technique among machine learning (ML) engineers, where multiple models are combined to boost overall predictive...
7 MIN READ
Stacking Generalization with HPO: Maximize Accuracy in 15 Minutes with NVIDIA cuML
Apr 29, 2025

Structuring Applications to Secure the KV Cache

When interacting with transformer-based models like large language models (LLMs) and vision-language models (VLMs), the structure of the input shapes the...
11 MIN READ
Structuring Applications to Secure the KV Cache
A fireside chat with Kaggle Grandmasters.
Apr 29, 2025

Kaggle Grandmasters Unveil Winning Strategies for Data Science Superpowers

Kaggle Grandmasters David Austin and Chris Deotte from NVIDIA and Ruchi Bhatia from HP joined Brenda Flynn from Kaggle at this year’s Google Cloud Next...
9 MIN READ
Kaggle Grandmasters Unveil Winning Strategies for Data Science Superpowers
Apr 23, 2025

NVIDIA cuPyNumeric 25.03 Now Fully Open Source with PIP and HDF5 Support

NVIDIA cuPyNumeric is a library that aims to provide a distributed and accelerated drop-in replacement for NumPy built on top of the Legate framework. It brings...
4 MIN READ
NVIDIA cuPyNumeric 25.03 Now Fully Open Source with PIP and HDF5 Support

Robotics

See all
May 12, 2025

Just Released: NVIDIA Warp is Now Open-Source Under Apache 2.0

NVIDIA Warp, a simulation computing framework, is now accessible to all developers.
1 MIN READ
Just Released: NVIDIA Warp is Now Open-Source Under Apache 2.0
Apr 25, 2025

R²D²: Adapting Dexterous Robots with NVIDIA Research Workflows and Models

Robotic arms are used today for assembly, packaging, inspection, and many more applications. However, they are still preprogrammed to perform specific and often...
8 MIN READ
R²D²: Adapting Dexterous Robots with NVIDIA Research Workflows and Models
Mar 31, 2025

Simulating Robots in Industrial Facility Digital Twins

Industrial enterprises are embracing physical AI and autonomous systems to transform their operations. This involves deploying heterogeneous robot fleets that...
6 MIN READ
Simulating Robots in Industrial Facility Digital Twins
Mar 27, 2025

R²D²: Advancing Robot Mobility and Whole-Body Control with Novel Workflows and AI Foundation Models from NVIDIA Research

Welcome to the first edition of the NVIDIA Robotics Research and Development Digest (R2D2). This technical blog series will give developers and researchers...
10 MIN READ
R²D²: Advancing Robot Mobility and Whole-Body Control with Novel Workflows and AI Foundation Models from NVIDIA Research
Mar 26, 2025

Just Released: Omniverse Kit 107.0

Kit SDK 107.0 is a major update release with primary updates for robotics development.
1 MIN READ
Just Released: Omniverse Kit 107.0
Mar 20, 2025

Easily Build Edge AI Apps with Dynamic Flow Control in NVIDIA Holoscan 3.0

NVIDIA announced at GTC 2025 the release of NVIDIA Holoscan 3.0, the real-time AI sensor processing platform. This latest version provides dynamic flow control,...
4 MIN READ
Easily Build Edge AI Apps with Dynamic Flow Control in NVIDIA Holoscan 3.0
A patient laying on a doctor's table with a robotic arm examining them.
Mar 18, 2025

Introducing NVIDIA Isaac for Healthcare, an AI-Powered Medical Robotics Development Platform

The future of MedTech is robotic—hospitals will be fully automated, with AI-driven surgical systems, robotic assistants, and autonomous patient care...
10 MIN READ
Introducing NVIDIA Isaac for Healthcare, an AI-Powered Medical Robotics Development Platform
NeMo Video Curator icon in a workflow diagram.
Mar 18, 2025

Petabyte-Scale Video Processing with NVIDIA NeMo Curator on NVIDIA DGX Cloud

With the rise of physical AI, video content generation has surged exponentially. A single camera-equipped autonomous vehicle can generate more than 1 TB of...
9 MIN READ
Petabyte-Scale Video Processing with NVIDIA NeMo Curator on NVIDIA DGX Cloud

Simulation / Modeling / Design

See all
May 09, 2025

CUDA C++ Compiler Updates Impacting ELF Visibility and Linkage

In the next CUDA major release, CUDA 13.0, NVIDIA is introducing two significant changes to the NVIDIA CUDA Compiler Driver (NVCC) that will impact ELF...
11 MIN READ
CUDA C++ Compiler Updates Impacting ELF Visibility and Linkage
Image of someone using a VR headset driving a simular
May 06, 2025

Powering Next-Gen XR Design at Rivian with NVIDIA RTX PRO Blackwell Desktop GPUs

For professionals pushing the boundaries of XR, creating the most immersive and highest fidelity experiences is always challenging. Demanding XR workflows push...
6 MIN READ
Powering Next-Gen XR Design at Rivian with NVIDIA RTX PRO Blackwell Desktop GPUs
May 01, 2025

NVIDIA Blackwell and NVIDIA CUDA 12.9 Introduce Family-Specific Architecture Features

One of the earliest architectural design decisions that went into the CUDA platform for NVIDIA GPUs was support for backward compatibility of GPU code. This...
14 MIN READ
NVIDIA Blackwell and NVIDIA CUDA 12.9 Introduce Family-Specific Architecture Features
Apr 28, 2025

How SETI Uses AI to Search for Intelligent Alien Life

A researcher from the SETI Institute described to a packed audience at GTC 2025 how SETI had successfully trialed a novel method to identify interstellar radio...
3 MIN READ
How SETI Uses AI to Search for Intelligent Alien Life
Apr 24, 2025

Fast Ray Tracing of Dynamic Scenes Using NVIDIA OptiX 9 and NVIDIA RTX Mega Geometry

Real-time ray tracing is a powerful rendering technique that can create incredibly realistic images. NVIDIA OptiX and RTX technology make this possible, even...
9 MIN READ
Fast Ray Tracing of Dynamic Scenes Using NVIDIA OptiX 9 and NVIDIA RTX Mega Geometry
A robot arm carving a sculpture.
Apr 21, 2025

AI Inspires Artists and Industrialists to Reimagine their Crafts

AI has become nearly synonymous with innovation. As it rushes onto the world stage, AI is seeding inspiration in creators and problem-solvers of all...
4 MIN READ
AI Inspires Artists and Industrialists to Reimagine their Crafts
A heatmap animated GIF.
Apr 16, 2025

AI-Generated Heat Maps Keep Seniors and their Privacy Safe

By 2030, more than one in five Americans will be 65 or older, becoming the United States’ largest group of seniors ever. Silicon Valley-based startup Butlr...
4 MIN READ
AI-Generated Heat Maps Keep Seniors and their Privacy Safe
Apr 11, 2025

AI Advances Parkinson’s Detection Using Standard MRI Scans

A simple brain scan may soon be all that's needed to accurately diagnose Parkinson’s disease, thanks to a new AI-powered tool. The advancement could help...
3 MIN READ
AI Advances Parkinson’s Detection Using Standard MRI Scans

Computer Vision / Video Analytics

See all
Apr 24, 2025

Benchmarking Agentic LLM and VLM Reasoning for Gaming with NVIDIA NIM

This is the first post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM. ...
7 MIN READ
Benchmarking Agentic LLM and VLM Reasoning for Gaming with NVIDIA NIM
Decorative image of a llama in sunglasses standing on two feet, with a shadow that is flexing it's muscles.
Apr 05, 2025

NVIDIA Accelerates Inference on Meta Llama 4 Scout and Maverick

The newest generation of the popular Llama AI models is here with Llama 4 Scout and Llama 4 Maverick. Accelerated by NVIDIA open-source software, they can...
4 MIN READ
NVIDIA Accelerates Inference on Meta Llama 4 Scout and Maverick
Mar 11, 2025

Build Real-Time Multimodal XR Apps with NVIDIA AI Blueprint for Video Search and Summarization

With the recent advancements in generative AI and vision foundational models, VLMs present a new wave of visual computing wherein the models are capable of...
9 MIN READ
Build Real-Time Multimodal XR Apps with NVIDIA AI Blueprint for Video Search and Summarization
Mar 03, 2025

AI Model Offers Conservationists New Tools to Protect Fisheries, Wildlife at Scale

In an effort to rein in illicit fishing, researchers have unveiled a new open-source AI model that can accurately identify what virtually all of the world’s...
5 MIN READ
AI Model Offers Conservationists New Tools to Protect Fisheries, Wildlife at Scale
An image of a phone with a chatbot dialog on the screen but also showing the inside of the phone.
Feb 26, 2025

Latest Multimodal Addition to Microsoft Phi SLMs Trained on NVIDIA GPUs

Large language models (LLMs) have permeated every industry and changed the potential of technology. However, due to their massive size they are not practical...
4 MIN READ
Latest Multimodal Addition to Microsoft Phi SLMs Trained on NVIDIA GPUs
A GIF of a warehouse with people walking around.
Feb 26, 2025

Vision Language Model Prompt Engineering Guide for Image and Video Understanding

Vision language models (VLMs) are evolving at a breakneck speed. In 2020, the first VLMs revolutionized the generative AI landscape by bringing visual...
12 MIN READ
Vision Language Model Prompt Engineering Guide for Image and Video Understanding
A person looking over an AV equipment bank.
Feb 24, 2025

NVIDIA Video Codec SDK 13.0 Powered by NVIDIA Blackwell

The release of NVIDIA Video Codec SDK 13.0 marks a significant upgrade, adding support for the latest-generation NVIDIA Blackwell GPUs. This version brings a...
10 MIN READ
NVIDIA Video Codec SDK 13.0 Powered by NVIDIA Blackwell
Decorative image.
Feb 24, 2025

Enabling Stereoscopic and 3D Views Using MV-HEVC in NVIDIA Video Codec SDK 13.0

NVIDIA announces the implementation of Multi-View High Efficiency Video Coding (MV-HEVC) encoder in the latest NVIDIA Video Codec SDK release, version 13.0....
4 MIN READ
Enabling Stereoscopic and 3D Views Using MV-HEVC in NVIDIA Video Codec SDK 13.0

Content Creation / Rendering

See all
Four tiles of city scenes.
May 14, 2025

NVIDIA TensorRT Unlocks FP4 Image Generation  for NVIDIA Blackwell GeForce RTX 50 Series GPUs

The launch of the NVIDIA Blackwell platform ushered in a new era of improvements in generative AI technology. At its forefront is the newly launched GeForce RTX...
11 MIN READ
NVIDIA TensorRT Unlocks FP4 Image Generation  for NVIDIA Blackwell GeForce RTX 50 Series GPUs
Apr 23, 2025

Real-Time GPU-Accelerated Gaussian Splatting with NVIDIA DesignWorks Sample vk_gaussian_splatting

Gaussian splatting is a novel approach to rendering complex 3D scenes by representing them as a collection of anisotropic Gaussians in 3D space. This technique...
3 MIN READ
Real-Time GPU-Accelerated Gaussian Splatting with NVIDIA DesignWorks Sample vk_gaussian_splatting
Apr 17, 2025

Neural Rendering in NVIDIA OptiX Using Cooperative Vectors

The release of NVIDIA OptiX 9.0 introduces a new feature called cooperative vectors that enables AI workflows as part of ray tracing kernels. The feature...
13 MIN READ
Neural Rendering in NVIDIA OptiX Using Cooperative Vectors
Apr 09, 2025

Just Released: NVIDIA AI Workbench 2025.03.10

NVIDIA AI Workbench 2025.03.10 features streamlined onboarding and enhanced UX for multicontainer projects.
1 MIN READ
Just Released: NVIDIA AI Workbench 2025.03.10
Decorative image of product icons.
Mar 25, 2025

Kickstart Your AI Journey on RTX AI PCs and Workstations with NVIDIA NIM Microservices

With emerging use cases such as digital humans, agents, podcasts, images, and video generation, generative AI is changing the way we interact with PCs. This...
7 MIN READ
Kickstart Your AI Journey on RTX AI PCs and Workstations with NVIDIA NIM Microservices
A GIF showing robotics.
Mar 18, 2025

Scale Synthetic Data and Physical AI Reasoning with NVIDIA Cosmos World Foundation Models

The next generation of AI-driven robots like humanoids and autonomous vehicles depends on high-fidelity, physics-aware training data. Without diverse and...
7 MIN READ
Scale Synthetic Data and Physical AI Reasoning with NVIDIA Cosmos World Foundation Models
A Zorah sample still showing a partially shaded courtyard with ornate pillars and plants growing everywhere.
Mar 17, 2025

NVIDIA RTX Advances with Neural Rendering and Digital Human Technologies at GDC 2025

AI is transforming how we experience our favorite games. It is unlocking new levels of visuals, performance, and gameplay possibilities with neural rendering...
10 MIN READ
NVIDIA RTX Advances with Neural Rendering and Digital Human Technologies at GDC 2025
Feb 06, 2025

Render Path-Traced Hair in Real Time with NVIDIA GeForce RTX 50 Series GPUs

Hardware support for ray tracing triangle meshes was introduced as part of NVIDIA RTX in 2018. But ray tracing for hair and fur has remained a compute-intensive...
9 MIN READ
Render Path-Traced Hair in Real Time with NVIDIA GeForce RTX 50 Series GPUs

Conversational AI

See all
Apr 29, 2025

Spotlight: Personal AI Brings AI Receptionists to Small Business Owners with NVIDIA Riva

It's 10 p.m. on a Tuesday when the phone rings at the Sapochnick Law Firm, a specialized law practice in San Diego, California. The caller, a client of the...
6 MIN READ
Spotlight: Personal AI Brings AI Receptionists to Small Business Owners with NVIDIA Riva
Apr 22, 2025

NVIDIA GTC Training Labs Now Available On Demand

Missed GTC? This year’s training labs are now available on demand to watch anywhere, anytime.
1 MIN READ
NVIDIA GTC Training Labs Now Available On Demand
Apr 18, 2025

Upcoming Event: NVIDIA Agent Toolkit Hackathon

Build a high-performance agentic AI system using the open-source NVIDIA Agent Intelligence toolkit — contest runs May 12 to May 23.
1 MIN READ
Upcoming Event: NVIDIA Agent Toolkit Hackathon
Apr 10, 2025

Curating Biological Findings from Scientific Literature with NVIDIA NIM

Scientific papers are highly heterogeneous, often employing diverse terminologies for the same entities, using varied methodologies to study biological...
7 MIN READ
Curating Biological Findings from Scientific Literature with NVIDIA NIM
Apr 09, 2025

Prevent LLM Hallucinations with the Cleanlab Trustworthy Language Model in NVIDIA NeMo Guardrails

As more enterprises integrate LLMs into their applications, they face a critical challenge: LLMs can generate plausible but incorrect responses, known as...
9 MIN READ
Prevent LLM Hallucinations with the Cleanlab Trustworthy Language Model in NVIDIA NeMo Guardrails
An illustration of a person using an AI agent.
Mar 04, 2025

Top Conversational AI Sessions at NVIDIA GTC 2025

Learn how to accelerate the full pipeline, from multilingual speech recognition and translation to generative AI and speech synthesis.
1 MIN READ
Top Conversational AI Sessions at NVIDIA GTC 2025
Decorative image of the guardrail process.
Mar 03, 2025

Measuring the Effectiveness and Performance of AI Guardrails in Generative AI Applications

Safeguarding AI agents and other conversational AI applications to ensure safe, on-brand and reliable behavior is essential for enterprises. NVIDIA NeMo...
12 MIN READ
Measuring the Effectiveness and Performance of AI Guardrails in Generative AI Applications
Feb 28, 2025

Spotlight: NAVER Place Optimizes SLM-Based Vertical Services with NVIDIA TensorRT-LLM

NAVER is a popular South Korean search engine company that offers Naver Place, a geo-based service that provides detailed information about millions of...
13 MIN READ
Spotlight: NAVER Place Optimizes SLM-Based Vertical Services with NVIDIA TensorRT-LLM

Edge Computing

See all
Decorative image.
Apr 16, 2025

Efficient Federated Learning in the Era of LLMs with Message Quantization and Streaming

Federated learning (FL) has emerged as a promising approach for training machine learning models across distributed data sources while preserving data privacy....
8 MIN READ
Efficient Federated Learning in the Era of LLMs with Message Quantization and Streaming
Apr 15, 2025

Event: Data Filtering Challenge for Training Edge Language Models

You’re invited to join the challenge. Develop and apply innovative data filtering techniques to curate datasets that enhance edge LM performance.
1 MIN READ
Event: Data Filtering Challenge for Training Edge Language Models
Decorative image.
Apr 11, 2025

Effortless Federated Learning on Mobile with NVIDIA FLARE and Meta ExecuTorch

NVIDIA and the PyTorch team at Meta announced a groundbreaking collaboration that brings federated learning (FL) capabilities to mobile devices through the...
12 MIN READ
Effortless Federated Learning on Mobile with NVIDIA FLARE and Meta ExecuTorch
An image of a robot underwater.
Apr 08, 2025

Using AI to Better Understand the Ocean

Humans know more about deep space than we know about Earth’s deepest oceans. But scientists have plans to change that—with the help of AI.  “We have...
3 MIN READ
Using AI to Better Understand the Ocean
Diagram of a retail clothing store with an avatar consulting a kiosk.
Mar 12, 2025

Lightweight, Multimodal, Multilingual Gemma 3 Models Are Streamlined for Performance

Building AI systems with foundation models requires a delicate balancing of resources such as memory, latency, storage, compute, and more. One size does not fit...
3 MIN READ
Lightweight, Multimodal, Multilingual Gemma 3 Models Are Streamlined for Performance
Mar 10, 2025

Streamline LLM Deployment for Autonomous Vehicle Applications with NVIDIA DriveOS LLM SDK

Large language models (LLMs) have shown remarkable generalization capabilities in natural language processing (NLP). They are used in a wide range of...
7 MIN READ
Streamline LLM Deployment for Autonomous Vehicle Applications with NVIDIA DriveOS LLM SDK
Feb 20, 2025

Featured Computer Vision and Video Analytics Sessions at NVIDIA GTC 2025

Explore visually perceptive AI agents, the latest vision AI technologies, hands-on training, and inspiring deployments.
1 MIN READ
Featured Computer Vision and Video Analytics Sessions at NVIDIA GTC 2025
Two views of a robot picker, real and computerized.
Jan 06, 2025

Advancing Robot Learning, Perception, and Manipulation with Latest NVIDIA Isaac Release

At CES 2025, NVIDIA announced key updates to NVIDIA Isaac, a platform of accelerated libraries, application frameworks, and AI models that accelerate the...
9 MIN READ
Advancing Robot Learning, Perception, and Manipulation with Latest NVIDIA Isaac Release

Data Center / Cloud

See all
May 13, 2025

Connect Simulations with the Real World Using NVIDIA Air Services

NVIDIA Air enables cloud-scale efficiency by creating identical replicas of real-world data center infrastructure deployments. With NVIDIA Air, you can spin up...
6 MIN READ
Connect Simulations with the Real World Using NVIDIA Air Services
A multi-data center illustration.
May 08, 2025

Turbocharge LLM Training Across Long-Haul Data Center Networks with NVIDIA Nemo Framework

Multi-data center training is becoming essential for AI factories as pretraining scaling fuels the creation of even larger models, leading the demand for...
6 MIN READ
Turbocharge LLM Training Across Long-Haul Data Center Networks with NVIDIA Nemo Framework
May 06, 2025

New NVIDIA NV-Tesseract Time Series Models Advance Dataset Processing and Anomaly Detection

Time-series data has evolved from a simple historical record into a real-time engine for critical decisions across industries. Whether it’s streamlining...
5 MIN READ
New NVIDIA NV-Tesseract Time Series Models Advance Dataset Processing and Anomaly Detection
Decorative image.
Apr 29, 2025

NVIDIA NIM Operator 2.0 Boosts AI Deployment with NVIDIA NeMo Microservices Support

The first release of NVIDIA NIM Operator simplified the deployment and lifecycle management of inference pipelines for NVIDIA NIM microservices, reducing the...
5 MIN READ
NVIDIA NIM Operator 2.0 Boosts AI Deployment with NVIDIA NeMo Microservices Support
Apr 23, 2025

Announcing NVIDIA Secure AI General Availability

As many enterprises move to running AI training or inference on their data, the data and the code need to be protected, especially for large language models...
3 MIN READ
Announcing NVIDIA Secure AI General Availability
Apr 14, 2025

Just Released: NVDIA Run:ai 2.21

NVIDIA Run:ai 2.21 adds GB200 NVL72 support, rolling inference updates and smarter resource controls.
1 MIN READ
Just Released: NVDIA Run:ai 2.21
Image of a Softbank datacenter corridor.
Apr 11, 2025

NVIDIA Helps Build AI Factories Faster Than Ever with NVIDIA DGX SuperPOD

In a cavernous room at an undisclosed location in Japan, a digital revolution is unfolding. Racks of servers stand like giants, their sleek frames linked by...
5 MIN READ
NVIDIA Helps Build AI Factories Faster Than Ever with NVIDIA DGX SuperPOD
Apr 10, 2025

Just Released: NVIDIA HPC SDK v25.3

The HPC SDK v25.3 release includes support for NVIDIA Blackwell GPUs and an optimized allocator for Arm CPUs.
1 MIN READ
Just Released: NVIDIA HPC SDK v25.3