NVIDIA Technical Blog
-
Agentic AI / Generative AICreate Your Own Bash Computer Use Agent with NVIDIA Nemotron in One Hour
-
RoboticsUnlock Faster, Smarter Edge Models with 7x Gen AI Performance on NVIDIA Jetson AGX Thor
-
Agentic AI / Generative AINVIDIA Blackwell Leads on SemiAnalysis InferenceMAX v1 Benchmarks
-
Data Center / CloudBuilding the 800 VDC Ecosystem for Efficient, Scalable AI Factories
-
Agentic AI / Generative AIBuild a Log Analysis Multi-Agent Self-Corrective RAG System with NVIDIA Nemotron
Recent

Oct 23, 2025
Reconstruct a Scene in NVIDIA Isaac Sim Using Only a Smartphone
Building realistic 3D environments for robotics simulation can be a labor-intensive process. Now, with NVIDIA Omniverse NuRec, you can complete the entire...
10 MIN READ

Oct 23, 2025
Train an LLM on an NVIDIA Blackwell Desktop with Unsloth—and Scale It
Fine-tuning and reinforcement learning (RL) for large language models (LLMs) require advanced expertise and complex workflows, making them out of reach for...
5 MIN READ

Oct 23, 2025
Bring Your Circuits to CUDA-Q Using QGEAR
Download NERSC’s QGEAR project to easily import Qiskit circuits into GPU-accelerated CUDA-Q kernels.
1 MIN READ

Oct 22, 2025
Create Your Own Bash Computer Use Agent with NVIDIA Nemotron in One Hour
What if you could talk to your computer and have it perform tasks through the Bash terminal, without you writing a single command? With NVIDIA Nemotron Nano v2,...
14 MIN READ

Oct 21, 2025
Build Practical Deep-Learning Skills for Real-World AI Applications with the New NVIDIA Learning Path
Check out the learning path page and sign up for courses, workshops, and certifications to help develop your skills.
1 MIN READ

Oct 21, 2025
NVIDIA ACE Adds Open Source Qwen3 SLM for On-Device Deployment in PC Games
To help create real-time, dynamic NPC game characters, NVIDIA ACE now supports the open source Qwen3-8B small language model (SLM) for on-device...
4 MIN READ

Oct 20, 2025
Build an AI Agent to Analyze IT Tickets with NVIDIA Nemotron
Modern organizations generate a massive volume of operational data through ticketing systems, incident reports, service requests, support escalations, and more....
11 MIN READ

Oct 20, 2025
Enabling Scalable AI-Driven Molecular Dynamics Simulations
Molecular dynamics (MD) simulations are a powerful tool in computational chemistry and materials science, and they’re essential for studying chemical...
14 MIN READ
Inference Performance

Oct 20, 2025
Scaling Large MoE Models with Wide Expert Parallelism on NVL72 Rack Scale Systems
Modern AI workloads have moved well beyond single-GPU inference serving. Model parallelism, which efficiently splits computation across many GPUs, is now the...
10 MIN READ

Oct 13, 2025
NVIDIA Blackwell Leads on SemiAnalysis InferenceMAX v1 Benchmarks
SemiAnalysis recently launched InferenceMAX v1, a new open source initiative that provides a comprehensive methodology to evaluate inference hardware...
11 MIN READ

Sep 29, 2025
Smart Multi-Node Scheduling for Fast and Efficient LLM Inference with NVIDIA Run:ai and NVIDIA Dynamo
The exponential growth in large language model complexity has created challenges, such as models too large for single GPUs, workloads that demand high...
9 MIN READ

Sep 18, 2025
How to Reduce KV Cache Bottlenecks with NVIDIA Dynamo
As AI models grow larger and more sophisticated, inference, the process by which a model generates responses, is becoming a major challenge. Large language...
11 MIN READ

Sep 17, 2025
An Introduction to Speculative Decoding for Reducing Latency in AI Inference
Generating text with large language models (LLMs) often involves running into a fundamental bottleneck. GPUs offer massive compute, yet much of that power sits...
11 MIN READ

Sep 16, 2025
Reducing Cold Start Latency for LLM Inference with NVIDIA Run:ai Model Streamer
Deploying large language models (LLMs) poses a challenge in optimizing inference efficiency. In particular, cold start delays—where models take significant...
13 MIN READ

Sep 10, 2025
Accelerate Protein Structure Inference Over 100x with NVIDIA RTX PRO 6000 Blackwell Server Edition
The race to understand protein structures has never been more critical. From accelerating drug discovery to preparing for future pandemics, the ability to...
6 MIN READ

Sep 10, 2025
Deploy Scalable AI Inference with NVIDIA NIM Operator 3.0.0
AI models, inference engine backends, and distributed inference frameworks continue to evolve in architecture, complexity, and scale. With the rapid pace of...
7 MIN READ
Build AI Agents

Oct 10, 2025
Build a Log Analysis Multi-Agent Self-Corrective RAG System with NVIDIA Nemotron
Logs are the lifeblood of modern systems. But as applications scale, logs often grow into endless walls of text—noisy, repetitive, and overwhelming. Hunting...
5 MIN READ

Sep 23, 2025
Build a Retrieval-Augmented Generation (RAG) Agent with NVIDIA Nemotron
Unlike traditional LLM-based systems that are limited by their training data, retrieval-augmented generation (RAG) improves text generation by incorporating...
17 MIN READ

Sep 15, 2025
Build a Report Generator AI Agent with NVIDIA Nemotron on OpenRouter
Unlike traditional systems that follow predefined paths, AI agents are autonomous systems that use large language models (LLMs) to make decisions, adapt to...
14 MIN READ

Jul 29, 2025
Build More Accurate and Efficient AI Agents with the New NVIDIA Llama Nemotron Super v1.5
AI agents now solve multi-step problems, write production-level code, and act as general assistants across multiple domains. But to reach their full potential,...
6 MIN READ

Jul 22, 2025
Train a Reasoning-Capable LLM in One Weekend with NVIDIA NeMo
Have you ever wanted to build your own reasoning models such as the NVIDIA Nemotron, but thought it was too complicated or required massive resources? Think...
18 MIN READ

Apr 08, 2025
Build Enterprise AI Agents with Advanced Open NVIDIA Llama Nemotron Reasoning Models
This updated post was originally published on March 18, 2025. Organizations are embracing AI agents to enhance productivity and streamline operations. To...
12 MIN READ
Agentic AI / Generative AI

Oct 15, 2025
Agentic AI Unleashed: Join the AWS & NVIDIA Hackathon
Build the next generation of intelligent, autonomous applications. This isn't just a hackathon—it's your chance to unleash the power of agentic AI and show...
1 MIN READ

Oct 15, 2025
Unlock Faster, Smarter Edge Models with 7x Gen AI Performance on NVIDIA Jetson AGX Thor
A defining strength of the NVIDIA software ecosystem is its commitment to continuous optimization. In August, NVIDIA Jetson AGX Thor launched, with up to a 5x...
8 MIN READ

Oct 13, 2025
Building the 800 VDC Ecosystem for Efficient, Scalable AI Factories
For decades, traditional data centers have been vast halls of servers with power and cooling as secondary considerations. The rise of generative AI has changed...
9 MIN READ

Oct 09, 2025
From Assistant to Adversary: Exploiting Agentic AI Developer Tools
Developers are increasingly turning to AI-enabled tools for coding, including Cursor, OpenAI Codex, Claude Code, and GitHub Copilot. While these automation...
10 MIN READ

Oct 03, 2025
Enable Gang Scheduling and Workload Prioritization in Ray with NVIDIA KAI Scheduler
NVIDIA KAI Scheduler is now natively integrated with KubeRay, bringing the same scheduling engine that powers high‑demand and high-scale environments in...
10 MIN READ

Oct 02, 2025
Practical LLM Security Advice from the NVIDIA AI Red Team
Over the last several years, the NVIDIA AI Red Team (AIRT) has evaluated numerous and diverse AI-enabled systems for potential vulnerabilities and security...
8 MIN READ

Sep 30, 2025
Advancing Anomaly Detection for Industry Applications with NVIDIA NV-Tesseract-AD
In a recent blog post, we introduced NVIDIA NV-Tesseract, a family of models designed to unify anomaly detection, classification, and forecasting within a...
10 MIN READ

Sep 25, 2025
How to Integrate Computer Vision Pipelines with Generative AI and Reasoning
Generative AI is opening new possibilities for analyzing existing video streams. Video analytics are evolving from counting objects to turning raw video content...
10 MIN READ
Robotics

Sep 29, 2025
Streamline Robot Learning with Whole-Body Control and Enhanced Teleoperation in NVIDIA Isaac Lab 2.3
Training robot policies from real-world demonstrations is costly, slow, and prone to overfitting, limiting generalization across tasks and environments. A...
10 MIN READ

Sep 29, 2025
Train a Quadruped Locomotion Policy and Simulate Cloth Manipulation with NVIDIA Isaac Lab and Newton
Physics plays a crucial role in robotic simulation, providing the foundation for accurate virtual representations of robot behavior and interactions within...
13 MIN READ

Sep 29, 2025
3 Easy Ways to Supercharge Your Robotics Development Using OpenUSD
The increasing demand for robotics is driving the need for physics-accurate simulation at an unprecedented scale. Universal Scene Description (OpenUSD) is key...
7 MIN READ

Sep 29, 2025
Advancing Robotics Development with Neural Dynamics in Newton
Modern robotics requires more than what classical analytic dynamics provides because of simplified contacts, omitted kinematic loops, and non-differentiable...
9 MIN READ

Sep 25, 2025
R²D²: Three Neural Breakthroughs Transforming Robot Learning from NVIDIA Research
While today's robots excel in controlled settings, they still struggle with the unpredictability, dexterity, and nuanced interactions required for real-world...
9 MIN READ

Sep 16, 2025
Just Released: Warp 1.9
The new release introduces CUDA 13.0 support and new functions for ahead-of-time compilation module.
1 MIN READ

Sep 03, 2025
Accelerate Autonomous Vehicle Development with the NVIDIA DRIVE AGX Thor Developer Kit
Autonomous vehicle (AV) technology is rapidly evolving, fueled by ever-larger and more complex AI models deployed at the edge. Modern vehicles now require not...
8 MIN READ

Sep 02, 2025
What’s New in CUDA Toolkit 13.0 for Jetson Thor: Unified Arm Ecosystem and More
The world of embedded and edge computing is about to get faster, more efficient, and more versatile with the upcoming CUDA 13.0 release for Jetson Thor SoC...
12 MIN READ
Data Science

Oct 14, 2025
Improve Variant Calling Accuracy with NVIDIA Parabricks
Built for data scientists and bioinformaticians, NVIDIA Parabricks is a scalable genomics software suite for secondary analysis. Providing GPU-accelerated...
7 MIN READ

Oct 08, 2025
Training Federated AI Models to Predict Protein Properties
Predicting where proteins are located inside a cell is critical in biology and drug discovery. This process is known as subcellular localization. The location...
5 MIN READ

Oct 06, 2025
Speeding Up Data Decompression with nvCOMP and the NVIDIA Blackwell Decompression Engine
Compression is a common technique to reduce storage costs and accelerate input/output transfer times across databases, data-center communications,...
7 MIN READ

Oct 06, 2025
Accelerating Large-Scale Data Analytics with GPU-Native Velox and NVIDIA cuDF
As workloads scale and demand for faster data processing grows, GPU-accelerated databases and query engines have been shown to deliver significant...
7 MIN READ

Sep 25, 2025
How to GPU-Accelerate Model Training with CUDA-X Data Science
In previous posts on AI in manufacturing and operations, we covered the unique data challenges in the supply chain and how smart feature engineering can...
8 MIN READ

Sep 23, 2025
Faster Training Throughput in FP8 Precision with NVIDIA NeMo
In previous posts on FP8 training, we explored the fundamentals of FP8 precision and took a deep dive into the various scaling recipes for practical large-scale...
12 MIN READ

Sep 23, 2025
How to Accelerate Community Detection in Python Using GPU-Powered Leiden
Community detection algorithms play an important role in understanding data by identifying hidden groups of related entities in networks. Social network...
9 MIN READ

Sep 18, 2025
The Kaggle Grandmasters Playbook: 7 Battle-Tested Modeling Techniques for Tabular Data
Over hundreds of Kaggle competitions, we've refined a playbook that consistently lands us near the top of the leaderboard—no matter if we’re working with...
13 MIN READ
Simulation / Modeling / Design

Sep 19, 2025
Predict Extreme Weather Events in Minutes Without a Supercomputer
Scientists from NVIDIA, in collaboration with Lawrence Berkeley National Laboratory (Berkeley Lab), released a machine learning tool called Huge Ensembles...
5 MIN READ

Sep 16, 2025
Autodesk Research Brings Warp Speed to Computational Fluid Dynamics on NVIDIA GH200
Computer-aided engineering (CAE) forms the backbone for modern product development across industries, from designing safer aircraft to optimizing renewable...
8 MIN READ

Sep 05, 2025
Just Released: NVIDIA PhysicsNeMo 25.08
NVIDIA PhysicsNeMo 25.08 is packed with powerful new workflows and recipes for CAE application developers.
1 MIN READ

Sep 03, 2025
How to Run AI-Powered CAE Simulations
In modern engineering, the pace of innovation is closely linked to the ability to perform accelerated simulations. Computer-aided engineering (CAE) plays a...
13 MIN READ

Aug 28, 2025
Getting Started with NVIDIA Isaac for Healthcare Using the Telesurgery Workflow
Telesurgery is no longer a futuristic idea—it’s quickly becoming essential to how care is delivered. With a global shortage of surgeons projected to reach...
8 MIN READ

Aug 27, 2025
How to Improve CUDA Kernel Performance with Shared Memory Register Spilling
When a CUDA kernel requires more hardware registers than are available, the compiler is forced to move the excess variables into local memory, a process known...
9 MIN READ

Aug 21, 2025
Less Coding, More Science: Simplify Ocean Modeling on GPUs With OpenACC and Unified Memory
NVIDIA HPC SDK v25.7 delivers a significant leap forward for developers working on high-performance computing (HPC) applications with GPU acceleration. This...
11 MIN READ

Aug 21, 2025
Improve Data Integrity and Security with Accelerated Hash Functions and Merkle Trees in cuPQC 0.4
As datasets get bigger, ensuring data security and integrity becomes increasingly important. Cryptographic techniques, such as inclusion proofs, data-integrity...
7 MIN READ
Computer Vision / Video Analytics

Sep 23, 2025
Build a Real-Time Visual Inspection Pipeline with NVIDIA TAO 6 and NVIDIA DeepStream 8
Building a robust visual inspection pipeline for defect detection and quality control is not easy. Manufacturers and developers often face challenges such as...
12 MIN READ

Sep 16, 2025
What’s New in PyNvVideoCodec 2.0 for Python GPU-Accelerated Video Processing
Powerful hardware-accelerated video processing in Python just got easier. PyNvVideoCodec is an NVIDIA Python-based library for GPU-accelerated video encoding,...
4 MIN READ

Sep 11, 2025
Build High-Performance Vision AI Pipelines with NVIDIA CUDA-Accelerated VC-6
The constantly increasing compute throughput of NVIDIA GPUs presents a new opportunity for optimizing vision AI workloads: keeping the hardware fed with data....
13 MIN READ

Aug 25, 2025
Introducing NVIDIA Jetson Thor, the Ultimate Platform for Physical AI
Robotics is undergoing a revolution, moving beyond the era of specialist machines to generalist robotics. This shift moves away from single-purpose,...
14 MIN READ

Aug 11, 2025
Maximize Robotics Performance by Post-Training NVIDIA Cosmos Reason
First unveiled at NVIDIA GTC 2025, NVIDIA Cosmos Reason is an open and fully customizable reasoning vision language model (VLM) for physical AI and robotics....
5 MIN READ

Jul 11, 2025
Improving Synthetic Data Augmentation and Human Action Recognition with SynthDa
Human action recognition is a capability in AI systems designed for safety-critical applications, such as surveillance, eldercare, and industrial monitoring....
10 MIN READ

Jun 24, 2025
Making Industrial Robots More Nimble With NVIDIA Isaac Manipulator and Vention MachineMotion AI
As industrial automation accelerates, factories are increasingly relying on advanced robotics to boost productivity and operational resilience. The successful...
7 MIN READ

Jun 18, 2025
Run Multimodal Extraction for More Efficient AI Pipelines Using One GPU
As enterprises generate and consume increasing volumes of diverse data, extracting insights from multimodal documents, like PDFs and presentations, has become a...
8 MIN READ
Content Creation / Rendering

Sep 30, 2025
How id Software Used Neural Rendering and Path Tracing in DOOM: The Dark Ages
DOOM: The Dark Ages pushes real-time graphics to new limits by integrating RTX neural rendering and path tracing, setting a new standard for how modern games...
6 MIN READ

Sep 24, 2025
NVIDIA Open Sources Audio2Face Animation Model
By leveraging large language and speech models, generative AI is creating intelligent 3D avatars that can engage users in natural conversation, from video games...
7 MIN READ

Aug 20, 2025
Deploying Your Omniverse Kit Apps at Scale
Running 3D applications that take advantage of advanced rendering and simulation technologies often requires users to navigate complex installs and have access...
12 MIN READ

Aug 18, 2025
Announcing the Latest NVIDIA Gaming AI and Neural Rendering Technologies
Today at Gamescom 2025, NVIDIA unveiled updates to NVIDIA RTX neural rendering and NVIDIA ACE generative AI technologies that enable developers to deliver...
9 MIN READ

Jul 29, 2025
Building CAD to USD Workflows with NVIDIA Omniverse
Transferring 3D data between applications has long been a challenge, especially with proprietary formats such as native computer-aided design (CAD) files. CAD...
16 MIN READ

Jul 10, 2025
Accelerating Video Production and Customization with GliaCloud and NVIDIA Omniverse Libraries
The proliferation of generative AI video models, along with the new workflows these models have introduced, has significantly accelerated production efficiency...
4 MIN READ

Jul 02, 2025
NVIDIA Omniverse: What Developers Need to Know About Migration Away From Launcher
As part of continued efforts to ensure NVIDIA Omniverse is a developer-first platform, NVIDIA will be deprecating the Omniverse Launcher on Oct. 1. Doing so...
2 MIN READ

Jun 17, 2025
Getting Started with Project G-Assist: Build a Twitch-Integrated Plug-in
Today, tweaking your PC to suit your workflows often involves digging through menus and settings across multiple control panels. Project G-Assist is an...
7 MIN READ
Edge Computing

Oct 15, 2025
Accelerated and Distributed UPF for the Era of Agentic AI and 6G
The telecommunications industry is innovating rapidly toward 6G for both AI-native Radio Access Networks (AI-RAN) and AI-Core. The distributed User Plane...
10 MIN READ

Jul 16, 2025
Driving AI-Powered Robotics Development with NVIDIA Isaac for Healthcare
By 2030, the World Health Organization projects a global shortage of over 15 million healthcare workers, including surgeons, radiologists, and nurses. In the...
6 MIN READ

Jun 27, 2025
AI Analyzes Nurses’ Observations to Reduce Patient Danger
Researchers have developed an AI-powered tool that can analyze nurses’ shift notes to identify—far earlier than traditional methods—when an admitted...
4 MIN READ

Jun 12, 2025
Run High-Performance AI Applications with NVIDIA TensorRT for RTX
NVIDIA TensorRT for RTX is now available for download as an SDK that can be integrated into C++ and Python applications for both Windows and Linux. At...
7 MIN READ

Jun 12, 2025
NVIDIA Holoscan Sensor Bridge Empowers Developers with Real-Time Data Processing
In the rapidly evolving robotics and edge AI landscape, the ability to efficiently process and transfer sensor data is crucial. Many edge applications are...
9 MIN READ

Jun 09, 2025
A Fine-tuning–Free Approach for Rapidly Recovering LLM Compression Errors with EoRA
Model compression techniques have been extensively explored to reduce the computational resource demands of serving large language models (LLMs) or other...
9 MIN READ

Jun 08, 2025
AI Helps Locate Dangerous Fishing Nets Lost at Sea
Conservationists have launched a new AI tool that can sift through petabytes of underwater imaging from anywhere in the world to identify signs of abandoned or...
4 MIN READ

May 30, 2025
AI Brings Coral Reefs Into Focus
Researchers have unveiled a new AI model that can transform hard-to-see underwater images into clear, highly accurate 3D scenes. It can help ecologists more...
4 MIN READ
Data Center / Cloud

Oct 14, 2025
Understanding Memory Management on Hardware-Coherent Platforms
If you're an application developer or a cluster administrator, you’ve likely seen how non-uniform memory access (NUMA) can impact system performance. When an...
6 MIN READ

Sep 19, 2025
NVIDIA HGX B200 Reduces Embodied Carbon Emissions Intensity
NVIDIA HGX B200 is revolutionizing accelerated computing by unlocking unprecedented performance and energy efficiency. This post shows how HGX B200 is...
5 MIN READ

Sep 10, 2025
Developers Can Now Get NVIDIA CUDA Directly from Their Favorite Third-Party Platforms
Building and deploying applications can be challenging for developers, requiring them to navigate the complex relationship between hardware and software...
3 MIN READ

Sep 10, 2025
Maximizing Low-Latency Networking Performance for Financial Services with NVIDIA Rivermax and NEIO FastSocket
Ultra-low latency and reliable packet delivery are critical requirements for modern applications in sectors such as the financial services industry (FSI), cloud...
10 MIN READ

Sep 09, 2025
How to Connect Distributed Data Centers Into Large AI Factories with Scale-Across Networking
AI scaling is incredibly complex, and new techniques in training and inference are continually demanding more out of the data center. While data center...
6 MIN READ

Sep 09, 2025
NVIDIA Blackwell Ultra Sets New Inference Records in MLPerf Debut
As large language models (LLMs) grow larger, they get smarter, with open models from leading developers now featuring hundreds of billions of parameters. At the...
10 MIN READ

Sep 09, 2025
NVIDIA Rubin CPX Accelerates Inference Performance and Efficiency for 1M+ Token Context Workloads
Inference has emerged as the new frontier of complexity in AI. Modern models are evolving into agentic systems capable of multi-step reasoning, persistent...
5 MIN READ

Sep 08, 2025
How to Build AI Systems In House with Outerbounds and DGX Cloud Lepton
It’s easy to underestimate how many moving parts a real-world, production-grade AI system involves. Whether you're building an agent that combines internal...
10 MIN READ
Networking / Communications

Sep 03, 2025
North–South Networks: The Key to Faster Enterprise AI Workloads
In AI infrastructure, data fuels the compute engine. With evolving agentic AI systems, where multiple models and services interact, fetch external context, and...
9 MIN READ

Aug 26, 2025
How Industry Collaboration Fosters NVIDIA Co-Packaged Optics
NVIDIA is reshaping the landscape of data-center connectivity by seamlessly integrating optical and electrical components. But it’s not doing it alone....
8 MIN READ

Aug 22, 2025
Inside NVIDIA Blackwell Ultra: The Chip Powering the AI Factory Era
As the latest member of the NVIDIA Blackwell architecture family, the NVIDIA Blackwell Ultra GPU builds on core innovations to accelerate training and AI...
14 MIN READ

Aug 21, 2025
Scaling AI Inference Performance and Flexibility with NVIDIA NVLink and NVLink Fusion
The exponential growth in AI model complexity has driven parameter counts from millions to trillions, requiring unprecedented computational resources that...
7 MIN READ

Aug 18, 2025
Scaling AI Factories with Co-Packaged Optics for Better Power Efficiency
As artificial intelligence redefines the computing landscape, the network has become the critical backbone shaping the data center of the future. Large language...
8 MIN READ

Jul 30, 2025
Using CI/CD to Automate Network Configuration and Deployment
Continuous integration and continuous delivery/deployment (CI/CD) is a set of modern software development practices used for delivering code changes more...
6 MIN READ

Jul 22, 2025
Understanding NCCL Tuning to Accelerate GPU-to-GPU Communication
The NVIDIA Collective Communications Library (NCCL) is essential for fast GPU-to-GPU communication in AI workloads, using various optimizations and tuning to...
14 MIN READ

Jul 18, 2025
Automating Network Design in NVIDIA Air with Ansible and Git
At its core, NVIDIA Air is built for automation. Every part of your network can be coded, versioned, and set to trigger automatically. This includes creating...
6 MIN READ