Recent posts
Mar 13, 2026
Scale Synthetic Data and Physical AI Reasoning with NVIDIA Cosmos World Foundation Models
The next generation of AI-driven robots like humanoids and autonomous vehicles depends on high-fidelity, physics-aware training data. Without diverse and...
8 MIN READ
Mar 12, 2026
Build Accelerated, Differentiable Computational Physics Code for AI with NVIDIA Warp
Computer-aided engineering (CAE) is shifting from human-driven workflows toward AI-driven ones, including physics foundation models that generalize across...
18 MIN READ
Mar 12, 2026
Validate Kubernetes for GPU Infrastructure with Layered, Reproducible Recipes
Every AI cluster running on Kubernetes requires a full software stack that works together, from low-level driver and kernel settings to high-level operator and...
5 MIN READ
Mar 12, 2026
Build Next-Gen Physical AI with Edge‑First LLMs for Autonomous Vehicles and Robotics
Physical AI is rapidly evolving, from next-generation software-defined autonomous vehicles (AVs) to humanoid robots. The challenge is no longer how to run a...
7 MIN READ
Mar 11, 2026
Introducing Nemotron 3 Super: An Open Hybrid Mamba-Transformer MoE for Agentic Reasoning
Agentic AI systems need models with the specialized depth to solve dense technical problems autonomously. They must excel at reasoning, coding, and long-context...
12 MIN READ
Mar 10, 2026
Reliable AI Coding for Unreal Engine: Improving Accuracy and Reducing Token Costs
Agentic code assistants are moving into daily game development as studios build larger worlds, ship more DLCs, and support distributed teams. These assistants...
6 MIN READ
Mar 10, 2026
NVIDIA RTX Innovations Are Powering the Next Era of Game Development
NVIDIA RTX ray tracing and AI-powered neural rendering technologies are redefining how games are made, enabling a new standard for visuals and performance. At...
13 MIN READ
Mar 09, 2026
CUDA 13.2 Introduces Enhanced CUDA Tile Support and New Python Features
CUDA 13.2 arrives with a major update: NVIDIA CUDA Tile is now supported on devices of compute capability 8.X architectures (NVIDIA Ampere and NVIDIA Ada), as...
15 MIN READ
Mar 09, 2026
Implementing Falcon-H1 Hybrid Architecture in NVIDIA Megatron Core
In the rapidly evolving landscape of large language model (LLM) development, NVIDIA Megatron Core has emerged as the foundational framework for training massive...
9 MIN READ
Mar 09, 2026
Enhancing Distributed Inference Performance with the NVIDIA Inference Transfer Library
Deploying large language models (LLMs) requires large-scale distributed inference, which spreads model computation and request handling across many GPUs and...
13 MIN READ
Mar 09, 2026
Removing the Guesswork from Disaggregated Serving
Deploying and optimizing large language models (LLMs) for high-performance, cost-effective serving can be an overwhelming engineering problem. The ideal...
10 MIN READ
Mar 05, 2026
Controlling Floating-Point Determinism in NVIDIA CCCL
A computation is considered deterministic if multiple runs with the same input data produce the same bitwise result. While this may seem like a simple property...
7 MIN READ
Mar 05, 2026
Tuning Flash Attention for Peak Performance in NVIDIA CUDA Tile
In this post, we dive into one of the most critical workloads in modern AI: Flash Attention, where you’ll learn: How to implement Flash Attention using NVIDIA...
20 MIN READ
Mar 03, 2026
How to Minimize Game Runtime Inference Costs with Coding Agents
NVIDIA ACE is a suite of technologies for building AI agents for gaming. ACE provides ready-to-integrate cloud and on-device AI models for every part of in-game...
10 MIN READ
Mar 03, 2026
cuTile.jl Brings NVIDIA CUDA Tile-Based Programming to Julia
NVIDIA CUDA Tile is one of the most significant additions to NVIDIA CUDA programming and unlocks automatic access to tensor cores and other specialized...
5 MIN READ
Feb 28, 2026
5 New Digital Twin Products Developers Can Use to Build 6G Networks
To make 6G a reality, the telecom industry must overcome a fundamental challenge: how to design, train, and validate AI-native networks that are too complex to...
6 MIN READ