Developer Tools & Techniques
Jan 14, 2026
How to Write High-Performance Matrix Multiply in NVIDIA CUDA Tile
This blog post is part of a series designed to help developers learn NVIDIA CUDA Tile programming for building high-performance GPU kernels, using matrix...
13 MIN READ
Jan 13, 2026
Learn How NVIDIA cuOpt Accelerates Mixed Integer Optimization using Primal Heuristics
NVIDIA cuOpt is a GPU-accelerated optimization engine designed to deliver fast, high-quality solutions for large, complex decision-making problems. Mixed...
7 MIN READ
Jan 09, 2026
Reimagining LLM Memory: Using Context as Training Data Unlocks Models That Learn at Test-Time
We keep seeing LLMs with larger context windows in the news, along with promises that they can hold entire conversation histories, volumes of books, or multiple...
6 MIN READ
Jan 09, 2026
Multi-Agent Warehouse AI Command Layer Enables Operational Excellence and Supply Chain Intelligence
Warehouses have never been more automated, more data-rich, or more operationally demanding than they are now—yet they still rely on systems that can’t keep...
11 MIN READ
Jan 09, 2026
Build an AI Catalog System That Delivers Localized, Interactive Product Experiences
E-commerce catalogs often contain sparse product data, generic images, a basic title, and short description. This limits discoverability, engagement, and...
10 MIN READ
Jan 08, 2026
Accelerating LLM and VLM Inference for Automotive and Robotics with NVIDIA TensorRT Edge-LLM
Large language models (LLMs) and multimodal reasoning systems are rapidly expanding beyond the data center. Automotive and robotics developers increasingly want...
6 MIN READ
Jan 05, 2026
Open Source AI Tool Upgrades Speed Up LLM and Diffusion Models on NVIDIA RTX PCs
AI developer activity on PCs is exploding, driven by the rising quality of small language models (SLMs) and diffusion models, such as FLUX.2, GPT-OSS-20B, and...
7 MIN READ
Dec 31, 2025
AI Factories, Physical AI, and Advances in Models, Agents, and Infrastructure That Shaped 2025
2025 was another milestone year for developers and researchers working with NVIDIA technologies. Progress in data center power and compute design, AI...
4 MIN READ
Dec 19, 2025
Accelerating AI-Powered Chemistry and Materials Science Simulations with NVIDIA ALCHEMI Toolkit-Ops
Machine learning interatomic potentials (MLIPs) are transforming the landscape of computational chemistry and materials science. MLIPs enable atomistic...
10 MIN READ
Dec 17, 2025
Real-Time Decoding, Algorithmic GPU Decoders, and AI Inference Enhancements in NVIDIA CUDA-Q QEC
Real-time decoding is crucial to fault-tolerant quantum computers. By enabling decoders to operate with low latency concurrently with a quantum processing unit...
6 MIN READ
Dec 17, 2025
Migrate Apache Spark Workloads to GPUs at Scale on Amazon EMR with Project Aether
Data is the fuel of modern business, but relying on older CPU-based Apache Spark pipelines introduces a heavy toll. They’re inherently slow, require large...
7 MIN READ
Dec 17, 2025
Simulate an Accurate Radio Environment Using NVIDIA Aerial Omniverse Digital Twin
The development of 5G and 6G requires high-fidelity radio channel modeling, but the ecosystem is highly fragmented. Link-level simulators, network-level...
10 MIN READ
Dec 16, 2025
Advanced Large-Scale Quantum Simulation Techniques in cuQuantum SDK v25.11
Simulating large-scale quantum computers has become more difficult as the quality of quantum processing units (QPUs) improves. Validating the results is key to...
11 MIN READ
Dec 16, 2025
Boost GPU Memory Performance with No Code Changes Using NVIDIA CUDA MPS
NVIDIA CUDA developers have access to a wide range of tools and libraries that simplify development and deployment, enabling users to focus on the “what”...
14 MIN READ
Dec 12, 2025
How to Scale Fast Fourier Transforms to Exascale on Modern NVIDIA GPU Architectures
Fast Fourier Transforms (FFTs) are widely used across scientific computing, from molecular dynamics and signal processing to computational fluid dynamics (CFD),...
8 MIN READ
Dec 12, 2025
How to Build Privacy-Preserving Evaluation Benchmarks with Synthetic Data
Validating AI systems requires benchmarks—datasets and evaluation workflows that mimic real-world conditions—to measure accuracy, reliability, and safety...
11 MIN READ