General

Aug 13, 2025
Scaling LLM Reinforcement Learning with Prolonged Training Using ProRL v2
Currently, one of the most compelling questions in AI is whether large language models (LLMs) can continue to improve through sustained reinforcement learning...
8 MIN READ

Aug 07, 2025
How Hackers Exploit AI's Problem-Solving Instincts
As multimodal AI models advance from perception to reasoning, and even start acting autonomously, new attack surfaces emerge. These threats don’t just target...
10 MIN READ

Jul 31, 2025
Securing Agentic AI: How Semantic Prompt Injections Bypass AI Guardrails
Prompt injection, where adversaries manipulate inputs to make large language models behave in unintended ways, has long posed a threat to AI systems since the...
8 MIN READ

Jul 29, 2025
FourCastNet 3 Enables Fast and Accurate Large Ensemble Weather Forecasting with Scalable Geometric ML
FourCastNet3 (FCN3) is the latest AI global weather forecasting system from NVIDIA Earth-2. FCN3 offers an unprecedented combination of probabilistic skill,...
7 MIN READ

Jul 22, 2025
Building Robotic Mental Models with NVIDIA Warp and Gaussian Splatting
This post explores a promising direction for building dynamic digital representations of the physical world, a topic gaining increasing attention in recent...
4 MIN READ

Jul 17, 2025
New Learning Pathway: Deploy AI Models with NVIDIA NIM on GKE
Get hands-on with Google Kubernetes Engine (GKE) and NVIDIA NIM when you join the new Google Cloud and NVIDIA community.
1 MIN READ

Jul 11, 2025
Improving Synthetic Data Augmentation and Human Action Recognition with SynthDa
Human action recognition is a capability in AI systems designed for safety-critical applications, such as surveillance, eldercare, and industrial monitoring....
10 MIN READ

Jul 03, 2025
RAPIDS Adds GPU Polars Streaming, a Unified GNN API, and Zero-Code ML Speedups
RAPIDS, a suite of NVIDIA CUDA-X libraries for Python data science, released version 25.06, introducing exciting new features. These include a Polars GPU...
6 MIN READ

Jul 01, 2025
Per-Tensor and Per-Block Scaling Strategies for Effective FP8 Training
In this blog post, we’ll break down the main FP8 scaling strategies—per-tensor scaling, delayed and current scaling, and per-block scaling (including the...
10 MIN READ

Jun 25, 2025
Join Us at We Are Developers World Congress 2025
Join us at We Are Developers World Congress from July 9 to 11 to attend our workshops and connect with experts.
1 MIN READ

Jun 24, 2025
NVIDIA Run:ai and Amazon SageMaker HyperPod: Working Together to Manage Complex AI Training
NVIDIA Run:ai and Amazon Web Services have introduced an integration that lets developers seamlessly scale and manage complex AI training workloads. Combining...
5 MIN READ

Jun 18, 2025
LLM Inference Benchmarking: How Much Does Your LLM Inference Cost?
This is the fourth post in the large language model latency-throughput benchmarking series, which aims to instruct developers on how to determine the cost of...
10 MIN READ

Jun 13, 2025
Run High-Performance LLM Inference Kernels from NVIDIA Using FlashInfer​​
Best-in-class LLM Inference requires two key elements: speed and developer velocity. Speed refers to maximizing the efficiency of the underlying hardware by...
6 MIN READ

Jun 13, 2025
New Professional Certifications in Accelerated Data Science & AI Networking
Unlock your potential with the new NCP-Accelerated Data Science and AI Networking certifications. Validate your skills in GPU-accelerated tools, data science...
1 MIN READ

Jun 13, 2025
Live Webinar: What’s New With NVIDIA Certification
Join this multi-time zone webinar on learning more about the NVIDIA Certifications. Learn the practical prep tips from NVIDIA Certification experts, insights on...
1 MIN READ

Jun 06, 2025
Introducing the Nemotron-H Reasoning Model Family: Throughput Gains Without Compromise
As large language models increasingly take on reasoning-intensive tasks in areas like math and science, their output lengths are getting significantly...
7 MIN READ