Synthetic Data Generation
Dec 12, 2025
How to Build Privacy-Preserving Evaluation Benchmarks with Synthetic Data
Validating AI systems requires benchmarks—datasets and evaluation workflows that mimic real-world conditions—to measure accuracy, reliability, and safety...
11 MIN READ
Dec 05, 2025
NVIDIA Kaggle Grandmasters Win Artificial General Intelligence Competition
NVIDIA researchers on Friday won a key Kaggle competition many in the field treat as a real-time pulse check on humanity’s progress toward artificial general...
3 MIN READ
Dec 01, 2025
How to Scale Data Generation for Physical AI with the NVIDIA Cosmos Cookbook
Building powerful physical AI models requires diverse, controllable, and physically-grounded data at scale. Collecting large-scale, diverse real-world datasets...
9 MIN READ
Oct 28, 2025
Accelerating AV Simulation with Neural Reconstruction and World Foundation Models
Autonomous vehicle (AV) stacks are evolving from a hierarchy of discrete building blocks to end-to-end architectures built on foundation models. This transition...
8 MIN READ
Oct 24, 2025
Build Synthetic Data Pipelines to Train Smarter Robots with NVIDIA Isaac Sim
As robots take on increasingly dynamic mobility tasks, developers need physics-accurate simulations that scale efficiently across environments and workloads....
9 MIN READ
Sep 29, 2025
Streamline Robot Learning with Whole-Body Control and Enhanced Teleoperation in NVIDIA Isaac Lab 2.3
Training robot policies from real-world demonstrations is costly, slow, and prone to overfitting, limiting generalization across tasks and environments. A...
11 MIN READ
Jul 11, 2025
Improving Synthetic Data Augmentation and Human Action Recognition with SynthDa
Human action recognition is a capability in AI systems designed for safety-critical applications, such as surveillance, eldercare, and industrial monitoring....
10 MIN READ
Jun 11, 2025
Advancing Agentic AI with NVIDIA Nemotron Open Reasoning Models
As AI progresses toward greater autonomy, the emergence of AI agents capable of independent decision-making marks a significant milestone. To function...
6 MIN READ
Jun 11, 2025
Simplify End-to-End Autonomous Vehicle Development with New NVIDIA Cosmos World Foundation Models
The shift to end-to-end planning models for powering autonomous vehicles (AVs) is increasing the demand for high-quality, physically-based sensor data. These...
7 MIN READ
Jun 11, 2025
Develop Custom Physical AI Foundation Models with NVIDIA Cosmos Predict-2
Building smarter robots and autonomous vehicles (AVs) starts with physical AI models that understand real-world dynamics. These models serve two critical roles:...
8 MIN READ
May 14, 2025
Build Custom Reasoning Models with Advanced, Open Post-Training Datasets
Synthetic data has become a standard part of large language model (LLM) post-training procedures. Using a large number of synthetically generated examples from...
5 MIN READ
May 07, 2025
Building Nemotron-CC, A High-Quality Trillion Token Dataset for LLM Pretraining from Common Crawl Using NVIDIA NeMo Curator
Curating high-quality pretraining datasets is critical for enterprise developers aiming to train state-of-the-art large language models (LLMs). To enable...
7 MIN READ
Apr 07, 2025
Evaluating and Enhancing RAG Pipeline Performance Using Synthetic Data
As large language models (LLM) gain popularity in various question-answering systems, retrieval-augmented generation (RAG) pipelines have also become a focal...
11 MIN READ
Jan 29, 2025
Mastering LLM Techniques: Evaluation
Evaluating large language models (LLMs) and retrieval-augmented generation (RAG) systems is a complex and nuanced process, reflecting the sophisticated and...
12 MIN READ
Jan 09, 2025
Advancing Physical AI with NVIDIA Cosmos World Foundation Model Platform
As robotics and autonomous vehicles advance, accelerating development of physical AI—which enables autonomous machines to perceive, understand, and perform...
14 MIN READ
Jan 06, 2025
How to Build a Generative AI-Enabled Synthetic Data Pipeline for Perception-Based Physical AI
Training physical AI models used to power autonomous machines, such as robots and autonomous vehicles, requires huge amounts of data. Acquiring large sets of...
7 MIN READ