NVIDIA Nemotron

Read Technical Report

Nemotron 3 Nano Omni 30B A3B

Single model for video, audio, image, and text understanding for a simplified agent workflow
Multimodal reasoning for sub-agents within agentic use cases such as computer use agent, document intelligence, and video/audio understanding
Highest in-class efficiency and with low costs

Read Technical Report

Nemotron 3 Super 120B A12B

Highest in-class efficiency and leading accuracy
Great for addressing complex tasks in multi-agent environment
Suitable for single data center GPU deployments

Demo Model on OpenRouter

Llama Nemotron Ultra 253B

Ideal for multi-agent enterprise workflows requiring highest accuracy, such as customer service automation, supply chain management, and IT security
Suitable for data center-scale deployments

Demo Model on OpenRouter

Nemotron Parse

Understands document semantics and extract text and tables elements with spatial grounding
Overcomes traditional OCR limitations with support for multi-column layouts, LaTeX table extraction, markdown formatting, and reading-order reconstruction
Designed to accelerate document intelligence pipelines for RAG, LLM training data curation, and agentic document workflows

Experience Models as NVIDIA NIM APIs

Nemotron RAG

Industry-leading extraction, embed, and rerank models
Best-in-class accuracy for multimodal document intelligence, question answering, and passage retrieval
Leading positions on ViDoRe V1, ViDoRe V2, and MTEB and MMTEB leaderboard

Download the Models on Hugging Face

Nemotron Speech

A family of open models optimized for high-throughput, ultra-low latency automatic speech recognition (ASR), text-to-speech (TTS), speech-to-speech (S2S), full-duplex, and neural machine translation (NMT) for agentic AI applications
Nemotron Speech models with the NVIDIA Riva GPU-accelerated speech AI library deliver state-of-the-art ASR and TTS capabilities for seamless production deployment

Experience the Model as an NVIDIA NIM API

Download the Models on Hugging Face

Nemotron Safety

Advanced multilingual, multimodal safety models that deliver high accuracy jailbreak detection, content moderation with cultural nuance, fine-grained PII detection, reasoning-based custom policy enforcement, and topic control for more secure and more compliant LLMs across global domains and use cases.
NeMo Guardrails, a flexible, open library for defining and enforcing enterprise AI policies in real time—covering dialogue control, topic guidance, RAG grounding, tool‑call governance, safety filtering, and more—with parallel, low‑latency execution across custom, community, and NVIDIA safety rails.

Experience the Model as an NVIDIA NIM API

NVIDIA Nemotron Datasets

Improve reasoning capabilities of large language models (LLMs) with one of the broadest commercially usable open data collections for agentic AI — spanning pre-training, post-training, personas, safety, RL, and RAG. Includes 10T+ tokens and 40M+ post-training samples, covering the full training lifecycle from foundation models to agent workflows.

Built with large-scale synthetic data generation, filtering, and curation — and released under permissive licenses. Developers can train, fine-tune, and evaluate models with full visibility into the data, accelerating development and reducing reliance on opaque datasets.

Nemotron Pre- and Post-Training Datasets

NVIDIA provides over 10T tokens of multilingual reasoning, coding, and safety data to help the community build their custom models.

Nemotron Personas Datasets

Fully synthetic, privacy-safe personas are grounded in real-world demographic, geographic, and cultural distributions. Part of NVIDIA’s growing global collection for Sovereign AI development, featuring datasets for USA, Japan, India, Singapore, Brazil, France, and South Korea.

Nemotron Omni Datasets

Multimodal data extending the Nemotron training pipeline beyond text to image, video, and speech. ~127B tokens of cross-modal pretraining data and ~124M curated post-training examples for document reasoning, computer use, and long-horizon workflows.

Nemotron Safety Datasets

High-quality, curated datasets built to power multilingual content safety, advanced policy reasoning, and threat-aware AI—spanning moderation data and audio-based safety signals for modern AI assistants.

Nemotron RL Datasets

Train models with the same reinforcement learning (RL) data powering Nemotron, including multi-turn trajectories, tool calls, and preference signals across coding, math, reasoning, and agentic tasks to build adaptive, reliable real-world AI.

Nemotron RAG Datasets

Unlock the foundation behind our leaderboard-topping model with the release of 15 meticulously curated datasets—spanning instruction-following, reasoning, coding, and evaluation data—to accelerate open research and transparent model development.