Generative AI

Aug 19, 2025
New Nemotron Nano 2 Open Reasoning Model Tops Leaderboard and Delivers 6x Higher Throughput
There’s a new leaderboard-topping NVIDIA Nemotron Nano 2 model. It’s an open model with leading accuracy and up to 6x higher throughput compared to the next...
1 MIN READ

Aug 14, 2025
Upcoming Livestream: Building Cross-Framework Agent Ecosystems
Join us on Aug. 21 to see how NVIDIA NeMo Agent toolkit boosts multi-agent workflows with deep MCP integration.
1 MIN READ

Aug 13, 2025
Scaling LLM Reinforcement Learning with Prolonged Training Using ProRL v2
Currently, one of the most compelling questions in AI is whether large language models (LLMs) can continue to improve through sustained reinforcement learning...
8 MIN READ

Aug 13, 2025
Streamlining Quantum Error Correction and Application Development with CUDA-QX 0.4
As quantum processor unit (QPU) builders and algorithm developers work to create large-scale, commercially viable quantum supercomputers, they are increasingly...
7 MIN READ

Aug 13, 2025
Dynamo 0.4 Delivers 4x Faster Performance, SLO-Based Autoscaling, and Real-Time Observability
The emergence of several new-frontier, open source models in recent weeks, including OpenAI’s gpt-oss and Moonshot AI’s Kimi K2, signals a wave of rapid LLM...
9 MIN READ

Aug 07, 2025
How Hackers Exploit AI's Problem-Solving Instincts
As multimodal AI models advance from perception to reasoning, and even start acting autonomously, new attack surfaces emerge. These threats don’t just target...
10 MIN READ

Aug 04, 2025
How to Enhance RAG Pipelines with Reasoning Using NVIDIA Llama Nemotron Models
A key challenge for retrieval-augmented generation (RAG) systems is handling user queries that lack explicit clarity or carry implicit intent. Users often...
13 MIN READ

Aug 01, 2025
Optimizing LLMs for Performance and Accuracy with Post-Training Quantization
Quantization is a core tool for developers aiming to improve inference performance with minimal overhead. It delivers significant gains in latency, throughput,...
14 MIN READ

Jul 31, 2025
Securing Agentic AI: How Semantic Prompt Injections Bypass AI Guardrails
Prompt injection, where adversaries manipulate inputs to make large language models behave in unintended ways, has long posed a threat to AI systems since the...
8 MIN READ

Jul 29, 2025
Build More Accurate and Efficient AI Agents with the New NVIDIA Llama Nemotron Super v1.5
AI agents now solve multi-step problems, write production-level code, and act as general assistants across multiple domains. But to reach their full potential,...
5 MIN READ

Jul 29, 2025
FourCastNet 3 Enables Fast and Accurate Large Ensemble Weather Forecasting with Scalable Geometric ML
FourCastNet3 (FCN3) is the latest AI global weather forecasting system from NVIDIA Earth-2. FCN3 offers an unprecedented combination of probabilistic skill,...
7 MIN READ

Jul 29, 2025
Turn Complex Documents into Usable Data with VLM, NVIDIA NeMo Retriever Parse
Enterprises generate and store vast amounts of unstructured data in documents like research reports, business contracts, financial statements, and technical...
10 MIN READ

Jul 28, 2025
Bringing Verifiable Trust to AI Models: Model Signing in NGC
AI is entering a new era—one defined by agents that reason, plan, and take action. These agentic systems dynamically interact with APIs, tools, and even the...
7 MIN READ

Jul 28, 2025
How New GB300 NVL72 Features Provide Steady Power for AI
The electrical grid is designed to support loads that are relatively steady, such as lighting, household appliances, and industrial machines that operate at...
8 MIN READ

Jul 23, 2025
Approaches to PDF Data Extraction for Information Retrieval
The PDF is among the most common file formats for sharing information such as financial reports, research papers, technical documents, and marketing materials....
11 MIN READ

Jul 23, 2025
Serverless Distributed Data Processing with Apache Spark and NVIDIA AI on Azure
The process of converting vast libraries of text into numerical representations known as embeddings is essential for generative AI. Various technologies—from...
9 MIN READ