Consumer Internet

Jun 02, 2026

Build Personal AI Agents on Windows PCs with New Tools from Microsoft and NVIDIA

AI agents are changing how you interact with your PC. Creators, developers, and AI enthusiasts are already using these agents extensively to assist with...

9 MIN READ

Jan 28, 2026

Speeding Up Variable-Length Training with Dynamic Context Parallelism and NVIDIA Megatron Core

This post introduces Dynamic Context Parallelism (Dynamic-CP), a scheduling approach in NVIDIA Megatron Core used for LLM post-training or DiT pre-training. It...

12 MIN READ

Jan 26, 2026

Adaptive Inference in NVIDIA TensorRT for RTX Enables Automatic Optimization

Deploying AI applications across diverse consumer hardware has traditionally forced a trade-off. You can optimize for specific GPU configurations and achieve...

9 MIN READ

Sep 23, 2025

Deploy High-Performance AI Models in Windows Applications on NVIDIA RTX AI PCs

Today, Microsoft is making Windows ML available to developers. Windows ML enables C#, C++ and Python developers to optimally run AI models locally across PC...

8 MIN READ

Jun 26, 2025

Run Google DeepMind’s Gemma 3n on NVIDIA Jetson and RTX

As of today, NVIDIA now supports the general availability of Gemma 3n on NVIDIA RTX and Jetson. Gemma, previewed by Google DeepMind at Google I/O last month,...

4 MIN READ

Jun 12, 2025

Run High-Performance AI Applications with NVIDIA TensorRT for RTX

NVIDIA TensorRT for RTX is now available for download as an SDK that can be integrated into C++ and Python applications for both Windows and Linux. At...

7 MIN READ

May 19, 2025

NVIDIA TensorRT for RTX Introduces an Optimized Inference AI Library on Windows 11

AI experiences are rapidly expanding on Windows in creativity, gaming, and productivity apps. There are various frameworks available to accelerate AI inference...

9 MIN READ

Feb 28, 2025

Spotlight: NAVER Place Optimizes SLM-Based Vertical Services with NVIDIA TensorRT-LLM

NAVER is a popular South Korean search engine company that offers Naver Place, a geo-based service that provides detailed information about millions of...

13 MIN READ

Jan 09, 2025

Announcing Nemotron-CC: A Trillion-Token English Language Dataset for LLM Pretraining

NVIDIA is excited to announce the release of Nemotron-CC, a 6.3-trillion-token English language Common Crawl dataset for pretraining highly accurate large...

4 MIN READ

Dec 05, 2024

Spotlight: Perplexity AI Serves 400 Million Search Queries a Month Using NVIDIA Inference Stack

The demand for AI-enabled services continues to grow rapidly, placing increasing pressure on IT and infrastructure teams. These teams are tasked with...

7 MIN READ

An illustration showing recommender systems.

Nov 20, 2024

Boost Large-Scale Recommendation System Training Embedding Using EMBark

Recommendation systems are core to the Internet industry, and efficiently training them is a key issue for various companies. Most recommendation systems are...

6 MIN READ

Oct 28, 2024

Creating RAG-Based Question-and-Answer LLM Workflows at NVIDIA

The rapid development of solutions using retrieval augmented generation (RAG) for question-and-answer LLM workflows has led to new types of system...

11 MIN READ

Decorative image of stars in a geometric shape and colored pink, purple, blue, and green.

Oct 09, 2024

Develop Academic and Industrial Applications with a New Specialized Math Model

Mathstral, an advanced AI model developed from the ground up, can deliver superior performance for enhanced learning of math, engineering, and science.

1 MIN READ

Sep 16, 2024

Generate code with Abacus AI’s Dracarys Large Language Model

Dracarys, fine-tuned from Llama 3.1 70B and available from NVIDIA NIM microservice, supports a variety of applications, including data analysis, text...

1 MIN READ

Aug 13, 2024

New NIM Available: Mistral Large 2 Instruct LLM

The new model by Mistral excels at a variety of complex tasks including text summarization, multilingual translation and reasoning, programming, question and...

1 MIN READ

Image of a person standing in front of an AI kiosk in a retail location.

Aug 07, 2024

Building AI Agents with NVIDIA NIM Microservices and LangChain

NVIDIA NIM, part of NVIDIA AI Enterprise, now supports tool-calling for models like Llama 3.1. It also integrates with LangChain to provide you with a...

3 MIN READ