featured
Sep 05, 2024
Low Latency Inference Chapter 1: Up to 1.9x Higher Llama 3.1 Performance with Medusa on NVIDIA HGX H200 with NVLink Switch
As large language models (LLMs) continue to grow in size and complexity, multi-GPU compute is a must-have to deliver the low latency and high throughput that...
5 MIN READ
Sep 05, 2024
AI-Powered Platform Advances Personalized Cancer Diagnostics and Treatments
A recent study introduced a cutting-edge AI-powered pathology platform that can help doctors diagnose and evaluate lung cancer in patients quickly and...
3 MIN READ
Sep 04, 2024
Accelerated, Production-Ready Graph Analytics for NetworkX Users
NetworkX is a popular, easy-to-use Python library for graph analytics. However, its performance and scalability may be unsatisfactory for medium-to-large-sized...
11 MIN READ
Sep 04, 2024
Hands-On Training at NVIDIA AI Summit in Washington, DC
Immerse yourself in NVIDIA technology with our full-day, hands-on technical workshops at our AI Summit in Washington D.C. on October 7, 2024.
1 MIN READ
Sep 04, 2024
NVIDIA Deep Learning Institute Releases New Generative AI Teaching Kit
Generative AI, powered by advanced machine learning models and deep neural networks, is revolutionizing industries by generating novel content and driving...
5 MIN READ
Sep 03, 2024
Real-Time Neural Receivers Drive AI-RAN Innovation
Today’s 5G New Radio (5G NR) wireless communication systems rely on highly optimized signal processing algorithms to reconstruct transmitted messages from...
11 MIN READ
Aug 30, 2024
Fast Inversion for Real-Time Image Editing with Text
Text-to-image diffusion models can generate diverse, high-fidelity images based on user-provided text prompts. They operate by mapping a random sample from a...
8 MIN READ
Aug 30, 2024
Accelerating Predictive Maintenance in Manufacturing with RAPIDS AI
The International Society of Automation (ISA) reports that 5% of plant production is lost annually due to downtime. Putting that into a different context,...
12 MIN READ
Aug 29, 2024
Spotlight: clicOH Accelerates Last-Mile Delivery 20x with NVIDIA cuOpt
Driven by shifts in consumer behavior and the pandemic, e-commerce continues its explosive growth and transformation. As a result, logistics and transportation...
3 MIN READ
Aug 29, 2024
Boosting CUDA Efficiency with Essential Techniques for New Developers
To fully harness the capabilities of NVIDIA GPUs, optimizing NVIDIA CUDA performance is essential, particularly for developers new to GPU programming. This talk...
2 MIN READ
Aug 29, 2024
Just Released: RAPIDS 24.08
RAPIDS 24.08 is now available with significant updates geared towards processing larger workloads and seamless CPU/GPU interoperability.
1 MIN READ
Aug 28, 2024
Boosting Llama 3.1 405B Performance up to 1.44x with NVIDIA TensorRT Model Optimizer on NVIDIA H200 GPUs
The Llama 3.1 405B large language model (LLM), developed by Meta, is an open-source community model that delivers state-of-the-art performance and supports a...
7 MIN READ
Aug 28, 2024
New Foundational Models and Training Capabilities with NVIDIA TAO 5.5
NVIDIA TAO is a framework designed to simplify and accelerate the development and deployment of AI models. It enables you to use pretrained models, fine-tune...
13 MIN READ
Aug 28, 2024
NVIDIA Triton Inference Server Achieves Outstanding Performance in MLPerf Inference 4.1 Benchmarks
Six years ago, we embarked on a journey to develop an AI inference serving solution specifically designed for high-throughput and time-sensitive production use...
8 MIN READ
Aug 28, 2024
NVIDIA NIM Agent Blueprint: Build an Enterprise-Scale Multimodal PDF Data Extraction Pipeline
Trillions of PDF files are generated every year, each file likely consisting of multiple pages filled with various content types, including text, images,...
8 MIN READ
Aug 28, 2024
NVIDIA Blackwell Platform Sets New LLM Inference Records in MLPerf Inference v4.1
Large language model (LLM) inference is a full-stack challenge. Powerful GPUs, high-bandwidth GPU-to-GPU interconnects, efficient acceleration libraries, and a...
13 MIN READ