Tutorial
Apr 19, 2024
Measuring the GPU Occupancy of Multi-stream Workloads
NVIDIA GPUs are becoming increasingly powerful with each new generation. This increase generally comes in two forms. Each streaming multi-processor (SM), the...
11 MIN READ
Apr 18, 2024
New Standard for Speech Recognition and Translation from the NVIDIA NeMo Canary Model
NVIDIA NeMo is an end-to-end platform for the development of multimodal generative AI models at scale anywhere—on any cloud and on-premises. The NeMo team...
4 MIN READ
Apr 18, 2024
Pushing the Boundaries of Speech Recognition with NVIDIA NeMo Parakeet ASR Models
NVIDIA NeMo, an end-to-end platform for the development of multimodal generative AI models at scale anywhere—on any cloud and on-premises—released the...
6 MIN READ
Apr 02, 2024
Tune and Deploy LoRA LLMs with NVIDIA TensorRT-LLM
Large language models (LLMs) have revolutionized natural language processing (NLP) with their ability to learn from massive amounts of text and generate fluent...
15 MIN READ
Mar 27, 2024
Efficient CUDA Debugging: Using NVIDIA Compute Sanitizer with NVIDIA Tools Extension and Creating Custom Tools
NVIDIA Compute Sanitizer is a powerful tool that can save you time and effort while improving the reliability and performance of your CUDA applications....
14 MIN READ
Mar 18, 2024
How to Take a RAG Application from Pilot to Production in Four Steps
Generative AI has the potential to transform every industry. Human workers are already using large language models (LLMs) to explain, reason about, and solve...
9 MIN READ
Mar 08, 2024
Optimizing Memory and Retrieval for Graph Neural Networks with WholeGraph, Part 1
Graph neural networks (GNNs) have revolutionized machine learning for graph-structured data. Unlike traditional neural networks, GNNs are good at capturing...
9 MIN READ
Mar 07, 2024
NVIDIA TensorRT Accelerates Stable Diffusion Nearly 2x Faster with 8-bit Post-Training Quantization
In the dynamic realm of generative AI, diffusion models stand out as the most powerful architecture for generating high-quality images with text prompts. Models...
7 MIN READ
Mar 07, 2024
Generate Stunning Images with Stable Diffusion XL on the NVIDIA AI Inference Platform
Diffusion models are transforming creative workflows across industries. These models generate stunning images based on simple text or image inputs by...
14 MIN READ
Mar 07, 2024
Simplifying Cumulus Linux Migrations
Migrating between major versions of software can present several challenges to the infrastructure management teams: Data format changes Feature deprecations...
5 MIN READ
Mar 06, 2024
How to Accelerate Quantitative Finance with ISO C++ Standard Parallelism
Quantitative finance libraries are software packages that consist of mathematical, statistical, and, more recently, machine learning models designed for use in...
10 MIN READ
Feb 26, 2024
Detecting Real-Time Waste Contamination Using Edge Computing and Video Analytics
The past few decades have witnessed a surge in rates of waste generation, closely linked to economic development and urbanization. This escalation in waste...
8 MIN READ
Feb 26, 2024
Ray Tracing Validation for DirectX 12 and Vulkan
This post was updated on April 17, 2024. For developers working on ray tracing applications for both DirectX 12 and Vulkan, ray tracing validation is here to...
7 MIN READ
Feb 21, 2024
Build an LLM-Powered API Agent for Task Execution
Developers have long been building interfaces like web apps to enable users to leverage the core products being built. To learn how to work with data in your...
10 MIN READ
Feb 20, 2024
Build an LLM-Powered Data Agent for Data Analysis
An AI agent is a system consisting of planning capabilities, memory, and tools to perform tasks requested by a user. For complex tasks such as data analytics or...
11 MIN READ
Feb 19, 2024
Experience NVIDIA cuOpt Accelerated Optimization to Boost Operational Efficiency
This week’s model release features NVIDIA cuOpt, a world-record-breaking accelerated optimization engine that helps teams solve complex routing problems and...
6 MIN READ