TensorRT

Oct 17, 2023
Unlock Faster Image Generation in Stable Diffusion Web UI with NVIDIA TensorRT
Stable Diffusion is an open-source generative AI image-based model that enables users to generate images with simple text descriptions. Gaining traction among...
4 MIN READ

Aug 31, 2023
Deploying YOLOv5 on NVIDIA Jetson Orin with cuDLA: Quantization-Aware Training to Inference
NVIDIA Jetson Orin is the best-in-class embedded platform for AI workloads. One of the key components of the Orin platform is the second-generation Deep...
11 MIN READ

May 04, 2023
Increasing Throughput and Reducing Costs for AI-Based Computer Vision with CV-CUDA
Real-time cloud-scale applications that involve AI-based computer vision are growing rapidly. The use cases include image understanding, content creation,...
11 MIN READ

Apr 05, 2023
Setting New Records in MLPerf Inference v3.0 with Full-Stack Optimizations for AI
The most exciting computing applications currently rely on training and running inference on complex AI models, often in demanding, real-time deployment...
15 MIN READ

Apr 03, 2023
NVIDIA Jetson Project of the Month: Recognizing Birds by Sound
It is one thing to identify a bird in the wild based on how it appears. It is quite another to identify that same bird based solely on how it sounds. Unless you...
7 MIN READ

Mar 23, 2023
Power Your AI Inference with New NVIDIA Triton and NVIDIA TensorRT Features
NVIDIA AI inference software consists of NVIDIA Triton Inference Server, open-source inference serving software, and NVIDIA TensorRT, an SDK for...
5 MIN READ

Mar 22, 2023
SDKs Accelerating Industry 5.0, Data Pipelines, Computational Science, and More Featured at NVIDIA GTC 2023
At NVIDIA GTC 2023, NVIDIA unveiled notable updates to its suite of NVIDIA AI software for developers to accelerate computing. The updates reduce costs in...
10 MIN READ

Mar 15, 2023
End-to-End AI for NVIDIA-Based PCs: NVIDIA TensorRT Deployment
This post is the fifth in a series about optimizing end-to-end AI. NVIDIA TensorRT is a solution for speed-of-light inference deployment on NVIDIA hardware....
10 MIN READ

Mar 14, 2023
Top AI for Creative Applications Sessions at NVIDIA GTC 2023
Learn how AI is boosting creative applications for creators during NVIDIA GTC 2023, March 20-23.
1 MIN READ

Mar 07, 2023
Developing an End-to-End Auto Labeling Pipeline for Autonomous Vehicle Perception
Accurately annotated datasets are crucial for camera-based deep learning algorithms to perform autonomous vehicle perception. However, manually labeling data is...
6 MIN READ

Feb 08, 2023
End-to-End AI for NVIDIA-Based PCs: CUDA and TensorRT Execution Providers in ONNX Runtime
This post is the fourth in a series about optimizing end-to-end AI. As explained in the previous post in the End-to-End AI for NVIDIA-Based PCs series, there...
9 MIN READ

Dec 15, 2022
End-to-End AI for NVIDIA-Based PCs: An Introduction to Optimization
This post is the first in a series about optimizing end-to-end AI. The great thing about the GPU is that it offers tremendous parallelism; it allows you to...
9 MIN READ

Dec 08, 2022
Introducing NVIDIA Riva: A GPU-Accelerated SDK for Developing Speech AI Applications
This post was updated in March 2023. Sign up for the latest Speech AI news from NVIDIA. Speech AI is used in a variety of applications, including contact...
8 MIN READ

Aug 30, 2022
AI in Endoscopy: Improving Detection Rates and Visibility with Real-Time Sensing
Clinical applications for AI are improving digital surgery, helping to reduce errors, provide consistency, and enable surgeon augmentations that were previously...
4 MIN READ

Aug 30, 2022
Beating SOTA Inference Performance on NVIDIA GPUs with GPUNet
Crafted by AI for AI, GPUNet is a class of convolutional neural networks designed to maximize the performance of NVIDIA GPUs using NVIDIA TensorRT. Built using...
6 MIN READ

Jul 29, 2022
Getting Started with the Deep Learning Accelerator on NVIDIA Jetson Orin
If you’re an active Jetson developer, you know that one of the key benefits of NVIDIA Jetson is that it combines a CPU and GPU into a single module, giving...
3 MIN READ