Inferencing Images 100x Faster with GPUs and TensorRT

At this week’s Computer Vision and Pattern Recognition conference, NVIDIA demonstrated how one Tesla V100 running NVIDIA TensorRT can perform a common inferencing task 100X faster than a system without GPUs.
In the video below, the CPU-only Intel Skylake-based system (on the left) can classify five flower images per second with a Resnet-152 trained classification network. That’s a speed that comfortably outpaces human capability.
By contrast, a single V100 GPU (on the right) can classify a dizzying 527 flower images per second, returning results with less than 7 milliseconds of latency — a superhuman feat.

While a 100X speed up in performance is impressive, that’s only half the equation. What are the costs associated with moving as fast as possible — what we here at NVIDIA call “speed of light”?
Remarkably, moving faster means fewer costs. One NVIDIA GPU-enabled system doing the same work as 100 CPU-only systems means 100 times fewer cloud servers to rent or buy.
NVIDIA TensorRT is available to members of the NVIDIA Developer Program as a free download to speed up AI inference on NVIDIA GPUs in the data center, in automobiles and in robots, drones and other devices at the edge.
Read more >

Inferencing Images 100x Faster with GPUs and TensorRT

Related resources

Tags

About the Authors

Inferencing Images 100x Faster with GPUs and TensorRT

Related resources

Tags

About the Authors

Comments

Related posts

GPU Inference Momentum Continues to Build

Volta Tensor Core GPU Achieves New AI Performance Milestones

White Paper: NVIDIA DGX-1 with Tesla V100

Top AI Researchers Receive First NVIDIA Tesla V100s

Accelerating Hyperscale Data Center Applications with Tesla GPUs

Related posts

Detecting Real-Time Waste Contamination Using Edge Computing and Video Analytics

Emulating the Attention Mechanism in Transformer Models with a Fully Convolutional Network

Free Digital Webinar Series: How to Get Started with AI Inference

Get Started with Generative AI Development for Windows PCs with NVIDIA RTX

Unlock Faster Image Generation in Stable Diffusion Web UI with NVIDIA TensorRT