Computer Vision / Video Analytics

TensorRT 5 RC Now Available

Sep 20, 2018

By Nefi Alarcon

Discuss (0)

AI-Generated Summary

Dislike

The latest version of NVIDIA's TensorRT, version 5 Release Candidate, has been released, offering improved performance for deep learning inference.
TensorRT 5 achieves up to 40x faster inference over CPU-only platforms by utilizing mixed precision on Turing Tensor Cores for models such as translation.
TensorRT 5 supports Xavier-based NVIDIA DRIVE platforms and the NVIDIA DLA accelerator for FP16, and provides new optimizations and INT8 APIs.

AI-generated content may summarize information incompletely. Verify important information. Learn more

AT GTC Japan, NVIDIA announced the latest version of the TensorRT’s high-performance deep learning inference optimizer and runtime. Today we are releasing the TensorRT 5 Release Candidate. TensorRT 5 supports the new Turing architecture, provides new optimizations, and INT8 APIs achieving up to 40x faster inference over CPU-only platforms. This latest version also dramatically speeds up inference of recommenders, neural machine translation, speech, and natural language processing apps.
TensorRT 5 Highlights:

Speeds up inference by 40x over CPUs for models such as translation using mixed precision on Turing Tensor Cores
Optimizes inference models with new INT8 APIs
Supports Xavier-based NVIDIA Drive platforms and the NVIDIA DLA accelerator for FP16

TensorRT 5 RC is available now to all members of the NVIDIA Developer Program.
Learn more>

Discuss (0)

About the Authors

About Nefi Alarcon
Nefi Alarcon is a senior executive communications manager on NVIDIA's leadership team. He has years of media relations and communication experience, and has previously worked at Google, Mozilla, and CNN. He received his bachelor's degree in Journalism from George Washington University.

View all posts by Nefi Alarcon

TensorRT 5 RC Now Available

Tags

About the Authors

Comments