Simulation / Modeling / Design

NVIDIA Releases TensorRT 4

Jun 19, 2018

By Nefi Alarcon

Discuss (0)

AI-Generated Summary

Dislike

TensorRT 4 is being released with capabilities to accelerate popular inference applications, including neural machine translation and recommender systems.
TensorRT 4 delivers up to 45 times higher throughput compared to CPU, with new layers for Multilayer Perceptrons and Recurrent Neural Networks.
TensorRT 4 supports NVIDIA DRIVE Xavier, an AI computer for autonomous vehicles, and provides 50 times faster inference performance on V100 versus CPU-only for ONNX models.

AI-generated content may summarize information incompletely. Verify important information. Learn more

Today we are releasing TensorRT 4 with capabilities for accelerating popular inference applications such as neural machine translation, recommender systems and speech. You also get an easy way to import models from popular deep learning frameworks such as Caffe 2, Chainer, MxNet, Microsoft Cognitive Toolkit and PyTorch through the ONNX format.
TensorRT delivers:

Up to 45x higher throughput vs. CPU with new layers for Multilayer Perceptrons (MLP) and Recurrent Neural Networks (RNN)
50x faster inference performance on V100 vs. CPU-only for ONNX models imported with ONNX parser in TensorRT
Support for NVIDIA DRIVE Xavier – AI Computer for Autonomous Vehicles
3x Inference speedup for FP16 custom layers with APIs for running on Volta Tensor Cores

Download TensorRT 4 today and try out these exciting new features!
Read more>

Discuss (0)

About the Authors

About Nefi Alarcon
Nefi Alarcon is a senior executive communications manager on NVIDIA's leadership team. He has years of media relations and communication experience, and has previously worked at Google, Mozilla, and CNN. He received his bachelor's degree in Journalism from George Washington University.

View all posts by Nefi Alarcon