GTC 2020: TensorRT inference with TensorFlow 2.0
After clicking “Watch Now” you will be prompted to login or join.
Click “Watch Now” to login or join the NVIDIA Developer Program.
TensorRT inference with TensorFlow 2.0
Jonathan Dekhtiar, NVIDIA | Tamas Bela Feher, NVIDIA | Xuan Vinh Nguyen, NVIDIA
NVIDIA TensorRT is a platform for high-performance deep learning inference. We'll describe how TensorRT is integrated with TensorFlow and show how combining the two improves the efficiency of machine-learning models while retaining the convenience and ease-of-use of a TF Python development environment. We'll provide updates for the TF 2.0 TRT interface, C++ API, dynamic shape support, and latest performance benchmarking.