Posts by Dheeraj Peri
AI Platforms / Deployment
Jul 24, 2025
Double PyTorch Inference Speed for Diffusion Models Using Torch-TensorRT
NVIDIA TensorRT is an AI inference library built to optimize machine learning models for deployment on NVIDIA GPUs. TensorRT targets dedicated hardware in...
8 MIN READ
Robotics
Jun 16, 2022
Accelerating Quantized Networks with the NVIDIA QAT Toolkit for TensorFlow and NVIDIA TensorRT
We’re excited to announce the NVIDIA Quantization-Aware Training (QAT) Toolkit for TensorFlow 2 with the goal of accelerating the quantized networks with...
9 MIN READ
Data Science
Sep 24, 2020
Estimating Depth with ONNX Models and Custom Layers Using NVIDIA TensorRT
TensorRT is an SDK for high performance, deep learning inference. It includes a deep learning inference optimizer and a runtime that delivers low latency and...
10 MIN READ