Posts by Dheeraj Peri
        
                    Developer Tools & Techniques
        
        
        Jul 24, 2025
      
      Double PyTorch Inference Speed for Diffusion Models Using Torch-TensorRT
                                                NVIDIA TensorRT is an AI inference library built to optimize machine learning models for deployment on NVIDIA GPUs. TensorRT targets dedicated hardware in...
                          
          
            8 MIN READ
          
        
      
    
        
                    Robotics
        
        
        Jun 16, 2022
      
      Accelerating Quantized Networks with the NVIDIA QAT Toolkit for TensorFlow and NVIDIA TensorRT
                                                We’re excited to announce the NVIDIA Quantization-Aware Training (QAT) Toolkit for TensorFlow 2 with the goal of accelerating the quantized networks with...
                          
          
            9 MIN READ
          
        
      
    
        
                    Data Science
        
        
        Sep 24, 2020
      
      Estimating Depth with ONNX Models and Custom Layers Using NVIDIA TensorRT
                                                TensorRT is an SDK for high performance, deep learning inference. It includes a deep learning inference optimizer and a runtime that delivers low latency and...
                          
          
            10 MIN READ
          
        
      
     
           
       
      