DEVELOPER BLOG

Jay Rodge

Jay Rodge is a product marketing manager for deep learning and inference products at NVIDIA driving launches and product marketing initiatives. Jay received his master’s degree in computer science from Illinois Tech, Chicago with a focus on computer vision and NLP. Before NVIDIA, Jay was an AI research intern at BMW Group solving problems using computer vision for BMW’s largest manufacturing plant.

Posts by Jay Rodge

AI / Deep Learning

NVIDIA Announces TensorRT 8 Slashing BERT-Large Inference Down to 1 Millisecond

NVIDIA announced TensorRT 8.0 which brings BERT-Large inference latency down to 1.2 ms with new optimizations. 3 MIN READ
AI / Deep Learning

Achieving FP32 Accuracy for INT8 Inference Using Quantization Aware Training with NVIDIA TensorRT

○ TensorRT is an SDK for high-performance deep learning inference and with TensorRT 8.0, you can import models trained using Quantization Aware Training (QAT)… 17 MIN READ
AI / Deep Learning

Accelerating Inference with Sparsity Using the NVIDIA Ampere Architecture and NVIDIA TensorRT

○ TensorRT is an SDK for high-performance deep learning inference, and TensorRT 8.0 introduces support for sparsity that uses sparse tensor cores on NVIDIA… 8 MIN READ