Technical Walkthrough 0

Deploying NVIDIA Triton at Scale with MIG and Kubernetes

NVIDIA Triton can manage any number and mix of models, support multiple deep-learning frameworks, and integrate easily with Kubernetes for large-scale… 24 MIN READ
Technical Walkthrough 0

Real-Time Natural Language Processing with BERT Using NVIDIA TensorRT (Updated)

Today, NVIDIA is releasing TensorRT 8.0, which introduces many transformer optimizations. With this post update, we present the latest TensorRT optimized BERT… 18 MIN READ