Posts by Davide Onofrio
Technical Walkthrough
May 11, 2022
Accelerating AI Inference Workloads with NVIDIA A30 GPU
Researchers, engineers, and data scientists can use A30 to deliver real-world results and deploy solutions into production at scale.
5 MIN READ
Technical Walkthrough
Nov 09, 2021
Introducing NVIDIA Riva: A GPU-Accelerated SDK for Developing Speech AI Applications
Learn about the Riva SDK and its use in developing speech AI applications. We also discuss pretrained models in NGC, TAO Toolkit for transfer learning.
7 MIN READ
Technical Walkthrough
Aug 25, 2021
Deploying NVIDIA Triton at Scale with MIG and Kubernetes
NVIDIA Triton can manage any number and mix of models, support multiple deep-learning frameworks, and integrate easily with Kubernetes for large-scale deployment.
24 MIN READ
Technical Walkthrough
Jul 20, 2021
Real-Time Natural Language Processing with BERT Using NVIDIA TensorRT (Updated)
Today, NVIDIA is releasing TensorRT 8.0, which introduces many transformer optimizations. With this post update, we present the latest TensorRT optimized BERT sample and its inference latency benchmar...
18 MIN READ
Technical Walkthrough
Jun 30, 2021
Continuously Improving Recommender Systems for Competitive Advantage Using NVIDIA Merlin and MLOps
This posts shares how NVIDIA Merlin components fit into a complete MLOps pipeline to operationalize a recommendation system, and continuously deliver improvements in production.
12 MIN READ
Technical Walkthrough
Jun 30, 2021
MLPerf v1.0 Training Benchmarks: Insights into a Record-Setting NVIDIA Performance
Learn about some of the major optimizations made to the NVIDIA platform that contributed to the nearly 7x increase in performance since the first MLPerf training benchmark.
31 MIN READ