Technical Blog
Tag: TensorRT
Subscribe
Technical Walkthrough
Jul 20, 2021
Accelerating Inference with Sparsity Using the NVIDIA Ampere Architecture and NVIDIA TensorRT
○ TensorRT is an SDK for high-performance deep learning inference, and TensorRT 8.0 introduces support for sparsity that uses sparse tensor cores on NVIDIA Ampere GPUs. It can accelerate networks by...
8 MIN READ
Technical Walkthrough
Jul 20, 2021
Real-Time Natural Language Processing with BERT Using NVIDIA TensorRT (Updated)
Today, NVIDIA is releasing TensorRT 8.0, which introduces many transformer optimizations. With this post update, we present the latest TensorRT optimized BERT sample and its inference latency benchmar...
18 MIN READ
Technical Walkthrough
Jul 20, 2021
Speeding Up Deep Learning Inference Using NVIDIA TensorRT (Updated)
This post was updated July 20, 2021 to reflect NVIDIA TensorRT 8.0 updates. NVIDIA TensorRT is an SDK for deep learning inference. TensorRT provides APIs and…
22 MIN READ
News
May 06, 2021
NVIDIA Releases Updates to CUDA-X AI Software
NVIDIA CUDA-X AI is a deep learning software stack for researchers and software developers to build high performance GPU-accelerated applications for conversational AI, recommendation systems and comp...
6 MIN READ
News
Apr 05, 2021
Jetson Project of the Month: DeepWay, AI-based navigation aid for the visually impaired
Satinder Singh won the Jetson Project of the Month for DeepWay, an AI-based navigation assistance system for the visually impaired. The project…
2 MIN READ
News
Feb 25, 2021
NVIDIA Releases Riva 1.0 Beta for Building Real-Time Conversational AI Services
Jarvis is a flexible application framework for multimodal conversational AI services that delivers real-time performance on NVIDIA GPUs.
4 MIN READ