Posts by Young-Jun Ko
Technical Walkthrough
Jul 20, 2021
Real-Time Natural Language Processing with BERT Using NVIDIA TensorRT (Updated)
Today, NVIDIA is releasing TensorRT 8.0, which introduces many transformer optimizations. With this post update, we present the latest TensorRT optimized BERT sample and its inference latency benchmar...
18 MIN READ
Technical Walkthrough
Aug 13, 2019
Real-Time Natural Language Understanding with BERT Using TensorRT
Large scale language models (LSLMs) such as BERT, GPT-2, and XL-Net have brought about exciting leaps in state-of-the-art accuracy for many natural language…
21 MIN READ