Young-Jun Ko

Young-Jun is an AI DevTech Engineer at NVIDIA currently working on accelerating NLP inference workloads on GPUs. Previously, he worked on HPC and AI and contributed to the RAPIDS open-source project. Before joining NVIDIA, Young-Jun received a PhD in computer science from EPFL and worked as a Machine Learning engineer at an adtech startup.
Avatar photo

Posts by Young-Jun Ko

Conversational AI

Real-Time Natural Language Processing with BERT Using NVIDIA TensorRT (Updated)

This post was originally published in August 2019 and has been updated for NVIDIA TensorRT 8.0. Large-scale language models (LSLMs) such as BERT, GPT-2, and... 18 MIN READ
Conversational AI

Real-Time Natural Language Understanding with BERT Using TensorRT

Large scale language models (LSLMs) such as BERT, GPT-2, and XL-Net have brought about exciting leaps in state-of-the-art accuracy for many natural language... 21 MIN READ