Technical Blog
Tag: BERT
Subscribe
Technical Walkthrough
Nov 09, 2021
Developing a Question Answering Application Quickly Using NVIDIA Riva
Learn how you can use NVIDIA Riva to develop a QA system.
5 MIN READ
Technical Walkthrough
Jul 20, 2021
Real-Time Natural Language Processing with BERT Using NVIDIA TensorRT (Updated)
Today, NVIDIA is releasing TensorRT 8.0, which introduces many transformer optimizations. With this post update, we present the latest TensorRT optimized BERT sample and its inference latency benchmark on A30 GPUs. Using the optimized sample, you can execute different batch sizes for BERT-base or BERT-large within the 10 ms latency budget for conversational AI applications.
18 MIN READ
Technical Walkthrough
May 10, 2021
Enabling Predictive Maintenance Using Root Cause Analysis, NLP, and NVIDIA Morpheus
The RAPIDS CLX team collaborated with the NVIDIA Enterprise Experience (NVEX) team to test and run a proof-of-concept (POC) to evaluate this NLP-based solution.
6 MIN READ
Technical Walkthrough
Feb 04, 2021
Achieving High-Quality Search and Recommendation Results with DeepNLP
Speech and natural language processing (NLP) have become the foundation for most of the AI development in the enterprise today, as textual data represents a…
12 MIN READ
Technical Walkthrough
Nov 11, 2020
Deploying a Natural Language Processing Service on a Kubernetes Cluster with Helm Charts from NVIDIA NGC
Conversational AI solutions such as chatbots are now deployed in the data center, on the cloud, and at the edge to deliver lower latency and high quality of…
12 MIN READ
Technical Walkthrough
Oct 06, 2020
Adding External Knowledge and Controllability to Language Models with Megatron-CNTRL
Large language models such as Megatron and GPT-3 are transforming AI. We are excited about applications that can take advantage of these models to create better…
8 MIN READ