BERT
Apr 26, 2023
An Introduction to Large Language Models: Prompt Engineering and P-Tuning
ChatGPT has made quite an impression. Users are excited to use the AI chatbot to ask questions, write poems, imbue a persona for interaction, act as a personal...
10 MIN READ
Apr 05, 2023
Setting New Records in MLPerf Inference v3.0 with Full-Stack Optimizations for AI
The most exciting computing applications currently rely on training and running inference on complex AI models, often in demanding, real-time deployment...
15 MIN READ
Apr 04, 2023
Topic Modeling and Image Classification with Dataiku and NVIDIA Data Science
The Dataiku platform for everyday AI simplifies deep learning. Use cases are far-reaching, from image classification to object detection and natural language...
11 MIN READ
Jun 30, 2022
The Full Stack Optimization Powering NVIDIA MLPerf Training v2.0 Performance
MLPerf benchmarks are developed by a consortium of AI leaders across industry, academia, and research labs, with the aim of providing standardized, fair, and...
14 MIN READ
Nov 09, 2021
Developing a Question Answering Application Quickly Using NVIDIA Riva
Sign up for the latest Speech AI news from NVIDIA. There is a high chance that you have asked your smart speaker a question like, “How tall is Mount...
6 MIN READ
Jul 20, 2021
Real-Time Natural Language Processing with BERT Using NVIDIA TensorRT (Updated)
This post was originally published in August 2019 and has been updated for NVIDIA TensorRT 8.0. Large-scale language models (LSLMs) such as BERT, GPT-2, and...
18 MIN READ
May 10, 2021
Enabling Predictive Maintenance Using Root Cause Analysis, NLP, and NVIDIA Morpheus
Background Predictive maintenance is used for early fault detection, diagnosis, and prediction when maintenance is needed in various industries including oil...
6 MIN READ
Feb 04, 2021
Achieving High-Quality Search and Recommendation Results with DeepNLP
Speech and natural language processing (NLP) have become the foundation for most of the AI development in the enterprise today, as textual data represents a...
12 MIN READ
Nov 11, 2020
Deploying a Natural Language Processing Service on a Kubernetes Cluster with Helm Charts from NVIDIA NGC
Conversational AI solutions such as chatbots are now deployed in the data center, on the cloud, and at the edge to deliver lower latency and high quality of...
12 MIN READ
Oct 06, 2020
Adding External Knowledge and Controllability to Language Models with Megatron-CNTRL
Large language models such as Megatron and GPT-3 are transforming AI. We are excited about applications that can take advantage of these models to create better...
8 MIN READ
Aug 18, 2020
Efficient BERT: Finding Your Optimal Model with Multimetric Bayesian Optimization, Part 3
This is the third post in this series about distilling BERT with multimetric Bayesian optimization. Part 1 discusses the background for the experiment and Part...
11 MIN READ
Aug 18, 2020
Efficient BERT: Finding Your Optimal Model with Multimetric Bayesian Optimization, Part 2
This is the second post in this series about distilling BERT with multimetric Bayesian optimization. Part 1 discusses the background for the experiment and Part...
9 MIN READ
Aug 18, 2020
Efficient BERT: Finding Your Optimal Model with Multimetric Bayesian Optimization, Part 1
This is the first post in a series about distilling BERT with multimetric Bayesian optimization. Part 2 discusses the set up for the Bayesian experiment, and...
8 MIN READ
Aug 07, 2020
Accelerating AI and ML Workflows with Amazon SageMaker and NVIDIA NGC
AI is going mainstream and is quickly becoming pervasive in every industry—from autonomous vehicles to drug discovery. However, developing and deploying AI...
13 MIN READ
Jul 29, 2020
Accelerating AI Training with MLPerf Containers and Models from NVIDIA NGC
The MLPerf consortium mission is to “build fair and useful benchmarks” to provide an unbiased training and inference performance reference for ML hardware,...
13 MIN READ
Jul 29, 2020
Optimizing NVIDIA AI Performance for MLPerf v0.7 Training
MLPerf is an industry-wide AI consortium that has developed a suite of performance benchmarks covering a range of leading AI workloads that are widely in use...
16 MIN READ