Fast-Track Production AI with Pretrained Models and NVIDIA TAO Toolkit 3.0

Today, NVIDIA announced new pretrained models and the general availability of TAO Toolkit 3.0, a core component of the NVIDIA Train, Adapt, and Optimize (TAO) platform-guided workflow for creating AI. The new release includes a variety of highly accurate and performant pretrained models in computer vision and conversational AI, as well as a set of powerful productivity features that boost AI development by up to 10x.

As enterprises race to bring AI-enabled solutions to market, your competitiveness relies on access to the best development tools. The development journey to deploy custom, high-accuracy, and performant AI models in production can be treacherous for many engineering and research teams attempting to train with open-source models for AI product creation.

NVIDIA offers high-quality, pretrained models and TAO Toolkit to help reduce costs with large-scale data collection and labeling. It also eliminates the burden of training AI/ML models from scratch. New entrants to the computer vision and speech-enabled service market can now deploy production-class AI without a massive AI development team.

Highlights of the new release:

A pose-estimation model that supports real-time inference on edge with 9x faster inference performance than the OpenPose model.
PeopleSemSegNet, a semantic segmentation network for people detection.
A variety of computer vision pretrained models in various industry use cases, such as license plate detection and recognition, heart rate monitoring, emotion recognition, facial landmarks, and more.
CitriNet, a new speech-recognition model that is trained on various proprietary domain-specific and open-source datasets.
A new Megatron Uncased model for Question Answering, plus many other pretrained models that support speech-to-text, named-entity recognition, punctuation, and text classification.
Training support on AWS, GCP, and Azure.
Out-of-the-box deployment on NVIDIA Triton and DeepStream SDK for vision AI and NVIDIA Riva for conversational AI.

Get started fast

Download TAO Toolkit and access to developer resources: Get started.
Download models from NGC: Computer vision | Conversational AI
Check out the latest developer tutorial: Training and Optimizing a 2D Pose-Estimation Model with NVIDIA TAO Toolkit, Part 1 and Part 2

Integration with data-generation and labeling tools for faster and more accurate AI

TAO Toolkit 3.0 is also now integrated with platforms from several leading partners who provide large, diverse, and high-quality labeled data—enabling faster end-to-end AI/ML workflows. You can now use these partners’ services to generate and annotate data, seamlessly integrate with the toolkit for model training and optimization, and deploy the model using DeepStream SDK or Riva to create reliable applications in computer vision and conversational AI.

Check out more partner posts and tutorials about synthetic data and data annotation with TAO Toolkit:

Learn more about NVIDIA pretrained models and TAO Toolkit > >