Pre-Trained / Foundation Models
Oct 09, 2024
Develop Academic and Industrial Applications with a New Specialized Math Model
Mathstral, an advanced AI model developed from the ground up, can deliver superior performance for enhanced learning of math, engineering, and science.
1 MIN READ
Sep 10, 2024
Streamlining Data Processing for Domain Adaptive Pretraining with NVIDIA NeMo Curator
Domain-adaptive pretraining (DAPT) of large language models (LLMs) is an important step towards building domain-specific models. These models demonstrate...
16 MIN READ
Aug 28, 2024
New Foundational Models and Training Capabilities with NVIDIA TAO 5.5
NVIDIA TAO is a framework designed to simplify and accelerate the development and deployment of AI models. It enables you to use pretrained models, fine-tune...
13 MIN READ
Apr 18, 2024
New Standard for Speech Recognition and Translation from the NVIDIA NeMo Canary Model
NVIDIA NeMo is an end-to-end platform for the development of multimodal generative AI models at scale anywhere—on any cloud and on-premises. The NeMo team...
4 MIN READ
Apr 18, 2024
Turbocharge ASR Accuracy and Speed with NVIDIA NeMo Parakeet-TDT
NVIDIA NeMo, an end-to-end platform for developing multimodal generative AI models at scale anywhere—on any cloud and on-premises—recently released...
6 MIN READ
Mar 27, 2024
Develop Custom Enterprise Generative AI with NVIDIA NeMo
Generative AI is transforming computing, paving new avenues for humans to interact with computers in natural, intuitive ways. For enterprises, the prospect of...
14 MIN READ
Mar 19, 2024
NVIDIA Speech and Translation AI Models Set Records for Speed and Accuracy
Speech and translation AI models developed at NVIDIA are pushing the boundaries of performance and innovation. The NVIDIA Parakeet automatic speech recognition...
8 MIN READ
Jan 16, 2024
New Support for Dutch and Persian Released by NVIDIA NeMo ASR
Breaking barriers in speech recognition, NVIDIA NeMo proudly presents pretrained models tailored for Dutch and Persian—languages often overlooked in the AI...
2 MIN READ
Aug 17, 2023
Designing Deep Networks to Process Other Deep Networks
Deep neural networks (DNNs) are the go-to model for learning functions from data, such as image classifiers or language models. In recent years, deep models...
15 MIN READ
Aug 15, 2023
Customizing AI Models: Train Character Detection and Recognition Models with NVIDIA TAO
Optical Character Detection (OCD) and Optical Character Recognition (OCR) are computer vision techniques used to extract text from images. Use cases vary across...
14 MIN READ
Aug 15, 2023
Customizing AI Models: Deploy a Character Detection and Recognition Model with NVIDIA Triton
NVIDIA Triton Inference Server streamlines and standardizes AI inference by enabling teams to deploy, run, and scale trained ML or DL models from any framework...
4 MIN READ
Aug 08, 2023
Unlocking the Power of Enterprise-Ready LLMs with NVIDIA NeMo
For more information about NVIDIA NeMo, see Develop Custom Enterprise Generative AI with NVIDIA NeMo. Generative AI has introduced a new era in computing, one...
10 MIN READ
Jul 25, 2023
Access the Latest in Vision AI Model Development Workflows with NVIDIA TAO Toolkit 5.0
NVIDIA TAO Toolkit provides a low-code AI framework to accelerate vision AI model development suitable for all skill levels, from novice beginners to expert...
14 MIN READ
Jun 20, 2023
Visual Foundation Models for Medical Image Analysis
The analysis of 3D medical images is crucial for advancing clinical responses, disease tracking, and overall patient survival. Deep learning models form the...
6 MIN READ
Jun 06, 2023
Develop Physics-Informed Machine Learning Models with Graph Neural Networks
NVIDIA Modulus is a framework for building, training, and fine-tuning deep learning models for physical systems, otherwise known as physics-informed machine...
6 MIN READ
Mar 29, 2023
Bootstrapping Object Detection Model Training with 3D Synthetic Data
Training AI models requires mountains of data. Acquiring large sets of training data can be difficult, time-consuming, and expensive. Also, the data collected...
12 MIN READ