AI Foundation Models
Nov 21, 2024
Deploying Fine-Tuned AI Models with NVIDIA NIM
For organizations adapting AI foundation models with domain-specific data, the ability to rapidly create and deploy fine-tuned models is key to efficiently...
5 MIN READ
Nov 21, 2024
Spotlight: Advancing Autonomous Operations with AVEVA Dynamic Simulation and NVIDIA Raptor
Industrial engineers are turning to AI to build advanced process simulation solutions and accelerate progress toward fully autonomous operations in the energy,...
6 MIN READ
Oct 09, 2024
Develop Academic and Industrial Applications with a New Specialized Math Model
Mathstral, an advanced AI model developed from the ground up, can deliver superior performance for enhanced learning of math, engineering, and science.
1 MIN READ
Oct 08, 2024
Mistral-NeMo-Minitron 8B Model Delivers Unparalleled Accuracy
This post was originally published August 21, 2024 but has been revised with current data. Recently, NVIDIA and Mistral AI unveiled Mistral NeMo 12B, a leading...
7 MIN READ
Oct 04, 2024
Just Released: NVIDIA TensorRT-LLM 0.13.0
Updates include tensor parallel support for Mamba2, sparse mixer normalization for MoE models, and more.
1 MIN READ
Oct 03, 2024
New Reward Model Helps Improve LLM Alignment with Human Preferences
Reinforcement learning from human feedback (RLHF) is essential for developing AI systems that are aligned with human values and preferences. RLHF enables the...
4 MIN READ
Sep 30, 2024
Improve Reinforcement Learning from Human Feedback with Leaderboard-Topping Reward Model
Llama 3.1 Nemotron 70B Reward model helps generate high-quality training data that aligns with human preferences for finance, retail, healthcare, scientific...
1 MIN READ
Sep 16, 2024
Generate code with Abacus AI’s Dracarys Large Language Model
Dracarys, fine-tuned from Llama 3.1 70B and available from NVIDIA NIM microservice, supports a variety of applications, including data analysis, text...
1 MIN READ
Sep 05, 2024
Low Latency Inference Chapter 1: Up to 1.9x Higher Llama 3.1 Performance with Medusa on NVIDIA HGX H200 with NVLink Switch
As large language models (LLMs) continue to grow in size and complexity, multi-GPU compute is a must-have to deliver the low latency and high throughput that...
5 MIN READ
Aug 13, 2024
New NIM Available: Mistral Large 2 Instruct LLM
The new model by Mistral excels at a variety of complex tasks including text summarization, multilingual translation and reasoning, programming, question and...
1 MIN READ
Jul 26, 2024
Power Text-Generation Applications with Mistral NeMo 12B Running on a Single GPU
NVIDIA collaborated with Mistral to co-build the next-generation language model that achieves leading performance across benchmarks in its class. With a growing...
6 MIN READ
Jul 25, 2024
Revolutionizing Code Completion with Codestral Mamba, the Next-Gen Coding LLM
In the rapidly evolving field of generative AI, coding models have become indispensable tools for developers, enhancing productivity and precision in software...
5 MIN READ
Jul 23, 2024
Supercharging Llama 3.1 across NVIDIA Platforms
Meta's Llama collection of large language models are the most popular foundation models in the open-source community today, supporting a variety of use cases....
8 MIN READ
Jul 02, 2024
Phi-3-Medium: Now Available on the NVIDIA API Catalog
Phi-3-Medium accelerates research with logic-rich features in both short (4K) and long (128K) context.
1 MIN READ
Jul 01, 2024
StarCoder2-15B: A Powerful LLM for Code Generation, Summarization, and Documentation
Trained on 600+ programming languages, StarCoder2-15B is now packaged as a NIM inference microservice available for free from the NVIDIA API catalog.
1 MIN READ
Jul 01, 2024
Google's New Gemma 2 Model Now Optimized and Available on NVIDIA API Catalog
Gemma 2, the next generation of Google Gemma models, is now optimized with TensorRT-LLM and packaged as NVIDIA NIM inference microservice.
1 MIN READ