AI Foundation Models
Apr 30, 2024
Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks
This week’s model release features DBRX, a state-of-the-art large language model (LLM) developed by Databricks. With demonstrated strength in programming and...
3 MIN READ
Apr 28, 2024
Turbocharging Meta Llama 3 Performance with NVIDIA TensorRT-LLM and NVIDIA Triton Inference Server
We're excited to announce support for the Meta Llama 3 family of models in NVIDIA TensorRT-LLM, accelerating and optimizing your LLM inference performance. You...
9 MIN READ
Apr 26, 2024
New LLM: Snowflake Arctic Model for SQL and Code Generation
Large language models (LLMs) have revolutionized natural language processing (NLP) in recent years, enabling a wide range of applications such as text...
3 MIN READ
Apr 22, 2024
Advancing Cell Segmentation and Morphology Analysis with NVIDIA AI Foundation Model VISTA-2D
Genomics researchers use different sequencing techniques to better understand biological systems, including single-cell and spatial omics. Unlike single-cell,...
7 MIN READ
Apr 22, 2024
Mistral Large and Mixtral 8x22B LLMs Now Powered by NVIDIA NIM and NVIDIA API
This week’s model release features two new NVIDIA AI Foundation models, Mistral Large and Mixtral 8x22B, both developed by Mistral AI. These cutting-edge...
4 MIN READ
Mar 18, 2024
Scale AI-Enabled Robotics Development Workloads with NVIDIA OSMO
Autonomous machine development is an iterative process of data generation and gathering, model training, and deployment characterized by complex multi-stage,...
4 MIN READ
Mar 14, 2024
Applying Mixture of Experts in LLM Architectures
Mixture of experts (MoE) large language model (LLM) architectures have recently emerged, both in proprietary LLMs such as GPT-4, as well as in community models...
12 MIN READ
Mar 07, 2024
Generate Stunning Images with Stable Diffusion XL on the NVIDIA AI Inference Platform
Diffusion models are transforming creative workflows across industries. These models generate stunning images based on simple text or image inputs by...
14 MIN READ
Mar 04, 2024
Solve Complex AI Tasks with Leaderboard-Topping Smaug 72B from NVIDIA AI Foundation Models
This week’s model release features the NVIDIA-optimized language model Smaug 72B, which you can experience directly from your browser. NVIDIA AI Foundation...
2 MIN READ
Feb 28, 2024
Unlock Your LLM Coding Potential with StarCoder2
Coding is essential in the digital age, but it can also be tedious and time-consuming. That's why many developers are looking for ways to automate and...
7 MIN READ
Feb 27, 2024
Unlock the Power of Small Language Model Phi-2 for Chat, Research, Coding, and More
This week’s model release features the NVIDIA-optimized language model Phi-2, which can be used for a wide range of natural language processing (NLP) tasks....
2 MIN READ
Feb 19, 2024
Experience NVIDIA cuOpt Accelerated Optimization to Boost Operational Efficiency
This week’s model release features NVIDIA cuOpt, a world-record-breaking accelerated optimization engine that helps teams solve complex routing problems and...
6 MIN READ
Feb 12, 2024
Performance-Efficient Mamba-Chat from NVIDIA AI Foundation Models
This week’s release features the NVIDIA-optimized Mamba-Chat model, which you can experience directly from your browser. This post is part of Model Mondays, a...
3 MIN READ
Feb 05, 2024
Generate Code, Answer Queries, and Translate Text with New NVIDIA AI Foundation Models
This week’s Model Monday release features the NVIDIA-optimized code Llama, Kosmos-2, and SeamlessM4T, which you can experience directly from your browser....
10 MIN READ
Jan 22, 2024
Query Graphs with Optimized DePlot Model
NVIDIA AI Foundation Models and Endpoints provides access to a curated set of community and NVIDIA-built generative AI models to experience, customize, and...
6 MIN READ