TensorRT-LLM
Apr 28, 2024
Turbocharging Meta Llama 3 Performance with NVIDIA TensorRT-LLM and NVIDIA Triton Inference Server
We're excited to announce support for the Meta Llama 3 family of models in NVIDIA TensorRT-LLM, accelerating and optimizing your LLM inference performance. You...
9 MIN READ
Apr 22, 2024
Mistral Large and Mixtral 8x22B LLMs Now Powered by NVIDIA NIM and NVIDIA API
This week’s model release features two new NVIDIA AI Foundation models, Mistral Large and Mixtral 8x22B, both developed by Mistral AI. These cutting-edge...
4 MIN READ
Apr 02, 2024
Tune and Deploy LoRA LLMs with NVIDIA TensorRT-LLM
Large language models (LLMs) have revolutionized natural language processing (NLP) with their ability to learn from massive amounts of text and generate fluent...
15 MIN READ
Mar 27, 2024
NVIDIA H200 Tensor Core GPUs and NVIDIA TensorRT-LLM Set MLPerf LLM Inference Records
Generative AI is unlocking new computing applications that greatly augment human capability, enabled by continued model innovation. Generative AI...
11 MIN READ
Feb 21, 2024
NVIDIA TensorRT-LLM Revs Up Inference for Google Gemma
NVIDIA is collaborating as a launch partner with Google in delivering Gemma, a newly optimized family of open models built from the same research and technology...
4 MIN READ
Feb 05, 2024
Generate Code, Answer Queries, and Translate Text with New NVIDIA AI Foundation Models
This week’s Model Monday release features the NVIDIA-optimized code Llama, Kosmos-2, and SeamlessM4T, which you can experience directly from your browser....
10 MIN READ
Feb 01, 2024
Deploy an AI Coding Assistant with NVIDIA TensorRT-LLM and NVIDIA Triton
Large language models (LLMs) have revolutionized the field of AI, creating entirely new ways of interacting with the digital world. While they provide a good...
12 MIN READ
Jan 08, 2024
Get Started with Generative AI Development for Windows PCs with NVIDIA RTX
Generative AI and large language models (LLMs) are changing human-computer interaction as we know it. Many use cases would benefit from running LLMs locally on...
4 MIN READ
Dec 19, 2023
Most Popular NVIDIA Technical Blog Posts of 2023: Generative AI, LLMs, Robotics, and Virtual Worlds Breakthroughs
As we approach the end of another exciting year at NVIDIA, it's time to look back at the most popular stories from the NVIDIA Technical Blog in 2023....
5 MIN READ