Posts by Anjali Shah
Generative AI
Apr 28, 2024
Turbocharging Meta Llama 3 Performance with NVIDIA TensorRT-LLM and NVIDIA Triton Inference Server
We're excited to announce support for the Meta Llama 3 family of models in NVIDIA TensorRT-LLM, accelerating and optimizing your LLM inference performance. You...
9 MIN READ
Data Center / Cloud
Mar 07, 2024
Generate Stunning Images with Stable Diffusion XL on the NVIDIA AI Inference Platform
Diffusion models are transforming creative workflows across industries. These models generate stunning images based on simple text or image inputs by...
14 MIN READ
Generative AI
Feb 21, 2024
NVIDIA TensorRT-LLM Revs Up Inference for Google Gemma
NVIDIA is collaborating as a launch partner with Google in delivering Gemma, a newly optimized family of open models built from the same research and technology...
4 MIN READ
Generative AI
Nov 16, 2023
Mastering LLM Techniques: Training
Large language models (LLMs) are a class of generative AI models built using transformer networks that can recognize, summarize, translate, predict, and...
15 MIN READ
Conversational AI
Aug 10, 2023
Mastering LLM Techniques: Customization
Large language models (LLMs) are becoming an integral tool for businesses to improve their operations, customer interactions, and decision-making processes....
12 MIN READ