Posts by Wenwen Gao
Generative AI
May 12, 2025
Run Hugging Face Models Instantly with Day-0 Support from NVIDIA NeMo Framework
As organizations strive to maximize the value of their generative AI investments, accessing the latest model developments is crucial to continued success. By...
6 MIN READ
Data Center / Cloud
May 08, 2025
Turbocharge LLM Training Across Long-Haul Data Center Networks with NVIDIA Nemo Framework
Multi-data center training is becoming essential for AI factories as pretraining scaling fuels the creation of even larger models, leading the demand for...
6 MIN READ
Generative AI
May 06, 2025
LLM Inference Benchmarking Guide: NVIDIA GenAI-Perf and NIM
This is the second post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM. ...
11 MIN READ
Generative AI
Apr 02, 2025
LLM Inference Benchmarking: Fundamental Concepts
This is the first post in the large language model latency-throughput benchmarking series, which aims to instruct developers on common metrics used for LLM...
15 MIN READ
Generative AI
Jan 07, 2025
Accelerate Custom Video Foundation Model Pipelines with New NVIDIA NeMo Framework Capabilities
Generative AI has evolved from text-based models to multimodal models, with a recent expansion into video, opening up new potential uses across various...
10 MIN READ
Generative AI
Mar 27, 2024
Fine-Tune and Align LLMs Easily with NVIDIA NeMo Customizer
As large language models (LLMs) continue to gain traction in enterprise AI applications, the demand for custom models that can understand and integrate specific...
5 MIN READ