Posts by Sergio Perez
Agentic AI / Generative AI
Jun 18, 2025
LLM Inference Benchmarking: How Much Does Your LLM Inference Cost?
Learn how to calculate LLM inference costs using NVIDIA GenAI-Perf benchmarking tools and TCO formulas. This guide covers performance metrics (TTFT,...
11 MIN READ
Agentic AI / Generative AI
Jun 04, 2025
Floating-Point 8: An Introduction to Efficient, Lower-Precision AI Training
With the growth of large language models (LLMs), deep learning is advancing both model architecture design and computational efficiency. Mixed precision...
11 MIN READ
Data Center / Cloud
Jan 16, 2025
Continued Pretraining of State-of-the-Art LLMs for Sovereign AI and Regulated Industries with Domyn and NVIDIA DGX Cloud
In recent years, large language models (LLMs) have achieved extraordinary progress in areas such as reasoning, code generation, machine translation, and...
17 MIN READ