Posts by Sergio Perez
Generative AI
Jun 18, 2025
Benchmarking LLM Inference Costs for Smarter Scaling and Deployment
This is the third post in the large language model latency-throughput benchmarking series, which aims to instruct developers on how to determine the cost of LLM...
10 MIN READ
Generative AI
Jun 04, 2025
Floating-Point 8: An Introduction to Efficient, Lower-Precision AI Training
With the growth of large language models (LLMs), deep learning is advancing both model architecture design and computational efficiency. Mixed precision...
11 MIN READ
Data Center / Cloud
Jan 16, 2025
Continued Pretraining of State-of-the-Art LLMs for Sovereign AI and Regulated Industries with Domyn and NVIDIA DGX Cloud
In recent years, large language models (LLMs) have achieved extraordinary progress in areas such as reasoning, code generation, machine translation, and...
17 MIN READ