Posts by Karin Sevegnani
        
                    Agentic AI / Generative AI
        
        
        Sep 23, 2025
      
      Faster Training Throughput in FP8 Precision with NVIDIA NeMo
                                                In previous posts on FP8 training, we explored the fundamentals of FP8 precision and took a deep dive into the various scaling recipes for practical large-scale...
                          
          
            12 MIN READ
          
        
      
    
        
                    Agentic AI / Generative AI
        
        
        Jul 01, 2025
      
      Per-Tensor and Per-Block Scaling Strategies for Effective FP8 Training
                                                In this blog post, we’ll break down the main FP8 scaling strategies—per-tensor scaling, delayed and current scaling, and per-block scaling (including the...
                          
          
            10 MIN READ
          
        
      
    
        
                    Agentic AI / Generative AI
        
        
        Jun 04, 2025
      
      Floating-Point 8: An Introduction to Efficient, Lower-Precision AI Training
                                                With the growth of large language models (LLMs), deep learning is advancing both model architecture design and computational efficiency. Mixed precision...
                          
          
            11 MIN READ
          
        
      
    
        
                    Developer Tools & Techniques
        
        
        May 27, 2025
      
      Advanced Optimization Strategies for LLM Training on NVIDIA Grace Hopper
                                                In the previous post, Profiling LLM Training Workflows on NVIDIA Grace Hopper, we explored the importance of profiling large language model (LLM) training...
                          
          
            10 MIN READ
          
        
      
    
        
                    Developer Tools & Techniques
        
        
        May 27, 2025
      
      Profiling LLM Training Workflows on NVIDIA Grace Hopper
                                                The rapid advancements in AI have resulted in an era of exponential growth in model sizes, particularly in the domain of large language models (LLMs). These...
                          
          
            12 MIN READ
          
        
      
    
        
                    Agentic AI / Generative AI
        
        
        Apr 24, 2025
      
      Benchmarking Agentic LLM and VLM Reasoning for Gaming with NVIDIA NIM
                                                This is the first post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM. ...
                          
          
            7 MIN READ