Posts by Asma Kuriparambil Thekkumpate
Agentic AI / Generative AI
Sep 11, 2025
How Quantization Aware Training Enables Low-Precision Accuracy Recovery
After training AI models, a variety of compression techniques can be used to optimize them for deployment. The most common is post-training quantization (PTQ),...
10 MIN READ
Agentic AI / Generative AI
Aug 29, 2025
Fine-Tuning gpt-oss for Accuracy and Performance with Quantization Aware Training
Major open-source foundational model releases are an exciting time for the AI community, bringing unique architectural innovations and capabilities. As the...
7 MIN READ
Agentic AI / Generative AI
Aug 15, 2024
NVIDIA TensorRT Model Optimizer v0.15 Boosts Inference Performance and Expands Model Support
NVIDIA has announced the latest v0.15 release of NVIDIA TensorRT Model Optimizer, a state-of-the-art quantization toolkit of model optimization techniques...
5 MIN READ