Asma Kuriparambil Thekkumpate

Asma Beevi K T is a senior engineer at NVIDIA, developing the NVIDIA TensorRT Model Optimizer toolkit. Her interests span training and inference optimizations for deep learning models, particularly LLMs.

Posts by Asma Kuriparambil Thekkumpate

Agentic AI / Generative AI Sep 11, 2025

How Quantization Aware Training Enables Low-Precision Accuracy Recovery

After training AI models, a variety of compression techniques can be used to optimize them for deployment. The most common is post-training quantization (PTQ),... 10 MIN READ

Agentic AI / Generative AI Aug 29, 2025

Fine-Tuning gpt-oss for Accuracy and Performance with Quantization Aware Training

Major open-source foundational model releases are an exciting time for the AI community, bringing unique architectural innovations and capabilities. As the... 7 MIN READ

Agentic AI / Generative AI Aug 15, 2024

NVIDIA TensorRT Model Optimizer v0.15 Boosts Inference Performance and Expands Model Support

NVIDIA has announced the latest v0.15 release of NVIDIA TensorRT Model Optimizer, a state-of-the-art quantization toolkit of model optimization techniques... 5 MIN READ