Asma Kuriparambil Thekkumpate

Asma Beevi K T is a senior engineer at NVIDIA, developing the NVIDIA TensorRT Model Optimizer toolkit. Her interests span training and inference optimizations for deep learning models, particularly LLMs.
Avatar photo

Posts by Asma Kuriparambil Thekkumpate

Generative AI

How Quantization Aware Training Enables Low-Precision Accuracy Recovery

After training AI models, a variety of compression techniques can be used to optimize them for deployment. The most common is post-training quantization (PTQ),... 10 MIN READ
Generative AI

Fine-Tuning gpt-oss for Accuracy and Performance with Quantization Aware Training

Major open-source foundational model releases are an exciting time for the AI community, bringing unique architectural innovations and capabilities. As the... 7 MIN READ
Generative AI

NVIDIA TensorRT Model Optimizer v0.15 Boosts Inference Performance and Expands Model Support

NVIDIA has announced the latest v0.15 release of NVIDIA TensorRT Model Optimizer, a state-of-the-art quantization toolkit of model optimization techniques... 5 MIN READ