Pavlo Molchanov

Pavlo Molchanov is a distinguished research scientist and manager at NVIDIA Research. He leads the Deep Learning Efficiency Research team. His main areas of interest include LLM and VLM efficiency, novel architecture design, post-training model compression, and adaptive/conditional inference.
Avatar photo

Posts by Pavlo Molchanov

Agentic AI / Generative AI

Hymba Hybrid-Head Architecture Boosts Small Language Model Performance

Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,... 12 MIN READ
Decorative image.
Agentic AI / Generative AI

Visual Language Models on NVIDIA Hardware with VILA

Note: As of January 6, 2025 VILA is now part of the new Cosmos Nemotron vision language models. Visual language models have evolved significantly recently.... 11 MIN READ