Pavlo Molchanov

Pavlo Molchanov is a distinguished research scientist and manager at NVIDIA Research. He leads the Deep Learning Efficiency Research team. His main areas of interest include LLM and VLM efficiency, novel architecture design, post-training model compression, and adaptive/conditional inference.

Posts by Pavlo Molchanov

Agentic AI / Generative AI Dec 01, 2025

Train Small Orchestration Agents to Solve Big Problems

Using the right tool and model for a task is a challenging and ever-present engineering problem in agent design. At NVIDIA Research, we're making fast progress... 7 MIN READ

Agentic AI / Generative AI Nov 22, 2024

Hymba Hybrid-Head Architecture Boosts Small Language Model Performance

Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,... 12 MIN READ

Agentic AI / Generative AI May 03, 2024

Visual Language Models on NVIDIA Hardware with VILA

Note: As of January 6, 2025 VILA is now part of the new Cosmos Nemotron vision language models. Visual language models have evolved significantly recently.... 11 MIN READ