Vishal Mehta

Vishal works as a senior developer technology engineer at NVIDIA, with focus on performance optimization for GPU applications. He has been working in the field of GPU computing for over 10 years. He is keen on teaching CUDA and GPU computing to users and drives the content for the CUDA programming guide. His day-to-day activities involve collaborations with domain scientists and industry experts to improve their workloads on GPUs.
Avatar photo

Posts by Vishal Mehta

Decorative image.
Data Center / Cloud

Demystifying AI Inference Deployments for Trillion Parameter Large Language Models

AI is transforming every industry, addressing grand human scientific challenges such as precision drug discovery and the development of autonomous vehicles, as... 14 MIN READ
Grace CPU Superchip illustration.
Simulation / Modeling / Design

NVIDIA Grace CPU Superchip Architecture In Depth

The NVIDIA Grace CPU is the first data center CPU developed by NVIDIA. Combining NVIDIA expertise with Arm processors, on-chip fabrics, system-on-chip (SoC)... 9 MIN READ
Simulation / Modeling / Design

NVIDIA Grace Hopper Superchip Architecture In-Depth

The NVIDIA Grace Hopper Superchip Architecture is the first true heterogeneous accelerated platform for high-performance computing (HPC) and AI workloads. It... 17 MIN READ
Simulation / Modeling / Design

NVIDIA Hopper Architecture In-Depth

Today during the 2022 NVIDIA GTC Keynote address, NVIDIA CEO Jensen Huang introduced the new NVIDIA H100 Tensor Core GPU based on the new NVIDIA Hopper GPU... 36 MIN READ
Data Science

Accelerating Random Forests Up to 45x Using cuML

Random forests are a popular machine learning technique for classification and regression problems. By building multiple independent decision trees, they reduce... 13 MIN READ