Ian Tramble

Ian is a senior deep learning libraries engineer at NVIDIA, where he works on CUTLASS and compilers. Previously, he worked on TensorRT, MLPerf Inference, and real time systems for autonomous vehicles. Ian graduated from the University of Toronto’s Engineering Science program with a major in Electrical and Computer Engineering.
Avatar photo

Posts by Ian Tramble

Models / Libraries / Frameworks

Improving GEMM Kernel Auto-Tuning Efficiency on NVIDIA GPUs with Heuristics and CUTLASS 4.2

Selecting the best possible General Matrix Multiplication (GEMM) kernel for a specific problem and hardware is a significant challenge. The performance of a... 8 MIN READ
Simulation / Modeling / Design

Getting the Best Performance on MLPerf Inference 2.0

Models like Megatron 530B are expanding the range of problems AI can address. However, as models continue to grow complexity, they pose a twofold challenge for... 11 MIN READ