Posts by Fred Oh
Generative AI
Sep 09, 2023
NVIDIA TensorRT-LLM Supercharges Large Language Model Inference on NVIDIA H100 GPUs
Large language models offer incredible new capabilities, expanding the frontier of what is possible with AI. But their large size and unique execution...
10 MIN READ
Simulation / Modeling / Design
Aug 22, 2023
Simplifying GPU Application Development with Heterogeneous Memory Management
Heterogeneous Memory Management (HMM) is a CUDA memory management feature that extends the simplicity and productivity of the CUDA Unified Memory programming...
16 MIN READ
Simulation / Modeling / Design
Jul 06, 2023
NVIDIA CUDA Toolkit 12.2 Unleashes Powerful Features for Boosting Applications
The latest release of NVIDIA CUDA Toolkit 12.2 introduces a range of essential new features, modifications to the programming model, and enhanced support for...
8 MIN READ
Data Center / Cloud
Jul 03, 2023
Event: CUDA 12.2 YouTube Premiere
Watch on-demand as experts deep dive into CUDA 12.2, including support for confidential computing.
1 MIN READ
Simulation / Modeling / Design
Jun 28, 2023
Improving GPU Performance by Reducing Instruction Cache Misses
GPUs are specially designed to crunch through massive amounts of data at high speed. They have a large amount of compute resources, called streaming...
11 MIN READ
Data Center / Cloud
Jun 05, 2023
CUDA 12.1 Supports Large Kernel Parameters
CUDA kernel function parameters are passed to the device through constant memory and have been limited to 4,096 bytes. CUDA 12.1 increases this parameter limit...
5 MIN READ