Posts by Hao Wu
Recommenders / Personalization
Aug 31, 2022
Fast, Terabyte-Scale Recommender Training Made Easy with NVIDIA Merlin Distributed-Embeddings
Embeddings play a key role in deep learning recommender models. They are used to map encoded categorical inputs in data to numerical values that can be...
8 MIN READ
Computer Vision / Video Analytics
Jul 20, 2021
Achieving FP32 Accuracy for INT8 Inference Using Quantization Aware Training with NVIDIA TensorRT
Deep learning is revolutionizing the way that industries are delivering products and services. These services include object detection, classification, and...
17 MIN READ
Simulation / Modeling / Design
Nov 06, 2019
Int4 Precision for AI Inference
INT4 Precision Can Bring an Additional 59% Speedup Compared to INT8 If there’s one constant in AI and deep learning, it’s never-ending optimization to wring...
5 MIN READ