Ruixiang Wang

Ruixiang Wang is a senior developer technology engineer for LLMs and generative AI at NVIDIA. His current focus is on optimizing AI workloads, including both training and inference, to achieve speed of the light performance on NVIDIA accelerators. He has a strong background in Machine Learning, Deep Learning, NLP, and LLMs. He also assists partners in leveraging the best of NVIDIA's technologies for their AI workloads. He holds an MSc degree in Computer Science from RWTH Aachen University.
Avatar photo

Posts by Ruixiang Wang

Decorative image.
Edge Computing

Model Quantization: Concepts, Methods, and Why It Matters

AI models are becoming increasingly complex, often exceeding the capabilities of available hardware. Quantization has emerged as a crucial technique to address... 12 MIN READ