Simulation / Modeling / Design

Everything You Ever Wanted to Know About Floating Point but Were Afraid to Ask

Diagram of fused multiply-add method for floating point.

Jun 07, 2011

By Mark Harris

AI-Generated Summary

Like

Dislike

Nathan Whitehead and Alex Fit-Florea wrote a paper about floating point on NVIDIA GPUs to help clarify issues related to floating point in CUDA.
The paper discusses how NVIDIA GPUs comply with the IEEE 754 standard and how fused multiply-add improves the accuracy of floating point calculations.
The authors also explain that there are multiple ways to compute a dot product, presenting three different methods, which can lead to different numerical results between CPU and GPU.

AI-generated content may summarize information incompletely. Verify important information. Learn more

This post was written by Nathan Whitehead

A few days ago, a friend came to me with a question about floating point. Let me start by saying that my friend knows his stuff, he doesn’t ask stupid questions. So he had my attention. He was working on some biosciences simulation code and was getting answers of a different precision than he expected on the GPU and wanted to know what was up.

Even expert CUDA programmers don’t always know all the intricacies of floating point. It’s a tricky topic. Even my friend, who is so cool that he wears sunglasses indoors, needed some help. If you look at the NVIDIA CUDA forums, questions and concerns about floating point come up regularly.

Getting a handle on how to effectively use floating point is obviously very important if you are doing numeric computations in CUDA.

In an attempt to help out, Alex Fit-Florea and I have written a short paper about floating point on NVIDIA GPUs, Floating Point and IEEE 754 Compliance for NVIDIA GPUs. In the paper, we talk about various issues related to floating point in CUDA:

How the IEEE 754 standard fits in with NVIDIA GPUs
How fused multiply-add improves accuracy
There’s more than one way to compute a dot product (we present three)
How to make sense of different numerical results between CPU and GPU

Tags

Simulation / Modeling / Design | HPC / Scientific Computing | CUDA | Beginner Technical | Deep dive | floating point

About the Authors

About Mark Harris
Mark is an NVIDIA Distinguished Engineer working on RAPIDS. Mark has over twenty years of experience developing software for GPUs, ranging from graphics and games, to physically-based simulation, to parallel algorithms and high-performance computing. While a Ph.D. student at The University of North Carolina he recognized a nascent trend and coined a name for it: GPGPU (General-Purpose computing on Graphics Processing Units).
Follow @harrism on Twitter

View all posts by Mark Harris