The performance of many math functions has improved with the release of the CUDA 4.0 Toolkit.

This  presentation includes the performance results of many of the key functions.

Results include performance measurements for :

  • cuFFT – Fast Fourier Transforms Library
  • cuBLAS – Complete BLAS Library
  • cuSPARSE – Sparse Matrix Library
  • cuRAND – Random Number Generation (RNG) Library
  • NPP – Performance Primitives for Image & Video Processing
  • Thrust – Templated Parallel Algorithms & Data Structures
  • math.h - C99 floating-point Library

File Description Size
CUDA_4_0_Math_Libraries_Performance_6_14.pdf A review of the performance of CUDA 4.0 Math Libraries, including cuFFT, cuBLAS, cuSPARSE, cuRAND, NPP, Thrust and others 1.44 MB