Get The  Latest CUDA Download

The CUDA 4.1 release candidate2 (RC2) 

Jump to downloads for : [Windows][ Linux ] [ MacOS 

This release includes a new LLVM-based CUDA compiler, 1000+ new image processing functions, and a redesigned Visual Profiler with automated performance analysis and integrated expert guidance.

We’re looking forward to hearing about your experience with this release (good and bad) through the Registered Developer feedback form. Give feedback via the CUDA Registered Developer Program .

Release Highlights

Try the new compiler!

  • New LLVM-based compiler delivers up to 10% faster performance for many applications

New & Improved “drop-in” acceleration with GPU-Accelerated Libraries

  • Over 1000 new image processing functions in the NPP library
  • New cuSPARSE tri-diagonal solver up to 10x faster than MKL on a 6 core CPU
  • New support in cuRAND for MRG32k3a and Mersenne Twister (MTGP11213) RNG algorithms
  • Bessel functions now supported in the CUDA standard Math library
  • Up to 2x faster sparse matrix vector multiply using ELL hybrid format
  • Learn more about all the great GPU-Accelerated Libraries

Enhanced & Redesigned Developer Tools

  • Redesigned Visual Profiler with automated performance analysis and expert guidance
  • CUDA_GDB support for multi-context debugging and assert() in device code
  • CUDA-MEMCHECK now detects out of bounds access for memory allocated in device code
  • Parallel Nsight 2.1 CUDA warp watch visualizes variables and expressions across an entire CUDA warp
  • Parallel Nsight 2.1 CUDA profiler now analyzes kernel memory activities, execution stalls and instruction throughput
  • Learn more about debugging and performance analysis tools for GPU developers on our CUDA Tools and Ecosystem Summary Page

Advanced Programming Features

  • Access to 3D surfaces and cube maps from device code
  • Enhanced no-copy pinning of system memory, cudaHostRegister() alignment and size restrictions removed
  • Peer-to-peer communication between processes
  • Support for resetting a GPU without rebooting the system in nvidia-smi

New & Improved SDK Code Samples

  • simpleP2P sample now supports peer-to-peer communication with any Fermi GPU
  • New grabcutNPP sample demonstrates interactive foreground extraction using iterated graph cuts
  • New samples showing how to implement the Horn-Schunck Method for optical flow, perform volume filtering, and read cube map texture

Watch the CUDA Toolkit 4.1 Feature and Overview Webinar  for an overview of some of the exciting new features of this release.
 
Find all the latest versions of other Libraries and Tools on our Tools & EcoSystem Page

Get yourself fully trained- check out the latest CUDA Webinars
Become a CUDA Registered Developer, report bugs, engage with NVIDIA engineering
Jump to: [Windows][ Linux ] [ MacOS 

Windows 7, VISTA, Windows XP Downloads
Developer Drivers for WinXP (285.86)
Support for XP on notebooks is being phased out and is not available for this release. See Release Notes and Getting Started Guides for more information.
32-bit 64-bit  
Developer Drivers for WinVista and Win7 (285.86) 32-bit 64-bit  
Notebook Developer Drivers for WinVista and Win7 (285.86) 32-bit 64-bit
CUDA Toolkit
  • C/C++ compiler
  • Visual Profiler
  • GPU-accelerated BLAS library
  • GPU-accelerated FFT library
  • GPU-accelerated Sparse Matrix library
  • GPU-accelerated RNG library
  • Additional tools and documentation

32-bit 64-bit

Documentation Installed with Toolkit

GPU Computing SDK - complete package including all code samples 32-bit 64-bit
browse online
Parallel Nsight 2.1RC2  download
Learn about additional tools, libraries, and more… CUDA Ecosystem

Linux Downloads
Developer Drivers for Linux (285.05.23)  32-bit64-bit
CUDA Toolkit
  • C/C++ compiler
  • CUDA-GDB debugger
  • Visual Profiler
  • GPU-accelerated BLAS library
  • GPU-accelerated FFT library
  • GPU-accelerated Sparse Matrix library
  • GPU-accelerated RNG library
  • Additional tools and documentation

Documentation Installed with Toolkit

CUDA Toolkit for Fedora 14 32-bit64-bit
CUDA Toolkit for RedHat Enterprise Linux 6.0 64-bit
CUDA Toolkit for RedHat Enterprise Linux 5.5 32-bit, 64-bit,  
CUDA Toolkit for Ubuntu Linux 11.04 32-bit64-bit
CUDA Toolkit for Ubuntu Linux 10.04 32-bit64-bit
CUDA Toolkit for OpenSUSE 11.2 32-bit64-bit
CUDA Toolkit for SUSE Linux Enterprise Server 11 SP1 32-bit,64-bit,
GPU Computing SDK - complete package including all code samples download 
browse online (production release)
Learn about additional tools, libraries, and more… CUDA Ecosystem

Mac OS X Downloads
Developer Drivers (4.1.21) for MacOS (requires OS ver. 10.6.8 or higher) download
CUDA Toolkit (requires OS version 10.6.8 or higher)
  • C/C++ compiler
  • CUDA-GDB debugger
  • Visual Profiler
  • GPU-accelerated BLAS library
  • GPU-accelerated FFT library
  • GPU-accelerated Sparse Matrix library
  • GPU-accelerated RNG library
  • Additional tools and documentation

download

Documentation Installed with Toolkit

GPU Computing SDK - complete package including all code samples download 
Browse Online (production)
Learn about additional tools, libraries, and more… CUDA Ecosystem