
The CUDA 4.1 release candidate2 (RC2)
Jump to downloads for : [Windows][ Linux ] [ MacOS ]
This release includes a new LLVM-based CUDA compiler, 1000+ new image processing functions, and a redesigned Visual Profiler with automated performance analysis and integrated expert guidance.
We’re looking forward to hearing about your experience with this release (good and bad) through the Registered Developer feedback form. Give feedback via the CUDA Registered Developer Program .
Release Highlights
Try the new compiler!
- New LLVM-based compiler delivers up to 10% faster performance for many applications
New & Improved “drop-in” acceleration with GPU-Accelerated Libraries
- Over 1000 new image processing functions in the NPP library
- New cuSPARSE tri-diagonal solver up to 10x faster than MKL on a 6 core CPU
- New support in cuRAND for MRG32k3a and Mersenne Twister (MTGP11213) RNG algorithms
- Bessel functions now supported in the CUDA standard Math library
- Up to 2x faster sparse matrix vector multiply using ELL hybrid format
- Learn more about all the great GPU-Accelerated Libraries
Enhanced & Redesigned Developer Tools
- Redesigned Visual Profiler with automated performance analysis and expert guidance
- CUDA_GDB support for multi-context debugging and assert() in device code
- CUDA-MEMCHECK now detects out of bounds access for memory allocated in device code
- Parallel Nsight 2.1 CUDA warp watch visualizes variables and expressions across an entire CUDA warp
- Parallel Nsight 2.1 CUDA profiler now analyzes kernel memory activities, execution stalls and instruction throughput
- Learn more about debugging and performance analysis tools for GPU developers on our CUDA Tools and Ecosystem Summary Page
Advanced Programming Features
- Access to 3D surfaces and cube maps from device code
- Enhanced no-copy pinning of system memory, cudaHostRegister() alignment and size restrictions removed
- Peer-to-peer communication between processes
- Support for resetting a GPU without rebooting the system in nvidia-smi
New & Improved SDK Code Samples
- simpleP2P sample now supports peer-to-peer communication with any Fermi GPU
- New grabcutNPP sample demonstrates interactive foreground extraction using iterated graph cuts
- New samples showing how to implement the Horn-Schunck Method for optical flow, perform volume filtering, and read cube map texture
Watch the CUDA Toolkit 4.1 Feature and Overview Webinar for an overview of some of the exciting new features of this release.
Find all the latest versions of other Libraries and Tools on our Tools & EcoSystem Page
Get yourself fully trained- check out the latest CUDA Webinars
Become a CUDA Registered Developer, report bugs, engage with NVIDIA engineering
Jump to: [Windows][ Linux ] [ MacOS ]
| Windows 7, VISTA, Windows XP | Downloads |
|---|---|
|
Developer Drivers for WinXP (285.86) Support for XP on notebooks is being phased out and is not available for this release. See Release Notes and Getting Started Guides for more information. |
32-bit 64-bit |
| Developer Drivers for WinVista and Win7 (285.86) | 32-bit 64-bit |
| Notebook Developer Drivers for WinVista and Win7 (285.86) | 32-bit 64-bit |
CUDA Toolkit
|
Documentation Installed with Toolkit |
| GPU Computing SDK - complete package including all code samples |
32-bit 64-bit browse online |
| Parallel Nsight 2.1RC2 | download |
| Learn about additional tools, libraries, and more… | CUDA Ecosystem |
| Linux | Downloads |
|---|---|
| Developer Drivers for Linux (285.05.23) | 32-bit, 64-bit |
CUDA Toolkit
|
Documentation Installed with Toolkit |
| CUDA Toolkit for Fedora 14 | 32-bit, 64-bit, |
| CUDA Toolkit for RedHat Enterprise Linux 6.0 | 64-bit |
| CUDA Toolkit for RedHat Enterprise Linux 5.5 | 32-bit, 64-bit, |
| CUDA Toolkit for Ubuntu Linux 11.04 | 32-bit, 64-bit, |
| CUDA Toolkit for Ubuntu Linux 10.04 | 32-bit, 64-bit, |
| CUDA Toolkit for OpenSUSE 11.2 | 32-bit, 64-bit, |
| CUDA Toolkit for SUSE Linux Enterprise Server 11 SP1 | 32-bit,64-bit, |
| GPU Computing SDK - complete package including all code samples |
download browse online (production release) |
| Learn about additional tools, libraries, and more… | CUDA Ecosystem |
| Mac OS X | Downloads |
|---|---|
| Developer Drivers (4.1.21) for MacOS (requires OS ver. 10.6.8 or higher) | download |
CUDA Toolkit (requires OS version 10.6.8 or higher)
|
Documentation Installed with Toolkit |
| GPU Computing SDK - complete package including all code samples |
download Browse Online (production) |
| Learn about additional tools, libraries, and more… | CUDA Ecosystem |




Registered Developers Website
NVDeveloper (old site)