cuSPARSE
Sep 10, 2024
Accelerating the HPCG Benchmark with NVIDIA Math Sparse Libraries
In the realm of high-performance computing (HPC), NVIDIA has continually advanced HPC by offering its highly optimized NVIDIA High-Performance Conjugate...
9 MIN READ
Aug 01, 2024
Just Released: CUDA Toolkit 12.6
The release supports GB100 capabilities and new library enhancements to cuBLAS, cuFFT, cuSOLVER, cuSPARSE, as well as the release of Nsight Compute 2024.3.
1 MIN READ
Mar 14, 2024
Just Released: NVIDIA cuSPARSELt 0.6
NVIDIA cuSPARSELt harnesses Sparse Tensor Cores to accelerate general matrix multiplications. Version 0.6. adds support for the NVIDIA Hopper architecture.
1 MIN READ
Mar 22, 2023
Reusable Computational Patterns for Machine Learning and Information Retrieval with RAPIDS RAFT
RAPIDS is a suite of accelerated libraries for data science and machine learning on GPUs: cuDF for pandas-like data structures cuGraph for graph data cuML for...
11 MIN READ
Dec 12, 2022
CUDA Toolkit 12.0 Released for General Availability
NVIDIA announces the newest CUDA Toolkit software release, 12.0. This release is the first major release in many years and it focuses on new programming models...
12 MIN READ
Aug 03, 2022
Accelerated Inference for Large Transformer Models Using NVIDIA Triton Inference Server
This is the first part of a two-part series discussing the NVIDIA Triton Inference Server’s FasterTransformer (FT) library, one of the fastest libraries for...
10 MIN READ
Jul 26, 2022
Accelerating GPU Applications with NVIDIA Math Libraries
There are three main ways to accelerate GPU applications: compiler directives, programming languages, and preprogrammed libraries. Compiler directives such as...
12 MIN READ
Jun 20, 2022
Just Released: cuSPARSELt v0.3
The NVIDIA cuSPARSELt update expands the high-performance CUDA library support for vectors of alpha and beta scalars, GeLu scaling, Split-K Mode, and more.
1 MIN READ
Nov 15, 2021
Accelerating ReLu and GeLu Activation Functions, and Batched Sparse GEMM in cuSPARSELt v0.2.0
Today, NVIDIA is announcing the availability of cuSPARSELt, version 0.2.0, which increases performance on activation functions, bias vectors, and Batched Sparse...
2 MIN READ
Mar 19, 2021
Accelerating Matrix Multiplication with Block Sparse Format and NVIDIA Tensor Cores
Sparse-matrix dense-matrix multiplication (SpMM) is a fundamental linear algebra operation and a building block for more complex algorithms such as finding the...
7 MIN READ
Jul 22, 2015
Using GPUs to Accelerate Epidemic Forecasting
Originally trained as a veterinary surgeon, Chris Jewell, a Senior Lecturer in Epidemiology at Lancaster Medical School in the UK became interested in epidemics...
12 MIN READ
Jul 08, 2015
New Features in CUDA 7.5
Today I'm happy to announce that the CUDA Toolkit 7.5 Release Candidate is now available. The CUDA Toolkit 7.5 adds support for FP16 storage for up to 2x larger...
12 MIN READ