Technical Blog
Tag: Sparsity
Subscribe
Technical Walkthrough
Jul 20, 2021
Accelerating Inference with Sparsity Using the NVIDIA Ampere Architecture and NVIDIA TensorRT
○ TensorRT is an SDK for high-performance deep learning inference, and TensorRT 8.0 introduces support for sparsity that uses sparse tensor cores on NVIDIA Ampere GPUs. It can accelerate networks by...
8 MIN READ
Technical Walkthrough
Dec 08, 2020
Exploiting NVIDIA Ampere Structured Sparsity with cuSPARSELt
Deep neural networks achieve outstanding performance in a variety of fields, such as computer vision, speech recognition, and natural language processing.
9 MIN READ
Technical Walkthrough
May 14, 2020
Defining AI Innovation with NVIDIA DGX A100
Built on the brand new NVIDIA A100 Tensor Core GPU, DGX A100 is the third generation of DGX systems and is the universal system for AI infrastructure.
15 MIN READ
Technical Walkthrough
May 14, 2020
State-of-the-Art Language Modeling Using Megatron on the NVIDIA A100 GPU
Recent work has demonstrated that larger language models dramatically advance the state of the art in natural language processing (NLP) applications such as…
9 MIN READ
Technical Walkthrough
May 14, 2020
NVIDIA Ampere Architecture In-Depth
Today, during the 2020 NVIDIA GTC keynote address, NVIDIA founder and CEO Jensen Huang introduced the new NVIDIA A100 GPU based on the new NVIDIA Ampere GPU…
30 MIN READ
Technical Walkthrough
Sep 11, 2017
Gradient Boosting, Decision Trees and XGBoost with CUDA
Gradient boosting is a powerful machine learning algorithm used to achieve state-of-the-art accuracy on a variety of tasks such as regression…
17 MIN READ