Open Source at NVIDIA

Engaging with open-source communities accelerates innovation, making it easier for developers to collaborate and build. NVIDIA contributes to important open-source projects—including Docker, JAX, Kubernetes, Linux kernel, PyTorch, TensorFlow, and Universal Scene Description (USD)—and leads innovative open-source projects across many domains. NVIDIA also actively contributes to and collaborates with open-standards bodies worldwide.

Explore the innovative open-source projects below to learn more about accelerating your applications.

All

Apache

Apache Spark

GPU-accelerated Apache Spark for data analytics, machine learning, and deep learning pipelines

Home|    Github

NVIDIA

Apex

A PyTorch extension: Tools for easy mixed precision and distributed training in PyTorch

Home|    Github

NVIDIA

AmgX

Distributed multi-grid linear solver library on GPU

Home|    Github

NVIDIA

Clara Holoscan SDK

A hybrid computing platform for medical devices, both hardware and software

Home|    Github

NVIDIA

client

Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.

Home|    Github

NVIDIA

Cloud Native Stack

A reference architecture for managing GPU workloads with Kubernetes

Home|    Github

NVIDIA

Collective Communication Library (NCCL)

Optimized primitives for collective multi-GPU communication

Home|    Github

NVIDIA

Container Toolkit

Allows users to build and run GPU accelerated containers.

Home|    Github

NVIDIA

CUB

A library of collective primitives and utilities

Home|    Github

NVIDIA

CUDA-PointPillars

A project demonstrating how to use CUDA-PointPillars to deal with cloud points data from lidar.

Home|    Github

NVIDIA

CUDA Templates for Linear Algebra Subroutines (CUTLASS)

A collection of templates for high-performance general matrix multiplication (GEMM) computations within CUDA kernels

Home|    Github

RAPIDS

cudf

cuDF - GPU DataFrame Library

Home|    Github

RAPIDS

cuml

cuML - RAPIDS Machine Learning Library

Home|    Github

RAPIDS

cugraph

cuGraph - RAPIDS Graph Analytics Library

Home|    Github

NVIDIA

CuPy

An open-source array library for GPU-accelerated computing with Python and CUDA

Home|    Github

RAPIDS

cuspatial

Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.

Home|    Github

NVIDIA

CV-CUDA

CV-CUDA‚ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

Home|    Github

RAPIDS

cuxfilter

GPU accelerated cross filtering with cuDF

Home|    Github

NVIDIA

Data Loading Library (DALI)

Data pre-processing in deep learning applications

Home|    Github

NVIDIA

Data Plane Development Kit (DPDK)

DPDK is a set of libraries and optimized network interface card (NIC) drivers for fast packet processing in a user space. Also provides a framework and common API for high-speed networking applications.

Home|    Github

NVIDIA

Deep Learning Inference Accelerator (NVDLA)

Open-source accelerator for deep learning inference

Home|    Github

NVIDIA

deepops

Tools for building GPU clusters

Home|    Github

NVIDIA

DIGITS

Deep Learning GPU Training System

Home|    Github

NVIDIA

Falcor

A real-time rendering framework

Home|    Github

NVIDIA

FasterTransformer

Highly optimized transformer-based encoder and decoder component for GPT and BERT models

NVIDIA

Federated Learning Active Runtime Environment (FLARE)

An open-source, extensible SDK that allows researchers and data scientists to adapt existing machine learning and deep learning workflow to a privacy-preserving federated paradigm

Home|    Github

Flang

Flang

A Fortran compiler targeting low-level virtual machine (LLVM)

NVIDIA

flowtron

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Home|    Github

NVIDIA

GenerativeModels

MONAI Generative Models makes it easy to train, evaluate, and deploy generative models and related applications

Home|    Github

NVIDIA

gdrcopy

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

Home|    Github

NVIDIA

GPU Operator

Automates the lifecycle management of the software required to expose GPUs on Kubernetes.

Home|    Github

NVIDIA

HugeCTR

HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training

Home|    Github

NVIDIA

isaac_ros_visual_slam

Visual odometry package based on hardware-accelerated NVIDIA Elbrus library with world class quality and performance.

Home|    Github

NVIDIA

jetbot

An educational AI robot based on NVIDIA Jetson Nano

Home|    Github

NVIDIA

K8s Device Plug-in

An NVIDIA plug-in that enables GPU support in Kubernetes

NVIDIA

Kaolin Library

PyTorch-based library with modular optimized functionality for 3D deep learning research

Home|    Github

NVIDIA

libcu++

C++ standard library for an entire CPU+GPU system supported by CUDA

NVIDIA

MAXINE-AR-SDK

NVIDIA AR SDK - API headers and sample applications

Home|    Github

NVIDIA

libnvidia-container

NVIDIA container runtime library

Home|    Github

NVIDIA

Material Definition Language (MDL)

Physically accurate material standard and SDK

Home|    Github

NVIDIA

MatX

An efficient C++17 GPU numerical computing library with Python-like syntax

Home|    Github

NVIDIA

Megatron-LM

Ongoing research for training transformer models at scale

NVIDIA

Merlin

Open-source framework for building high-performing recommender systems at scale

Home|    Github

NVIDIA

mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Home|    Github

NVIDIA

Milano

Milano is a tool for automating hyper-parameters search for your models on a backend of your choice

NVIDIA

Modulus

A neural network framework for developing Physics-ML models

Home|    Github

NVIDIA

MONAI

PyTorch-based framework for deep learning in healthcare imaging

Home|    Github

NVIDIA

MONAILabel

Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.

Home|    Github

NVIDIA

Morpheus

Open-source framework that enables cybersecurity developers to create optimized AI pipelines for filtering, processing, and classifying large volumes of real-time data

Home|    Github

NVIDIA

NeMo

An end-to-end, open and interoperable framework for building, customizing, and deploying generative AI models at-scale anywhere.

Home |    Github

NVIDIA

NeMo Guardrails

A toolkit for easily developing trustworthy, safe, and secure LLM conversational systems.

NVIDIA

Network Operator

Automates the lifecycle management of the software for accelerated networking on Kubernetes.

Home|    Github

NVIDIA

nccl-tests

NCCL Tests

Home|    Github

NVIDIA

nglod

Neural Geometric Level of Detail: Real-time Rendering with Implicit 3D Shapes (CVPR 2021 Oral)

Home|    Github

NVIDIA

nvcomp

High-performance GPU data compression library

Home|    Github

NVIDIA

NVTabular

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

Home|    Github

NVIDIA

ONNX-TensorRT

TensorRT backend for ONNX

Home|    Github

NVIDIA

Open GPU Kernel Modules

Open-source release of CUDA GPU kernel modules for Linux

Home|    Github

NVIDIA

Open Seq2Seq

Toolkit for efficient experimentation with various sequence-to-sequence models

Home|    Github

NVIDIA

PhysX

Advanced, true-to-reality physics simulation engine

Home|    Github

PyTorch

PyTorch

An open source machine learning framework that accelerates the path from research prototyping to production deployment.

Home|    Github

RAPIDS

raft

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.

Home|    Github

RAPIDS

RAPIDS

A suite of open-source software libraries and APIs for executing data science pipelines entirely on GPUs.

Home|    Github

NVIDIA Community

rl_games

High-performance reinforcement learning library

Home|    Github

RAPIDS

rmm

RAPIDS Memory Manager

Home|    Github

NVIDIA

ROS GEMs

Hardware-accelerated packages for the robot operating system (ROS)—deep neural networks (DNNs), perception pipelines, and deep learning software

Home|    Github

NVIDIA

Sionna

An Open-Source Library for 6G Physical-Layer Research

Home|    Github

NVIDIA

slang

Making it easier to work with shaders

Home|    Github

NVIDIA

spark-rapids

Spark RAPIDS plugin - accelerate Apache Spark with GPUs

Home|    Github

NVIDIA

Streamline

Streamline Integration Framework

Home|    Github

NVIDIA

Tacotron 2

PyTorch implementation of Natural TTS Synthesis by conditioning WaveNet on Mel Spectrogram predictions

TensorFlow

TensorFlow

An Open Source Machine Learning Framework for Everyone

Home|    Github

NVIDIA

TensorFlow-TensorRT

Integration for TensorFlow with TensorRT to get up to 6X faster inference in TensorFlow with few lines of code

Home|    Github

NVIDIA

Thrust

Thrust is a parallel algorithms library that resembles the C++ STL.

Home|    Github

NVIDIA

Torch-TensorRT

Integration to PyTorch with TensorRT to get 4X faster inference in PyTorch with one line of code

Home|    Github

NVIDIA

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Home|    Github

NVIDIA

Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.

Home|    Github

NVIDIA

Triton Inference server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Home|    Github

Pixar

Universal Scene Description (OpenUSD)

Open and extensible ecosystem for describing, composing, simulating, and collaborating within 3D worlds.

Home  |    Github  |

NVIDIA

vid2vid

High-resolution photorealistic video-to-video translation

NVIDIA

VideoProcessingFramework

Set of Python bindings to C++ libraries which provides full HW acceleration for video decoding, encoding and GPU-accelerated color space and pixel format conversions

Home|    Github

NVIDIA

waveglow

A Flow-based Generative Network for Speech Synthesis

Home|    Github

NVIDIA

yolo_deepstream

yolo model qat and deploy with deepstream&tensorrt

Home|    Github

AI and Deep Learning

Apache

Apache Spark

GPU-accelerated Apache Spark for data analytics, machine learning, and deep learning pipelines

Home|    Github

NVIDIA

Apex

A PyTorch extension: Tools for easy mixed precision and distributed training in PyTorch

Home|    Github

NVIDIA

Clara Holoscan SDK

A hybrid computing platform for medical devices, both hardware and software

Home|    Github

NVIDIA

client

Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.

Home|    Github

NVIDIA

Collective Communication Library (NCCL)

Optimized primitives for collective multi-GPU communication

Home|    Github

NVIDIA

CV-CUDA

CV-CUDA‚ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

Home|    Github

RAPIDS

cuml

cuML - RAPIDS Machine Learning Library

Home|    Github

NVIDIA

Data Loading Library (DALI)

Data pre-processing in deep learning applications

Home|    Github

NVIDIA

Deep Learning Examples

Tensor Cores-optimized code samples

Home|    Github

NVIDIA

Deep Learning Inference Accelerator (NVDLA)

Open-source accelerator for deep learning inference

Home|    Github

NVIDIA

DIGITS

Deep Learning GPU Training System

Home|    Github

NVIDIA

Federated Learning Active Runtime Environment (FLARE)

An open-source, extensible SDK that allows researchers and data scientists to adapt existing machine learning and deep learning workflow to a privacy-preserving federated paradigm

Home|    Github

NVIDIA

GenerativeModels

MONAI Generative Models makes it easy to train, evaluate, and deploy generative models and related applications

Home|    Github

NVIDIA

HugeCTR

HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training

Home|    Github

RAPIDS

raft

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.

Home|    Github

NVIDIA

Kaolin Library

PyTorch-based library with modular optimized functionality for 3D deep learning research

Home|    Github

NVIDIA

mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Home|    Github

NVIDIA

Merlin

Open-source framework for building high-performing recommender systems at scale

Home|    Github

NVIDIA

MONAI

PyTorch-based framework for deep learning in healthcare imaging

Home|    Github

NVIDIA

Morpheus

Open-source framework that enables cybersecurity developers to create optimized AI pipelines for filtering, processing, and classifying large volumes of real-time data

Home|    Github

NVIDIA

NeMo

An end-to-end, open and interoperable framework for building, customizing, and deploying generative AI models at-scale anywhere.

Home  |    Github

NVIDIA

NeMo Guardrails

A toolkit for easily developing trustworthy, safe, and secure LLM conversational systems.

NVIDIA

NVTabular

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

Home|    Github

NVIDIA

ONNX-TensorRT

TensorRT backend for ONNX

Home|    Github

PyTorch

PyTorch

An open source machine learning framework that accelerates the path from research prototyping to production deployment.

Home|    Github

RAPIDS

RAPIDS

A suite of open-source software libraries and APIs for executing data science pipelines entirely on GPUs.

Home|    Github

NVIDIA

ROS GEMs

Hardware-accelerated packages for the robot operating system (ROS)—deep neural networks (DNNs), perception pipelines, and deep learning software

Home|    Github

NVIDIA

spark-rapids

Spark RAPIDS plugin - accelerate Apache Spark with GPUs

Home|    Github

TensorFlow

TensorFlow

An Open Source Machine Learning Framework for Everyone

Home|    Github

NVIDIA

TensorFlow-TensorRT

Integration for TensorFlow with TensorRT to get up to 6X faster inference in TensorFlow with few lines of code

Home|    Github

NVIDIA

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Home|    Github

NVIDIA

Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.

Home|    Github

NVIDIA

Triton Inference server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Home|    Github

NVIDIA

Torch-TensorRT

Integration to PyTorch with TensorRT to get 4X faster inference in PyTorch with one line of code

Home|    Github

NVIDIA

waveglow

A Flow-based Generative Network for Speech Synthesis

Home|    Github

NVIDIA

yolo_deepstream

yolo model qat and deploy with deepstream&tensorrt

Home|    Github

Edge AI and Robotics

NVIDIA

isaac_ros_visual_slam

Visual odometry package based on hardware-accelerated NVIDIA Elbrus library with world class quality and performance.

Home|    Github

NVIDIA

PhysX

Advanced, true-to-reality physics simulation engine

Home|    Github

NVIDIA Community

rl_games

High-performance reinforcement learning library

Home|    Github

Cloud Native Technology

NVIDIA

Cloud Native Stack

A reference architecture for managing GPU workloads with Kubernetes

Home|    Github

NVIDIA

Container Toolkit

Allows users to build and run GPU accelerated containers.

Home|    Github

NVIDIA

deepops

Tools for building GPU clusters

Home|    Github

NVIDIA

GPU Operator

Automates the lifecycle management of the software required to expose GPUs on Kubernetes.

Home|     Github

NVIDIA

K8s Device Plug-in

An NVIDIA plug-in that enables GPU support in Kubernetes

NVIDIA

libnvidia-container

NVIDIA container runtime library

Home|    Github

NVIDIA

Network Operator

Automates the lifecycle management of the software for accelerated networking on Kubernetes.

Home|    Github

Design, Visualization, and Simulation

NVIDIA

CUDA-PointPillars

A project demonstrating how to use CUDA-PointPillars to deal with cloud points data from lidar.

Home|    Github

NVIDIA

Falcor

A real-time rendering framework

Home|    Github

NVIDIA

MAXINE-AR-SDK

NVIDIA AR SDK - API headers and sample applications

Home|    Github

NVIDIA

Material Definition Language (MDL)

Physically accurate material standard and SDK

Home|    Github

NVIDIA

slang

Making it easier to work with shaders

Home|    Github

NVIDIA

Streamline

Streamline Integration Framework

Home|    Github

Pixar

Universal Scene Description (OpenUSD)

Open and extensible ecosystem for describing, composing, simulating, and collaborating within 3D worlds.

Home|    Github  |

High-Performance Computing (HPC)

NVIDIA

AmgX

Distributed multi-grid linear solver library on GPU

Home|    Github

NVIDIA

CUB

A library of collective primitives and utilities

Home|    Github

RAPIDS

cuxfilter

GPU accelerated cross filtering with cuDF

Home|    Github

NVIDIA

CUDA Templates for Linear Algebra Subroutines (CUTLASS)

A collection of templates for high-performance general matrix multiplication (GEMM) computations within CUDA kernels

Home|    Github

RAPIDS

cugraph

cuGraph - RAPIDS Graph Analytics Library

Home|    Github

RAPIDS

cudf

cuDF - GPU DataFrame Library

Home|    Github

NVIDIA

CuPy

An open-source array library for GPU-accelerated computing with Python and CUDA

Home|    Github

RAPIDS

cuspatial

Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.

Home|    Github

NVIDIA

Data Plane Development Kit (DPDK)

DPDK is a set of libraries and optimized network interface card (NIC) drivers for fast packet processing in a user space. Also provides a framework and common API for high-speed networking applications.

Home|    Github

Flang

Flang

A Fortran compiler targeting low-level virtual machine (LLVM)

NVIDIA

gdrcopy

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

Home|    Github

NVIDIA

jetbot

An educational AI robot based on NVIDIA Jetson Nano

Home|    Github

NVIDIA

libcu++

C++ standard library for an entire CPU+GPU system supported by CUDA

NVIDIA

MatX

An efficient C++17 GPU numerical computing library with Python-like syntax

Home|    Github

NVIDIA

Modulus

A neural network framework for developing Physics-ML models

Home|    Github

NVIDIA

nccl-tests

NCCL Tests

Home|    Github

NVIDIA

nvcomp

High-performance GPU data compression library

Home|    Github

NVIDIA

Open GPU Kernel Modules

Open-source release of CUDA GPU kernel modules for Linux

Home|    Github

RAPIDS

rmm

RAPIDS Memory Manager

Home|    Github

NVIDIA

Thrust

Thrust is a parallel algorithms library that resembles the C++ STL.

Home|    Github

NVIDIA

VideoProcessingFramework

Set of Python bindings to C++ libraries which provides full HW acceleration for video decoding, encoding and GPU-accelerated color space and pixel format conversions

Home|    Github

Research

NVIDIA

FasterTransformer

Highly optimized transformer-based encoder and decoder component for GPT and BERT models

NVIDIA

flowtron

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Home|    Github

NVIDIA

Kaolin Library

PyTorch-based library with modular optimized functionality for 3D deep learning research

Home|    Github

NVIDIA

Megatron-LM

Ongoing research for training transformer models at scale

NVIDIA

Milano

Milano is a tool for automating hyper-parameters search for your models on a backend of your choice

NVIDIA

nglod

Neural Geometric Level of Detail: Real-time Rendering with Implicit 3D Shapes (CVPR 2021 Oral)

Home|    Github

NVIDIA

Open Seq2Seq

Toolkit for efficient experimentation with various sequence-to-sequence models

Home|    Github

NVIDIA

PhysX

Advanced, true-to-reality physics simulation engine

Home|    Github

NVIDIA

Sionna

An Open-Source Library for 6G Physical-Layer Research

Home|    Github

NVIDIA

Tacotron 2

PyTorch implementation of Natural TTS Synthesis by conditioning WaveNet on Mel Spectrogram predictions

NVIDIA

vid2vid

High-resolution photorealistic video-to-video translation

Source code for many NVIDIA Research projects is freely available for research purposes. Also, visit our SDK Glossary page for a more comprehensive list of our SDK and Library offerings.