Overview

NVIDIA Data Center GPU Manager is a suite of tools for managing and monitoring Tesla™ GPUs in cluster environments. It includes active health monitoring, comprehensive diagnostics, system alerts and governance policies including power and clock management. It can be used standalone by system administrators and easily integrates into cluster management, resource scheduling and monitoring products from NVIDIA partners.

DCGM supports Linux operating systems on x86 and POWER platforms. DCGM packages below include the libraries, binaries, NVIDIA Validation Suite (NVVS) and source examples for using the API (C and Python).

If you would like to download the DCGM installation packages and document please register for this program using the "Join Now" button below. This is a free to join and everyone will be accepted, if you are signed up and logged in there will not be a button and you can proceed to download the packages.

Join now

What's New in DCGM 1.4.6

  • Added support for Tesla M10, and bandwidth test on Tesla P4
  • Integration with open-source tools such as Prometheus, collectd to report GPU metrics
  • Additional GPU metrics reported by DCGM (e.g. PCIe stats, Memory, Performance states, Video encoder/decoder clocks)
  • Supports the NVIDIA® Tesla® V100 (32GB) GPU accelerator
  • Many other improvements and bug fixes -- see release notes for details

DCGM 1.4.6 Downloads (Sept 2018)

Please download the DCGM 1.4.6 Package for your distribution below. Note that this version of DCGM requires at least an R384 Tesla driver that can be downloaded from NVIDIA Driver Downloads page

Downloads
User Guide & Install Instructions PDF
DCGM API Reference Guide PDF
NVIDIA Validation Suite User Guide PDF
DCGM Release Notes PDF
RPM Packages (x86_64) RPM
RPM Packages (Power8) RPM
DEB Packages (x86_64) DEB
DEB Packages (Power8) DEB
EULA PDF

Learn More

Legacy DCGM Downloads

What's New in DCGM 1.4.2

  • Integration with open-source tools such as Prometheus, collectd to report GPU metrics
  • Additional GPU metrics reported by DCGM (e.g. PCIe stats, Memory, Performance states, Video encoder/decoder clocks)
  • Supports the NVIDIA® Tesla® V100 (32GB) GPU accelerator
  • Many other improvements and bug fixes -- see release notes for details

DCGM 1.4.2 Downloads (May 2018)

Please download the DCGM 1.4.2 Package for your distribution below. Note that this version of DCGM requires at least an R384 Tesla driver that can be downloaded from NVIDIA Driver Downloads page

Downloads
User Guide & Install Instructions PDF
DCGM API Reference Guide PDF
NVIDIA Validation Suite User Guide PDF
DCGM Release Notes PDF
RPM Packages (x86_64) RPM
RPM Packages (Power8) RPM
DEB Packages (x86_64) DEB
DEB Packages (Power8) DEB
EULA PDF

What's New in DCGM 1.3.3

  • DCGM features are now available on non-Tesla GPUs
  • Includes additional GPU diagnostics to stress GPU hardware
  • All functionality of NVVS is now accessible via the DCGM command line interface
  • Supports the NVIDIA® Tesla® V100 Hyperscale PCIe GPU accelerator
  • Many other improvements and bug fixes -- see release notes for details

DCGM 1.3.3 Downloads

Please download the DCGM 1.3.3 Package for your distribution below. Note that this version of DCGM requires at least an R384 Tesla driver that can be downloaded from NVIDIA Driver Downloads page

Downloads
User Guide & Install Instructions PDF
DCGM API Reference Guide PDF
NVIDIA Validation Suite User Guide PDF
DCGM Release Notes PDF
RPM Packages (x86_64) RPM
RPM Packages (Power8) RPM
DEB Packages (x86_64) DEB
DEB Packages (Power8) DEB
EULA PDF

What's New in DCGM 1.2.3

  • Added support for the NVIDIA® Tesla® V100 GPU accelerator
  • Performance improvements - up to 40x speedups for reporting of GPU metrics
  • Added new policy triggers for XID events
  • Bug fixes

DCGM 1.2.3 Downloads

Please download the DCGM 1.2.3 Package for your distribution below. Note that this version of DCGM requires at least an R384 Tesla driver that can be downloaded from NVIDIA Driver Downloads page

Downloads
User Guide & Install Instructions PDF
DCGM API Reference Guide PDF
NVVS User Guide PDF
DCGM Release Notes PDF
RPM Packages (x86_64) RPM
RPM Packages (Power8) RPM
DEB Packages (x86_64) DEB
DEB Packages (Power8) DEB
EULA PDF