NVIDIA Nsight® Perf SDK

The NVIDIA® Nsight Perf SDK is a graphics profiling toolbox for DirectX, Vulkan, and OpenGL, enabling you to collect GPU performance metrics directly from your application.

Get Started
NVIDIA Nsight® Perf SDK
Just a few lines of code needed to set up GPU performance metrics collection with Nsight Perf SDK API calls in your application
Profile In-Application

Profile In-Application

Integrate GPU performance metric collection into your application or graphics developer tool of choice. Activate profiling from your own custom programmatic triggers. Choose the list of GPU metrics to collect, customize your output, and keep control over your workflow.

CI/CD

Upgrade Your CI/CD

Generate detailed profiler reports on every developer and artist change. Add dedicated perf regression criteria by inspecting GPU metric values.

Be One with the GPU

Be One with the GPU

Generate detailed profiler reports on every developer and artist change. Add dedicated perf regression criteria by inspecting GPU metric values.



HTML Profiler Report Generator

Generate detailed profiler reports with minimal effort. Simply insert a few calls at Graphics API Device Initialization, Present/SwapBuffers, a Keypress handler, or an automated trigger.

Insert annotations (PushRange/PopRange) around GPU workloads to collect additional reports per region of execution. The report generator automatically collects 100s of GPU metrics of interest; there is no need to study these complex topics on first usage.

The reports provide a top-down representation of GPU performance, with fast navigation to the top performance limiters. Quickly determine the workload type, pipeline activity and utilization, shader latency reasons, and 3D data flow.

Be One with the GPU


Partners and Industry Standards



NVIDIA NSight News


Announcing Latest Nsight Graphics 2021.4 – Download Now

Announcing Latest Nsight Graphics 2021.4 – Download Now

Learn about the latest release of Nsight Graphics, an all-in-one graphics debugger and profiler to help game developers get the most out of NVIDIA hardware.

Optimizing DX12 Resource Uploads to the GPU Using CPU-Visible VRAM

Optimizing DX12 Resource Uploads to the GPU Using CPU-Visible VRAM

This walk-through shares how moving cherry-picked DX12 UPLOAD heaps to CPU-Visible VRAM (CVV) using NVAPI can be a simple solution to speed up PCIe limited workloads.

Announcing Nsight Deep Learning Designer 2021.1 – A Tool for Efficient Deep Learning Model Design and Development

Announcing Nsight Deep Learning Designer 2021.1 – A Tool for Efficient Deep Learning Model Design and Development

NVIDIA announces Nsight DL Designer – the first in-class integrated development environment to support efficient design of deep neural networks for in-app inference. 

Discovering New Features in CUDA 11.4

Discovering New Features in CUDA 11.4

This post shares an overview of the key capabilities released in CUDA 11.4.


View all Nsight news