NVIDIA Nsight Perf SDK

The NVIDIA® Nsight™ Perf SDK is a graphics profiling toolbox for DirectX, Vulkan, and OpenGL enabling you to collect GPU performance metrics directly from your application.

Get Started
NVIDIA Nsight® Perf SDK
Just a few lines of code are needed to set up GPU performance metrics collection with the Nsight Perf SDK.
Realtime Perf Triage
(click image to expand)

Realtime Perf Triage

Enable high-level performance triage via realtime collection and on-screen visualization of GPU performance metrics. The new GPU Periodic Sampler collects device-level metrics at high sampling rates with low overhead.

Profile In-Application
(click image to expand)
Microsoft’s PIX on Windows showing NVIDIA GPU performance metrics

Profile In-Application

Integrate GPU performance metric collection into your application or graphics developer tool of choice. Activate profiling from your own custom programmatic triggers. Choose the list of GPU metrics to collect, customize your output, and keep control over your workflow.

CI/CD

Upgrade Your CI/CD

Generate detailed profiler reports on every developer and artist change. Add dedicated perf regression criteria by inspecting GPU metric values.



Realtime Performance HUD

Realtime Performance HUD
(click image to expand)

Add continuous performance metrics collection to your code, and leverage the built-in HUD renderer to effortlessly enable real-time, high-level performance triage.

Explore panels with metrics on SM, L2 cache, ROP, VRAM and various other subunits to gain an early understanding of the performance characteristics and potential bottlenecks of the scene as you move through it.

The HUD- and Periodic Sampler-utility classes also serve as an example for creating your own powerful, low-overhead, real-time workflows on top of the low-level Nsight Perf SDK API.




HTML Profiler Report Generator

Generate detailed profiler reports with minimal effort. Simply insert a few calls at Graphics API Device Initialization, Present/SwapBuffers, a Keypress handler, or an automated trigger.

Insert annotations (PushRange/PopRange) around GPU workloads to collect additional reports per region of execution. The report generator automatically collects 100s of GPU metrics of interest; there is no need to study these complex topics on first usage.

The reports provide a top-down representation of GPU performance, with fast navigation to the top performance limiters. Quickly determine the workload type, pipeline activity and utilization, shader latency reasons, and 3D data flow.

Be One with the GPU
(click image to expand)


Partners and Industry Standards



NVIDIA Nsight Tools News


GPU-Accelerated Video Processing with NVIDIA In-Depth Support for Vulkan Video

GPU-Accelerated Video Processing with NVIDIA In-Depth Support for Vulkan Video

Vulkan Video extensions for video decoding get a finalized release and support from Vulkan SDK, bringing highly tunable video processing to cross-platform applications.

CUDA Toolkit 12.0 Released for General Availability

CUDA Toolkit 12.0 Released for General Availability

NVIDIA announces the newest CUDA Toolkit software release, 12.0. This release is the first major release in many years and it focuses on new programming models and CUDA application acceleration through new hardware capabilities. For more information, watch the YouTube Premiere webinar, CUDA 12.0: New Features and Beyond. You can now target architecture-specific features and … Continued

Just Released: CUDA Toolkit 12.0

Just Released: CUDA Toolkit 12.0

CUDA Toolkit 12.0 supports NVIDIA Hopper architecture and many new features to help developers maximize performance on NVIDIA GPU-based products.

Upcoming Workshop: Fundamentals of Accelerated Computing with CUDA C/C++

Upcoming Workshop: Fundamentals of Accelerated Computing with CUDA C/C++

Learn the fundamental tools and techniques for accelerating C/C++ applications to run on massively parallel GPUs with CUDA in this instructor-led workshop.


View all Nsight news


Ready to download NVIDIA Nsight® Perf SDK?

Get Started