Using Nsight Compute to Inspect your Kernels

HPC, CUDA, Development Tools and Libraries, Nsight, Nsight compute, NSight Systems

Nadeem Mohammad, posted Sep 16 2019

By now, hopefully you read the first two blogs in this series “Migrating to NVIDIA Nsight Tools from NVVP and Nvprof” and “Transitioning to Nsight Systems from NVIDIA Visual Profiler / nvprof,” and you’ve discovered NVIDIA added a few new tools, b

Read more

Developer Spotlight: Visualizing High-Resolution Atomic Structures to Simulate Molecular Dynamics

HPC, Computer Graphics & Visualization, CUDA, Machine Learning & Artificial Intelligence, Tesla

Nadeem Mohammad, posted Sep 09 2019

Experimental sciences deliver high-resolution atomic structures for biological complexes, but researchers need to refine those structures, prove their accuracy, and simulate their dynamics while retaining all of the information that makes simulati

Read more

Getting Started with CUDA Graphs

AI / Deep Learning, HPC, CUDA, CUDA graph, Featured

Nadeem Mohammad, posted Sep 05 2019

The performance of GPU architectures continue to increase with every new generation. Modern GPUs are so fast that, in many cases of interest, the time taken by each GPU operation (e.g. kernel or memory copy) is now measured in microseconds.

Read more

Researchers at VideoGorillas Use AI to Remaster Archived Content to 4K Resolution and Above

AI / Deep Learning, Graphics / Simulation, CUDA, cuDNN, Machine Learning & Artificial Intelligence

Nadeem Mohammad, posted Aug 23 2019

To meet the growing pace of innovation, one company is developing a new AI-enhanced solution to exceed visual expectations at lower costs.

Read more

Deep Learning Helps UCLA Scientists Identify Cancer Cells in the Blood Instantaneously

AI / Deep Learning, Features, CUDA, cuDNN, Featured, Healthcare & Life Sciences, Machine Learning & Artificial Intelligence, Tesla

Nadeem Mohammad, posted Aug 22 2019

UCLA researchers have just developed a deep learning, GPU-powered device that can detect cancer cells in a few milliseconds, hundreds of times faster than previous methods.

Read more

CUDA Pro Tip: The Fast Way to Query Device Properties

AI / Deep Learning, Data Science, HPC, CUDA, Pro Tip

Nadeem Mohammad, posted Aug 20 2019

CUDA applications often need to know the maximum available shared memory per block or to query the number of multiprocessors in the active GPU. One way to do this is by calling cudaGetDeviceProperties().

Read more

CUDA 10.1 Update 2 Now Available

AI / Deep Learning, HPC, CUDA, Development Tools & Libraries

Nadeem Mohammad, posted Aug 16 2019

CUDA 10.1 Update 2 is now available for download. This version is a compatible update to CUDA 10.1 and includes updates to libraries, developer tools and bug fixes.

Read more

Accelerate Genome Assembly and Analysis with Clara Genomics SDK 0.2

AI / Deep Learning, CUDA, Genomics, Healthcare & Life Sciences, Machine Learning & Artificial Intelligence

Nadeem Mohammad, posted Aug 16 2019

The Clara Genomics SDK has been upgraded with high performance analysis algorithms for long read sequencing and early access to deep learning-based processing of short read ATAC sequencing

Read more

NVIDIA announces Nsight Systems 2019.4

HPC, CUDA, Nsight

Nadeem Mohammad, posted Aug 14 2019

NVIDIA Nsight Systems 2019.4 is now available for download. This release aims to provide a more detailed data collection, exploration, and collection control for all markets ranging from high performance computing to visual effects

Read more

Transitioning to Nsight Systems from NVIDIA Visual Profiler / nvprof

AI / Deep Learning, Data Science, HPC, CUDA, Debugging, machine learning and AI, NSight Systems, nsys, nvprof, Profiling, Software Tools and Libraries, Visual Profiler

Nadeem Mohammad, posted Aug 02 2019

The Nsight suite of profiling tools now supersedes the NVIDIA Visual Profiler (NVVP) and nvprof. Let’s look at what this means for NVIDIA Visual Profiler or nvprof users. Before diving in, let’s first review what is not changing.

Read more