Search Results for “//” | NVIDIA Technical Blog

Improving GPU Performance by Reducing Instruction Cache Misses

June 28, 2023

Instruction cache misses can cause performance degradation for kernels that have a large instruction footprint, which is often caused by substantial loop unrolling.

Building NVIDIA GPU-Accelerated Pipelines on Azure Synapse Analytics with RAPIDS

August 3, 2021

By Alexander Spiridonov

Azure recently announced support for NVIDIA’s T4 Tensor Core Graphics Processing Units (GPUs) which are optimized for deploying machine learning inferencing or analytical workloads in a cost-effective manner. With Apache Spark™ deployments tuned for NVIDIA GPUs, plus pre-installed libraries, Azure Synapse Analytics offers a simple way to leverage GPUs to power a variety of data … Continued

Metropolis Spotlight: Sighthound Enhances Traffic Safety with NVIDIA GPU-Accelerated AI Technologies

October 4, 2021

By Debraj Sinha

Using NVIDIA pretrained models and the Jetson edge AI platform, a computer vision innovator accelerates game-changing traffic management in Denver.

Scaling-out RAPIDS cuML and XGBoost with Dask on Google Kubernetes Engine (GKE)

May 17, 2021

By Devin Robison

This guide will walk through how to easily train cuML models on multi-node, multi-GPU (MNMG) clusters managed by Google’s Kubernetes Engine (GKE) platform.

Optimizing Data Movement in GPU Applications with the NVIDIA Magnum IO Developer Environment

April 12, 2021

By Kushal Datta

Magnum IO is the collection of IO technologies from NVIDIA and Mellanox that make up the IO subsystem of the modern data center and enable applications at scale. If you are trying to scale up your application to multiple GPUs, or scaling it out across multiple nodes, you are probably using some of the libraries … Continued

NVIDIA’s Thomas Kernen Named 2020 SMPTE Fellow For Acts of Good Timing

August 20, 2020

By Nefi Alarcon

The Society of Motion Picture and Television Engineers (SMPTE) named NVIDIA employee Thomas Kernen as one of their new Fellows for 2020.

Making GPU I/O Scream on Platforms of Today and Tomorrow

March 1, 2019

By Nefi Alarcon

At GTC 2019 in Silicon Valley, NVIDIA engineers will present a proof of concept designed to help hardware, systems, applications, and framework developers accelerate their work.

GPU Accelerating Node.js JavaScript for Visualization and Beyond

July 28, 2021

By Allan Enemark

NVIDIA GTC21 had numerous great and engaging contents, especially around RAPIDS, so it would be easy to miss our debut presentation “Using RAPIDS to Accelerate Node.js JavaScript for Visualization and Beyond.” Yep – we are bringing the power of GPU accelerated data science to the JavaScript Node.js community with the Node-RAPIDS project. Node-RAPIDS is an … Continued

Maximizing GROMACS Throughput with Multiple Simulations per GPU Using MPS and MIG

October 8, 2021

By Alan Gray

In this post, we demonstrate the benefits of running multiple simulations per GPU for GROMACS.

From Earth Sciences to Factory Production: GPU Hackathon Optimizes Modeling Results

February 24, 2022

By Izumi Barker

The recent Taiwan Computing Cloud GPU Hackathon helped 12 teams advance their HPC and AI projects, using innovative technologies to address pressing global challenges.

Accelerating Lossless GPU Compression with New Flexible Interfaces in NVIDIA nvCOMP

March 18, 2022

By Eric Schmidt

Use the high-level nvCOMP API for easy compression and decompression and the low-level API for more advanced workflows.

Optimizing T5 and GPT-2 for Real-Time Inference with NVIDIA TensorRT

December 2, 2021

By Vinh Nguyen

TensorRT 8.2 optimizes HuggingFace T5 and GPT-2 models. You can build real-time translation, summarization, and other online NLP apps.

New Open Source GPU-Accelerated Atari Emulator for Reinforcement Learning Now Available

July 25, 2019

By Nefi Alarcon

To help accelerate the development and testing of new deep reinforcement learning algorithms, NVIDIA researchers have just published a new research paper and corresponding code that introduces an open source CUDA-based Learning Environment (CuLE) for Atari 2600 games.

Gordon Bell Finalist Displays Earthquake Simulator at SC 18

November 12, 2018

By Nefi Alarcon

During a large earthquake, energy rips through the ground in the form of seismic waves that can cause serious harm on densely populated areas. The effects of earthquakes can be difficult to predict, and even the best modeling and simulation techniques to date have been unable to capture some of these earthquakes’ more complex characteristics. To … Continued

Search Results for //