Technical Walkthrough 0

Accelerating Embedding with the HugeCTR TensorFlow Embedding Plugin

The NVIDIA Merlin recommendation system framework introduces an optimized embedding implementation that is up to 8x more performant and is available as a… 12 MIN READ
Technical Walkthrough 0

Using the NVIDIA CUDA Stream-Ordered Memory Allocator, Part 2

In part 1 of this series, we introduced new API functions, and , that enable memory allocation and deallocation to be stream-ordered operations. In this post… 9 MIN READ
Technical Walkthrough 0

Using the NVIDIA CUDA Stream-Ordered Memory Allocator, Part 1

This post introduces new API functions that enable memory allocation and deallocation to be stream-ordered operations. 14 MIN READ
Technical Walkthrough 0

Speeding Up Deep Learning Inference Using TensorFlow, ONNX, and NVIDIA TensorRT

This post was updated July 20, 2021 to reflect NVIDIA TensorRT 8.0 updates. In this post, you learn how to deploy TensorFlow trained deep learning models using… 15 MIN READ
Diagram of neuroprosthesis device
News 0

Transforming Brain Waves into Words with AI

New research out of the University of California, San Francisco has given a paralyzed man the ability to communicate by translating his brain signals into… 3 MIN READ
Technical Walkthrough 0

Accelerating the Wide & Deep Model Workflow from 25 Hours to 10 Minutes Using NVIDIA GPUs

In this post, we detail the new TensorFlow2 implementation of the Wide & Deep model that was recently added to the NVIDIA Deep Learning Examples repository. 14 MIN READ