Recommenders / Personalization

Nov 28, 2023
One Giant Superchip for LLMs, Recommenders, and GNNs: Introducing NVIDIA GH200 NVL32
At AWS re:Invent 2023, AWS and NVIDIA announced that AWS will be the first cloud provider to offer NVIDIA GH200 Grace Hopper Superchips interconnected with...
9 MIN READ

Nov 09, 2023
Accelerating Neurosymbolic AI with RAPIDS and Prometheux Vadalog Parallel
As the scale of available data continues to grow, so does the need for scalable and intelligent data processing systems to swiftly harness useful knowledge....
11 MIN READ

Oct 17, 2023
Unlock Faster Image Generation in Stable Diffusion Web UI with NVIDIA TensorRT
Stable Diffusion is an open-source generative AI image-based model that enables users to generate images with simple text descriptions. Gaining traction among...
4 MIN READ

Oct 13, 2023
Supercharge Graph Analytics at Scale with GPU-CPU Fusion for 100x Performance
Graphs form the foundation of many modern data and analytics capabilities to find relationships between people, places, things, events, and locations across...
11 MIN READ

Oct 02, 2023
Accelerated Vector Search: Approximating with RAPIDS RAFT IVF-Flat
Performing an exhaustive exact k-nearest neighbor (kNN) search, also known as brute-force search, is expensive, and it doesn’t scale particularly well to...
15 MIN READ

Sep 12, 2023
Event: RecSys at Work: Best Practices and Insights
On Sept. 27, join us to learn recommender systems best practices for building, training, and deploying at any scale.
1 MIN READ

Sep 11, 2023
Accelerating Vector Search: Fine-Tuning GPU Index Algorithms
In this post, we dive deeper into each of the GPU-accelerated indexes mentioned in part 1 and give a brief explanation of how the algorithms work, along with a...
12 MIN READ

Sep 11, 2023
Accelerating Vector Search: Using GPU-Powered Indexes with RAPIDS RAFT
In the AI landscape of 2023, vector search is one of the hottest topics due to its applications in large language models (LLM) and generative AI. Semantic...
11 MIN READ

Sep 09, 2023
NVIDIA TensorRT-LLM Supercharges Large Language Model Inference on NVIDIA H100 GPUs
Large language models (LLMs) offer incredible new capabilities, expanding the frontier of what is possible with AI. However, their large size and unique...
9 MIN READ

Sep 07, 2023
Ask Me Anything: Winning Formula for the Best Multilingual Recommender Systems
On Sept. 13, connect with the winning multilingual recommender systems Kaggle Grandmaster team of KDD’23.
1 MIN READ

Aug 18, 2023
Take a Free NVIDIA Technical Training Course
Join the free NVIDIA Developer Program and enroll in a course from the NVIDIA Deep Learning Institute.
1 MIN READ

Aug 10, 2023
Pro Tips for Building Multilingual Recommender Systems
Picture this: You're browsing through an online store, looking for the perfect pair of running shoes. But with thousands of options available, where do you even...
12 MIN READ

Jul 03, 2023
Structured Sparsity in the NVIDIA Ampere Architecture and Applications in Search Engines
Deep learning is achieving significant success in various fields and areas, as it has revolutionized the way we analyze, understand, and manipulate data. There...
13 MIN READ

Jun 27, 2023
Breaking MLPerf Training Records with NVIDIA H100 GPUs
At the heart of the rapidly expanding set of AI-powered applications are powerful AI models. Before these models can be deployed, they must be trained through a...
15 MIN READ

May 28, 2023
Announcing NVIDIA DGX GH200: The First 100 Terabyte GPU Memory System
At COMPUTEX 2023, NVIDIA announced the NVIDIA DGX GH200, which marks another breakthrough in GPU-accelerated computing to power the most demanding giant AI...
6 MIN READ

May 15, 2023
Efficiently Scale LLM Training Across a Large GPU Cluster with Alpa and Ray
Recent years have seen a proliferation of large language models (LLMs) that extend beyond traditional language tasks to generative AI. This includes models like...
16 MIN READ