Recommenders / Personalization

May 28, 2023
Announcing NVIDIA DGX GH200: The First 100 Terabyte GPU Memory System
At COMPUTEX 2023, NVIDIA announced NVIDIA DGX GH200, which marks another breakthrough in GPU-accelerated computing to power the most demanding giant AI...
6 MIN READ

May 15, 2023
Efficiently Scale LLM Training Across a Large GPU Cluster with Alpa and Ray
Recent years have seen a proliferation of large language models (LLMs) that extend beyond traditional language tasks to generative AI. This includes models like...
16 MIN READ

May 04, 2023
Increasing Throughput and Reducing Costs for AI-Based Computer Vision with CV-CUDA
Real-time cloud-scale applications that involve AI-based computer vision are growing rapidly. The use cases include image understanding, content creation,...
11 MIN READ

Apr 19, 2023
Model Parallelism Virtual Workshop
Learn to build and deploy large neural networks to production with this virtual workshop on May 3 from the NVIDIA Deep Learning Institute.
1 MIN READ

Apr 18, 2023
New GPU Library Lowers Compute Costs for Apache Spark ML
Spark MLlib is a key component of Apache Spark for large-scale machine learning and provides built-in implementations of many popular machine learning...
6 MIN READ

Mar 22, 2023
ICYMI: New and Updated AI Workflows Announced at NVIDIA GTC 2023
NVIDIA showed how AI workflows can be leveraged to help you accelerate the development of AI solutions to address a range of use cases at NVIDIA GTC 2023. AI...
7 MIN READ

Mar 21, 2023
Supercharging AI Video and AI Inference Performance with NVIDIA L4 GPUs
NVIDIA T4 was introduced 4 years ago as a universal GPU for use in mainstream servers. T4 GPUs achieved widespread adoption and are now the highest-volume...
10 MIN READ

Mar 21, 2023
Catapulting Enterprises to the Leading Edge of AI with NVIDIA AI Enterprise 3.1
Generative AI has marked an important milestone in the AI revolution journey. We are at a fundamental breaking point where enterprises are not only getting...
4 MIN READ

Mar 15, 2023
End-to-End AI for NVIDIA-Based PCs: NVIDIA TensorRT Deployment
This post is the fifth in a series about optimizing end-to-end AI. NVIDIA TensorRT is a solution for speed-of-light inference deployment on NVIDIA hardware....
10 MIN READ

Mar 06, 2023
Top Deep Learning Sessions at NVIDIA GTC 2023
Explore the latest tools, optimizations, and best practices for deep learning training and inference.
1 MIN READ

Mar 02, 2023
Top Recommender System Sessions at NVIDIA GTC 2023
Get training, insights, and access to experts for the latest in recommender systems.
1 MIN READ

Mar 01, 2023
Offline to Online: Feature Storage for Real-time Recommendation Systems with NVIDIA Merlin
Recommendation models have progressed rapidly in recent years due to advances in deep learning and the use of vector embeddings. The growing complexity of these...
14 MIN READ

Feb 24, 2023
Top Data Science Sessions at NVIDIA GTC 2023
Learn about the latest AI and data science breakthroughs from leading data science teams at NVIDIA GTC 2023.
1 MIN READ

Feb 14, 2023
Top Speech AI Developer Day Sessions at NVIDIA GTC 2023
Explore the latest advances in accurate and customizable automatic speech recognition, multi-language translation, and text-to-speech.
1 MIN READ

Feb 01, 2023
New cuBLAS 12.0 Features and Matrix Multiplication Performance on NVIDIA Hopper GPUs
The NVIDIA H100 Tensor Core GPU, based on the NVIDIA Hopper architecture with the fourth generation of NVIDIA Tensor Cores, recently debuted delivering...
10 MIN READ

Jan 20, 2023
New Hands-on Lab: Build Session-Based RecommendersÂ
Learn how to streamline building state-of-the-art session-based recommender pipelines with this free hands-on lab.
1 MIN READ