After clicking “Watch Now” you will be prompted to login or join.


WATCH NOW



 
Click “Watch Now” to login or join the NVIDIA Developer Program.

WATCH NOW

NVTabular: GPU Accelerated ETL for Recommender Systems

Julio Perez, NVIDIA | Even Oldridge, NVIDIA

GTC 2020

Recommender systems require massive datasets to train, particularly for deep learning based solutions. The transformation of these datasets in order to prepare them for model training is particularly challenging. Often the time taken to do steps such as feature engineering, categorical encoding and normalization of continuous variables exceeds the time it takes to train a model. NVTabular is an open source feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems. It provides a high level abstraction to simplify code, making development faster, and accelerates computation on the GPU using the RAPIDS cuDF library. It is available for download and contributions at http://www.github.com/nvidia/NVTabular. A part of the Merlin Recommenders Framework, it pairs perfectly with HugeCTR to provide a straightforward method to train huge deep learning based recommender systems on GPU.




View More GTC 2020 Content