Data Science |

GTC 21: Top 5 Data Science Technical Sessions

Join thousands of other practitioners, leaders, and innovators to learn data science from the world’s most advanced data teams.

The following are some highlighted data science sessions planned for GTC:

1. GPU-Accelerated Model Evaluation: How we took our offline evaluation process from hours to minutes with RAPIDS

In this session, we’ll describe how we utilized cuDF and Dask-CUDF to build an interactive model evaluation system that drastically reduced the time it took to evaluate our recommender systems in an offline setting. As a result, model evaluations that previously took hours to complete as CPU workloads now run in minutes, allowing us to increase our overall iteration speed and thus build better models.

Joseph Cauteruccio, Machine Learning Engineer, Spotify
Marc Romeyn – Machine Learning Engineer, Spotify

2. Accelerated ETL, Training and Inference of Recommender Systems on the GPU with Merlin, HugeCTR, NVTabular, and Triton

In this talk, we’ll share the Merlin framework, consisting of NVTabular for ETL, HugeCTR for training, and Triton for inference serving. Merlin accelerates recommender systems on GPU, speeding up common ETL tasks, training of models, and inference serving by ~10x over commonly used methods. Beyond providing better performance, these libraries are also designed to be easy to use and integrate with existing recommendation pipelines.

Speaker: Even Oldridge, Senior Manager, Recommender Systems Framework Team, NVIDIA

3. How Walmart improves computationally intensive business processes with NVIDIA GPU Computing

Over the last several years, Walmart has been developing and implementing a wide range of applications that require GPU computing to be computationally feasible at Walmart scale. We will present CPU vs. GPU performance comparisons on a number of real-world problems from different areas of the business and we highlight, not just the performance gains from GPU computing, but also what capabilities GPU computing has enabled that would simply not be possible on CPU-only architectures.

Richard Ulrich, Senior Director, Walmart
John Bowman, Director, Data Science, Walmart

4. How Cloudera Data Platform uses a single pane of glass to deploy GPU accelerated applications s across hybrid and multi-clouds

Learn how Cloudera Data Platform uses a single pane of glass to deploy GPU-accelerated applications across hybrid and multi-clouds.

Karthikeyan Rajendran, Product Manager, NVIDIA
Scott McClellan, General Manager of Data Science, NVIDIA

5. GPU-Accelerated, High-Performance Machine Learning Pipeline

The Adobe team is currently working with NVIDIA to build an unprecedented GPU-based, high-performance machine learning pipeline.

Speaker: Lei Zhang, Senior Machine Learning Engineer, Adobe

Visit the GTC website to register for GTC (free) and to learn more about our Data Science track.