DEVELOPER BLOG

Tag: Apache Spark

Data Science

An End-to-End Blueprint for Accelerating Customer Churn Modeling and Prediction-Part 1

If you want to solve a particular kind of business problem with machine learning, you’ll likely have no trouble finding a tutorial showing you how to extract… 9 MIN READ
AI / Deep Learning

Accelerating Deep Learning with Apache Spark and NVIDIA GPUs on AWS

With the growing interest in deep learning (DL), more and more users are using DL in production environments. Because DL requires intensive computational power… 7 MIN READ
Data Science

Making Apache Spark More Concurrent

Apache Spark provides capabilities to program entire clusters with implicit data parallelism. With Spark 3.0 and the open source RAPIDS Accelerator for Spark… 7 MIN READ
Data Science

Monitoring High-Performance Machine Learning Models with RAPIDS and whylogs

Machine learning (ML) data is big and messy. Organizations have increasingly adopted RAPIDS and cuML to help their teams run experiments faster and achieve… 7 MIN READ
Data Science

Accelerating Spark 3.0 and XGBoost End-to-End Training and Hyperparameter Tuning

At GTC Spring 2020, Adobe, Verizon Media, and Uber each discussed how they used Spark 3.0 with GPUs to accelerate and scale ML big data pre-processing, training… 17 MIN READ
AI / Deep Learning

Optimizing and Improving Spark 3.0 Performance with GPUs

Apache Spark continued the effort to analyze big data that Apache Hadoop started over 15 years ago and has become the leading framework for large-scale… 11 MIN READ