Technical Blog
Tag: DASK
Subscribe
Technical Walkthrough
Aug 30, 2022
Accelerating ETL on KubeFlow with RAPIDS
In the machine learning and MLOps world, GPUs are widely used to speed up model training and inference, but what about the other stages of the workflow like ETL...
13 MIN READ
Technical Walkthrough
May 05, 2022
Optimizing Access to Parquet Data with fsspec
As datasets continue to grow in size, the adoption of cloud-storage platforms like Amazon S3 and Google Cloud Storage (GCS) are becoming more popular. Although...
12 MIN READ
Technical Walkthrough
Oct 21, 2021
Accelerated Portfolio Construction with Numba and Dask in Python
Python is no stranger to data scientists. It ranks as the most popular computer language and is widely used for all kinds of tasks. Though Python is notoriously...
8 MIN READ
Technical Walkthrough
Aug 31, 2021
Zero to RAPIDS in Minutes with NVIDIA GPUs + Saturn Cloud
GPU-accelerated computing is a game-changer for data practitioners and enterprises, but leveraging GPUs can be challenging for data professionals. RAPIDS...
9 MIN READ
Technical Walkthrough
Jun 17, 2021
Accelerating XGBoost on GPU Clusters with Dask
In XGBoost 1.0, we introduced a new official Dask interface to support efficient distributed training. Fast-forwarding to XGBoost 1.4, the interface is...
11 MIN READ
Technical Walkthrough
Mar 18, 2021
Dask Tutorial - Beginner's Guide to Distributed Computing with GPUs in Python
This is the third installment of the series of introductions to the RAPIDS ecosystem. The series explores and discusses various aspects of RAPIDS that allow its...
8 MIN READ