DASK
Aug 30, 2022
Accelerating ETL on KubeFlow with RAPIDS
In the machine learning and MLOps world, GPUs are widely used to speed up model training and inference, but what about the other stages of the workflow like ETL...
13 MIN READ
May 05, 2022
Optimizing Access to Parquet Data with fsspec
As datasets continue to grow in size, the adoption of cloud-storage platforms like Amazon S3 and Google Cloud Storage (GCS) are becoming more popular. Although...
12 MIN READ
Oct 21, 2021
Accelerated Portfolio Construction with Numba and Dask in Python
Python is no stranger to data scientists. It ranks as the most popular computer language and is widely used for all kinds of tasks. Though Python is notoriously...
8 MIN READ
Aug 31, 2021
Zero to RAPIDS in Minutes with NVIDIA GPUs + Saturn Cloud
GPU-accelerated computing is a game-changer for data practitioners and enterprises, but leveraging GPUs can be challenging for data professionals. RAPIDS...
9 MIN READ
Jun 17, 2021
Accelerating XGBoost on GPU Clusters with Dask
In XGBoost 1.0, we introduced a new official Dask interface to support efficient distributed training. Fast-forwarding to XGBoost 1.4, the interface is...
11 MIN READ
Mar 18, 2021
Dask Tutorial - Beginner's Guide to Distributed Computing with GPUs in Python
This is the third installment of the series of introductions to the RAPIDS ecosystem. The series explores and discusses various aspects of RAPIDS that allow its...
8 MIN READ
Oct 05, 2020
Making Python Data Science Enterprise-Ready with Dask
At NVIDIA, we are driving change in data science, machine learning, and artificial intelligence. Some of the key trends that drive us are as follows: The rise...
10 MIN READ
Mar 19, 2020
Accelerating Python for Exotic Option Pricing
In finance, computation efficiency can be directly converted to trading profits sometimes. Quants are facing the challenges of trading off research efficiency...
25 MIN READ