RAPIDS

Apr 10, 2025
Efficiently Scaling Polars GPU Parquet Reader
When working with large datasets, the performance of your data processing tools becomes critical. Polars, an open-source library for data manipulation known for...
4 MIN READ

Apr 03, 2025
Accelerating Apache Parquet Scans on Apache Spark with GPUs
As data sizes have grown in enterprises across industries, Apache Parquet has become a prominent format for storing data. Apache Parquet is a columnar storage...
8 MIN READ

Mar 11, 2025
Efficient ETL with Polars and Apache Spark on NVIDIA Grace CPU
The NVIDIA Grace CPU Superchip delivers outstanding performance and best-in-class energy efficiency for CPU workloads in the data center and in the cloud. The...
7 MIN READ

Mar 06, 2025
Accelerate Apache Spark ML on NVIDIA GPUs with Zero Code Change
The NVIDIA RAPIDS Accelerator for Apache Spark software plug-in pioneered a zero code change user experience (UX) for GPU-accelerated data processing. It...
5 MIN READ

Mar 04, 2025
GPU-Accelerate Algorithmic Trading Simulations by over 100x with Numba
Quantitative developers need to run back-testing simulations to see how financial algorithms perform from a profit and loss (P&L) standpoint. Statistical...
12 MIN READ

Feb 27, 2025
High-Performance Remote IO With NVIDIA KvikIO
Workloads processing large amounts of data, especially those running on the cloud, will often use an object storage service (S3, Google Cloud Storage, Azure...
9 MIN READ

Feb 20, 2025
JSON Lines Reading with pandas 100x Faster Using NVIDIA cuDF
JSON is a widely adopted format for text-based information working interoperably between systems, most commonly in web applications and large language models...
10 MIN READ

Feb 13, 2025
Using NetworkX, Jaccard Similarity, and cuGraph to Predict Your Next Favorite Movie
As the amount of data available to everyone in the world increases, the ability for a consumer to make informed decisions becomes increasingly difficult....
9 MIN READ

Feb 06, 2025
Get Started with GPU Acceleration for Data Science
In data science, operational efficiency is key to handling increasingly complex and large datasets. GPU acceleration has become essential for modern workflows,...
8 MIN READ

Feb 05, 2025
Featured Researcher and Educator Sessions at NVIDIA GTC 2025
Explore the latest advancements in academia, including advanced research, innovative teaching methods, and the future of learning and technology.
1 MIN READ

Jan 30, 2025
Mastering the cudf.pandas Profiler for GPU Acceleration
In the world of Python data science, pandas has long reigned as the go-to library for intuitive data manipulation and analysis. However, as data volumes grow,...
6 MIN READ

Jan 29, 2025
Accelerating JSON Processing on Apache Spark with GPUs
JSON is a popular format for text-based data that allows for interoperability between systems in web applications as well as data management. The format has...
9 MIN READ

Jan 16, 2025
Accelerating Time Series Forecasting with RAPIDS cuML
Time series forecasting is a powerful data science technique used to predict future values based on data points from the past Open source Python libraries like...
4 MIN READ

Jan 13, 2025
Upcoming Webinar: Inside the RAPIDS-Accelerated Polars GPU Engine
In the webinar on January 28th, you'll get an inside look of the new GPU engine to learn how Polars' declarative API and query optimizer enable seamless GPU...
1 MIN READ

Dec 20, 2024
Accelerating GPU Analytics Using RAPIDS and Ray
RAPIDS is a suite of open-source GPU-accelerated data science and AI libraries that are well supported for scale-out with distributed engines like Spark and...
4 MIN READ

Dec 20, 2024
NVIDIA Hackathon Winners Share Strategies for RAPIDS-Accelerated ML Workflows
Approximately 220 teams gathered at the Open Data Science Conference (ODSC) West this year to compete in the NVIDIA hackathon, a 24-hour machine learning (ML)...
8 MIN READ