cuDF

Feb 06, 2025
Get Started with GPU Acceleration for Data Science
In data science, operational efficiency is key to handling increasingly complex and large datasets. GPU acceleration has become essential for modern workflows,...
8 MIN READ

Jan 29, 2025
Accelerating JSON Processing on Apache Spark with GPUs
JSON is a popular format for text-based data that allows for interoperability between systems in web applications as well as data management. The format has...
9 MIN READ

Jan 13, 2025
Upcoming Webinar: Inside the RAPIDS-Accelerated Polars GPU Engine
In the webinar on January 28th, you'll get an inside look of the new GPU engine to learn how Polars' declarative API and query optimizer enable seamless GPU...
1 MIN READ

Dec 19, 2024
Enhance Your Training Data with New NVIDIA NeMo Curator Classifier Models
Classifier models are specialized in categorizing data into predefined groups or classes, playing a crucial role in optimizing data processing pipelines for...
11 MIN READ

Dec 19, 2024
RAPIDS 24.12 Introduces cuDF on PyPI, CUDA Unified Memory for Polars, and Faster GNNs
RAPIDS 24.12 introduces cuDF packages to PyPI, speeds up groupby aggregations and reading files from AWS S3, enables larger-than-GPU memory queries in the...
8 MIN READ

Dec 05, 2024
Unified Virtual Memory Supercharges pandas with RAPIDS cuDF
cuDF-pandas, introduced in a previous post, is a GPU-accelerated library that accelerates pandas to deliver significant performance improvements—up to 50x...
5 MIN READ

Nov 28, 2024
Supercharging Deduplication in pandas Using RAPIDS cuDF
A common operation in data analytics is to drop duplicate rows. Deduplication is critical in Extract, Transform, Load (ETL) workflows, where you might want to...
12 MIN READ

Nov 21, 2024
Best Practices for Multi-GPU Data Analysis Using RAPIDS with Dask
As we move towards a more dense computing infrastructure, with more compute, more GPUs, accelerated networking, and so forth—multi-gpu training and analysis...
5 MIN READ

Oct 08, 2024
NVIDIA CUDA-X Now Accelerates the Polars Data Processing Library
Polars, one of the fastest-growing data analytics tools, has just crossed 9M monthly downloads. As a modern DataFrame library, it is designed for efficiently...
3 MIN READ

Sep 17, 2024
Polars GPU Engine Powered by RAPIDS cuDF Now Available in Open Beta
Today, Polars released a new GPU engine powered by RAPIDS cuDF that accelerates Polars workflows up to 13x on NVIDIA GPUs, allowing data scientists to process...
4 MIN READ

Sep 11, 2024
Scaling Up to One Billion Rows of Data in pandas using RAPIDS cuDF
The One Billion Row Challenge is a fun benchmark to showcase basic data processing operations. It was originally launched as a pure-Java competition, and has...
11 MIN READ

Sep 04, 2024
Hands-On Training at NVIDIA AI Summit in Washington, DC
Immerse yourself in NVIDIA technology with our full-day, hands-on technical workshops at our AI Summit in Washington D.C. on October 7, 2024.
1 MIN READ

Aug 30, 2024
Accelerating Predictive Maintenance in Manufacturing with RAPIDS AI
The International Society of Automation (ISA) reports that 5% of plant production is lost annually due to downtime. Putting that into a different context,...
12 MIN READ

Aug 29, 2024
Just Released: RAPIDS 24.08
RAPIDS 24.08 is now available with significant updates geared towards processing larger workloads and seamless CPU/GPU interoperability.
1 MIN READ

Aug 21, 2024
Build Efficient Recommender Systems with Co-Visitation Matrices and RAPIDS cuDF
Recommender systems play a crucial role in personalizing user experiences across various platforms. These systems are designed to predict and suggest items that...
12 MIN READ

Aug 09, 2024
RAPIDS cuDF Unified Memory Accelerates pandas up to 30x on Large Datasets
NVIDIA has released RAPIDS cuDF unified memory and text data processing features that help data scientists continue to use pandas when working with larger and...
6 MIN READ