RAPIDS

Sep 25, 2025
How to GPU-Accelerate Model Training with CUDA-X Data Science
In previous posts on AI in manufacturing and operations, we covered the unique data challenges in the supply chain and how smart feature engineering can...
8 MIN READ

Sep 23, 2025
How to Accelerate Community Detection in Python Using GPU-Powered Leiden
Community detection algorithms play an important role in understanding data by identifying hidden groups of related entities in networks. Social network...
9 MIN READ

Sep 18, 2025
The Kaggle Grandmasters Playbook: 7 Battle-Tested Modeling Techniques for Tabular Data
Over hundreds of Kaggle competitions, we've refined a playbook that consistently lands us near the top of the leaderboard—no matter if we’re working with...
13 MIN READ

Sep 17, 2025
NVIDIA RAPIDS 25.08 Adds New Profiler for cuML, Updates to the Polars GPU Engine, Additional Algorithm Support, and More
The 25.08 release of RAPIDS continues to push the boundaries toward making accelerated data science more accessible and scalable with the addition of several...
9 MIN READ

Aug 22, 2025
How to Spot (and Fix) 5 Common Performance Bottlenecks in pandas Workflows
Slow data loads, memory-intensive joins, and long-running operations—these are problems every Python practitioner has faced. They waste valuable time and make...
7 MIN READ

Aug 22, 2025
NVIDIA Hardware Innovations and Open Source Contributions Are Shaping AI
Open source AI models such as Cosmos, DeepSeek, Gemma, GPT-OSS, Llama, Nemotron, Phi, Qwen, and many more are the foundation of AI innovation. These models are...
8 MIN READ

Aug 07, 2025
Efficient Transforms in cuDF Using JIT Compilation
RAPIDS cuDF offers a broad set of ETL algorithms for processing data with GPUs. For pandas users, cuDF accelerated algorithms are available with the zero code...
9 MIN READ

Aug 07, 2025
Train with Terabyte-Scale Datasets on a Single NVIDIA Grace Hopper Superchip Using XGBoost 3.0
Gradient-boosted decision trees (GBDTs) power everything from real-time fraud filters to petabyte-scale demand forecasts. XGBoost open source library has long...
7 MIN READ

Aug 01, 2025
7 Drop-In Replacements to Instantly Speed Up Your Python Data Science Workflows
You've been there. You wrote the perfect Python script, tested it on a sample CSV, and everything worked flawlessly. But when you unleashed it on the full 10...
8 MIN READ

Jul 23, 2025
Serverless Distributed Data Processing with Apache Spark and NVIDIA AI on Azure
The process of converting vast libraries of text into numerical representations known as embeddings is essential for generative AI. Various technologies—from...
9 MIN READ

Jul 18, 2025
3 pandas Workflows That Slowed to a Crawl on Large Datasets—Until We Turned on GPUs
If you work with pandas, you’ve probably hit the wall. It’s that moment when your trusty workflow, so elegant on smaller datasets, grinds to a halt on a...
4 MIN READ

Jul 17, 2025
Feature Engineering at Scale: Optimizing ML Models in Semiconductor Manufacturing with NVIDIA CUDA‑X Data Science
In our previous post, we introduced the setup of predictive modeling in chip manufacturing and operations, highlighting common challenges such as imbalanced...
6 MIN READ

Jul 03, 2025
RAPIDS Adds GPU Polars Streaming, a Unified GNN API, and Zero-Code ML Speedups
RAPIDS, a suite of NVIDIA CUDA-X libraries for Python data science, released version 25.06, introducing exciting new features. These include a Polars GPU...
6 MIN READ

Jun 27, 2025
How to Work with Data Exceeding VRAM in the Polars GPU Engine
In high-stakes fields such as quant finance, algorithmic trading, and fraud detection, data practitioners frequently need to process hundreds of gigabytes (GB)...
4 MIN READ

Jun 18, 2025
AI in Manufacturing and Operations at NVIDIA: Accelerating ML Models with NVIDIA CUDA-X Data Science
NVIDIA leverages data science and machine learning to optimize chip manufacturing and operations workflows—from wafer fabrication and circuit probing to...
8 MIN READ

Jun 12, 2025
Driving Toward Billion-Cell Analysis and Biological Breakthroughs with RAPIDS-singlecell
The future of cell biology and virtual cell models is dependent on measuring and analyzing data at scale. Single-cell experiments have been growing at an...
7 MIN READ