Data Analytics / Processing

Jan 29, 2025
Accelerating JSON Processing on Apache Spark with GPUs
JSON is a popular format for text-based data that allows for interoperability between systems in web applications as well as data management. The format has...
9 MIN READ

Jan 16, 2025
Accelerating Time Series Forecasting with RAPIDS cuML
Time series forecasting is a powerful data science technique used to predict future values based on data points from the past Open source Python libraries like...
4 MIN READ

Jan 13, 2025
Enhancing Generative AI Model Accuracy with NVIDIA NeMo Curator
In the rapidly evolving landscape of artificial intelligence, the quality of the data used for training models is paramount. High-quality data ensures that...
5 MIN READ

Dec 12, 2024
Harnessing GPU Acceleration for Multi-Label Classification with RAPIDS cuML
Modern classification workflows often require classifying individual records and data points into multiple categories instead of just assigning a single label....
4 MIN READ

Dec 05, 2024
Unified Virtual Memory Supercharges pandas with RAPIDS cuDF
cuDF-pandas, introduced in a previous post, is a GPU-accelerated library that accelerates pandas to deliver significant performance improvements—up to 50x...
5 MIN READ

Nov 14, 2024
Faster Causal Inference on Large Datasets with NVIDIA RAPIDS
As consumer applications generate more data than ever before, enterprises are turning to causal inference methods for observational data to help shed light on...
4 MIN READ

Oct 15, 2024
Train Highly Accurate LLMs with the Zyda-2 Open 5T-Token Dataset Processed with NVIDIA NeMo Curator
Open-source datasets have significantly democratized access to high-quality data, lowering the barriers of entry for developers and researchers to train...
5 MIN READ

Oct 04, 2024
Just Released: NVIDIA NeMo Curator Improvements for Accelerating Data Curation
NeMo Curator now supports images, enabling you to process data for training accurate generative AI models.
1 MIN READ

Sep 17, 2024
Polars GPU Engine Powered by RAPIDS cuDF Now Available in Open Beta
Today, Polars released a new GPU engine powered by RAPIDS cuDF that accelerates Polars workflows up to 13x on NVIDIA GPUs, allowing data scientists to process...
4 MIN READ

Aug 30, 2024
Accelerating Predictive Maintenance in Manufacturing with RAPIDS AI
The International Society of Automation (ISA) reports that 5% of plant production is lost annually due to downtime. Putting that into a different context,...
12 MIN READ

Aug 29, 2024
Just Released: RAPIDS 24.08
RAPIDS 24.08 is now available with significant updates geared towards processing larger workloads and seamless CPU/GPU interoperability.
1 MIN READ

Aug 09, 2024
RAPIDS cuDF Unified Memory Accelerates pandas up to 30x on Large Datasets
NVIDIA has released RAPIDS cuDF unified memory and text data processing features that help data scientists continue to use pandas when working with larger and...
6 MIN READ

Mar 18, 2024
NVIDIA GB200 NVL72 Delivers Trillion-Parameter LLM Training and Real-Time Inference
What is the interest in trillion-parameter models? We know many of the use cases today and interest is growing due to the promise of an increased capacity for:...
9 MIN READ

Dec 15, 2023
Streamline ETL Workflows with Nested Data Types in RAPIDS libcudf
Nested data types are a convenient way to represent hierarchical relationships within columnar data. They are frequently used as part of extract, transform,...
10 MIN READ

Nov 06, 2023
ICYMI: Leveraging the Power of GPUs with CuPy in Python
See how KDNuggets achieved 500x speedup using CuPy and NVIDIA CUDA on 3D arrays.
1 MIN READ

Oct 10, 2023
Event: AI and Data Science Virtual Summit
Meta, NetworkX, Fast.ai, and other industry leaders share how to gain new insights from your data with emerging tools.
1 MIN READ