Data Analytics / Processing
Dec 05, 2024
Unified Virtual Memory Supercharges pandas with RAPIDS cuDF
cuDF-pandas, introduced in a previous post, is a GPU-accelerated library that accelerates pandas to deliver significant performance improvements—up to 50x...
5 MIN READ
Nov 14, 2024
Faster Causal Inference on Large Datasets with NVIDIA RAPIDS
As consumer applications generate more data than ever before, enterprises are turning to causal inference methods for observational data to help shed light on...
4 MIN READ
Oct 15, 2024
Train Highly Accurate LLMs with the Zyda-2 Open 5T-Token Dataset Processed with NVIDIA NeMo Curator
Open-source datasets have significantly democratized access to high-quality data, lowering the barriers of entry for developers and researchers to train...
5 MIN READ
Oct 04, 2024
Just Released: NVIDIA NeMo Curator Improvements for Accelerating Data Curation
NeMo Curator now supports images, enabling you to process data for training accurate generative AI models.
1 MIN READ
Sep 17, 2024
Polars GPU Engine Powered by RAPIDS cuDF Now Available in Open Beta
Today, Polars released a new GPU engine powered by RAPIDS cuDF that accelerates Polars workflows up to 13x on NVIDIA GPUs, allowing data scientists to process...
4 MIN READ
Aug 30, 2024
Accelerating Predictive Maintenance in Manufacturing with RAPIDS AI
The International Society of Automation (ISA) reports that 5% of plant production is lost annually due to downtime. Putting that into a different context,...
12 MIN READ
Aug 29, 2024
Just Released: RAPIDS 24.08
RAPIDS 24.08 is now available with significant updates geared towards processing larger workloads and seamless CPU/GPU interoperability.
1 MIN READ
Aug 09, 2024
RAPIDS cuDF Unified Memory Accelerates pandas up to 30x on Large Datasets
NVIDIA has released RAPIDS cuDF unified memory and text data processing features that help data scientists continue to use pandas when working with larger and...
6 MIN READ
Mar 18, 2024
NVIDIA GB200 NVL72 Delivers Trillion-Parameter LLM Training and Real-Time Inference
What is the interest in trillion-parameter models? We know many of the use cases today and interest is growing due to the promise of an increased capacity for:...
9 MIN READ
Dec 15, 2023
Streamline ETL Workflows with Nested Data Types in RAPIDS libcudf
Nested data types are a convenient way to represent hierarchical relationships within columnar data. They are frequently used as part of extract, transform,...
10 MIN READ
Nov 06, 2023
ICYMI: Leveraging the Power of GPUs with CuPy in Python
See how KDNuggets achieved 500x speedup using CuPy and NVIDIA CUDA on 3D arrays.
1 MIN READ
Oct 10, 2023
Event: AI and Data Science Virtual Summit
Meta, NetworkX, Fast.ai, and other industry leaders share how to gain new insights from your data with emerging tools.
1 MIN READ
Sep 06, 2023
GPUs for ETL? Optimizing ETL Architecture for Apache Spark SQL Operations
Extract-transform-load (ETL) operations with GPUs using the NVIDIA RAPIDS Accelerator for Apache Spark running on large-scale data can produce both cost savings...
8 MIN READ
Jul 17, 2023
New Video: Visualizing Census Data with RAPIDS cuDF and Plotly Dash
Gathering business insights can be a pain, especially when you're dealing with countless data points. It’s no secret that GPUs can be a time-saver for...
2 MIN READ
Jul 17, 2023
GPUs for ETL? Run Faster, Less Costly Workloads with NVIDIA RAPIDS Accelerator for Apache Spark and Databricks
We were stuck. Really stuck. With a hard delivery deadline looming, our team needed to figure out how to process a complex extract-transform-load (ETL) job on...
7 MIN READ
Jul 13, 2023
Whole Slide Image Analysis in Real Time with MONAI and RAPIDS
Digital pathology slide scanners generate massive images. Glass slides are routinely scanned at 40x magnification, resulting in gigapixel images. Compression...
11 MIN READ