Posts by Ayush Dattagupta
Data Science
Dec 20, 2024
Accelerating GPU Analytics Using RAPIDS and Ray
RAPIDS is a suite of open-source GPU-accelerated data science and AI libraries that are well supported for scale-out with distributed engines like Spark and...
4 MIN READ
Conversational AI
Aug 08, 2023
Curating Trillion-Token Datasets: Introducing NVIDIA NeMo Data Curator
The latest developments in large language model (LLM) scaling laws have shown that when scaling the number of model parameters, the number of tokens used for...
8 MIN READ
Data Science
Jun 07, 2021
Accelerating Sequential Python User-Defined Functions with RAPIDS on GPUs for 100X Speedups
Motivation Custom “row-by-row” processing logic (sometimes called sequential User-Defined Functions) is prevalent in ETL workflows. The sequential nature of...
3 MIN READ