DEVELOPER BLOG

Tag: Python

AI / Deep Learning

Accelerating Blender Python Using CUDA

This post described two different approaches for how to accelerate matrix multiplication. The first approach used the Numba compiler to decrease the overhead… 9 MIN READ
Data Science

Accelerating Sequential Python User-Defined Functions with RAPIDS on GPUs for 100X Speedups

Custom “row-by-row” processing logic (sometimes called sequential User-Defined Functions) is prevalent in ETL workflows. The sequential nature of UDFs makes… 3 MIN READ
Data Science

High-Performance Python Communication with UCX-Py

UCX/UCX-Py is an accelerated networking library designed for low-latency high-bandwidth transfers for both host and GPU device memory objects. 9 MIN READ
Data Science

NLP and Text Processing with RAPIDS: Now Simpler and Faster

In this post, we will showcase performance improvements for string processing across cuDF and cuML, which enables acceleration across diverse text processing… 3 MIN READ
Data Science

Run State of the Art NLP Workloads at Scale with RAPIDS, HuggingFace, and Dask

This post explains how to leverage RAPIDS for feature engineering and string processing, HuggingFace for deep learning inference, and Dask for scaling out for… 6 MIN READ
Data Science

How to Build a Winning Deep Learning Powered Recommender System-Part 3

Recommender systems (RecSys) have become a key component in many online services, such as e-commerce, social media, news service, or online video streaming. 21 MIN READ