Algorithms / Numerical Techniques
Mar 14, 2024
Applying Mixture of Experts in LLM Architectures
Mixture of experts (MoE) large language model (LLM) architectures have recently emerged, both in proprietary LLMs such as GPT-4, as well as in community models...
12 MIN READ
Mar 08, 2024
cuTENSOR 2.0: Applications and Performance
While part 1 focused on the usage of the new NVIDIA cuTENSOR 2.0 CUDA math library, this post introduces a variety of usage modes beyond that, specifically...
9 MIN READ
Mar 08, 2024
cuTENSOR 2.0: A Comprehensive Guide for Accelerating Tensor Computations
NVIDIA cuTENSOR is a CUDA math library that provides optimized implementations of tensor operations where tensors are dense, multi-dimensional arrays or array...
17 MIN READ
Oct 10, 2023
Event: AI and Data Science Virtual Summit
Meta, NetworkX, Fast.ai, and other industry leaders share how to gain new insights from your data with emerging tools.
1 MIN READ
Oct 02, 2023
Accelerated Vector Search: Approximating with RAPIDS RAFT IVF-Flat
Performing an exhaustive exact k-nearest neighbor (kNN) search, also known as brute-force search, is expensive, and it doesn’t scale particularly well to...
15 MIN READ
Sep 11, 2023
Accelerating Vector Search: Fine-Tuning GPU Index Algorithms
In this post, we dive deeper into each of the GPU-accelerated indexes mentioned in part 1 and give a brief explanation of how the algorithms work, along with a...
12 MIN READ
Sep 11, 2023
Accelerating Vector Search: Using GPU-Powered Indexes with RAPIDS RAFT
In the AI landscape of 2023, vector search is one of the hottest topics due to its applications in large language models (LLM) and generative AI. Semantic...
11 MIN READ
Aug 04, 2023
ICYMI: Unlocking the Power of GPU-Accelerated DataFrames in Python
Read this tutorial on how to tap into GPUs by importing cuDF instead of pandas–with only a few code changes.
1 MIN READ
Jul 20, 2023
A Comprehensive Guide on Interaction Terms in Time Series Forecasting
Modeling time series data can be challenging (and fascinating) due to its inherent complexity and unpredictability. Long-term trends in time series can change...
8 MIN READ
Jul 19, 2023
Programming the Quantum-Classical Supercomputer
Heterogeneous computing architectures—those that incorporate a variety of processor types working in tandem—have proven extremely valuable in the continued...
9 MIN READ
Jul 11, 2023
Accelerated Data Analytics: Machine Learning with GPU-Accelerated Pandas and Scikit-learn
If you are looking to take your machine learning (ML) projects to new levels of speed and scalability, GPU-accelerated data analytics can help you deliver...
14 MIN READ
Jun 28, 2023
ICYMI: Exploring Challenges Posed by Biased Datasets Using RAPIDS cuDF
Read about an innovative GPU solution that solves limitations using small biased datasets with RAPIDS cuDF.
1 MIN READ
Jun 27, 2023
GPU-Accelerated Single-Cell RNA Analysis with RAPIDS-singlecell
Single-cell sequencing has become one of the most prominent technologies used in biomedical research. Its ability to decipher changes in the transcriptome and...
13 MIN READ
Jun 09, 2023
Recreate High-Fidelity Digital Twins with Neural Kernel Surface Reconstruction
Reconstructing a smooth surface from a point cloud is a fundamental step in creating digital twins of real-world objects and scenes. Algorithms for surface...
5 MIN READ
May 19, 2023
Limit Order Book Dataset Generation for Accelerated Short-Term Price Prediction with RAPIDS
In the high-frequency trading world, thousands of market participants interact daily. In fact, high-frequency trading accounts for more than half of the US...
9 MIN READ
May 18, 2023
QHack Results Highlight Quantum Computing Applications and Tools on GPUs
QHack is an educational conference and the world’s largest quantum machine learning (QML) hackathon. This year at QHack 2023, 2,850 individuals from 105...
9 MIN READ