Intermediate Technical
Sep 18, 2024
Quickly Voice Your Apps with NVIDIA NIM Microservices for Speech and Translation
NVIDIA NIM, part of NVIDIA AI Enterprise, provides containers to self-host GPU-accelerated inferencing microservices for pretrained and customized AI models...
11 MIN READ
Sep 18, 2024
NVIDIA Presents AI Security Expertise at Leading Cybersecurity Conferences
Each August, tens of thousands of security professionals attend the cutting-edge security conferences Black Hat USA and DEF CON. This year, NVIDIA AI security...
9 MIN READ
Sep 17, 2024
Accelerating Oracle Database Generative AI Workloads with NVIDIA NIM and NVIDIA cuVS
The vast majority of the world's data remains untapped, and enterprises are looking to generate value from this data by creating the next wave of generative AI...
6 MIN READ
Sep 17, 2024
Optimizing Data Center Performance with AI Agents and the OODA Loop Strategy
For any data center, operating large, complex GPU clusters is not for the faint of heart! There is a tremendous amount of complexity. Cooling, power,...
12 MIN READ
Sep 16, 2024
Memory Efficiency, Faster Initialization, and Cost Estimation with NVIDIA Collective Communications Library 2.22
For the past few months, the NVIDIA Collective Communications Library (NCCL) developers have been working hard on a set of new library features and bug fixes....
8 MIN READ
Sep 16, 2024
Orchestrating Innovation at Scale with NVIDIA Maxine and Texel
The NVIDIA Maxine AI developer platform is a suite of NVIDIA NIM microservices, cloud-accelerated microservices, and SDKs that offer state-of-the-art features...
5 MIN READ
Sep 13, 2024
Improved Data Loading with Threads
Data loading is a critical aspect of deep learning workflows, whether you're focused on training or inference. However, it often presents a paradox: the need...
8 MIN READ
Sep 11, 2024
Spotlight: xpander AI Equips NVIDIA NIM Applications with Agentic Tools
Equipping agentic AI applications with tools will usher in the next phase of AI. By enabling autonomous agents and other AI applications to fetch real-time...
11 MIN READ
Sep 11, 2024
Processing One Billion Rows of Data with RAPIDS cuDF pandas Accelerator Mode
The One Billion Row Challenge is a fun benchmark to showcase basic data processing operations. It was originally launched as a pure-Java competition, and has...
11 MIN READ
Sep 11, 2024
Advanced Strategies for High-Performance GPU Programming with NVIDIA CUDA
Stephen Jones, a leading expert and distinguished NVIDIA CUDA architect, offers his guidance and insights with a deep dive into the complexities of mapping...
2 MIN READ
Sep 11, 2024
Constant Time Launch for Straight-Line CUDA Graphs and Other Performance Enhancements
CUDA Graphs are a way to define and batch GPU operations as a graph rather than a sequence of stream launches. A CUDA Graph groups a set of CUDA kernels and...
8 MIN READ
Sep 10, 2024
Streamlining Data Processing for Domain Adaptive Pretraining with NVIDIA NeMo Curator
Domain-adaptive pretraining (DAPT) of large language models (LLMs) is an important step towards building domain-specific models. These models demonstrate...
16 MIN READ
Sep 10, 2024
Accelerating the HPCG Benchmark with NVIDIA Math Sparse Libraries
In the realm of high-performance computing (HPC), NVIDIA has continually advanced HPC by offering its highly optimized NVIDIA High-Performance Conjugate...
9 MIN READ
Sep 10, 2024
Post-Training Quantization of LLMs with NVIDIA NeMo and NVIDIA TensorRT Model Optimizer
As large language models (LLMs) are becoming even bigger, it is increasingly important to provide easy-to-use and efficient deployment paths because the cost of...
10 MIN READ
Sep 09, 2024
Spotlight: Shell Accelerates CO2 Storage Modeling 100,000x Using NVIDIA Modulus
As the world faces the urgent need to combat climate change, carbon capture and storage (CCS) has emerged as a crucial technology for achieving net-zero...
7 MIN READ
Sep 06, 2024
Enhancing Application Portability and Compatibility across New Platforms Using NVIDIA Magnum IO NVSHMEM 3.0
NVSHMEM is a parallel programming interface that provides efficient and scalable communication for NVIDIA GPU clusters. Part of NVIDIA Magnum IO and based on...
7 MIN READ