Intermediate Technical
Dec 06, 2024
Content Moderation and Safety Checks with NVIDIA NeMo Guardrails
Content moderation has become essential in retrieval-augmented generation (RAG) applications powered by generative AI, given the extensive volume of...
10 MIN READ
Dec 05, 2024
Unified Virtual Memory Supercharges pandas with RAPIDS cuDF
cuDF-pandas, introduced in a previous post, is a GPU-accelerated library that accelerates pandas to deliver significant performance improvements—up to 50x...
5 MIN READ
Dec 05, 2024
Optimize GPU Workloads for Graphics Applications with NVIDIA Nsight Graphics
One of the great pastimes of graphics developers and enthusiasts is comparing specifications of GPUs and marveling at the ever-increasing counts of shader...
11 MIN READ
Dec 04, 2024
How AI is Making Climate Modeling Faster, Greener, and More Accurate
Christopher Bretherton, Senior Director of Climate Modeling at the Allen Institute for AI (AI2), highlights how AI is revolutionizing climate science. In this...
2 MIN READ
Dec 03, 2024
Scaling Action Recognition Models with Synthetic Data
Action recognition models such as PoseClassificationNet have been around for some time, helping systems identify and classify human actions like walking,...
11 MIN READ
Dec 03, 2024
How to Build a Generative AI-Enabled Synthetic Data Pipeline for Perception AI
Training physical AI models used to power autonomous machines, such as robots and autonomous vehicles, requires huge amounts of data. Acquiring large sets of...
6 MIN READ
Dec 03, 2024
Build an Agentic Video Workflow with Video Search and Summarization
Building a question-answering chatbot with large language models (LLMs) is now a common workflow for text-based interactions. What about creating an AI system...
11 MIN READ
Dec 03, 2024
Automate Early Security Patching in CI Pipelines on AWS Using NVIDIA AI Blueprints
The evolution of modern application development has led to a significant shift toward microservice-based architectures. This approach offers great flexibility...
10 MIN READ
Dec 03, 2024
Introducing NVIDIA cuPQC for GPU-Accelerated Post-Quantum Cryptography
In the past decade, quantum computers have progressed significantly and could one day be used to undermine current cybersecurity practices. If run on a quantum...
6 MIN READ
Dec 03, 2024
In-Silico Antibody Development with AlphaBind Using NVIDIA BioNeMo and AWS HealthOmics
Antibodies have become the most prevalent class of therapeutics, primarily due to their ability to target specific antigens, enabling them to treat a wide range...
6 MIN READ
Dec 02, 2024
TensorRT-LLM Speculative Decoding Boosts Inference Throughput by up to 3.6x
NVIDIA TensorRT-LLM support for speculative decoding now provides over 3x the speedup in total token throughput. TensorRT-LLM is an open-source library that...
9 MIN READ
Dec 02, 2024
Unified Whole-Body Control for Physically Simulated Humanoids
Creating interactive simulated humanoids that move naturally and respond intelligently to diverse control inputs remains one of the most challenging problems in...
7 MIN READ
Nov 28, 2024
Supercharging Deduplication in pandas Using RAPIDS cuDF
A common operation in data analytics is to drop duplicate rows. Deduplication is critical in Extract, Transform, Load (ETL) workflows, where you might want to...
12 MIN READ
Nov 22, 2024
Spotlight: TCS Increases Automotive Software Testing Speeds by 2x Using NVIDIA Generative AI
Generative AI is transforming every aspect of the automotive industry, including software development, testing, user experience, personalization, and safety....
8 MIN READ
Nov 22, 2024
Hymba Hybrid-Head Architecture Boosts Small Language Model Performance
Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,...
12 MIN READ
Nov 21, 2024
NVIDIA TensorRT-LLM Multiblock Attention Boosts Throughput by More Than 3x for Long Sequence Lengths on NVIDIA HGX H200
Generative AI models are advancing rapidly. Every generation of models comes with a larger number of parameters and longer context windows. The Llama 2 series...
5 MIN READ