H100

Apr 16, 2025
AI-Generated Heat Maps Keep Seniors and their Privacy Safe
By 2030, more than one in five Americans will be 65 or older, becoming the United States’ largest group of seniors ever. Silicon Valley-based startup Butlr...
4 MIN READ

Apr 02, 2025
NVIDIA Blackwell Delivers Massive Performance Leaps in MLPerf Inference v5.0
The compute demands for large language model (LLM) inference are growing rapidly, fueled by the combination of growing model sizes, real-time latency...
9 MIN READ

Mar 19, 2025
NVIDIA Blackwell Ultra for the Era of AI Reasoning
For years, advancements in AI have followed a clear trajectory through pretraining scaling: larger models, more data, and greater computational resources lead...
5 MIN READ

Mar 03, 2025
AI Model Offers Conservationists New Tools to Protect Fisheries, Wildlife at Scale
In an effort to rein in illicit fishing, researchers have unveiled a new open-source AI model that can accurately identify what virtually all of the world’s...
5 MIN READ

Feb 25, 2025
NVIDIA cuDSS Advances Solver Technologies for Engineering and Scientific Computing
NVIDIA cuDSS is a first-generation sparse direct solver library designed to accelerate engineering and scientific computing. cuDSS is increasingly adopted in...
12 MIN READ

Feb 14, 2025
Optimizing Qwen2.5-Coder Throughput with NVIDIA TensorRT-LLM Lookahead Decoding
Large language models (LLMs) that specialize in coding have been steadily adopted into developer workflows. From pair programming to self-improving AI agents,...
7 MIN READ

Feb 04, 2025
New AI Model Offers Cellular-Level View of Cancerous Tumors
Researchers studying cancer unveiled a new AI model that provides cellular-level mapping and visualizations of cancer cells, which scientists hope can shed...
3 MIN READ

Jan 29, 2025
Advancing Rare Disease Detection with AI-Powered Cellular Profiling
Rare diseases are difficult to diagnose due to limitations in traditional genomic sequencing. Wolfgang Pernice, assistant professor at Columbia University, is...
3 MIN READ

Jan 24, 2025
Dynamic Memory Compression
Despite the success of large language models (LLMs) as general-purpose AI tools, their high demand for computational resources make their deployment challenging...
9 MIN READ

Dec 20, 2024
Taking Computational Fluid Dynamics to the Next Level with the NVIDIA H200 Tensor Core GPU
Computational fluid dynamics (CFD) is used in industry and academia to address a wide range of use cases, including external aerodynamics, internal flows, heat...
5 MIN READ

Dec 19, 2024
AI Vision Helps Green Recycling Plants
Each year, the world recycles only around 13% of its two billion-plus tons of municipal waste. By 2050, the world's annual municipal waste will reach 3.88B...
4 MIN READ

Dec 05, 2024
Spotlight: Perplexity AI Serves 400 Million Search Queries a Month Using NVIDIA Inference Stack
The demand for AI-enabled services continues to grow rapidly, placing increasing pressure on IT and infrastructure teams. These teams are tasked with...
7 MIN READ

Dec 03, 2024
Introducing NVIDIA cuPQC for GPU-Accelerated Post-Quantum Cryptography
In the past decade, quantum computers have progressed significantly and could one day be used to undermine current cybersecurity practices. If run on a quantum...
6 MIN READ

Nov 14, 2024
Exploring the Case of Super Protocol with Self-Sovereign AI and NVIDIA Confidential Computing
Confidential and self-sovereign AI is a new approach to AI development, training, and inference where the user’s data is decentralized, private, and...
15 MIN READ

Nov 14, 2024
NVIDIA DOCA 2.9 Enhances AI and Cloud Computing Infrastructure with New Performance and Security Features
NVIDIA DOCA enhances the capabilities of NVIDIA networking platforms by providing a comprehensive software framework for developers to leverage hardware...
9 MIN READ

Nov 08, 2024
5x Faster Time to First Token with NVIDIA TensorRT-LLM KV Cache Early Reuse
In our previous blog post, we demonstrated how reusing the key-value (KV) cache by offloading it to CPU memory can accelerate time to first token (TTFT) by up...
5 MIN READ