H100
Dec 20, 2024
Taking Computational Fluid Dynamics to the Next Level with the NVIDIA H200 Tensor Core GPU
Computational fluid dynamics (CFD) is used in industry and academia to address a wide range of use cases, including external aerodynamics, internal flows, heat...
5 MIN READ
Dec 19, 2024
AI Vision Helps Green Recycling Plants
Each year, the world recycles only around 13% of its two billion-plus tons of municipal waste. By 2050, the world's annual municipal waste will reach 3.88B...
4 MIN READ
Dec 05, 2024
Spotlight: Perplexity AI Serves 400 Million Search Queries a Month Using NVIDIA Inference Stack
The demand for AI-enabled services continues to grow rapidly, placing increasing pressure on IT and infrastructure teams. These teams are tasked with...
7 MIN READ
Dec 03, 2024
Introducing NVIDIA cuPQC for GPU-Accelerated Post-Quantum Cryptography
In the past decade, quantum computers have progressed significantly and could one day be used to undermine current cybersecurity practices. If run on a quantum...
6 MIN READ
Nov 14, 2024
Exploring the Case of Super Protocol with Self-Sovereign AI and NVIDIA Confidential Computing
Confidential and self-sovereign AI is a new approach to AI development, training, and inference where the user’s data is decentralized, private, and...
15 MIN READ
Nov 14, 2024
NVIDIA DOCA 2.9 Enhances AI and Cloud Computing Infrastructure with New Performance and Security Features
NVIDIA DOCA enhances the capabilities of NVIDIA networking platforms by providing a comprehensive software framework for developers to leverage hardware...
9 MIN READ
Nov 08, 2024
5x Faster Time to First Token with NVIDIA TensorRT-LLM KV Cache Early Reuse
In our previous blog post, we demonstrated how reusing the key-value (KV) cache by offloading it to CPU memory can accelerate time to first token (TTFT) by up...
5 MIN READ
Nov 04, 2024
Discover New Biological Insights with Accelerated Pangenome Alignment in NVIDIA Parabricks
NVIDIA Parabricks is a scalable genomics analysis software suite that solves omics challenges with accelerated computing and deep learning to unlock new...
8 MIN READ
Oct 16, 2024
Maximizing Energy and Power Efficiency in Applications with NVIDIA GPUs
As the demand for high-performance computing (HPC) and AI applications grows, so does the importance of energy efficiency. NVIDIA Principal Developer Technology...
2 MIN READ
Oct 08, 2024
Mistral-NeMo-Minitron 8B Model Delivers Unparalleled Accuracy
This post was originally published August 21, 2024 but has been revised with current data. Recently, NVIDIA and Mistral AI unveiled Mistral NeMo 12B, a leading...
7 MIN READ
Oct 02, 2024
AI Uses Zero-Shot Learning to Find Existing Drugs for Treating Rare Diseases
A groundbreaking drug-repurposing AI model could bring new hope to doctors and patients trying to treat diseases with limited or no existing treatment options....
3 MIN READ
Sep 25, 2024
Deploying Accelerated Llama 3.2 from the Edge to the Cloud
Expanding the open-source Meta Llama collection of models, the Llama 3.2 collection includes vision language models (VLMs), small language models (SLMs), and an...
6 MIN READ
Sep 24, 2024
NVIDIA GH200 Grace Hopper Superchip Delivers Outstanding Performance in MLPerf Inference v4.1
In the latest round of MLPerf Inference – a suite of standardized, peer-reviewed inference benchmarks – the NVIDIA platform delivered outstanding...
7 MIN READ
Aug 28, 2024
NVIDIA Blackwell Platform Sets New LLM Inference Records in MLPerf Inference v4.1
Large language model (LLM) inference is a full-stack challenge. Powerful GPUs, high-bandwidth GPU-to-GPU interconnects, efficient acceleration libraries, and a...
13 MIN READ
Aug 22, 2024
Jamba 1.5 LLMs Leverage Hybrid Architecture to Deliver Superior Reasoning and Long Context Handling
AI21 Labs has unveiled their latest and most advanced Jamba 1.5 model family, a cutting-edge collection of large language models (LLMs) designed to excel in a...
4 MIN READ
Jul 12, 2024
Boosting Mathematical Optimization Performance and Energy Efficiency on the NVIDIA Grace CPU
Mathematical optimization is a powerful tool that enables businesses and people to make smarter decisions and reach any number of goals—from improving...
4 MIN READ