InfiniBand
Nov 21, 2024
Advancing Ansys Workloads with NVIDIA Grace and NVIDIA Grace Hopper
Accelerated computing is enabling giant leaps in performance and energy efficiency compared to traditional CPU computing. Delivering these advancements requires...
10 MIN READ
Nov 13, 2024
NVIDIA Blackwell Doubles LLM Training Performance in MLPerf Training v4.1
As models grow larger and are trained on more data, they become more capable, making them more useful. To train these models quickly, more performance,...
8 MIN READ
Oct 25, 2024
Advancing Performance with NVIDIA SHARP In-Network Computing
AI and scientific computing applications are great examples of distributed computing problems. The problems are too large and the computations too intensive to...
7 MIN READ
Oct 15, 2024
Powering Next-Generation AI Networking with NVIDIA SuperNICs
In the era of generative AI, accelerated networking is essential to build high-performance computing fabrics for massively distributed AI workloads. NVIDIA...
6 MIN READ
Sep 06, 2024
Enhancing Application Portability and Compatibility across New Platforms Using NVIDIA Magnum IO NVSHMEM 3.0
NVSHMEM is a parallel programming interface that provides efficient and scalable communication for NVIDIA GPU clusters. Part of NVIDIA Magnum IO and based on...
7 MIN READ
Jan 23, 2024
Simplifying Network Operations for AI with NVIDIA Quantum InfiniBand
A common technological misconception is that performance and complexity are directly linked. That is, the highest-performance implementation is also the most...
4 MIN READ
Nov 14, 2023
Energy Efficiency in High-Performance Computing: Balancing Speed and Sustainability
The world of computing is on the precipice of a seismic shift. The demand for computing power, particularly in high-performance computing (HPC), is...
17 MIN READ
Nov 08, 2023
Setting New Records at Data Center Scale Using NVIDIA H100 GPUs and NVIDIA Quantum-2 InfiniBand
Generative AI is rapidly transforming computing, unlocking new use cases and turbocharging existing ones. Large language models (LLMs), such as OpenAI’s GPT...
19 MIN READ
Oct 12, 2023
Networking for Data Centers and the Era of AI
Traditional cloud data centers have served as the bedrock of computing infrastructure for over a decade, catering to a diverse range of users and applications....
6 MIN READ
Jul 06, 2023
New MLPerf Inference Network Division Showcases NVIDIA InfiniBand and GPUDirect RDMA Capabilities
In MLPerf Inference v3.0, NVIDIA made its first submissions to the newly introduced Network division, which is now part of the MLPerf Inference Datacenter...
9 MIN READ
May 25, 2023
Navigating Generative AI for Network Admins
We all know that AI is changing the world. For network admins, AI can improve day-to-day operations in some amazing ways: Automation of repetitive tasks: This...
6 MIN READ
May 15, 2023
Efficiently Scale LLM Training Across a Large GPU Cluster with Alpa and Ray
Recent years have seen a proliferation of large language models (LLMs) that extend beyond traditional language tasks to generative AI. This includes models like...
16 MIN READ
Apr 05, 2023
Setting New Records in MLPerf Inference v3.0 with Full-Stack Optimizations for AI
The most exciting computing applications currently rely on training and running inference on complex AI models, often in demanding, real-time deployment...
15 MIN READ
May 24, 2022
Optimizing Your Data Center Network
Data centers can be optimized by updating key network architectures in two ways: through networking technologies or operational efficiency in NetDevOps. In this...
5 MIN READ
Nov 10, 2021
Announcing NVIDIA Nsight Systems 2021.5
The latest update to NVIDIA Nsight Systems—a performance analysis tool—is now available for download. Designed to help you tune and scale software across...
3 MIN READ
Nov 09, 2021
Accelerating Cloud-Native Supercomputing with Magnum IO
Supercomputers are significant investments. However they are extremely valuable tools for researchers and scientists. To effectively and securely share the...
4 MIN READ