Hardware / Semiconductor
Dec 12, 2024
An Introduction to NVIDIA Air
The advent of AI has introduced a new type of data center, the AI factory, purpose-built from the ground up to handle AI workloads. AI workloads can...
6 MIN READ
Dec 11, 2024
Deploying NVIDIA H200 NVL at Scale with New Enterprise Reference Architecture
Last month at the Supercomputing 2024 conference, NVIDIA announced the availability of NVIDIA H200 NVL, the latest NVIDIA Hopper platform. Optimized for...
8 MIN READ
Nov 14, 2024
Exploring the Case of Super Protocol with Self-Sovereign AI and NVIDIA Confidential Computing
Confidential and self-sovereign AI is a new approach to AI development, training, and inference where the user’s data is decentralized, private, and...
15 MIN READ
Oct 29, 2024
Protect Your Network with Secure Boot in SONiC
NVIDIA technology helps organizations build and maintain secure, scalable, and high-performance network infrastructure. Advances in AI, with NVIDIA at the...
4 MIN READ
Oct 25, 2024
Advancing Performance with NVIDIA SHARP In-Network Computing
AI and scientific computing applications are great examples of distributed computing problems. The problems are too large and the computations too intensive to...
7 MIN READ
Oct 24, 2024
Building AI Agents to Automate Software Test Case Creation
In software development, testing is crucial for ensuring the quality and reliability of the final product. However, creating test plans and specifications can...
15 MIN READ
Oct 09, 2024
NVIDIA Grace CPU Delivers World-Class Data Center Performance and Breakthrough Energy Efficiency
NVIDIA designed the NVIDIA Grace CPU to be a new kind of high-performance, data center CPU—one built to deliver breakthrough energy efficiency and optimized...
8 MIN READ
Sep 06, 2024
Using Generative AI Models in Circuit Design
Generative models have been making big waves in the past few years, from intelligent text-generating large language models (LLMs) to creative image and...
7 MIN READ
Aug 28, 2024
NVIDIA Blackwell Platform Sets New LLM Inference Records in MLPerf Inference v4.1
Large language model (LLM) inference is a full-stack challenge. Powerful GPUs, high-bandwidth GPU-to-GPU interconnects, efficient acceleration libraries, and a...
13 MIN READ
Aug 27, 2024
Optimize Large-Scale AI Workloads with NVIDIA Spectrum-X
In today’s rapidly evolving technological landscape, staying ahead of the curve is not just a goal—it's a necessity. The surge of innovations, particularly...
5 MIN READ
Aug 12, 2024
NVIDIA NVLink and NVIDIA NVSwitch Supercharge Large Language Model Inference
Large language models (LLM) are getting larger, increasing the amount of compute required to process inference requests. To meet real-time latency requirements...
8 MIN READ
Jul 11, 2024
Next Generation of FlashAttention
NVIDIA is excited to collaborate with Colfax, Together.ai, Meta, and Princeton University on their recent achievement to exploit the Hopper GPU architecture and...
1 MIN READ
Jun 24, 2024
Exploring SONiC on NVIDIA Air
Testing out networking infrastructure and building working PoCs for a new environment can be tricky at best and downright dreadful at worst. You may run into...
6 MIN READ
Jun 17, 2024
Video: Talk to Your Supply Chain Data Using NVIDIA NIM
NVIDIA operates one of the largest and most complex supply chains in the world. The supercomputers we build connect tens of thousands of NVIDIA GPUs with...
2 MIN READ
Jun 12, 2024
Introducing Grouped GEMM APIs in cuBLAS and More Performance Updates
The latest release of NVIDIA cuBLAS library, version 12.5, continues to deliver functionality and performance to deep learning (DL) and high-performance...
7 MIN READ
Jun 10, 2024
Spotlight: Cisco Enhances Workload Security and Operational Efficiency with NVIDIA BlueField-3 DPUs
As cyberattacks become more sophisticated, organizations must constantly adapt with cutting-edge solutions to protect their critical assets. One such solution...
7 MIN READ