Data Center / Cloud

Mar 20, 2025
NVIDIA Demonstrates GeForce NOW for Game AI Inference and Streamlined Hands-on Opportunities
NVIDIA cloud gaming service GeForce NOW is providing developers and publishers with new tools to bring their games to more gamers—and offer new experiences...
5 MIN READ

Mar 20, 2025
Boost Llama Model Performance on Microsoft Azure AI Foundry with NVIDIA TensorRT-LLM
Microsoft, in collaboration with NVIDIA, announced transformative performance improvements for the Meta Llama family of models on its Azure AI Foundry platform....
4 MIN READ

Mar 19, 2025
NVIDIA Virtual GPU 18.0 Enables VDI for AI on Every Virtualized Platform
NVIDIA Virtual GPU (vGPU) technology unlocks AI capabilities within Virtual Desktop Infrastructure (VDI), making it more powerful and versatile than ever...
4 MIN READ

Mar 19, 2025
NVIDIA Blackwell Ultra for the Era of AI Reasoning
For years, advancements in AI have followed a clear trajectory through pretraining scaling: larger models, more data, and greater computational resources lead...
5 MIN READ

Mar 19, 2025
NVIDIA Earth-2 Powers Regional AI Weather Forecasting in the United Arab Emirates
In the United Arab Emirates (UAE), extreme weather events disrupt daily life, delaying flights, endangering transportation, and complicating urban planning....
13 MIN READ

Mar 19, 2025
Shrink Genomics and Single-Cell Analysis Time to Minutes with NVIDIA Parabricks and NVIDIA AI Blueprints
NVIDIA Parabricks is a scalable genomics analysis software suite that solves omics challenges with accelerated computing and deep learning to unlock new...
8 MIN READ

Mar 18, 2025
Seamlessly Scale AI Across Cloud Environments with NVIDIA DGX Cloud Serverless Inference
NVIDIA DGX Cloud Serverless Inference is an auto-scaling AI inference solution that enables application deployment with speed and reliability. Powered by NVIDIA...
9 MIN READ

Mar 18, 2025
Measure and Improve AI Workload Performance with NVIDIA DGX Cloud Benchmarking
As AI capabilities advance, understanding the impact of hardware and software infrastructure choices on workload performance is crucial for both technical...
7 MIN READ

Mar 18, 2025
Introducing NVIDIA Dynamo, A Low-Latency Distributed Inference Framework for Scaling Reasoning AI Models
NVIDIA announced the release of NVIDIA Dynamo today at GTC 2025. NVIDIA Dynamo is a high-throughput, low-latency open-source inference serving framework for...
14 MIN READ

Mar 18, 2025
NVIDIA Blackwell Delivers World-Record DeepSeek-R1 Inference Performance
NVIDIA announced world-record DeepSeek-R1 inference performance at NVIDIA GTC 2025. A single NVIDIA DGX system with eight NVIDIA Blackwell GPUs can achieve over...
14 MIN READ

Mar 13, 2025
Networking Reliability and Observability at Scale with NCCL 2.24
The NVIDIA Collective Communications Library (NCCL) implements multi-GPU and multinode (MGMN) communication primitives optimized for NVIDIA GPUs and networking....
14 MIN READ

Mar 12, 2025
Lightweight, Multimodal, Multilingual Gemma 3 Models Are Streamlined for Performance
Building AI systems with foundation models requires a delicate balancing of resources such as memory, latency, storage, compute, and more. One size does not fit...
3 MIN READ

Mar 11, 2025
Efficient ETL with Polars and Apache Spark on NVIDIA Grace CPU
The NVIDIA Grace CPU Superchip delivers outstanding performance and best-in-class energy efficiency for CPU workloads in the data center and in the cloud. The...
7 MIN READ

Mar 10, 2025
Optimizing Compile Times for CUDA C++
In modern software development, time is an incredibly valuable resource, especially during the compilation process. For developers working with CUDA C++ on...
10 MIN READ

Mar 10, 2025
Ensuring Reliable Model Training on NVIDIA DGX Cloud
Training AI models on massive GPU clusters presents significant challenges for model builders. Because manual intervention becomes impractical as job scale...
8 MIN READ

Mar 07, 2025
Featured Data Center and Cloud Sessions at NVIDIA GTC 2025
Explore the latest innovations in data center and cloud with sessions showcasing the full capabilities of the NVIDIA accelerated computing platform.
1 MIN READ