Cloud Services

Jun 05, 2023
CUDA 12.1 Supports Large Kernel Parameters
CUDA kernel function parameters are passed to the device through constant memory and have been limited to 4,096 bytes. CUDA 12.1 increases this parameter limit...
5 MIN READ

Jun 02, 2023
Harnessing the Power of NVIDIA AI Enterprise on Azure Machine Learning
AI is transforming industries, automating processes, and opening new opportunities for innovation in the rapidly evolving technological landscape. As more...
7 MIN READ

May 31, 2023
Protecting Sensitive Data and AI Models with Confidential Computing
Rapid digital transformation has led to an explosion of sensitive data being generated across the enterprise. That data has to be stored and processed in data...
10 MIN READ

May 29, 2023
Turbocharging Generative AI Workloads with NVIDIA Spectrum-X Networking Platform
Large Language Models (LLMs) and AI applications such as ChatGPT and DALL-E have recently seen rapid growth. Thanks to GPUs, CPUs, DPUs, high-speed storage, and...
8 MIN READ

May 28, 2023
Announcing NVIDIA DGX GH200: The First 100 Terabyte GPU Memory System
At COMPUTEX 2023, NVIDIA announced NVIDIA DGX GH200, which marks another breakthrough in GPU-accelerated computing to power the most demanding giant AI...
6 MIN READ

May 28, 2023
NVIDIA AX800 Delivers High-Performance 5G vRAN and AI Services on One Common Cloud Infrastructure
The pace of 5G investment and adoption is accelerating. According to the GSMA Mobile Economy 2023 report, nearly $1.4 trillion will be spent on 5G CAPEX,...
11 MIN READ

May 25, 2023
Navigating Generative AI for Network Admins
We all know that AI is changing the world. For network admins, AI can improve day-to-day operations in some amazing ways: Automation of repetitive tasks: This...
6 MIN READ

May 11, 2023
Power the Next Wave of Applications with NVIDIA BlueField-3 DPUs
ChatGPT, Stable Diffusion, DALL-E, and similar applications have awakened the world to generative AI. ChatGPT is the fastest-growing application in history. The...
9 MIN READ

May 09, 2023
Automating Data Center Networks with NVIDIA Cumulus Linux
With evolving and ever-growing data centers, the days of simple networks that remained mostly unchanged are gone. Back then, when a configuration change was...
4 MIN READ

May 04, 2023
Increasing Throughput and Reducing Costs for AI-Based Computer Vision with CV-CUDA
Real-time cloud-scale applications that involve AI-based computer vision are growing rapidly. The use cases include image understanding, content creation,...
11 MIN READ

May 04, 2023
Accelerating the Suricata IDS/IPS with NVIDIA BlueField DPUs
Deep packet inspection (DPI) is a critical technology for network security that enables the inspection and analysis of data packets as they travel across a...
5 MIN READ

Apr 26, 2023
An Introduction to Large Language Models: Prompt Engineering and P-Tuning
ChatGPT has made quite an impression. Users are excited to use the AI chatbot to ask questions, write poems, imbue a persona for interaction, act as a personal...
10 MIN READ

Apr 25, 2023
Increasing Inference Acceleration of KoGPT with NVIDIA FasterTransformer
Transformers are one of the most influential AI model architectures today and are shaping the direction of future AI R&D. First invented as a tool for...
6 MIN READ

Apr 18, 2023
New GPU Library Lowers Compute Costs for Apache Spark ML
Spark MLlib is a key component of Apache Spark for large-scale machine learning and provides built-in implementations of many popular machine learning...
6 MIN READ

Mar 23, 2023
Power Your AI Inference with New NVIDIA Triton and NVIDIA TensorRT Features
NVIDIA AI inference software consists of NVIDIA Triton Inference Server, open-source inference serving software, and NVIDIA TensorRT, an SDK for...
5 MIN READ

Mar 22, 2023
NVIDIA Maxine Elevates Video Conferencing in the Cloud
Real-time remote communication has become the new normal, yet many office workers still experience poor video and audio quality, which impacts collaboration and...
6 MIN READ