DGX

Feb 14, 2025
Optimizing Qwen2.5-Coder Throughput with NVIDIA TensorRT-LLM Lookahead Decoding
Large language models (LLMs) that specialize in coding have been steadily adopted into developer workflows. From pair programming to self-improving AI agents,...
7 MIN READ

Feb 11, 2025
NVIDIA DGX Cloud Introduces Ready-To-Use Templates to Benchmark AI Platform Performance
In the rapidly evolving landscape of AI systems and workloads, achieving optimal model training performance extends far beyond chip speed. It requires a...
7 MIN READ

Jan 16, 2025
Continued Pretraining of State-of-the-Art LLMs for Sovereign AI and Regulated Industries with iGenius and NVIDIA DGX Cloud
In recent years, large language models (LLMs) have achieved extraordinary progress in areas such as reasoning, code generation, machine translation, and...
17 MIN READ

Jan 09, 2025
NVIDIA Project DIGITS, A Grace Blackwell AI Supercomputer On Your Desk
Powered by the new GB10 Grace Blackwell Superchip, Project DIGITS can tackle large generative AI models of up to 200B parameters.
1 MIN READ

Dec 18, 2024
Five Takeaways from NVIDIA 6G Developer Day 2024
NVIDIA 6G Developer Day 2024 brought together members of the 6G research and development community to share insights and learn new ways of engaging with NVIDIA...
10 MIN READ

Nov 22, 2024
Spotlight: TCS Increases Automotive Software Testing Speeds by 2x Using NVIDIA Generative AI
Generative AI is transforming every aspect of the automotive industry, including software development, testing, user experience, personalization, and safety....
8 MIN READ

Nov 20, 2024
Boost Large-Scale Recommendation System Training Embedding Using EMBark
Recommendation systems are core to the Internet industry, and efficiently training them is a key issue for various companies. Most recommendation systems are...
6 MIN READ

Oct 24, 2024
Powering the Next Wave of AI Robotics with Three ComputersÂ
NVIDIA has built three computers and accelerated development platforms to enable developers to create physical AI.
1 MIN READ

Oct 22, 2024
Multi-Agent AI and GPU-Powered Innovation in Sound-to-Text Technology
The Automated Audio Captioning task centers around generating natural language descriptions from audio inputs. Given the distinct modalities between the input...
7 MIN READ

Oct 16, 2024
Maximizing Energy and Power Efficiency in Applications with NVIDIA GPUs
As the demand for high-performance computing (HPC) and AI applications grows, so does the importance of energy efficiency. NVIDIA Principal Developer Technology...
2 MIN READ

Apr 26, 2024
Perception Model Training for Autonomous Vehicles with Tensor Parallelism
Due to the adoption of multicamera inputs and deep convolutional backbone networks, the GPU memory footprint for training autonomous driving perception models...
10 MIN READ

Apr 23, 2024
Democratizing AI Workflows with Union.ai and NVIDIA DGX Cloud
GPUs were initially specialized for rendering 3D graphics in video games, primarily to accelerate linear algebra calculations. Today, GPUs have become one of...
7 MIN READ

Apr 03, 2024
Optimizing Memory and Retrieval for Graph Neural Networks with WholeGraph, Part 2
Large-scale graph neural network (GNN) training presents formidable challenges, particularly concerning the scale and complexity of graph data. These challenges...
5 MIN READ

Mar 27, 2024
Scale and Curate High-Quality Datasets for LLM Training with NVIDIA NeMo Curator
Enterprises are using large language models (LLMs) as powerful tools to improve operational efficiency and drive innovation. NVIDIA NeMo microservices aim to...
6 MIN READ

Mar 19, 2024
Generative AI for Digital Human Technologies and New AI-powered NVIDIA RTX Lighting
At GDC 2024, NVIDIA announced that leading AI application developers such as Inworld AI are using NVIDIA digital human technologies to accelerate the deployment...
4 MIN READ

Mar 18, 2024
NVIDIA NIM Offers Optimized Inference Microservices for Deploying AI Models at Scale
The rise in generative AI adoption has been remarkable. Catalyzed by the launch of OpenAI’s ChatGPT in 2022, the new technology amassed over 100M users within...
6 MIN READ