Advanced Technical

Dec 04, 2023
NVIDIA TensorRT-LLM Enhancements Deliver Massive Large Language Model Speedups on NVIDIA H200
Large language models (LLMs) have seen dramatic growth over the last year, and the challenge of delivering great user experiences depends on both high-compute...
5 MIN READ

Nov 28, 2023
One Giant Superchip for LLMs, Recommenders, and GNNs: Introducing NVIDIA GH200 NVL32
At AWS re:Invent 2023, AWS and NVIDIA announced that AWS will be the first cloud provider to offer NVIDIA GH200 Grace Hopper Superchips interconnected with...
9 MIN READ

Nov 17, 2023
Mastering LLM Techniques: Inference Optimization
Stacking transformer layers to create large models results in better accuracies, few-shot learning capabilities, and even near-human emergent abilities on a...
25 MIN READ

Nov 15, 2023
Build Custom Enterprise-Grade Generative AI with NVIDIA AI Foundation ModelsĀ
In the realm of generative AI, building enterprise-grade large language models (LLMs) requires expertise collecting high-quality data, setting up the...
9 MIN READ

Nov 14, 2023
Accelerating Ptychography Workflows with NVIDIA Holoscan at Diamond Light Source
Diamond Light Source is a world-renowned synchrotron facility in the UK that provides scientists with access to intense beams of x-rays, infrared, and other...
10 MIN READ

Nov 14, 2023
Energy Efficiency in High-Performance Computing: Balancing Speed and Sustainability
The world of computing is on the precipice of a seismic shift. The demand for computing power, particularly in high-performance computing (HPC), is...
17 MIN READ

Nov 13, 2023
Using Synthetic Data to Address Novel Viewpoints for Autonomous Vehicle Perception
Autonomous vehicles (AV) come in all shapes and sizes, ranging from small passenger cars to multi-axle semi-trucks. However, a perception algorithm deployed on...
7 MIN READ

Nov 09, 2023
Enabling Greater Patient-Specific Cardiovascular Care with AI Surrogates
A Stanford University team is transforming heart healthcare with near real-time cardiovascular simulations driven by the power of AI. Harnessing...
8 MIN READ

Nov 08, 2023
New Workshop: Rapid Application Development Using Large Language Models
Interested in developing LLM-based applications? Get started with this exploration of the open-source ecosystem.
1 MIN READ

Oct 12, 2023
Workshop: Model Parallelism: Building and Deploying Large Neural Networks
Learn how to train the largest neural networks and deploy them to production.
1 MIN READ

Oct 10, 2023
Event: AI and Data Science Virtual Summit
Meta, NetworkX, Fast.ai, and other industry leaders share how to gain new insights from your data with emerging tools.
1 MIN READ

Sep 29, 2023
Comparing Solutions for Boosting Data Center Redundancy
In todayās data center, there are many ways to achieve system redundancy from a server connected to a fabric. Customers usually seek redundancy to increase...
7 MIN READ

Sep 26, 2023
Validating NVIDIA DRIVE Sim Radar Models
Sensor simulation is a critical tool to address the gaps in real-world data for autonomous vehicle (AV) development. However, it is only effective if sensor...
15 MIN READ

Sep 21, 2023
Just Released: NVIDIA Modulus 23.09
NVIDIA Modulus 23.09 is now available, providing ease-of-use updates, fixes, and other enhancements.
1 MIN READ

Sep 09, 2023
Leading MLPerf Inference v3.1 Results with NVIDIA GH200 Grace Hopper Superchip Debut
AI is transforming computing, and inference is how the capabilities of AI are deployed in the worldās applications. Intelligent chatbots, image and video...
13 MIN READ

Sep 07, 2023
Unlocking Multi-GPU Model Training with Dask XGBoost
As data scientists, we often face the challenging task of training large models on huge datasets. One commonly used tool, XGBoost, is a robust and efficient...
11 MIN READ