Deep dive

Nov 30, 2023
Introduction to LLM Agents
Consider a large language model (LLM) application that is designed to help financial analysts answer questions about the performance of a company. With a...
10 MIN READ

Nov 27, 2023
Bolstering Cybersecurity: How Large Language Models and Generative AI are Transforming Digital Security
Identity-based attacks are on the rise, with phishing remaining the most common and second-most expensive attack vector. Some attackers are using AI to craft...
9 MIN READ

Nov 17, 2023
Mastering LLM Techniques: Inference Optimization
Stacking transformer layers to create large models results in better accuracies, few-shot learning capabilities, and even near-human emergent abilities on a...
25 MIN READ

Nov 16, 2023
Unlock the Power of NVIDIA Grace and NVIDIA Hopper Architectures with Foundational HPC Software
High-performance computing (HPC) powers applications in simulation and modeling, healthcare and life sciences, industry and engineering, and more. In the modern...
7 MIN READ

Nov 16, 2023
Mastering LLM Techniques: Training
Large language models (LLMs) are a class of generative AI models built using transformer networks that can recognize, summarize, translate, predict, and...
15 MIN READ

Nov 15, 2023
Mastering LLM Techniques: LLMOps
Businesses rely more than ever on data and AI to innovate, offer value to customers, and stay competitive. The adoption of machine learning (ML), created a need...
14 MIN READ

Nov 14, 2023
Energy Efficiency in High-Performance Computing: Balancing Speed and Sustainability
The world of computing is on the precipice of a seismic shift. The demand for computing power, particularly in high-performance computing (HPC), is...
17 MIN READ

Nov 13, 2023
Simplifying GPU Programming for HPC with NVIDIA Grace Hopper Superchip
The new hardware developments in NVIDIA Grace Hopper Superchip systems enable some dramatic changes to the way developers approach GPU programming. Most...
17 MIN READ

Nov 13, 2023
Optimize Energy Efficiency of Multi-Node VASP Simulations with NVIDIA Magnum IO
Computational energy efficiency has become a primary decision criterion for most supercomputing centers. Data centers, once built, are capped in terms of the...
17 MIN READ

Nov 07, 2023
CUDA-Accelerated Robot Motion Generation in Milliseconds with NVIDIA cuRobo
Real-time autonomous robot navigation powered by a fast motion-generation algorithm can enable applications in several industries such as food and services,...
3 MIN READ

Oct 29, 2023
How to Train Autonomous Mobile Robots to Detect Warehouse Pallet Jacks Using Synthetic Data
Synthetic data can play a key role when training perception AI models that are deployed on autonomous mobile robots (AMRs). This process is becoming...
10 MIN READ

Oct 28, 2023
Accelerate Genomic Analysis for Any Sequencer with NVIDIA Parabricks v4.2
Parabricks version 4.2 has been released, furthering its mission to deliver unprecedented speed, cost-effectiveness, and accuracy in genomics sequencing...
7 MIN READ

Oct 24, 2023
Efficient CUDA Debugging: Memory Initialization and Thread Synchronization with NVIDIA Compute Sanitizer
NVIDIA Compute Sanitizer (NCS) is a powerful tool that can save you time and effort while improving the reliability and performance of your CUDA...
13 MIN READ

Oct 22, 2023
Differentiable Slang: Example Applications
Differentiable Slang easily integrates with existing codebases—from Python, PyTorch, and CUDA to HLSL—to aid multiple computer graphics tasks and enable...
6 MIN READ

Oct 22, 2023
Differentiable Slang: A Shading Language for Renderers That Learn
NVIDIA just released a SIGGRAPH Asia 2023 research paper, SLANG.D: Fast, Modular and Differentiable Shader Programming. The paper shows how a single language...
12 MIN READ

Oct 19, 2023
Bringing Generative AI to Life with NVIDIA Jetson
Recently, NVIDIA unveiled Jetson Generative AI Lab, which empowers developers to explore the limitless possibilities of generative AI in a real-world setting...
11 MIN READ