-
Data SciencePolars GPU Engine Powered by RAPIDS cuDF Now Available in Open Beta
-
Edge ComputingAI Tool Helps Farmers Combat Crop Loss and Climate Change
-
Data ScienceAdvanced Strategies for High-Performance GPU Programming with NVIDIA CUDA
-
Computer Vision / Video AnalyticsHigh-Tech AI Framework Transforms Global Marine Pollution Tracking
-
Generative AI / LLMsLow Latency Inference Chapter 1: Up to 1.9x Higher Llama 3.1 Performance with Medusa on NVIDIA HGX H200 with NVLink Switch
Recent
Sep 17, 2024
Accelerating Oracle Database Gen AI Workloads with NVIDIA NIM and NVIDIA cuVS
The vast majority of the world's data remains untapped, and enterprises are looking to generate value from this data by creating the next wave of generative AI...
6 MIN READ
Sep 17, 2024
Optimizing Data Center Performance with AI Agents and the OODA Loop Strategy
For any data center, operating large, complex GPU clusters is not for the faint of heart! There is a tremendous amount of complexity. Cooling, power,...
12 MIN READ
Sep 17, 2024
Polars GPU Engine Powered by RAPIDS cuDF Now Available in Open Beta
Today, Polars released a new GPU engine powered by RAPIDS cuDF that accelerates Polars workflows up to 13x on NVIDIA GPUs, allowing data scientists to process...
4 MIN READ
Sep 16, 2024
Generate code with Abacus AI’s Dracarys Large Language Model
Dracarys, fine-tuned from Llama 3.1 70B and available from NVIDIA NIM microservice, supports a variety of applications, including data analysis, text...
1 MIN READ
Sep 16, 2024
Memory Efficiency, Faster Initialization, and Cost Estimation with NVIDIA Collective Communications Library 2.22
For the past few months, the NVIDIA Collective Communications Library (NCCL) developers have been working hard on a set of new library features and bug fixes....
8 MIN READ
Sep 16, 2024
Orchestrating Innovation at Scale with NVIDIA Maxine and Texel
The NVIDIA Maxine AI developer platform is a suite of NVIDIA NIM microservices, cloud-accelerated microservices, and SDKs that offer state-of-the-art features...
5 MIN READ
Sep 13, 2024
Improved Data Loading with Threads
Data loading is a critical aspect of deep learning workflows, whether you're focused on training or inference. However, it often presents a paradox: the need...
8 MIN READ
Sep 11, 2024
Enabling Customizable GPU-Accelerated Video Transcoding Pipelines
Today, over 80% of internet traffic is video. This content is generated by and consumed across various devices, including IoT gadgets, smartphones, computers,...
10 MIN READ
Sep 11, 2024
Spotlight: xpander AI Equips NVIDIA NIM Applications with Agentic Tools
Equipping agentic AI applications with tools will usher in the next phase of AI. By enabling autonomous agents and other AI applications to fetch real-time...
11 MIN READ
Sep 11, 2024
Processing One Billion Rows of Data with RAPIDS cuDF pandas Accelerator Mode
The One Billion Row Challenge is a fun benchmark to showcase basic data processing operations. It was originally launched as a pure-Java competition, and has...
11 MIN READ
Sep 11, 2024
AI Tool Helps Farmers Combat Crop Loss and Climate Change
Machine Learning algorithms are beginning to revolutionize modern agriculture. Enabling farmers to combat pests and diseases in real time, the technology is...
3 MIN READ
Sep 11, 2024
Advanced Strategies for High-Performance GPU Programming with NVIDIA CUDA
Stephen Jones, a leading expert and distinguished NVIDIA CUDA architect, offers his guidance and insights with a deep dive into the complexities of mapping...
2 MIN READ
Generative AI / LLMs
Sep 17, 2024
Accelerating Oracle Database Gen AI Workloads with NVIDIA NIM and NVIDIA cuVS
The vast majority of the world's data remains untapped, and enterprises are looking to generate value from this data by creating the next wave of generative AI...
6 MIN READ
Sep 17, 2024
Optimizing Data Center Performance with AI Agents and the OODA Loop Strategy
For any data center, operating large, complex GPU clusters is not for the faint of heart! There is a tremendous amount of complexity. Cooling, power,...
12 MIN READ
Sep 16, 2024
Generate code with Abacus AI’s Dracarys Large Language Model
Dracarys, fine-tuned from Llama 3.1 70B and available from NVIDIA NIM microservice, supports a variety of applications, including data analysis, text...
1 MIN READ
Sep 13, 2024
Improved Data Loading with Threads
Data loading is a critical aspect of deep learning workflows, whether you're focused on training or inference. However, it often presents a paradox: the need...
8 MIN READ
Sep 11, 2024
Spotlight: xpander AI Equips NVIDIA NIM Applications with Agentic Tools
Equipping agentic AI applications with tools will usher in the next phase of AI. By enabling autonomous agents and other AI applications to fetch real-time...
11 MIN READ
Sep 11, 2024
Constant Time Launch for Straight-Line CUDA Graphs and Other Performance Enhancements
CUDA Graphs are a way to define and batch GPU operations as a graph rather than a sequence of stream launches. A CUDA Graph groups a set of CUDA kernels and...
8 MIN READ
Sep 10, 2024
Streamlining Data Processing for Domain Adaptive Pretraining with NVIDIA NeMo Curator
Domain-adaptive pretraining (DAPT) of large language models (LLMs) is an important step towards building domain-specific models. These models demonstrate...
16 MIN READ
Sep 10, 2024
Post-Training Quantization of LLMs with NVIDIA NeMo and NVIDIA TensorRT Model Optimizer
As large language models (LLMs) are becoming even bigger, it is increasingly important to provide easy-to-use and efficient deployment paths because the cost of...
10 MIN READ
Sep 06, 2024
Using Generative AI Models in Circuit Design
Generative models have been making big waves in the past few years, from intelligent text-generating large language models (LLMs) to creative image and...
7 MIN READ
Sep 05, 2024
Achieving State-of-the-Art Zero-Shot Waveform Audio Generation across Audio Types
Stunning audio content is an essential component of virtual worlds. Audio generative AI plays a key role in creating this content, and NVIDIA is continuously...
6 MIN READ
Sep 05, 2024
Low Latency Inference Chapter 1: Up to 1.9x Higher Llama 3.1 Performance with Medusa on NVIDIA HGX H200 with NVLink Switch
As large language models (LLMs) continue to grow in size and complexity, multi-GPU compute is a must-have to deliver the low latency and high throughput that...
5 MIN READ
Sep 04, 2024
Hands-On Training at NVIDIA AI Summit in Washington, DC
Immerse yourself in NVIDIA technology with our full-day, hands-on technical workshops at our AI Summit in Washington D.C. on October 7, 2024.
1 MIN READ
AI Foundation Models
Aug 21, 2024
Mistral-NeMo-Minitron 8B Foundation Model Delivers Unparalleled Accuracy
Last month, NVIDIA and Mistral AI unveiled Mistral NeMo 12B, a leading state-of-the-art large language model (LLM). Mistral NeMo 12B consistently outperforms...
5 MIN READ
Jul 29, 2024
Fast-Track Robot Learning in Simulation Using NVIDIA Isaac Lab
Robots need to be adaptable, readily learning new skills and adjusting to their surroundings. Yet traditional training methods can limit a robot’s ability to...
7 MIN READ
Jul 26, 2024
Power Text-Generation Applications with Mistral NeMo 12B Running on a Single GPU
NVIDIA collaborated with Mistral to co-build the next-generation language model that achieves leading performance across benchmarks in its class. With a growing...
6 MIN READ
Jun 28, 2024
Transforming Financial Analysis with NVIDIA NIM
In financial services, portfolio managers and research analysts diligently sift through vast amounts of data to gain a competitive edge in investments. Making...
13 MIN READ
Jun 24, 2024
Addressing Medical Imaging Limitations with Synthetic Data Generation
Synthetic data in medical imaging offers numerous benefits, including the ability to augment datasets with diverse and realistic images where real data is...
9 MIN READ
Jun 10, 2024
SOLAR-10.7B: Optimized Model Tailored Instruction Following, Reasoning, and Mathematical Tasks
Enhance efficiency and performance in instruction-based NLP tasks with SOLAR-10.7B, especially in following instructions, reasoning, and mathematical tasks.
1 MIN READ
Jun 03, 2024
Breeze-7B: LLM Specialized for Traditional Chinese
The model demonstrates strong performance for tasks such as Q&A, multi-round chat, and summarization in both traditional Chinese and English.
1 MIN READ
Jun 03, 2024
BGE-M3: Advanced Multilingual Text Retrieval Model
Experience the versatile embedding model designed for multilingual, multi-functional, and multi-granularity text retrieval tasks, excelling in dense,...
1 MIN READ
May 30, 2024
Convert Natural Language to Code with CodeGemma
Experience the advanced LLM API for code generation, completion, mathematical reasoning, and instruction following with free cloud credits.
1 MIN READ
May 14, 2024
Generate Text Responses from Visual and Text Inputs with Google's New PaliGemma Model
With free NVIDIA cloud credits, you can start testing the model at scale on the API Catalog.
1 MIN READ
May 13, 2024
Regional LLMs SEA-LION and SeaLLM Serve Languages and Cultures of Southeast Asia
At the recent World Governments Summit in Dubai, NVIDIA CEO Jensen Huang emphasized the importance of sovereign AI, which refers to a nation’s capability to...
3 MIN READ
Apr 30, 2024
Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks
This week’s model release features DBRX, a state-of-the-art large language model (LLM) developed by Databricks. With demonstrated strength in programming and...
3 MIN READ
Simulation / Modeling / Design
Sep 16, 2024
Memory Efficiency, Faster Initialization, and Cost Estimation with NVIDIA Collective Communications Library 2.22
For the past few months, the NVIDIA Collective Communications Library (NCCL) developers have been working hard on a set of new library features and bug fixes....
8 MIN READ
Sep 11, 2024
Constant Time Launch for Straight-Line CUDA Graphs and Other Performance Enhancements
CUDA Graphs are a way to define and batch GPU operations as a graph rather than a sequence of stream launches. A CUDA Graph groups a set of CUDA kernels and...
8 MIN READ
Sep 09, 2024
Spotlight: Shell Accelerates CO2 Storage Modeling 100,000x Using NVIDIA Modulus
As the world faces the urgent need to combat climate change, carbon capture and storage (CCS) has emerged as a crucial technology for achieving net-zero...
7 MIN READ
Sep 09, 2024
Transform Live Media Pipelines with NVIDIA Holoscan for Media
NVIDIA Holoscan for Media is now ready to be used in live production, taking advantage of the best of both networking and GPU technologies. Holoscan for...
3 MIN READ
Sep 06, 2024
Enhancing Application Portability and Compatibility across New Platforms Using NVIDIA Magnum IO NVSHMEM 3.0
NVSHMEM is a parallel programming interface that provides efficient and scalable communication for NVIDIA GPU clusters. Part of NVIDIA Magnum IO and based on...
7 MIN READ
Sep 06, 2024
Using Generative AI Models in Circuit Design
Generative models have been making big waves in the past few years, from intelligent text-generating large language models (LLMs) to creative image and...
7 MIN READ
Sep 03, 2024
Real-Time Neural Receivers Drive AI-RAN Innovation
Today’s 5G New Radio (5G NR) wireless communication systems rely on highly optimized signal processing algorithms to reconstruct transmitted messages from...
11 MIN READ
Aug 29, 2024
Spotlight: clicOH Accelerates Last-Mile Delivery 20x with NVIDIA cuOpt
Driven by shifts in consumer behavior and the pandemic, e-commerce continues its explosive growth and transformation. As a result, logistics and transportation...
3 MIN READ
Aug 27, 2024
Simplifying Camera Calibration to Enhance AI-Powered Multi-Camera Tracking
This post is the third in a series on building multi-camera tracking vision AI applications. We introduce the overall end-to-end workflow and fine-tuning...
12 MIN READ
Aug 26, 2024
CUDA-Q Enabled Resource Reduction for Quantum Clustering Algorithms
Quantum computers can use the quantum properties of superposition, entanglement, and interference to generalize learnings and insights from data. Such quantum...
7 MIN READ
Aug 20, 2024
Deploy the First On-Device Small Language Model for Improved Game Character Roleplay
At Gamescom 2024, NVIDIA announced our first on-device small language model (SLM) for improving the conversation abilities of game characters. We also announced...
4 MIN READ
Aug 14, 2024
Video: Build Live Media Applications for AI-Enabled Infrastructure with NVIDIA Holoscan for Media
NVIDIA Holoscan for Media is a software-defined, AI-enabled platform that enables live video pipelines to run on the same infrastructure as AI. This video...
1 MIN READ
Robotics
Aug 27, 2024
Simplifying Camera Calibration to Enhance AI-Powered Multi-Camera Tracking
This post is the third in a series on building multi-camera tracking vision AI applications. We introduce the overall end-to-end workflow and fine-tuning...
12 MIN READ
Jul 29, 2024
Build VLM-Powered Visual AI Agents Using NVIDIA NIM and NVIDIA VIA Microservices
Traditional video analytics applications and their development workflow are typically built on fixed-function, limited models that are designed to detect and...
10 MIN READ
Jul 29, 2024
Fast-Track Robot Learning in Simulation Using NVIDIA Isaac Lab
Robots need to be adaptable, readily learning new skills and adjusting to their surroundings. Yet traditional training methods can limit a robot’s ability to...
7 MIN READ
Jul 18, 2024
Webinar: Improving Robot Uptime Featuring Nav2 Autonomous Docking with NVIDIA Isaac ROS
Join Isaac ROS engineers and the founder of Open Navigation to explore the new Nav2 autonomous docking feature.
1 MIN READ
Jul 11, 2024
Training Sim-to-Real Transferable Robotic Assembly Skills over Diverse Geometries
Most objects in home and industrial settings consist of multiple parts that must be assembled. While human workers typically perform assembly, in certain...
10 MIN READ
Jul 11, 2024
Spotlight: Siemens Energy Accelerates Power Grid Asset Simulation 10,000x Using NVIDIA Modulus
The world’s energy system is increasingly complex and distributed due to increasing renewable energy generation, decentralization of energy resources, and...
9 MIN READ
Jul 10, 2024
Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data
Large-scale, use–case-specific synthetic data has become increasingly important in real-world computer vision and AI workflows. That’s because digital twins...
14 MIN READ
Jun 25, 2024
AI-Enhanced Navigation Charts Safer Waters for Massive Ships
Maritime startup Orca AI is pioneering safety at sea with its AI-powered navigation system, which provides real-time video processing to help crews make...
5 MIN READ
Jun 24, 2024
Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim
As vision AI complexity increases, streamlined deployment solutions are crucial to optimizing spaces and processes. NVIDIA accelerates development, turning...
13 MIN READ
Jun 17, 2024
Closing the Sim-to-Real Gap: Training Spot Quadruped Locomotion with NVIDIA Isaac Lab
Developing effective locomotion policies for quadrupeds poses significant challenges in robotics due to the complex dynamics involved. Training quadrupeds to...
12 MIN READ
Jun 17, 2024
Supercharge Robotics Workflows with AI and Simulation Using NVIDIA Isaac Sim 4.0 and NVIDIA Isaac Lab
The era of AI robots powered by physical AI has arrived. Physical AI models understand their environments and autonomously complete complex tasks in the...
11 MIN READ
Jun 14, 2024
Level Up Your Skills with Five New NVIDIA Technical Courses
With AI introducing an unprecedented pace of technological innovation, staying ahead means keeping your skills up to date. The NVIDIA Developer Program gives...
4 MIN READ
Computer Vision / Video Analytics
Sep 13, 2024
Improved Data Loading with Threads
Data loading is a critical aspect of deep learning workflows, whether you're focused on training or inference. However, it often presents a paradox: the need...
8 MIN READ
Sep 11, 2024
Enabling Customizable GPU-Accelerated Video Transcoding Pipelines
Today, over 80% of internet traffic is video. This content is generated by and consumed across various devices, including IoT gadgets, smartphones, computers,...
10 MIN READ
Sep 11, 2024
AI Tool Helps Farmers Combat Crop Loss and Climate Change
Machine Learning algorithms are beginning to revolutionize modern agriculture. Enabling farmers to combat pests and diseases in real time, the technology is...
3 MIN READ
Sep 09, 2024
High-Tech AI Framework Transforms Global Marine Pollution Tracking
An AI-powered remote sensing study offers a dynamic new tool for global ocean cleanup efforts. Detailed in the ISPRS Journal of Photogrammetry and Remote...
4 MIN READ
Sep 05, 2024
AI-Powered Platform Advances Personalized Cancer Diagnostics and Treatments
A recent study introduced a cutting-edge AI-powered pathology platform that can help doctors diagnose and evaluate lung cancer in patients quickly and...
3 MIN READ
Aug 30, 2024
Fast Inversion for Real-Time Image Editing with Text
Text-to-image diffusion models can generate diverse, high-fidelity images based on user-provided text prompts. They operate by mapping a random sample from a...
8 MIN READ
Aug 28, 2024
New Foundational Models and Training Capabilities with NVIDIA TAO 5.5
NVIDIA TAO is a framework designed to simplify and accelerate the development and deployment of AI models. It enables you to use pretrained models, fine-tune...
13 MIN READ
Aug 27, 2024
Simplifying Camera Calibration to Enhance AI-Powered Multi-Camera Tracking
This post is the third in a series on building multi-camera tracking vision AI applications. We introduce the overall end-to-end workflow and fine-tuning...
12 MIN READ
Aug 19, 2024
Webinar: Build Visual AI Agents With Generative AI and NVIDIA NIM
Learn how to build high-performance solutions with NVIDIA visual AI agents that help streamline operations across a range of industries.
1 MIN READ
Aug 12, 2024
Interactive AI Tool Delivers Immersive Video Content to Blind and Low-Vision Viewers
New research aims to revolutionize video accessibility for blind or low-vision (BLV) viewers with an AI-powered system that gives users the ability to explore...
4 MIN READ
Aug 06, 2024
​​Real-Time AI Shark Detection is Boosting Beach Safety
California beaches are becoming safer with a new AI-powered shark detection system. Known as SharkEye, the technology identifies sharks near shorelines in real...
2 MIN READ
Jul 29, 2024
Build VLM-Powered Visual AI Agents Using NVIDIA NIM and NVIDIA VIA Microservices
Traditional video analytics applications and their development workflow are typically built on fixed-function, limited models that are designed to detect and...
10 MIN READ
Data Science
Sep 17, 2024
Polars GPU Engine Powered by RAPIDS cuDF Now Available in Open Beta
Today, Polars released a new GPU engine powered by RAPIDS cuDF that accelerates Polars workflows up to 13x on NVIDIA GPUs, allowing data scientists to process...
4 MIN READ
Sep 13, 2024
Improved Data Loading with Threads
Data loading is a critical aspect of deep learning workflows, whether you're focused on training or inference. However, it often presents a paradox: the need...
8 MIN READ
Sep 11, 2024
Processing One Billion Rows of Data with RAPIDS cuDF pandas Accelerator Mode
The One Billion Row Challenge is a fun benchmark to showcase basic data processing operations. It was originally launched as a pure-Java competition, and has...
11 MIN READ
Sep 11, 2024
Advanced Strategies for High-Performance GPU Programming with NVIDIA CUDA
Stephen Jones, a leading expert and distinguished NVIDIA CUDA architect, offers his guidance and insights with a deep dive into the complexities of mapping...
2 MIN READ
Sep 10, 2024
Streamlining Data Processing for Domain Adaptive Pretraining with NVIDIA NeMo Curator
Domain-adaptive pretraining (DAPT) of large language models (LLMs) is an important step towards building domain-specific models. These models demonstrate...
16 MIN READ
Sep 04, 2024
Accelerated, Production-Ready Graph Analytics for NetworkX Users
NetworkX is a popular, easy-to-use Python library for graph analytics. However, its performance and scalability may be unsatisfactory for medium-to-large-sized...
11 MIN READ
Sep 04, 2024
Hands-On Training at NVIDIA AI Summit in Washington, DC
Immerse yourself in NVIDIA technology with our full-day, hands-on technical workshops at our AI Summit in Washington D.C. on October 7, 2024.
1 MIN READ
Aug 30, 2024
Accelerating Predictive Maintenance in Manufacturing with RAPIDS AI
The International Society of Automation (ISA) reports that 5% of plant production is lost annually due to downtime. Putting that into a different context,...
12 MIN READ
Aug 29, 2024
Spotlight: clicOH Accelerates Last-Mile Delivery 20x with NVIDIA cuOpt
Driven by shifts in consumer behavior and the pandemic, e-commerce continues its explosive growth and transformation. As a result, logistics and transportation...
3 MIN READ
Aug 29, 2024
Boosting CUDA Efficiency with Essential Techniques for New Developers
To fully harness the capabilities of NVIDIA GPUs, optimizing NVIDIA CUDA performance is essential, particularly for developers new to GPU programming. This talk...
2 MIN READ
Aug 29, 2024
Just Released: RAPIDS 24.08
RAPIDS 24.08 is now available with significant updates geared towards processing larger workloads and seamless CPU/GPU interoperability.
1 MIN READ
Aug 28, 2024
NVIDIA Triton Inference Server Achieves Outstanding Performance in MLPerf Inference 4.1 Benchmarks
Six years ago, we embarked on a journey to develop an AI inference serving solution specifically designed for high-throughput and time-sensitive production use...
8 MIN READ
Content Creation / Rendering
Sep 16, 2024
Orchestrating Innovation at Scale with NVIDIA Maxine and Texel
The NVIDIA Maxine AI developer platform is a suite of NVIDIA NIM microservices, cloud-accelerated microservices, and SDKs that offer state-of-the-art features...
5 MIN READ
Sep 11, 2024
Enabling Customizable GPU-Accelerated Video Transcoding Pipelines
Today, over 80% of internet traffic is video. This content is generated by and consumed across various devices, including IoT gadgets, smartphones, computers,...
10 MIN READ
Sep 09, 2024
Transform Live Media Pipelines with NVIDIA Holoscan for Media
NVIDIA Holoscan for Media is now ready to be used in live production, taking advantage of the best of both networking and GPU technologies. Holoscan for...
3 MIN READ
Aug 30, 2024
Fast Inversion for Real-Time Image Editing with Text
Text-to-image diffusion models can generate diverse, high-fidelity images based on user-provided text prompts. They operate by mapping a random sample from a...
8 MIN READ
Aug 20, 2024
Deploy the First On-Device Small Language Model for Improved Game Character Roleplay
At Gamescom 2024, NVIDIA announced our first on-device small language model (SLM) for improving the conversation abilities of game characters. We also announced...
4 MIN READ
Aug 12, 2024
Elevating Video Communication with the NVIDIA Maxine AI Developer Platform and VideoRequest
Effective video communication is important for everyone who communicates online. For businesses, educators, and content creators, it is vital. NVIDIA Maxine is...
5 MIN READ
Jul 31, 2024
Shader Debugging Made Easy with NVIDIA Nsight Graphics
Shaders are specialized programs that run on the GPU that manipulate rays, pixels, vertices, and textures to achieve unique visual effects. With shaders, you...
8 MIN READ
Jul 29, 2024
Building Spatial Intelligence from Real-World 3D Data Using Deep-Learning Framework fVDB
Generative physical AI models can understand and execute actions with fine or gross motor skills within the physical world. Understanding and navigating in the...
6 MIN READ
Jul 29, 2024
Advancing Telepresence and Next-Generation Digital Human Technology with NVIDIA Maxine
At SIGGRAPH 2024 this week, NVIDIA is showcasing the latest advancements in the NVIDIA Maxine AI developer platform, available through NVIDIA AI...
8 MIN READ
Jul 25, 2024
Simulate Elastic Objects in Any Representation with NVIDIA Kaolin Library
Recent advancements in generative AI and multi-view reconstruction have introduced new ways to rapidly generate 3D content. However, to be useful for downstream...
2 MIN READ
Jul 24, 2024
Developing Product Configurators with OpenUSD
Developers from advertising agencies to software vendors are empowering global brands to deliver hyperpersonalization for digital experiences and visual...
5 MIN READ
Jul 22, 2024
Gets Hands-On Training at SIGGRAPH 2024
Complimentary trainings on OpenUSD, Digital Humans, LLMs and more with hands-on labs for Full Conference and Experience attendees.
1 MIN READ
Conversational AI
Sep 17, 2024
Optimizing Data Center Performance with AI Agents and the OODA Loop Strategy
For any data center, operating large, complex GPU clusters is not for the faint of heart! There is a tremendous amount of complexity. Cooling, power,...
12 MIN READ
Sep 10, 2024
Post-Training Quantization of LLMs with NVIDIA NeMo and NVIDIA TensorRT Model Optimizer
As large language models (LLMs) are becoming even bigger, it is increasingly important to provide easy-to-use and efficient deployment paths because the cost of...
10 MIN READ
Sep 05, 2024
Achieving State-of-the-Art Zero-Shot Waveform Audio Generation across Audio Types
Stunning audio content is an essential component of virtual worlds. Audio generative AI plays a key role in creating this content, and NVIDIA is continuously...
6 MIN READ
Aug 28, 2024
Deploy Diverse AI Apps with Multi-LoRA Support on RTX AI PCs and Workstations
Today’s large language models (LLMs) achieve unprecedented results across many use cases. Yet, application developers often need to customize and tune these...
10 MIN READ
Aug 27, 2024
Enhancing RAG Applications with NVIDIA NIM
The advent of large language models (LLMs) has significantly benefited the AI industry, offering versatile tools capable of generating human-like text and...
10 MIN READ
Aug 21, 2024
Practical Strategies for Optimizing LLM Inference Sizing and Performance
As the use of large language models (LLMs) grows across many applications, such as chatbots and content creation, it's important to understand the process of...
2 MIN READ
Aug 20, 2024
Hackathon: Build Groundbreaking Generative AI Projects Using NVIDIA AI Workbench
Hosted by Dell and NVIDIA, demonstrate how AI Workbench can be used to build and deliver apps for a wide range of tasks and workflows.
1 MIN READ
Aug 20, 2024
Deploy the First On-Device Small Language Model for Improved Game Character Roleplay
At Gamescom 2024, NVIDIA announced our first on-device small language model (SLM) for improving the conversation abilities of game characters. We also announced...
4 MIN READ
Aug 15, 2024
NVIDIA TensorRT Model Optimizer v0.15 Boosts Inference Performance and Expands Model Support
NVIDIA has announced the latest v0.15 release of NVIDIA TensorRT Model Optimizer, a state-of-the-art quantization toolkit of model optimization techniques...
5 MIN READ
Aug 14, 2024
Video: Build Live Media Applications for AI-Enabled Infrastructure with NVIDIA Holoscan for Media
NVIDIA Holoscan for Media is a software-defined, AI-enabled platform that enables live video pipelines to run on the same infrastructure as AI. This video...
1 MIN READ
Aug 13, 2024
New NIM Available: Mistral Large 2 Instruct LLM
The new model by Mistral excels at a variety of complex tasks including text summarization, multilingual translation and reasoning, programming, question and...
1 MIN READ
Aug 07, 2024
Building AI Agents with NVIDIA NIM Microservices and LangChain
NVIDIA NIM, part of NVIDIA AI Enterprise, now supports tool-calling for models like Llama 3.1. It also integrates with LangChain to provide you with a...
3 MIN READ
Edge Computing
Sep 11, 2024
AI Tool Helps Farmers Combat Crop Loss and Climate Change
Machine Learning algorithms are beginning to revolutionize modern agriculture. Enabling farmers to combat pests and diseases in real time, the technology is...
3 MIN READ
Aug 28, 2024
New Foundational Models and Training Capabilities with NVIDIA TAO 5.5
NVIDIA TAO is a framework designed to simplify and accelerate the development and deployment of AI models. It enables you to use pretrained models, fine-tune...
13 MIN READ
Aug 19, 2024
Webinar: Build Visual AI Agents With Generative AI and NVIDIA NIM
Learn how to build high-performance solutions with NVIDIA visual AI agents that help streamline operations across a range of industries.
1 MIN READ
Aug 14, 2024
Just Released: DOCA 2.8 Software Framework
The new release includes support for Spectrum-X 1.1 RA and new features for AI Cloud Data Centers.
1 MIN READ
Aug 07, 2024
Profit and Loss Modeling on GPUs with ISO C++ Language Parallelism
The previous post How to Accelerate Quantitative Finance with ISO C++ Standard Parallelism demonstrated how to write a Black-Scholes simulation using ISO C++...
10 MIN READ
Jul 22, 2024
Spotlight: HP 3D Printing Open Sources AI Surrogates for Additive Manufacturing Using NVIDIA Modulus
An open ecosystem for physics-informed machine learning (physics-ML) fosters innovation and AI engineering applications. Physics-ML embeds into the learning...
7 MIN READ
Jul 19, 2024
Boosting AI-Driven Innovation in 6G with the AI-RAN Alliance, 3GPP, and O-RAN
The pace of 6G research and development is picking up as the 5G era crosses the midpoint of the decade-long cellular generation time frame. In this blog post,...
13 MIN READ
Jul 18, 2024
Webinar: Improving Robot Uptime Featuring Nav2 Autonomous Docking with NVIDIA Isaac ROS
Join Isaac ROS engineers and the founder of Open Navigation to explore the new Nav2 autonomous docking feature.
1 MIN READ
Jul 17, 2024
Develop Generative AI-Powered Visual AI Agents for the Edge
An exciting breakthrough in AI technology—Vision Language Models (VLMs)—offers a more dynamic and flexible method for video analysis. VLMs enable users to...
9 MIN READ
Jul 03, 2024
Powering the Future of AI-Enabled Medical Devices with NVIDIA Holoscan and RTI Connext
The demand for real-time insights and autonomous decision-making is growing across industries, and healthcare and medical devices are no exception. Relying on...
8 MIN READ
Jun 28, 2024
Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning
Full fine-tuning (FT) is commonly employed to tailor general pretrained models for specific downstream tasks. To reduce the training cost, parameter-efficient...
6 MIN READ
Jun 25, 2024
AI-Enhanced Navigation Charts Safer Waters for Massive Ships
Maritime startup Orca AI is pioneering safety at sea with its AI-powered navigation system, which provides real-time video processing to help crews make...
5 MIN READ
Data Center / Cloud
Sep 17, 2024
Accelerating Oracle Database Gen AI Workloads with NVIDIA NIM and NVIDIA cuVS
The vast majority of the world's data remains untapped, and enterprises are looking to generate value from this data by creating the next wave of generative AI...
6 MIN READ
Sep 17, 2024
Optimizing Data Center Performance with AI Agents and the OODA Loop Strategy
For any data center, operating large, complex GPU clusters is not for the faint of heart! There is a tremendous amount of complexity. Cooling, power,...
12 MIN READ
Sep 16, 2024
Memory Efficiency, Faster Initialization, and Cost Estimation with NVIDIA Collective Communications Library 2.22
For the past few months, the NVIDIA Collective Communications Library (NCCL) developers have been working hard on a set of new library features and bug fixes....
8 MIN READ
Sep 11, 2024
Enabling Customizable GPU-Accelerated Video Transcoding Pipelines
Today, over 80% of internet traffic is video. This content is generated by and consumed across various devices, including IoT gadgets, smartphones, computers,...
10 MIN READ
Sep 10, 2024
Accelerating the HPCG Benchmark with NVIDIA Math Sparse Libraries
In the realm of high-performance computing (HPC), NVIDIA has continually advanced HPC by offering its highly optimized NVIDIA High-Performance Conjugate...
9 MIN READ
Sep 09, 2024
Spotlight: Shell Accelerates CO2 Storage Modeling 100,000x Using NVIDIA Modulus
As the world faces the urgent need to combat climate change, carbon capture and storage (CCS) has emerged as a crucial technology for achieving net-zero...
7 MIN READ
Sep 09, 2024
Transform Live Media Pipelines with NVIDIA Holoscan for Media
NVIDIA Holoscan for Media is now ready to be used in live production, taking advantage of the best of both networking and GPU technologies. Holoscan for...
3 MIN READ
Sep 06, 2024
Enhancing Application Portability and Compatibility across New Platforms Using NVIDIA Magnum IO NVSHMEM 3.0
NVSHMEM is a parallel programming interface that provides efficient and scalable communication for NVIDIA GPU clusters. Part of NVIDIA Magnum IO and based on...
7 MIN READ
Sep 05, 2024
Low Latency Inference Chapter 1: Up to 1.9x Higher Llama 3.1 Performance with Medusa on NVIDIA HGX H200 with NVLink Switch
As large language models (LLMs) continue to grow in size and complexity, multi-GPU compute is a must-have to deliver the low latency and high throughput that...
5 MIN READ
Sep 03, 2024
Real-Time Neural Receivers Drive AI-RAN Innovation
Today’s 5G New Radio (5G NR) wireless communication systems rely on highly optimized signal processing algorithms to reconstruct transmitted messages from...
11 MIN READ
Aug 28, 2024
Boosting Llama 3.1 405B Performance up to 1.44x with NVIDIA TensorRT Model Optimizer on NVIDIA H200 GPUs
The Llama 3.1 405B large language model (LLM), developed by Meta, is an open-source community model that delivers state-of-the-art performance and supports a...
7 MIN READ
Aug 28, 2024
NVIDIA Triton Inference Server Achieves Outstanding Performance in MLPerf Inference 4.1 Benchmarks
Six years ago, we embarked on a journey to develop an AI inference serving solution specifically designed for high-throughput and time-sensitive production use...
8 MIN READ