Home DEVELOPER
  • Home
  • Blog
  • Forums
  • Docs
  • Downloads
  • Training
  • Join
Computer Vision / Video Analytics

​​Real-Time AI Shark Detection is Boosting Beach Safety

Read now
​​Real-Time AI Shark Detection is Boosting Beach Safety
Generative AI

Access to NVIDIA NIM Now Available Free to Developer Program Members

Read now
Access to NVIDIA NIM Now Available Free to Developer Program Members
Generative AI

Supercharging Llama 3.1 across NVIDIA Platforms

Read now
Supercharging Llama 3.1 across NVIDIA Platforms
Generative AI

Develop Production-Grade Text Retrieval Pipelines for RAG with NVIDIA NeMo Retriever 

Read now
Develop Production-Grade Text Retrieval Pipelines for RAG with NVIDIA NeMo Retriever 
Generative AI

Customize Generative AI Models for Enterprise Applications with Llama 3.1

Read now
Customize Generative AI Models for Enterprise Applications with Llama 3.1
  • Computer Vision / Video Analytics
    ​​Real-Time AI Shark Detection is Boosting Beach Safety
  • Generative AI
    Access to NVIDIA NIM Now Available Free to Developer Program Members
  • Generative AI
    Supercharging Llama 3.1 across NVIDIA Platforms
  • Generative AI
    Develop Production-Grade Text Retrieval Pipelines for RAG with NVIDIA NeMo Retriever 
  • Generative AI
    Customize Generative AI Models for Enterprise Applications with Llama 3.1

Recent

See all
Aug 07, 2024

Optimizing llama.cpp AI Inference with CUDA Graphs

The open-source llama.cpp code base was originally released in 2023 as a lightweight but efficient framework for performing inference on Meta Llama models....
8 MIN READ
Optimizing llama.cpp AI Inference with CUDA Graphs
Aug 07, 2024

Writer Releases Domain-Specific LLMs for Healthcare and Finance

Writer has released two new domain-specific AI models, Palmyra-Med 70B and Palmyra-Fin 70B, expanding the capabilities of NVIDIA NIM. These models bring...
6 MIN READ
Writer Releases Domain-Specific LLMs for Healthcare and Finance
Decorative image of a profit/loss graph.
Aug 07, 2024

Profit and Loss Modeling on GPUs with ISO C++ Language Parallelism

The previous post How to Accelerate Quantitative Finance with ISO C++ Standard Parallelism demonstrated how to write a Black-Scholes simulation using ISO C++...
10 MIN READ
Profit and Loss Modeling on GPUs with ISO C++ Language Parallelism
Image of a person standing in front of an AI kiosk in a retail location.
Aug 07, 2024

Building AI Agents with NVIDIA NIM Microservices and LangChain

NVIDIA NIM, part of NVIDIA AI Enterprise, now supports tool-calling for models like Llama 3.1. It also integrates with LangChain to provide you with a...
3 MIN READ
Building AI Agents with NVIDIA NIM Microservices and LangChain
An aerial view of a shark swimming.
Aug 06, 2024

​​Real-Time AI Shark Detection is Boosting Beach Safety

California beaches are becoming safer with a new AI-powered shark detection system. Known as SharkEye, the technology identifies sharks near shorelines in real...
2 MIN READ
​​Real-Time AI Shark Detection is Boosting Beach Safety
Aug 06, 2024

Spotlight: NVIDIA BlueField DPUs Power the VAST Data Platform for AI Workload Optimization

As the demand for sophisticated AI capabilities escalates, VAST Data introduces the VAST Data Platform, now enhanced with NVIDIA BlueField DPUs. This innovation...
7 MIN READ
Spotlight: NVIDIA BlueField DPUs Power the VAST Data Platform for AI Workload Optimization
Aug 06, 2024

Accelerating Hebrew LLM Performance with NVIDIA TensorRT-LLM

Developing a high-performing Hebrew large language model (LLM) presents distinct challenges stemming from the rich and complex nature of the Hebrew language...
8 MIN READ
Accelerating Hebrew LLM Performance with NVIDIA TensorRT-LLM
Aug 06, 2024

A Deep Dive into the Latest AI Models Optimized with NVIDIA NIM

Delivered as optimized containers, NVIDIA NIM microservices are designed to accelerate AI application development for businesses of all sizes, paving the way...
9 MIN READ
A Deep Dive into the Latest AI Models Optimized with NVIDIA NIM
Aug 05, 2024

Securing Generative AI Deployments with NVIDIA NIM and NVIDIA NeMo Guardrails

As enterprises adopt generative AI applications powered by large language models (LLMs), there is an increasing need to implement guardrails to ensure safety...
6 MIN READ
Securing Generative AI Deployments with NVIDIA NIM and NVIDIA NeMo Guardrails
Image of two people sitting in their cubicles with speech recognition visualizations in the background.
Aug 05, 2024

Developing Robust Georgian Automatic Speech Recognition with FastConformer Hybrid Transducer CTC BPE

Building an effective automatic speech recognition (ASR) model for underrepresented languages presents unique challenges due to limited data resources.  In...
9 MIN READ
Developing Robust Georgian Automatic Speech Recognition with FastConformer Hybrid Transducer CTC BPE
Aug 02, 2024

Just Released: Nsight Compute 2024.3

Nsight Compute 2024.3 improves selectively exporting results into a new report, kernel name logging to debug empty reports, and profiling green contexts.
1 MIN READ
Just Released: Nsight Compute 2024.3
NVIDIA Hopper GPU and NVIDIA Grace CPUs on a black background.
Aug 02, 2024

Revolutionizing Data Center Efficiency with the NVIDIA Grace Family

The exponential growth in data processing demand is projected to reach 175 zettabytes by 2025. This contrasts sharply with the slowing pace of CPU performance...
16 MIN READ
Revolutionizing Data Center Efficiency with the NVIDIA Grace Family

Generative AI

See all
Aug 07, 2024

Optimizing llama.cpp AI Inference with CUDA Graphs

The open-source llama.cpp code base was originally released in 2023 as a lightweight but efficient framework for performing inference on Meta Llama models....
8 MIN READ
Optimizing llama.cpp AI Inference with CUDA Graphs
Aug 07, 2024

Writer Releases Domain-Specific LLMs for Healthcare and Finance

Writer has released two new domain-specific AI models, Palmyra-Med 70B and Palmyra-Fin 70B, expanding the capabilities of NVIDIA NIM. These models bring...
6 MIN READ
Writer Releases Domain-Specific LLMs for Healthcare and Finance
Image of a person standing in front of an AI kiosk in a retail location.
Aug 07, 2024

Building AI Agents with NVIDIA NIM Microservices and LangChain

NVIDIA NIM, part of NVIDIA AI Enterprise, now supports tool-calling for models like Llama 3.1. It also integrates with LangChain to provide you with a...
3 MIN READ
Building AI Agents with NVIDIA NIM Microservices and LangChain
Aug 06, 2024

Spotlight: NVIDIA BlueField DPUs Power the VAST Data Platform for AI Workload Optimization

As the demand for sophisticated AI capabilities escalates, VAST Data introduces the VAST Data Platform, now enhanced with NVIDIA BlueField DPUs. This innovation...
7 MIN READ
Spotlight: NVIDIA BlueField DPUs Power the VAST Data Platform for AI Workload Optimization
Aug 06, 2024

Accelerating Hebrew LLM Performance with NVIDIA TensorRT-LLM

Developing a high-performing Hebrew large language model (LLM) presents distinct challenges stemming from the rich and complex nature of the Hebrew language...
8 MIN READ
Accelerating Hebrew LLM Performance with NVIDIA TensorRT-LLM
Aug 06, 2024

A Deep Dive into the Latest AI Models Optimized with NVIDIA NIM

Delivered as optimized containers, NVIDIA NIM microservices are designed to accelerate AI application development for businesses of all sizes, paving the way...
9 MIN READ
A Deep Dive into the Latest AI Models Optimized with NVIDIA NIM
Aug 05, 2024

Securing Generative AI Deployments with NVIDIA NIM and NVIDIA NeMo Guardrails

As enterprises adopt generative AI applications powered by large language models (LLMs), there is an increasing need to implement guardrails to ensure safety...
6 MIN READ
Securing Generative AI Deployments with NVIDIA NIM and NVIDIA NeMo Guardrails
Image of two people sitting in their cubicles with speech recognition visualizations in the background.
Aug 05, 2024

Developing Robust Georgian Automatic Speech Recognition with FastConformer Hybrid Transducer CTC BPE

Building an effective automatic speech recognition (ASR) model for underrepresented languages presents unique challenges due to limited data resources.  In...
9 MIN READ
Developing Robust Georgian Automatic Speech Recognition with FastConformer Hybrid Transducer CTC BPE
Aug 01, 2024

Deliver Personalized Retail Experiences with an AI-Powered Shopping Advisor

Imagine being able to put your best sales associate in front of every customer for every interaction. Your best sales associate offers product recommendations...
4 MIN READ
Deliver Personalized Retail Experiences with an AI-Powered Shopping Advisor
Decorative image.
Aug 01, 2024

Measuring Generative AI Model Performance Using NVIDIA GenAI-Perf and an OpenAI-Compatible API

NVIDIA offers tools like Perf Analyzer and Model Analyzer to assist machine learning engineers with measuring and balancing the trade-off between latency and...
6 MIN READ
Measuring Generative AI Model Performance Using NVIDIA GenAI-Perf and an OpenAI-Compatible API
A data curator designed for dataset preparation and enhanced LLM performance.
Jul 31, 2024

Curating Custom Datasets for LLM Parameter-Efficient Fine-Tuning with NVIDIA NeMo Curator

In a recent post, we discussed how to use NVIDIA NeMo Curator to curate custom datasets for pretraining or continuous training use cases of large language...
11 MIN READ
Curating Custom Datasets for LLM Parameter-Efficient Fine-Tuning with NVIDIA NeMo Curator
A connected grid of AI applications, optimizing RAG pipelines.
Jul 30, 2024

Enhancing RAG Pipelines with Re-Ranking

In the rapidly evolving landscape of AI-driven applications, re-ranking has emerged as a pivotal technique to enhance the precision and relevance of enterprise...
8 MIN READ
Enhancing RAG Pipelines with Re-Ranking

AI Foundation Models

See all
Jul 29, 2024

Fast-Track Robot Learning in Simulation Using NVIDIA Isaac Lab

Robots need to be adaptable, readily learning new skills and adjusting to their surroundings. Yet traditional training methods can limit a robot’s ability to...
7 MIN READ
Fast-Track Robot Learning in Simulation Using NVIDIA Isaac Lab
Decorative image of a model with multiple apps.
Jul 26, 2024

Power Text-Generation Applications with Mistral NeMo 12B Running on a Single GPU

NVIDIA collaborated with Mistral to co-build the next-generation language model that achieves leading performance across benchmarks in its class. With a growing...
6 MIN READ
Power Text-Generation Applications with Mistral NeMo 12B Running on a Single GPU
Jun 28, 2024

Transforming Financial Analysis with NVIDIA NIM

In financial services, portfolio managers and research analysts diligently sift through vast amounts of data to gain a competitive edge in investments. Making...
13 MIN READ
Transforming Financial Analysis with NVIDIA NIM
Jun 24, 2024

Addressing Medical Imaging Limitations with Synthetic Data Generation

Synthetic data in medical imaging offers numerous benefits, including the ability to augment datasets with diverse and realistic images where real data is...
9 MIN READ
Addressing Medical Imaging Limitations with Synthetic Data Generation
Jun 10, 2024

SOLAR-10.7B: Optimized Model Tailored Instruction Following, Reasoning, and Mathematical Tasks

Enhance efficiency and performance in instruction-based NLP tasks with SOLAR-10.7B, especially in following instructions, reasoning, and mathematical tasks.
1 MIN READ
SOLAR-10.7B: Optimized Model Tailored Instruction Following, Reasoning, and Mathematical Tasks
Jun 03, 2024

Breeze-7B: LLM Specialized for Traditional Chinese

The model demonstrates strong performance for tasks such as Q&A, multi-round chat, and summarization in both traditional Chinese and English.
1 MIN READ
Breeze-7B: LLM Specialized for Traditional Chinese
Jun 03, 2024

BGE-M3: Advanced Multilingual Text Retrieval Model

Experience the versatile embedding model designed for multilingual, multi-functional, and multi-granularity text retrieval tasks, excelling in dense,...
1 MIN READ
BGE-M3: Advanced Multilingual Text Retrieval Model
May 30, 2024

Convert Natural Language to Code with CodeGemma

Experience the advanced LLM API for code generation, completion, mathematical reasoning, and instruction following with free cloud credits.
1 MIN READ
Convert Natural Language to Code with CodeGemma
May 14, 2024

Generate Text Responses from Visual and Text Inputs with Google's New PaliGemma Model

With free NVIDIA cloud credits, you can start testing the model at scale on the API Catalog.
1 MIN READ
Generate Text Responses from Visual and Text Inputs with Google's New PaliGemma Model
May 13, 2024

Regional LLMs SEA-LION and SeaLLM Serve Languages and Cultures of Southeast Asia

At the recent World Governments Summit in Dubai, NVIDIA CEO Jensen Huang emphasized the importance of sovereign AI, which refers to a nation’s capability to...
3 MIN READ
Regional LLMs SEA-LION and SeaLLM Serve Languages and Cultures of Southeast Asia
Apr 30, 2024

Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks

This week’s model release features DBRX, a state-of-the-art large language model (LLM) developed by Databricks. With demonstrated strength in programming and...
3 MIN READ
Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks
Decorative image of LLM workflow.
Apr 26, 2024

New LLM: Snowflake Arctic Model for SQL and Code Generation

Large language models (LLMs) have revolutionized natural language processing (NLP) in recent years, enabling a wide range of applications such as text...
3 MIN READ
New LLM: Snowflake Arctic Model for SQL and Code Generation

Simulation / Modeling / Design

See all
Decorative image of a profit/loss graph.
Aug 07, 2024

Profit and Loss Modeling on GPUs with ISO C++ Language Parallelism

The previous post How to Accelerate Quantitative Finance with ISO C++ Standard Parallelism demonstrated how to write a Black-Scholes simulation using ISO C++...
10 MIN READ
Profit and Loss Modeling on GPUs with ISO C++ Language Parallelism
Aug 01, 2024

Just Released: CUDA Toolkit 12.6

The release supports GB100 capabilities and new library enhancements to cuBLAS, cuFFT, cuSOLVER, cuSPARSE, as well as the release of Nsight Compute 2024.3.
1 MIN READ
Just Released: CUDA Toolkit 12.6
Aug 01, 2024

Just Released: NVIDIA HPC SDK v24.7

The new release delivers support for Ubuntu 24.04, new Fortran interfaces for CUDA Graphs, and a major version NVSHMEM API update. It is the last release to...
1 MIN READ
Just Released: NVIDIA HPC SDK v24.7
Jul 30, 2024

Just Released: NVIDIA Modulus v24.07

NVIDIA Modulus 24.07 brings new GNN enhancements and application samples for training with large meshes.
1 MIN READ
Just Released: NVIDIA Modulus v24.07
Weather forecasts running multiple simulations over the same forecast horizon.
Jul 30, 2024

Empowering Energy Trading with MetDesk and NVIDIA Earth-2

Despite the continuous improvement of weather forecasts over the last few decades, uncertainties due to meteorological measurements and models mean that...
13 MIN READ
Empowering Energy Trading with MetDesk and NVIDIA Earth-2
Jul 29, 2024

Building Spatial Intelligence from Real-World 3D Data Using Deep-Learning Framework fVDB

Generative physical AI models can understand and execute actions with fine or gross motor skills within the physical world. Understanding and navigating in the...
6 MIN READ
Building Spatial Intelligence from Real-World 3D Data Using Deep-Learning Framework fVDB
Jul 29, 2024

Fast-Track Robot Learning in Simulation Using NVIDIA Isaac Lab

Robots need to be adaptable, readily learning new skills and adjusting to their surroundings. Yet traditional training methods can limit a robot’s ability to...
7 MIN READ
Fast-Track Robot Learning in Simulation Using NVIDIA Isaac Lab
Jul 29, 2024

How to Build a Generative AI-Enabled Synthetic Data Pipeline with OpenUSD

Training physical AI models used to power autonomous machines, such as robots and autonomous vehicles, requires huge amounts of data. Acquiring large sets of...
14 MIN READ
How to Build a Generative AI-Enabled Synthetic Data Pipeline with OpenUSD
Jul 29, 2024

Integrate Generative AI into OpenUSD Workflows Using New NVIDIA Omniverse Developer Tools

NVIDIA announced new USD-based generative AI and NVIDIA-accelerated development tools built on NVIDIA Omniverse at SIGGRAPH 2024. These advancements will expand...
6 MIN READ
Integrate Generative AI into OpenUSD Workflows Using New NVIDIA Omniverse Developer Tools
Computational Fluid Dynamics simulation of passenger car in motion, viewed in the Luminary Cloud interface.
Jul 26, 2024

Faster Insights from Luminary Cloud's Engineering Simulations with NVIDIA GPUs

Engineering simulation is used across industries to accelerate product development. Simulations are used to check the safety of aircraft, cars, and buildings,...
8 MIN READ
Faster Insights from Luminary Cloud's Engineering Simulations with NVIDIA GPUs
GIF of a tree, excavator, and purple spiky blog moving.
Jul 25, 2024

Simulate Elastic Objects in Any Representation with NVIDIA Kaolin Library

Recent advancements in generative AI and multi-view reconstruction have introduced new ways to rapidly generate 3D content. However, to be useful for downstream...
2 MIN READ
Simulate Elastic Objects in Any Representation with NVIDIA Kaolin Library
Collage of four photos of a car with different colors and roof storage.
Jul 24, 2024

Developing Product Configurators with OpenUSD

Developers from advertising agencies to software vendors are empowering global brands to deliver hyperpersonalization for digital experiences and visual...
5 MIN READ
Developing Product Configurators with OpenUSD

Robotics

See all
Decorative image of icons and a molecular structure in green.
Jul 29, 2024

Build VLM-Powered Visual AI Agents Using NVIDIA NIM and NVIDIA VIA Microservices

Traditional video analytics applications and their development workflow are typically built on fixed-function, limited models that are designed to detect and...
10 MIN READ
Build VLM-Powered Visual AI Agents Using NVIDIA NIM and NVIDIA VIA Microservices
Jul 29, 2024

Fast-Track Robot Learning in Simulation Using NVIDIA Isaac Lab

Robots need to be adaptable, readily learning new skills and adjusting to their surroundings. Yet traditional training methods can limit a robot’s ability to...
7 MIN READ
Fast-Track Robot Learning in Simulation Using NVIDIA Isaac Lab
Jul 18, 2024

Webinar: Improving Robot Uptime Featuring Nav2 Autonomous Docking with NVIDIA Isaac ROS

Join Isaac ROS engineers and the founder of Open Navigation to explore the new Nav2 autonomous docking feature.
1 MIN READ
Webinar: Improving Robot Uptime Featuring Nav2 Autonomous Docking with NVIDIA Isaac ROS
Jul 11, 2024

Training Sim-to-Real Transferable Robotic Assembly Skills over Diverse Geometries

Most objects in home and industrial settings consist of multiple parts that must be assembled. While human workers typically perform assembly, in certain...
10 MIN READ
Training Sim-to-Real Transferable Robotic Assembly Skills over Diverse Geometries
Jul 11, 2024

Spotlight: Siemens Energy Accelerates Power Grid Asset Simulation 10,000x Using NVIDIA Modulus

The world’s energy system is increasingly complex and distributed due to increasing renewable energy generation, decentralization of energy resources, and...
9 MIN READ
Spotlight: Siemens Energy Accelerates Power Grid Asset Simulation 10,000x Using NVIDIA Modulus
Decorative image of workflow steps.
Jul 10, 2024

Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data

Large-scale, use–case-specific synthetic data has become increasingly important in real-world computer vision and AI workflows. That’s because digital twins...
14 MIN READ
Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data
Jun 25, 2024

AI-Enhanced Navigation Charts Safer Waters for Massive Ships

Maritime startup Orca AI is pioneering safety at sea with its AI-powered navigation system, which provides real-time video processing to help crews make...
5 MIN READ
AI-Enhanced Navigation Charts Safer Waters for Massive Ships
Jun 24, 2024

Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim

As vision AI complexity increases, streamlined deployment solutions are crucial to optimizing spaces and processes. NVIDIA accelerates development, turning...
13 MIN READ
Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim
Jun 17, 2024

Closing the Sim-to-Real Gap: Training Spot Quadruped Locomotion with NVIDIA Isaac Lab

Developing effective locomotion policies for quadrupeds poses significant challenges in robotics due to the complex dynamics involved. Training quadrupeds to...
12 MIN READ
Closing the Sim-to-Real Gap: Training Spot Quadruped Locomotion with NVIDIA Isaac Lab
Jun 17, 2024

Supercharge Robotics Workflows with AI and Simulation Using NVIDIA Isaac Sim 4.0 and NVIDIA Isaac Lab

The era of AI robots powered by physical AI has arrived. Physical AI models understand their environments and autonomously complete complex tasks in the...
11 MIN READ
Supercharge Robotics Workflows with AI and Simulation Using NVIDIA Isaac Sim 4.0 and NVIDIA Isaac Lab
Jun 14, 2024

Level Up Your Skills with Five New NVIDIA Technical Courses

With AI introducing an unprecedented pace of technological innovation, staying ahead means keeping your skills up to date. The NVIDIA Developer Program gives...
4 MIN READ
Level Up Your Skills with Five New NVIDIA Technical Courses
Image of a robotic arm lifting a package.
Jun 13, 2024

Build OpenUSD Applications for the Cloud with NVIDIA Omniverse Kit 106 Milestone Release

NVIDIA Omniverse is a platform that enables you to build applications for complex 3D and industrial digitalization workflows based on Universal Scene...
5 MIN READ
Build OpenUSD Applications for the Cloud with NVIDIA Omniverse Kit 106 Milestone Release

Computer Vision / Video Analytics

See all
An aerial view of a shark swimming.
Aug 06, 2024

​​Real-Time AI Shark Detection is Boosting Beach Safety

California beaches are becoming safer with a new AI-powered shark detection system. Known as SharkEye, the technology identifies sharks near shorelines in real...
2 MIN READ
​​Real-Time AI Shark Detection is Boosting Beach Safety
Decorative image of icons and a molecular structure in green.
Jul 29, 2024

Build VLM-Powered Visual AI Agents Using NVIDIA NIM and NVIDIA VIA Microservices

Traditional video analytics applications and their development workflow are typically built on fixed-function, limited models that are designed to detect and...
10 MIN READ
Build VLM-Powered Visual AI Agents Using NVIDIA NIM and NVIDIA VIA Microservices
Three CT scan segments on a black background.
Jul 26, 2024

Computed Tomography Organ and Disease Segmentation Using the NVIDIA VISTA-3D NIM Microservice

Over 300M computed tomography (CT) scans are performed globally, 85M in the US alone. Radiologists are looking for ways to speed up their workflow and generate...
9 MIN READ
Computed Tomography Organ and Disease Segmentation Using the NVIDIA VISTA-3D NIM Microservice
An illustration representing an AI model.
Jul 17, 2024

Develop Generative AI-Powered Visual AI Agents for the Edge

An exciting breakthrough in AI technology—Vision Language Models (VLMs)—offers a more dynamic and flexible method for video analysis. VLMs enable users to...
9 MIN READ
Develop Generative AI-Powered Visual AI Agents for the Edge
Decorative image of workflow steps.
Jul 10, 2024

Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data

Large-scale, use–case-specific synthetic data has become increasingly important in real-world computer vision and AI workflows. That’s because digital twins...
14 MIN READ
Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data
Jun 28, 2024

Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning

Full fine-tuning (FT) is commonly employed to tailor general pretrained models for specific downstream tasks. To reduce the training cost, parameter-efficient...
6 MIN READ
Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning
Jun 26, 2024

Improving Video Quality with the NVIDIA Video Codec SDK 12.2 for HEVC

NVIDIA Video Codec SDK provides a comprehensive set of APIs for hardware-accelerated video encode and decode on Windows and Linux. The 12.2 release improves...
7 MIN READ
Improving Video Quality with the NVIDIA Video Codec SDK 12.2 for HEVC
Image of a factory floor with loading equipment and a person with a clipboard.
Jun 26, 2024

Transforming Microsoft XLS and PPT Files into a Factory Digital Twin with OpenUSD

SyncTwin GmbH, a company that builds software to optimize production, intralogistics, and assembly, is on a mission to unlock industrial digital twins for small...
7 MIN READ
Transforming Microsoft XLS and PPT Files into a Factory Digital Twin with OpenUSD
Jun 25, 2024

AI-Enhanced Navigation Charts Safer Waters for Massive Ships

Maritime startup Orca AI is pioneering safety at sea with its AI-powered navigation system, which provides real-time video processing to help crews make...
5 MIN READ
AI-Enhanced Navigation Charts Safer Waters for Massive Ships
Jun 24, 2024

Addressing Medical Imaging Limitations with Synthetic Data Generation

Synthetic data in medical imaging offers numerous benefits, including the ability to augment datasets with diverse and realistic images where real data is...
9 MIN READ
Addressing Medical Imaging Limitations with Synthetic Data Generation
Jun 24, 2024

Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim

As vision AI complexity increases, streamlined deployment solutions are crucial to optimizing spaces and processes. NVIDIA accelerates development, turning...
13 MIN READ
Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim
Jun 18, 2024

Generate Traffic Insights Using YOLOv8 and NVIDIA JetPack 6.0

Intelligent Transportation Systems (ITS) applications are becoming increasingly valuable and prevalent in modern urban environments. The benefits of using ITS...
11 MIN READ
Generate Traffic Insights Using YOLOv8 and NVIDIA JetPack 6.0

Data Science

See all
Aug 01, 2024

Just Released: CUDA Toolkit 12.6

The release supports GB100 capabilities and new library enhancements to cuBLAS, cuFFT, cuSOLVER, cuSPARSE, as well as the release of Nsight Compute 2024.3.
1 MIN READ
Just Released: CUDA Toolkit 12.6
A data curator designed for dataset preparation and enhanced LLM performance.
Jul 31, 2024

Curating Custom Datasets for LLM Parameter-Efficient Fine-Tuning with NVIDIA NeMo Curator

In a recent post, we discussed how to use NVIDIA NeMo Curator to curate custom datasets for pretraining or continuous training use cases of large language...
11 MIN READ
Curating Custom Datasets for LLM Parameter-Efficient Fine-Tuning with NVIDIA NeMo Curator
Live cell image showing cell segmentations.
Jul 24, 2024

Cell Imaging Feature Extraction and Morphology Clustering for Spatial Omics

VISTA-2D is a new foundational model from NVIDIA that can quickly and accurately perform cell segmentation, a fundamental task in cell imaging and spatial omics...
8 MIN READ
Cell Imaging Feature Extraction and Morphology Clustering for Spatial Omics
Jul 18, 2024

Accelerating Vector Search: RAPIDS cuVS IVF-PQ Part 2, Performance Tuning

In the first part of the series, we presented an overview of the IVF-PQ algorithm and explained how it builds on top of the IVF-Flat algorithm, using the...
14 MIN READ
Accelerating Vector Search: RAPIDS cuVS IVF-PQ Part 2, Performance Tuning
Jul 18, 2024

Accelerating Vector Search: RAPIDS cuVS IVF-PQ Part 1, Deep Dive

In this blog post, we continue the series on accelerating vector search using cuVS. Our previous post in the series introduced IVF-Flat, a fast algorithm for...
14 MIN READ
Accelerating Vector Search: RAPIDS cuVS IVF-PQ Part 1, Deep Dive
Jul 17, 2024

Encoding and Compression Guide for Parquet String Data Using RAPIDS

Parquet writers provide encoding and compression options that are turned off by default. Enabling these options may provide better lossless compression for your...
10 MIN READ
Encoding and Compression Guide for Parquet String Data Using RAPIDS
GIF of a factory floor with potential paths marked in green.
Jul 16, 2024

Building an AI Agent for Supply Chain Optimization with NVIDIA NIM and cuOpt

Enterprises face significant challenges in making supply chain decisions that maximize profits while adapting quickly to dynamic changes. Optimal supply chain...
8 MIN READ
Building an AI Agent for Supply Chain Optimization with NVIDIA NIM and cuOpt
Jul 15, 2024

Unlock Gene Networks Using Limited Data with AI Model Geneformer

Geneformer is a recently introduced and powerful AI model that learns gene network dynamics and interactions using transfer learning from vast single-cell...
6 MIN READ
Unlock Gene Networks Using Limited Data with AI Model Geneformer
Jul 12, 2024

Event: WeAreDevelopers World Congress 2024

Join NVIDIA at WeAreDevelopers July 17-19 to learn how accelerated computing tools powered by GPUs are shaping the future.
1 MIN READ
Event: WeAreDevelopers World Congress 2024
Jul 12, 2024

Boosting Mathematical Optimization Performance and Energy Efficiency on the NVIDIA Grace CPU

Mathematical optimization is a powerful tool that enables businesses and people to make smarter decisions and reach any number of goals—from improving...
4 MIN READ
Boosting Mathematical Optimization Performance and Energy Efficiency on the NVIDIA Grace CPU
An illustration showing a securit alert.
Jul 11, 2024

Defending AI Model Files from Unauthorized Access with Canaries

As AI models grow in capability and cost of creation, and hold more sensitive or proprietary data, securing them at rest is increasingly important....
6 MIN READ
Defending AI Model Files from Unauthorized Access with Canaries
Jul 11, 2024

Optimize AI Model Performance and Maintain Data Privacy with Hybrid RAG

The rapidly evolving field of generative AI is focused on building neural networks that can create realistic content such as text, images, audio, and synthetic...
7 MIN READ
Optimize AI Model Performance and Maintain Data Privacy with Hybrid RAG

Content Creation / Rendering

See all
Jul 31, 2024

Shader Debugging Made Easy with NVIDIA Nsight Graphics

Shaders are specialized programs that run on the GPU that manipulate rays, pixels, vertices, and textures to achieve unique visual effects. With shaders, you...
8 MIN READ
Shader Debugging Made Easy with NVIDIA Nsight Graphics
Jul 29, 2024

Building Spatial Intelligence from Real-World 3D Data Using Deep-Learning Framework fVDB

Generative physical AI models can understand and execute actions with fine or gross motor skills within the physical world. Understanding and navigating in the...
6 MIN READ
Building Spatial Intelligence from Real-World 3D Data Using Deep-Learning Framework fVDB
Person looking at image of themself captured by a camera, and then the shot pans to a smaller display which shows their face in a 3D hologram.
Jul 29, 2024

Advancing Telepresence and Next-Generation Digital Humans with NVIDIA Maxine

At SIGGRAPH 2024 this week, NVIDIA is showcasing the latest advancements in the NVIDIA Maxine AI developer platform, available through NVIDIA AI...
8 MIN READ
Advancing Telepresence and Next-Generation Digital Humans with NVIDIA Maxine
GIF of a tree, excavator, and purple spiky blog moving.
Jul 25, 2024

Simulate Elastic Objects in Any Representation with NVIDIA Kaolin Library

Recent advancements in generative AI and multi-view reconstruction have introduced new ways to rapidly generate 3D content. However, to be useful for downstream...
2 MIN READ
Simulate Elastic Objects in Any Representation with NVIDIA Kaolin Library
Collage of four photos of a car with different colors and roof storage.
Jul 24, 2024

Developing Product Configurators with OpenUSD

Developers from advertising agencies to software vendors are empowering global brands to deliver hyperpersonalization for digital experiences and visual...
5 MIN READ
Developing Product Configurators with OpenUSD
Jul 22, 2024

Gets Hands-On Training at SIGGRAPH 2024

Complimentary trainings on OpenUSD, Digital Humans, LLMs and more with hands-on labs for Full Conference and Experience attendees.
1 MIN READ
Gets Hands-On Training at SIGGRAPH 2024
Jul 18, 2024

Spotlight: UneeQ Revolutionizes Customer Engagement with AI-Powered Digital Humans

With the rise of chatbots and virtual assistants, customer interactions have evolved to embrace the versatility of voice and text inputs. However, integrating...
4 MIN READ
Spotlight: UneeQ Revolutionizes Customer Engagement with AI-Powered Digital Humans
A GIF showing the creation of a building image with diffusion models.
Jul 10, 2024

Understanding Diffusion Models: An Essential Guide for AEC Professionals

Generative AI, the ability of algorithms to process various types of inputs—such as text, images, audio, video, and code—and generate new content, is...
13 MIN READ
Understanding Diffusion Models: An Essential Guide for AEC Professionals
Jun 26, 2024

Improving Video Quality with the NVIDIA Video Codec SDK 12.2 for HEVC

NVIDIA Video Codec SDK provides a comprehensive set of APIs for hardware-accelerated video encode and decode on Windows and Linux. The 12.2 release improves...
7 MIN READ
Improving Video Quality with the NVIDIA Video Codec SDK 12.2 for HEVC
Image of a robotic arm lifting a package.
Jun 13, 2024

Build OpenUSD Applications for the Cloud with NVIDIA Omniverse Kit 106 Milestone Release

NVIDIA Omniverse is a platform that enables you to build applications for complex 3D and industrial digitalization workflows based on Universal Scene...
5 MIN READ
Build OpenUSD Applications for the Cloud with NVIDIA Omniverse Kit 106 Milestone Release
Jun 10, 2024

Reallusion Brings Digital Characters to Life with NVIDIA AI

In today's digital age, creating realistic animated characters is crucial for filmmakers, game developers, and content creators looking to bring their visions...
6 MIN READ
Reallusion Brings Digital Characters to Life with NVIDIA AI
Comparison of 1080p and 4K RTX VSR and HDR.
Jun 06, 2024

Enhancing Low-Resolution SDR Video with the NVIDIA RTX Video SDK

NVIDIA RTX Video is a collection of AI video enhancements that improve the visual quality of lower-quality video.  RTX Video Super Resolution was announced...
2 MIN READ
Enhancing Low-Resolution SDR Video with the NVIDIA RTX Video SDK

Conversational AI

See all
Image of a person standing in front of an AI kiosk in a retail location.
Aug 07, 2024

Building AI Agents with NVIDIA NIM Microservices and LangChain

NVIDIA NIM, part of NVIDIA AI Enterprise, now supports tool-calling for models like Llama 3.1. It also integrates with LangChain to provide you with a...
3 MIN READ
Building AI Agents with NVIDIA NIM Microservices and LangChain
Aug 05, 2024

Securing Generative AI Deployments with NVIDIA NIM and NVIDIA NeMo Guardrails

As enterprises adopt generative AI applications powered by large language models (LLMs), there is an increasing need to implement guardrails to ensure safety...
6 MIN READ
Securing Generative AI Deployments with NVIDIA NIM and NVIDIA NeMo Guardrails
Image of two people sitting in their cubicles with speech recognition visualizations in the background.
Aug 05, 2024

Developing Robust Georgian Automatic Speech Recognition with FastConformer Hybrid Transducer CTC BPE

Building an effective automatic speech recognition (ASR) model for underrepresented languages presents unique challenges due to limited data resources.  In...
9 MIN READ
Developing Robust Georgian Automatic Speech Recognition with FastConformer Hybrid Transducer CTC BPE
A connected grid of AI applications, optimizing RAG pipelines.
Jul 30, 2024

Enhancing RAG Pipelines with Re-Ranking

In the rapidly evolving landscape of AI-driven applications, re-ranking has emerged as a pivotal technique to enhance the precision and relevance of enterprise...
8 MIN READ
Enhancing RAG Pipelines with Re-Ranking
Jul 18, 2024

Spotlight: UneeQ Revolutionizes Customer Engagement with AI-Powered Digital Humans

With the rise of chatbots and virtual assistants, customer interactions have evolved to embrace the versatility of voice and text inputs. However, integrating...
4 MIN READ
Spotlight: UneeQ Revolutionizes Customer Engagement with AI-Powered Digital Humans
Jul 18, 2024

Accelerating Vector Search: RAPIDS cuVS IVF-PQ Part 2, Performance Tuning

In the first part of the series, we presented an overview of the IVF-PQ algorithm and explained how it builds on top of the IVF-Flat algorithm, using the...
14 MIN READ
Accelerating Vector Search: RAPIDS cuVS IVF-PQ Part 2, Performance Tuning
Jul 18, 2024

Accelerating Vector Search: RAPIDS cuVS IVF-PQ Part 1, Deep Dive

In this blog post, we continue the series on accelerating vector search using cuVS. Our previous post in the series introduced IVF-Flat, a fast algorithm for...
14 MIN READ
Accelerating Vector Search: RAPIDS cuVS IVF-PQ Part 1, Deep Dive
Illustration showing models and NeMo.
Jul 17, 2024

NVIDIA NeMo Accelerates LLM Innovation with Hybrid State Space Model Support

Today’s large language models (LLMs) are based on the transformer model architecture introduced in 2017. Since then, rapid advances in AI compute performance...
7 MIN READ
NVIDIA NeMo Accelerates LLM Innovation with Hybrid State Space Model Support
Jul 16, 2024

New Workshops: Customize LLMs, Build and Deploy Large Neural Networks

Register now for an instructor-led public workshop in July, August or September. Space is limited.
1 MIN READ
New Workshops: Customize LLMs, Build and Deploy Large Neural Networks
Jul 12, 2024

Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities

First introduced in 2019, NVIDIA Megatron-LM sparked a wave of innovation in the AI community, enabling researchers and developers to use the underpinnings of...
11 MIN READ
Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities
Jul 02, 2024

Addressing Hallucinations in Speech Synthesis LLMs with the NVIDIA NeMo T5-TTS Model

NVIDIA NeMo has released the T5-TTS model, a significant advancement in text-to-speech (TTS) technology. Based on large language models (LLMs), T5-TTS produces...
4 MIN READ
Addressing Hallucinations in Speech Synthesis LLMs with the NVIDIA NeMo T5-TTS Model
Jun 28, 2024

Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning

Full fine-tuning (FT) is commonly employed to tailor general pretrained models for specific downstream tasks. To reduce the training cost, parameter-efficient...
6 MIN READ
Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning

Edge Computing

See all
Decorative image of a profit/loss graph.
Aug 07, 2024

Profit and Loss Modeling on GPUs with ISO C++ Language Parallelism

The previous post How to Accelerate Quantitative Finance with ISO C++ Standard Parallelism demonstrated how to write a Black-Scholes simulation using ISO C++...
10 MIN READ
Profit and Loss Modeling on GPUs with ISO C++ Language Parallelism
Jul 22, 2024

Spotlight: HP 3D Printing Open Sources AI Surrogates for Additive Manufacturing Using NVIDIA Modulus

An open ecosystem for physics-informed machine learning (physics-ML) fosters innovation and AI engineering applications. Physics-ML embeds into the learning...
7 MIN READ
Spotlight: HP 3D Printing Open Sources AI Surrogates for Additive Manufacturing Using NVIDIA Modulus
Image of a city simulation with a 6G network.
Jul 19, 2024

Boosting AI-Driven Innovation in 6G with the AI-RAN Alliance, 3GPP, and O-RAN

The pace of 6G research and development is picking up as the 5G era crosses the midpoint of the decade-long cellular generation time frame. In this blog post,...
13 MIN READ
Boosting AI-Driven Innovation in 6G with the AI-RAN Alliance, 3GPP, and O-RAN
Jul 18, 2024

Webinar: Improving Robot Uptime Featuring Nav2 Autonomous Docking with NVIDIA Isaac ROS

Join Isaac ROS engineers and the founder of Open Navigation to explore the new Nav2 autonomous docking feature.
1 MIN READ
Webinar: Improving Robot Uptime Featuring Nav2 Autonomous Docking with NVIDIA Isaac ROS
An illustration representing an AI model.
Jul 17, 2024

Develop Generative AI-Powered Visual AI Agents for the Edge

An exciting breakthrough in AI technology—Vision Language Models (VLMs)—offers a more dynamic and flexible method for video analysis. VLMs enable users to...
9 MIN READ
Develop Generative AI-Powered Visual AI Agents for the Edge
Jul 03, 2024

Powering the Future of AI-Enabled Medical Devices with NVIDIA Holoscan and RTI Connext

The demand for real-time insights and autonomous decision-making is growing across industries, and healthcare and medical devices are no exception. Relying on...
8 MIN READ
Powering the Future of AI-Enabled Medical Devices with NVIDIA Holoscan and RTI Connext
Jun 28, 2024

Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning

Full fine-tuning (FT) is commonly employed to tailor general pretrained models for specific downstream tasks. To reduce the training cost, parameter-efficient...
6 MIN READ
Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning
Jun 25, 2024

AI-Enhanced Navigation Charts Safer Waters for Massive Ships

Maritime startup Orca AI is pioneering safety at sea with its AI-powered navigation system, which provides real-time video processing to help crews make...
5 MIN READ
AI-Enhanced Navigation Charts Safer Waters for Massive Ships
Jun 18, 2024

Generate Traffic Insights Using YOLOv8 and NVIDIA JetPack 6.0

Intelligent Transportation Systems (ITS) applications are becoming increasingly valuable and prevalent in modern urban environments. The benefits of using ITS...
11 MIN READ
Generate Traffic Insights Using YOLOv8 and NVIDIA JetPack 6.0
Decorative image of TensorRT workflow on a black background.
Jun 11, 2024

Maximum Performance and Minimum Footprint for AI Apps with NVIDIA TensorRT Weight-Stripped Engines

NVIDIA TensorRT, an established inference library for data centers, has rapidly emerged as a desirable inference backend for NVIDIA GeForce RTX and NVIDIA RTX...
8 MIN READ
Maximum Performance and Minimum Footprint for AI Apps with NVIDIA TensorRT Weight-Stripped Engines
Decorative image.
Jun 06, 2024

MediaTek Integrates NVIDIA TAO Toolkit for IoT Edge AI Development

MediaTek is teaming with NVIDIA to integrate NVIDIA TAO training and pretrained models into its development workflow, bringing advanced AI and visual perception...
1 MIN READ
MediaTek Integrates NVIDIA TAO Toolkit for IoT Edge AI Development
Jun 05, 2024

Build a Zero-Copy AI Sensor Processing Pipeline with OpenCV in NVIDIA Holoscan SDK

NVIDIA Holoscan is the NVIDIA domain-agnostic multimodal real-time AI sensor processing platform that delivers the foundation for developers to build their...
6 MIN READ
Build a Zero-Copy AI Sensor Processing Pipeline with OpenCV in NVIDIA Holoscan SDK

Data Center / Cloud

See all
Decorative image of a profit/loss graph.
Aug 07, 2024

Profit and Loss Modeling on GPUs with ISO C++ Language Parallelism

The previous post How to Accelerate Quantitative Finance with ISO C++ Standard Parallelism demonstrated how to write a Black-Scholes simulation using ISO C++...
10 MIN READ
Profit and Loss Modeling on GPUs with ISO C++ Language Parallelism
Aug 06, 2024

Spotlight: NVIDIA BlueField DPUs Power the VAST Data Platform for AI Workload Optimization

As the demand for sophisticated AI capabilities escalates, VAST Data introduces the VAST Data Platform, now enhanced with NVIDIA BlueField DPUs. This innovation...
7 MIN READ
Spotlight: NVIDIA BlueField DPUs Power the VAST Data Platform for AI Workload Optimization
Aug 06, 2024

A Deep Dive into the Latest AI Models Optimized with NVIDIA NIM

Delivered as optimized containers, NVIDIA NIM microservices are designed to accelerate AI application development for businesses of all sizes, paving the way...
9 MIN READ
A Deep Dive into the Latest AI Models Optimized with NVIDIA NIM
Aug 02, 2024

Just Released: Nsight Compute 2024.3

Nsight Compute 2024.3 improves selectively exporting results into a new report, kernel name logging to debug empty reports, and profiling green contexts.
1 MIN READ
Just Released: Nsight Compute 2024.3
NVIDIA Hopper GPU and NVIDIA Grace CPUs on a black background.
Aug 02, 2024

Revolutionizing Data Center Efficiency with the NVIDIA Grace Family

The exponential growth in data processing demand is projected to reach 175 zettabytes by 2025. This contrasts sharply with the slowing pace of CPU performance...
16 MIN READ
Revolutionizing Data Center Efficiency with the NVIDIA Grace Family
Aug 01, 2024

Just Released: CUDA Toolkit 12.6

The release supports GB100 capabilities and new library enhancements to cuBLAS, cuFFT, cuSOLVER, cuSPARSE, as well as the release of Nsight Compute 2024.3.
1 MIN READ
Just Released: CUDA Toolkit 12.6
Aug 01, 2024

Just Released: NVIDIA HPC SDK v24.7

The new release delivers support for Ubuntu 24.04, new Fortran interfaces for CUDA Graphs, and a major version NVSHMEM API update. It is the last release to...
1 MIN READ
Just Released: NVIDIA HPC SDK v24.7
Decorative image.
Aug 01, 2024

Measuring Generative AI Model Performance Using NVIDIA GenAI-Perf and an OpenAI-Compatible API

NVIDIA offers tools like Perf Analyzer and Model Analyzer to assist machine learning engineers with measuring and balancing the trade-off between latency and...
6 MIN READ
Measuring Generative AI Model Performance Using NVIDIA GenAI-Perf and an OpenAI-Compatible API
Weather forecasts running multiple simulations over the same forecast horizon.
Jul 30, 2024

Empowering Energy Trading with MetDesk and NVIDIA Earth-2

Despite the continuous improvement of weather forecasts over the last few decades, uncertainties due to meteorological measurements and models mean that...
13 MIN READ
Empowering Energy Trading with MetDesk and NVIDIA Earth-2
Computational Fluid Dynamics simulation of passenger car in motion, viewed in the Luminary Cloud interface.
Jul 26, 2024

Faster Insights from Luminary Cloud's Engineering Simulations with NVIDIA GPUs

Engineering simulation is used across industries to accelerate product development. Simulations are used to check the safety of aircraft, cars, and buildings,...
8 MIN READ
Faster Insights from Luminary Cloud's Engineering Simulations with NVIDIA GPUs
Three CT scan segments on a black background.
Jul 26, 2024

Computed Tomography Organ and Disease Segmentation Using the NVIDIA VISTA-3D NIM Microservice

Over 300M computed tomography (CT) scans are performed globally, 85M in the US alone. Radiologists are looking for ways to speed up their workflow and generate...
9 MIN READ
Computed Tomography Organ and Disease Segmentation Using the NVIDIA VISTA-3D NIM Microservice
Live cell image showing cell segmentations.
Jul 24, 2024

Cell Imaging Feature Extraction and Morphology Clustering for Spatial Omics

VISTA-2D is a new foundational model from NVIDIA that can quickly and accurately perform cell segmentation, a fundamental task in cell imaging and spatial omics...
8 MIN READ
Cell Imaging Feature Extraction and Morphology Clustering for Spatial Omics