-
Data ScienceAI Uses Zero-Shot Learning to Find Existing Drugs for Treating Rare Diseases
-
Generative AI / LLMsAccelerating LLMs with llama.cpp on NVIDIA RTX Systems
-
Generative AI / LLMsAI Chatbot Delivers Multilingual Support to African Farmers
-
Generative AI / LLMsLow Latency Inference Chapter 2: Blackwell is Coming. NVIDIA GH200 NVL32 with NVLink Switch Gives Signs of Big Leap in Time to First Token Performance
-
Generative AI / LLMsDeploying Accelerated Llama 3.2 from the Edge to the Cloud
Recent
Oct 07, 2024
Optimizing Microsoft Bing Visual Search with NVIDIA Accelerated Libraries
Microsoft Bing Visual Search enables people around the world to find content using photographs as queries. The heart of this capability is Microsoft's TuringMM...
11 MIN READ
Oct 07, 2024
Generate Image and Text Embeddings with NV-CLIP
NV-CLIP, a cutting-edge multimodal embeddings model for image and text, is now generally available.
1 MIN READ
Oct 07, 2024
Producing Cinematic Content at Scale with a Generative AI-Enabled OpenUSD Pipeline
Producing commercials is resource-intensive, requiring physical locations and various props and setups to display products in different settings and...
7 MIN READ
Oct 07, 2024
Real-Time Surgical Guidance by Fusing Multi-Modal Imaging with NVIDIA Holoscan
Developers in the fields of image-guided surgery and surgical vision face unique challenges in creating systems and applications that can significantly improve...
7 MIN READ
Oct 04, 2024
Just Released: NVIDIA TensorRT-LLM 0.13.0
Updates include tensor parallel support for Mamba2, sparse mixer normalization for MoE models, and more.
1 MIN READ
Oct 04, 2024
Just Released: NVIDIA NeMo Curator Improvements for Accelerating Data Curation
NeMo Curator now supports images, enabling you to process data for training accurate generative AI models.
1 MIN READ
Oct 03, 2024
Event: Community Over Code
Learn about accelerating vector search with NVIDIA cuVS and Apache Solr on October 10 at Community Over Code.
1 MIN READ
Oct 03, 2024
AI Investigates Antarctica's Disappearing Moss to Uncover Climate Change Clues
Antarctica plays a crucial role in regulating ‌Earth’s climate. Most climate research into the world’s coldest, most windswept continent focuses on the...
5 MIN READ
Oct 03, 2024
New Reward Model Helps Improve LLM Alignment with Human Preferences
Reinforcement learning from human feedback (RLHF) is essential for developing AI systems that are aligned with human values and preferences. RLHF enables the...
3 MIN READ
Oct 03, 2024
Event: NVIDIA cuOpt at INFORMS 2024
Join NVIDIA cuOpt engineers at INFORMS 2024 on October 22-23 to learn how to revolutionize accelerated computing.
1 MIN READ
Oct 02, 2024
Webinar: Accelerating Python with GPUs
Join us on October 9 to learn how your applications can benefit from NVIDIA CUDA Python software initiatives.
1 MIN READ
Oct 02, 2024
Building LLM-Powered Production Systems with NVIDIA NIM and Outerbounds
With the rapid expansion of language models over the past 18 months, hundreds of variants are now available. These include large language models (LLMs), small...
15 MIN READ
Generative AI / LLMs
Oct 07, 2024
Producing Cinematic Content at Scale with a Generative AI-Enabled OpenUSD Pipeline
Producing commercials is resource-intensive, requiring physical locations and various props and setups to display products in different settings and...
7 MIN READ
Oct 04, 2024
Just Released: NVIDIA TensorRT-LLM 0.13.0
Updates include tensor parallel support for Mamba2, sparse mixer normalization for MoE models, and more.
1 MIN READ
Oct 04, 2024
Just Released: NVIDIA NeMo Curator Improvements for Accelerating Data Curation
NeMo Curator now supports images, enabling you to process data for training accurate generative AI models.
1 MIN READ
Oct 03, 2024
Event: Community Over Code
Learn about accelerating vector search with NVIDIA cuVS and Apache Solr on October 10 at Community Over Code.
1 MIN READ
Oct 03, 2024
New Reward Model Helps Improve LLM Alignment with Human Preferences
Reinforcement learning from human feedback (RLHF) is essential for developing AI systems that are aligned with human values and preferences. RLHF enables the...
3 MIN READ
Oct 02, 2024
Building LLM-Powered Production Systems with NVIDIA NIM and Outerbounds
With the rapid expansion of language models over the past 18 months, hundreds of variants are now available. These include large language models (LLMs), small...
15 MIN READ
Oct 02, 2024
Accelerating LLMs with llama.cpp on NVIDIA RTX Systems
The NVIDIA RTX AI for Windows PCs platform offers a thriving ecosystem of thousands of open-source models for application developers to leverage and integrate...
5 MIN READ
Oct 01, 2024
Evolving AI-Powered Game Development with Retrieval-Augmented Generation
Game development is a complex and resource-intensive process, particularly when using advanced tools like Unreal Engine. Developers find themselves navigating...
6 MIN READ
Oct 01, 2024
Simplify and Scale AI-Powered MetaHuman Deployment with NVIDIA ACE and Unreal Engine 5
At Unreal Fest 2024, NVIDIA released new Unreal Engine 5 on-device plugins for NVIDIA ACE, making it easier to build and deploy AI-powered MetaHuman characters...
4 MIN READ
Oct 01, 2024
Evaluating Medical RAG with NVIDIA AI Endpoints and Ragas
In the rapidly evolving field of medicine, the integration of cutting-edge technologies is crucial for enhancing patient care and advancing research. One such...
11 MIN READ
Sep 30, 2024
Managing AI Inference Pipelines on Kubernetes with NVIDIA NIM Operator
Developers have shown a lot of excitement for NVIDIA NIM microservices, a set of easy-to-use cloud-native microservices that shortens the time-to-market and...
5 MIN READ
Sep 30, 2024
Improve Reinforcement Learning from Human Feedback with Leaderboard-Topping Reward Model
Llama 3.1 Nemotron 70B Reward model helps generate high-quality training data that aligns with human preferences for finance, retail, healthcare, scientific...
1 MIN READ
AI Foundation Models
Sep 25, 2024
Deploying Accelerated Llama 3.2 from the Edge to the Cloud
Expanding the open-source Meta Llama collection of models, the Llama 3.2 collection includes vision language models (VLMs), small language models (SLMs), and an...
6 MIN READ
Aug 21, 2024
Mistral-NeMo-Minitron 8B Foundation Model Delivers Unparalleled Accuracy
Last month, NVIDIA and Mistral AI unveiled Mistral NeMo 12B, a leading state-of-the-art large language model (LLM). Mistral NeMo 12B consistently outperforms...
5 MIN READ
Jul 29, 2024
Fast-Track Robot Learning in Simulation Using NVIDIA Isaac Lab
Robots need to be adaptable, readily learning new skills and adjusting to their surroundings. Yet traditional training methods can limit a robot’s ability to...
7 MIN READ
Jul 26, 2024
Power Text-Generation Applications with Mistral NeMo 12B Running on a Single GPU
NVIDIA collaborated with Mistral to co-build the next-generation language model that achieves leading performance across benchmarks in its class. With a growing...
6 MIN READ
Jun 28, 2024
Transforming Financial Analysis with NVIDIA NIM
In financial services, portfolio managers and research analysts diligently sift through vast amounts of data to gain a competitive edge in investments. Making...
13 MIN READ
Jun 24, 2024
Addressing Medical Imaging Limitations with Synthetic Data Generation
Synthetic data in medical imaging offers numerous benefits, including the ability to augment datasets with diverse and realistic images where real data is...
9 MIN READ
Jun 10, 2024
SOLAR-10.7B: Optimized Model Tailored Instruction Following, Reasoning, and Mathematical Tasks
Enhance efficiency and performance in instruction-based NLP tasks with SOLAR-10.7B, especially in following instructions, reasoning, and mathematical tasks.
1 MIN READ
Jun 03, 2024
BGE-M3: Advanced Multilingual Text Retrieval Model
Experience the versatile embedding model designed for multilingual, multi-functional, and multi-granularity text retrieval tasks, excelling in dense,...
1 MIN READ
Jun 03, 2024
Breeze-7B: LLM Specialized for Traditional Chinese
The model demonstrates strong performance for tasks such as Q&A, multi-round chat, and summarization in both traditional Chinese and English.
1 MIN READ
May 30, 2024
Convert Natural Language to Code with CodeGemma
Experience the advanced LLM API for code generation, completion, mathematical reasoning, and instruction following with free cloud credits.
1 MIN READ
May 14, 2024
Generate Text Responses from Visual and Text Inputs with Google's New PaliGemma Model
With free NVIDIA cloud credits, you can start testing the model at scale on the API Catalog.
1 MIN READ
May 13, 2024
Regional LLMs SEA-LION and SeaLLM Serve Languages and Cultures of Southeast Asia
At the recent World Governments Summit in Dubai, NVIDIA CEO Jensen Huang emphasized the importance of sovereign AI, which refers to a nation’s capability to...
3 MIN READ
Simulation / Modeling / Design
Oct 07, 2024
Producing Cinematic Content at Scale with a Generative AI-Enabled OpenUSD Pipeline
Producing commercials is resource-intensive, requiring physical locations and various props and setups to display products in different settings and...
7 MIN READ
Oct 07, 2024
Real-Time Surgical Guidance by Fusing Multi-Modal Imaging with NVIDIA Holoscan
Developers in the fields of image-guided surgery and surgical vision face unique challenges in creating systems and applications that can significantly improve...
7 MIN READ
Sep 30, 2024
Advancing Quantum Algorithm Design with GPTs
AI techniques like large language models (LLMs) are rapidly transforming many scientific disciplines. Quantum computing is no exception. A collaboration between...
8 MIN READ
Sep 27, 2024
Just Released: NVIDIA HPC SDK v24.9
The new release includes several new features including improved stdpar programming and Arm processor support.
1 MIN READ
Sep 26, 2024
Spotlight: Montai Builds a Multimodal AI Platform for Drug Discovery Using NVIDIA NIM Microservices
Drug discovery aims to develop new therapeutic agents that effectively target diseases while minimizing side effects for patients. Using multimodal data—such...
4 MIN READ
Sep 24, 2024
Developing Next-Generation Wireless Networks with NVIDIA Aerial Omniverse Digital Twin
The journey to 6G has begun, offering opportunities to deliver a network infrastructure that is performant, efficient, resilient, and adaptable. 6G networks...
9 MIN READ
Sep 24, 2024
Spotlight: Petrobras Speeds Up Linear Solvers for Reservoir Simulation Using NVIDIA Grace CPU
Reservoir simulation helps reservoir engineers optimize their resource exploration approach by simulating complex scenarios and comparing with real-world field...
8 MIN READ
Sep 23, 2024
Just Released: Free OpenUSD Training Courses
Accelerate your OpenUSD workflows with this free curriculum for developers and 3D practitioners.
1 MIN READ
Sep 20, 2024
New AI-Powered 3D Printing Can Help Surgeons Rehearse Procedures
Researchers at Washington State University (WSU) unveiled a new AI-guided 3D printing technique that can help physicians print intricate replicas of human...
3 MIN READ
Sep 19, 2024
Spotlight: SLB and NVIDIA Collaborate on Generative AI Solutions for Energy
Global energy technology company SLB has announced the next milestone in its long-standing collaboration with NVIDIA to develop and scale generative AI...
3 MIN READ
Sep 16, 2024
Memory Efficiency, Faster Initialization, and Cost Estimation with NVIDIA Collective Communications Library 2.22
For the past few months, the NVIDIA Collective Communications Library (NCCL) developers have been working hard on a set of new library features and bug fixes....
8 MIN READ
Sep 11, 2024
Constant Time Launch for Straight-Line CUDA Graphs and Other Performance Enhancements
CUDA Graphs are a way to define and batch GPU operations as a graph rather than a sequence of stream launches. A CUDA Graph groups a set of CUDA kernels and...
8 MIN READ
Robotics
Sep 25, 2024
How AI and Robotics are Driving Agricultural Productivity and Sustainability
By 2030, John Deere aims for fully autonomous farming, addressing global challenges like labor shortages, sustainability, and food security. Their AI and...
2 MIN READ
Sep 23, 2024
Using Generative AI to Enable Robots to Reason and Act with ReMEmbR
Vision-language models (VLMs) combine the powerful language understanding of foundational LLMs with the vision capabilities of vision transformers (ViTs) by...
10 MIN READ
Aug 27, 2024
Simplifying Camera Calibration to Enhance AI-Powered Multi-Camera Tracking
This post is the third in a series on building multi-camera tracking vision AI applications. We introduce the overall end-to-end workflow and fine-tuning...
12 MIN READ
Jul 29, 2024
Build VLM-Powered Visual AI Agents Using NVIDIA NIM and NVIDIA VIA Microservices
Traditional video analytics applications and their development workflow are typically built on fixed-function, limited models that are designed to detect and...
10 MIN READ
Jul 29, 2024
Fast-Track Robot Learning in Simulation Using NVIDIA Isaac Lab
Robots need to be adaptable, readily learning new skills and adjusting to their surroundings. Yet traditional training methods can limit a robot’s ability to...
7 MIN READ
Jul 18, 2024
Webinar: Improving Robot Uptime Featuring Nav2 Autonomous Docking with NVIDIA Isaac ROS
Join Isaac ROS engineers and the founder of Open Navigation to explore the new Nav2 autonomous docking feature.
1 MIN READ
Jul 11, 2024
Training Sim-to-Real Transferable Robotic Assembly Skills over Diverse Geometries
Most objects in home and industrial settings consist of multiple parts that must be assembled. While human workers typically perform assembly, in certain...
10 MIN READ
Jul 11, 2024
Spotlight: Siemens Energy Accelerates Power Grid Asset Simulation 10,000x Using NVIDIA Modulus
The world’s energy system is increasingly complex and distributed due to increasing renewable energy generation, decentralization of energy resources, and...
9 MIN READ
Jul 10, 2024
Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data
Large-scale, use–case-specific synthetic data has become increasingly important in real-world computer vision and AI workflows. That’s because digital twins...
14 MIN READ
Jun 25, 2024
AI-Enhanced Navigation Charts Safer Waters for Massive Ships
Maritime startup Orca AI is pioneering safety at sea with its AI-powered navigation system, which provides real-time video processing to help crews make...
5 MIN READ
Jun 24, 2024
Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim
As vision AI complexity increases, streamlined deployment solutions are crucial to optimizing spaces and processes. NVIDIA accelerates development, turning...
13 MIN READ
Jun 17, 2024
Closing the Sim-to-Real Gap: Training Spot Quadruped Locomotion with NVIDIA Isaac Lab
Developing effective locomotion policies for quadrupeds poses significant challenges in robotics due to the complex dynamics involved. Training quadrupeds to...
12 MIN READ
Computer Vision / Video Analytics
Oct 07, 2024
Optimizing Microsoft Bing Visual Search with NVIDIA Accelerated Libraries
Microsoft Bing Visual Search enables people around the world to find content using photographs as queries. The heart of this capability is Microsoft's TuringMM...
11 MIN READ
Oct 07, 2024
Generate Image and Text Embeddings with NV-CLIP
NV-CLIP, a cutting-edge multimodal embeddings model for image and text, is now generally available.
1 MIN READ
Oct 07, 2024
Real-Time Surgical Guidance by Fusing Multi-Modal Imaging with NVIDIA Holoscan
Developers in the fields of image-guided surgery and surgical vision face unique challenges in creating systems and applications that can significantly improve...
7 MIN READ
Sep 27, 2024
AI Chatbot Delivers Multilingual Support to African Farmers
Some of Africa’s most resource-constrained farmers are gaining access to on-demand, AI-powered advice through a multimodal chatbot that gives detailed...
4 MIN READ
Sep 25, 2024
How AI and Robotics are Driving Agricultural Productivity and Sustainability
By 2030, John Deere aims for fully autonomous farming, addressing global challenges like labor shortages, sustainability, and food security. Their AI and...
2 MIN READ
Sep 13, 2024
Improved Data Loading with Threads
Data loading is a critical aspect of deep learning workflows, whether you're focused on training or inference. However, it often presents a paradox: the need...
8 MIN READ
Sep 11, 2024
Enabling Customizable GPU-Accelerated Video Transcoding Pipelines
Today, over 80% of internet traffic is video. This content is generated by and consumed across various devices, including IoT gadgets, smartphones, computers,...
10 MIN READ
Sep 11, 2024
AI Tool Helps Farmers Combat Crop Loss and Climate Change
Machine Learning algorithms are beginning to revolutionize modern agriculture. Enabling farmers to combat pests and diseases in real time, the technology is...
3 MIN READ
Sep 09, 2024
High-Tech AI Framework Transforms Global Marine Pollution Tracking
An AI-powered remote sensing study offers a dynamic new tool for global ocean cleanup efforts. Detailed in the ISPRS Journal of Photogrammetry and Remote...
4 MIN READ
Sep 05, 2024
AI-Powered Platform Advances Personalized Cancer Diagnostics and Treatments
A recent study introduced a cutting-edge AI-powered pathology platform that can help doctors diagnose and evaluate lung cancer in patients quickly and...
3 MIN READ
Aug 30, 2024
Fast Inversion for Real-Time Image Editing with Text
Text-to-image diffusion models can generate diverse, high-fidelity images based on user-provided text prompts. They operate by mapping a random sample from a...
8 MIN READ
Aug 28, 2024
New Foundational Models and Training Capabilities with NVIDIA TAO 5.5
NVIDIA TAO is a framework designed to simplify and accelerate the development and deployment of AI models. It enables you to use pretrained models, fine-tune...
13 MIN READ
Data Science
Oct 04, 2024
Just Released: NVIDIA NeMo Curator Improvements for Accelerating Data Curation
NeMo Curator now supports images, enabling you to process data for training accurate generative AI models.
1 MIN READ
Oct 03, 2024
Event: Community Over Code
Learn about accelerating vector search with NVIDIA cuVS and Apache Solr on October 10 at Community Over Code.
1 MIN READ
Oct 03, 2024
AI Investigates Antarctica's Disappearing Moss to Uncover Climate Change Clues
Antarctica plays a crucial role in regulating ‌Earth’s climate. Most climate research into the world’s coldest, most windswept continent focuses on the...
5 MIN READ
Oct 03, 2024
Event: NVIDIA cuOpt at INFORMS 2024
Join NVIDIA cuOpt engineers at INFORMS 2024 on October 22-23 to learn how to revolutionize accelerated computing.
1 MIN READ
Oct 02, 2024
Webinar: Accelerating Python with GPUs
Join us on October 9 to learn how your applications can benefit from NVIDIA CUDA Python software initiatives.
1 MIN READ
Oct 02, 2024
Building LLM-Powered Production Systems with NVIDIA NIM and Outerbounds
With the rapid expansion of language models over the past 18 months, hundreds of variants are now available. These include large language models (LLMs), small...
15 MIN READ
Oct 02, 2024
AI Uses Zero-Shot Learning to Find Existing Drugs for Treating Rare Diseases
A groundbreaking drug-repurposing AI model could bring new hope to doctors and patients trying to treat diseases with limited or no existing treatment options....
3 MIN READ
Sep 27, 2024
AI Chatbot Delivers Multilingual Support to African Farmers
Some of Africa’s most resource-constrained farmers are gaining access to on-demand, AI-powered advice through a multimodal chatbot that gives detailed...
4 MIN READ
Sep 26, 2024
Harnessing Data with AI to Boost Zero Trust Cyber Defense
Modern cyber threats have grown increasingly sophisticated, posing significant risks to federal agencies and critical infrastructure. According to Deloitte,...
8 MIN READ
Sep 18, 2024
Event: Developer Day for Financial Services
Join this virtual developer day to learn how AI and Machine Learning can revolutionize fraud detection and financial crime prevention.
1 MIN READ
Sep 17, 2024
Polars GPU Engine Powered by RAPIDS cuDF Now Available in Open Beta
Today, Polars released a new GPU engine powered by RAPIDS cuDF that accelerates Polars workflows up to 13x on NVIDIA GPUs, allowing data scientists to process...
4 MIN READ
Sep 13, 2024
Improved Data Loading with Threads
Data loading is a critical aspect of deep learning workflows, whether you're focused on training or inference. However, it often presents a paradox: the need...
8 MIN READ
Content Creation / Rendering
Oct 07, 2024
Producing Cinematic Content at Scale with a Generative AI-Enabled OpenUSD Pipeline
Producing commercials is resource-intensive, requiring physical locations and various props and setups to display products in different settings and...
7 MIN READ
Oct 02, 2024
Accelerating LLMs with llama.cpp on NVIDIA RTX Systems
The NVIDIA RTX AI for Windows PCs platform offers a thriving ecosystem of thousands of open-source models for application developers to leverage and integrate...
5 MIN READ
Oct 01, 2024
Revolutionizing Cloud Gaming and Graphics Rendering with NVIDIA GDN
Gaming has always pushed the boundaries of graphics hardware. The most popular games typically required robust GPU, CPU, and RAM resources on a user’s PC or...
7 MIN READ
Oct 01, 2024
Simplify and Scale AI-Powered MetaHuman Deployment with NVIDIA ACE and Unreal Engine 5
At Unreal Fest 2024, NVIDIA released new Unreal Engine 5 on-device plugins for NVIDIA ACE, making it easier to build and deploy AI-powered MetaHuman characters...
4 MIN READ
Sep 23, 2024
Just Released: Free OpenUSD Training Courses
Accelerate your OpenUSD workflows with this free curriculum for developers and 3D practitioners.
1 MIN READ
Sep 16, 2024
Orchestrating Innovation at Scale with NVIDIA Maxine and Texel
The NVIDIA Maxine AI developer platform is a suite of NVIDIA NIM microservices, cloud-accelerated microservices, and SDKs that offer state-of-the-art features...
5 MIN READ
Sep 11, 2024
Enabling Customizable GPU-Accelerated Video Transcoding Pipelines
Today, over 80% of internet traffic is video. This content is generated by and consumed across various devices, including IoT gadgets, smartphones, computers,...
10 MIN READ
Sep 09, 2024
Transform Live Media Pipelines with NVIDIA Holoscan for Media
NVIDIA Holoscan for Media is now ready to be used in live production, taking advantage of the best of both networking and GPU technologies. Holoscan for...
3 MIN READ
Aug 30, 2024
Fast Inversion for Real-Time Image Editing with Text
Text-to-image diffusion models can generate diverse, high-fidelity images based on user-provided text prompts. They operate by mapping a random sample from a...
8 MIN READ
Aug 20, 2024
Deploy the First On-Device Small Language Model for Improved Game Character Roleplay
At Gamescom 2024, NVIDIA announced our first on-device small language model (SLM) for improving the conversation abilities of game characters. We also announced...
4 MIN READ
Aug 12, 2024
Elevating Video Communication with the NVIDIA Maxine AI Developer Platform and VideoRequest
Effective video communication is important for everyone who communicates online. For businesses, educators, and content creators, it is vital. NVIDIA Maxine is...
5 MIN READ
Jul 31, 2024
Shader Debugging Made Easy with NVIDIA Nsight Graphics
Shaders are specialized programs that run on the GPU that manipulate rays, pixels, vertices, and textures to achieve unique visual effects. With shaders, you...
8 MIN READ
Conversational AI
Oct 01, 2024
Evaluating Medical RAG with NVIDIA AI Endpoints and Ragas
In the rapidly evolving field of medicine, the integration of cutting-edge technologies is crucial for enhancing patient care and advancing research. One such...
11 MIN READ
Sep 26, 2024
Low Latency Inference Chapter 2: Blackwell is Coming. NVIDIA GH200 NVL32 with NVLink Switch Gives Signs of Big Leap in Time to First Token Performance
Many of the most exciting applications of large language models (LLMs), such as interactive speech bots, coding co-pilots, and search, need to begin responding...
8 MIN READ
Sep 25, 2024
Build a Digital Human Interface for AI Apps with an NVIDIA NIM Agent Blueprint
Providing customers with quality service remains a top priority for businesses across industries, from answering questions and troubleshooting issues to...
5 MIN READ
Sep 25, 2024
Deploying Accelerated Llama 3.2 from the Edge to the Cloud
Expanding the open-source Meta Llama collection of models, the Llama 3.2 collection includes vision language models (VLMs), small language models (SLMs), and an...
6 MIN READ
Sep 24, 2024
Accelerating Leaderboard-Topping ASR Models 10x with NVIDIA NeMo
NVIDIA NeMo has consistently developed automatic speech recognition (ASR) models that set the benchmark in the industry, particularly those topping the Hugging...
13 MIN READ
Sep 18, 2024
Quickly Voice Your Apps with NVIDIA NIM Microservices for Speech and Translation
NVIDIA NIM, part of NVIDIA AI Enterprise, provides containers to self-host GPU-accelerated inferencing microservices for pretrained and customized AI models...
11 MIN READ
Sep 17, 2024
Optimizing Data Center Performance with AI Agents and the OODA Loop Strategy
For any data center, operating large, complex GPU clusters is not for the faint of heart! There is a tremendous amount of complexity. Cooling, power,...
12 MIN READ
Sep 10, 2024
Post-Training Quantization of LLMs with NVIDIA NeMo and NVIDIA TensorRT Model Optimizer
As large language models (LLMs) are becoming even bigger, it is increasingly important to provide easy-to-use and efficient deployment paths because the cost of...
10 MIN READ
Sep 05, 2024
Achieving State-of-the-Art Zero-Shot Waveform Audio Generation across Audio Types
Stunning audio content is an essential component of virtual worlds. Audio generative AI plays a key role in creating this content, and NVIDIA is continuously...
6 MIN READ
Aug 28, 2024
Deploy Diverse AI Apps with Multi-LoRA Support on RTX AI PCs and Workstations
Today’s large language models (LLMs) achieve unprecedented results across many use cases. Yet, application developers often need to customize and tune these...
10 MIN READ
Aug 27, 2024
Enhancing RAG Applications with NVIDIA NIM
The advent of large language models (LLMs) has significantly benefited the AI industry, offering versatile tools capable of generating human-like text and...
10 MIN READ
Aug 21, 2024
Practical Strategies for Optimizing LLM Inference Sizing and Performance
As the use of large language models (LLMs) grows across many applications, such as chatbots and content creation, it's important to understand the process of...
2 MIN READ
Edge Computing
Oct 07, 2024
Real-Time Surgical Guidance by Fusing Multi-Modal Imaging with NVIDIA Holoscan
Developers in the fields of image-guided surgery and surgical vision face unique challenges in creating systems and applications that can significantly improve...
7 MIN READ
Oct 03, 2024
AI Investigates Antarctica's Disappearing Moss to Uncover Climate Change Clues
Antarctica plays a crucial role in regulating ‌Earth’s climate. Most climate research into the world’s coldest, most windswept continent focuses on the...
5 MIN READ
Sep 25, 2024
How AI and Robotics are Driving Agricultural Productivity and Sustainability
By 2030, John Deere aims for fully autonomous farming, addressing global challenges like labor shortages, sustainability, and food security. Their AI and...
2 MIN READ
Sep 24, 2024
Developing Next-Generation Wireless Networks with NVIDIA Aerial Omniverse Digital Twin
The journey to 6G has begun, offering opportunities to deliver a network infrastructure that is performant, efficient, resilient, and adaptable. 6G networks...
9 MIN READ
Sep 23, 2024
Using Generative AI to Enable Robots to Reason and Act with ReMEmbR
Vision-language models (VLMs) combine the powerful language understanding of foundational LLMs with the vision capabilities of vision transformers (ViTs) by...
10 MIN READ
Sep 11, 2024
AI Tool Helps Farmers Combat Crop Loss and Climate Change
Machine Learning algorithms are beginning to revolutionize modern agriculture. Enabling farmers to combat pests and diseases in real time, the technology is...
3 MIN READ
Aug 28, 2024
New Foundational Models and Training Capabilities with NVIDIA TAO 5.5
NVIDIA TAO is a framework designed to simplify and accelerate the development and deployment of AI models. It enables you to use pretrained models, fine-tune...
13 MIN READ
Aug 19, 2024
Webinar: Build Visual AI Agents With Generative AI and NVIDIA NIM
Learn how to build high-performance solutions with NVIDIA visual AI agents that help streamline operations across a range of industries.
1 MIN READ
Aug 14, 2024
Just Released: DOCA 2.8 Software Framework
The new release includes support for Spectrum-X 1.1 RA and new features for AI Cloud Data Centers.
1 MIN READ
Aug 07, 2024
Profit and Loss Modeling on GPUs with ISO C++ Language Parallelism
The previous post How to Accelerate Quantitative Finance with ISO C++ Standard Parallelism demonstrated how to write a Black-Scholes simulation using ISO C++...
10 MIN READ
Jul 22, 2024
Spotlight: HP 3D Printing Open Sources AI Surrogates for Additive Manufacturing Using NVIDIA Modulus
An open ecosystem for physics-informed machine learning (physics-ML) fosters innovation and AI engineering applications. Physics-ML embeds into the learning...
7 MIN READ
Jul 19, 2024
Boosting AI-Driven Innovation in 6G with the AI-RAN Alliance, 3GPP, and O-RAN
The pace of 6G research and development is picking up as the 5G era crosses the midpoint of the decade-long cellular generation time frame. In this blog post,...
13 MIN READ
Data Center / Cloud
Oct 07, 2024
Optimizing Microsoft Bing Visual Search with NVIDIA Accelerated Libraries
Microsoft Bing Visual Search enables people around the world to find content using photographs as queries. The heart of this capability is Microsoft's TuringMM...
11 MIN READ
Oct 02, 2024
Webinar: Accelerating Python with GPUs
Join us on October 9 to learn how your applications can benefit from NVIDIA CUDA Python software initiatives.
1 MIN READ
Oct 01, 2024
Revolutionizing Cloud Gaming and Graphics Rendering with NVIDIA GDN
Gaming has always pushed the boundaries of graphics hardware. The most popular games typically required robust GPU, CPU, and RAM resources on a user’s PC or...
7 MIN READ
Sep 30, 2024
Managing AI Inference Pipelines on Kubernetes with NVIDIA NIM Operator
Developers have shown a lot of excitement for NVIDIA NIM microservices, a set of easy-to-use cloud-native microservices that shortens the time-to-market and...
5 MIN READ
Sep 26, 2024
Low Latency Inference Chapter 2: Blackwell is Coming. NVIDIA GH200 NVL32 with NVLink Switch Gives Signs of Big Leap in Time to First Token Performance
Many of the most exciting applications of large language models (LLMs), such as interactive speech bots, coding co-pilots, and search, need to begin responding...
8 MIN READ
Sep 24, 2024
Developing Next-Generation Wireless Networks with NVIDIA Aerial Omniverse Digital Twin
The journey to 6G has begun, offering opportunities to deliver a network infrastructure that is performant, efficient, resilient, and adaptable. 6G networks...
9 MIN READ
Sep 24, 2024
NVIDIA GH200 Grace Hopper Superchip Delivers Outstanding Performance in MLPerf Inference v4.1
In the latest round of MLPerf Inference – a suite of standardized, peer-reviewed inference benchmarks – the NVIDIA platform delivered outstanding...
7 MIN READ
Sep 24, 2024
Spotlight: Petrobras Speeds Up Linear Solvers for Reservoir Simulation Using NVIDIA Grace CPU
Reservoir simulation helps reservoir engineers optimize their resource exploration approach by simulating complex scenarios and comparing with real-world field...
8 MIN READ
Sep 19, 2024
Spotlight: SLB and NVIDIA Collaborate on Generative AI Solutions for Energy
Global energy technology company SLB has announced the next milestone in its long-standing collaboration with NVIDIA to develop and scale generative AI...
3 MIN READ
Sep 17, 2024
Accelerating Oracle Database Generative AI Workloads with NVIDIA NIM and NVIDIA cuVS
The vast majority of the world's data remains untapped, and enterprises are looking to generate value from this data by creating the next wave of generative AI...
6 MIN READ
Sep 17, 2024
Optimizing Data Center Performance with AI Agents and the OODA Loop Strategy
For any data center, operating large, complex GPU clusters is not for the faint of heart! There is a tremendous amount of complexity. Cooling, power,...
12 MIN READ
Sep 16, 2024
Memory Efficiency, Faster Initialization, and Cost Estimation with NVIDIA Collective Communications Library 2.22
For the past few months, the NVIDIA Collective Communications Library (NCCL) developers have been working hard on a set of new library features and bug fixes....
8 MIN READ