NIM
Jan 16, 2025
How to Safeguard AI Agents for Customer Service with NVIDIA NeMo Guardrails
AI agents present a significant opportunity for businesses to scale and elevate customer service and support interactions. By automating routine inquiries and...
15 MIN READ
Jan 13, 2025
Accelerate Protein Engineering with the NVIDIA BioNeMo Blueprint for Generative Protein Binder Design
Designing a therapeutic protein that specifically binds its target in drug discovery is a staggering challenge. Traditional workflows are often a painstaking...
4 MIN READ
Jan 06, 2025
One-Click Deployments for the Best of NVIDIA AI with NVIDIA Launchables
AI development has become a core part of modern software engineering, and NVIDIA is committed to finding ways to bring optimized accelerated computing to every...
6 MIN READ
Jan 06, 2025
Build a Video Search and Summarization Agent with NVIDIA AI Blueprint
This post was originally published July 29, 2024 but has been extensively revised with NVIDIA AI Blueprint information. Traditional video analytics applications...
11 MIN READ
Dec 20, 2024
Build a Generative AI Medical Device Training Assistant with NVIDIA NIM Microservices
Innovation in medical devices continues to accelerate, with a record number authorized by the FDA every year. When these new or updated devices are introduced...
5 MIN READ
Dec 18, 2024
A Guide to Retrieval-Augmented Generation for AEC
Large language models (LLMs) are rapidly changing the business landscape, offering new capabilities in natural language processing (NLP), content generation,...
12 MIN READ
Dec 17, 2024
Fine-Tuning Small Language Models to Optimize Code Review Accuracy
Generative AI is transforming enterprises by driving innovation and boosting efficiency across numerous applications. However, adopting large foundational...
15 MIN READ
Dec 17, 2024
Boost Llama 3.3 70B Inference Throughput 3x with NVIDIA TensorRT-LLM Speculative Decoding
Meta's Llama collection of open large language models (LLMs) continues to grow with the recent addition of Llama 3.3 70B, a text-only...
8 MIN READ
Dec 17, 2024
Develop Multilingual and Cross-Lingual Information Retrieval Systems with Efficient Data Storage
Efficient text retrieval is critical for a broad range of information retrieval applications such as search, question answering, semantic textual similarity,...
8 MIN READ
Dec 16, 2024
Top Posts of 2024 Highlight NVIDIA NIM, LLM Breakthroughs, and Data Science Optimization
2024 was another landmark year for developers, researchers, and innovators working with NVIDIA technologies. From groundbreaking developments in AI inference to...
4 MIN READ
Dec 12, 2024
Advancing Solar Irradiance Prediction with NVIDIA Earth-2
As global electricity demand continues to rise, traditional sources of energy are increasingly unsustainable. Energy providers are facing pressure to reduce...
9 MIN READ
Dec 11, 2024
Deploying NVIDIA H200 NVL at Scale with New Enterprise Reference Architecture
Last month at the Supercomputing 2024 conference, NVIDIA announced the availability of NVIDIA H200 NVL, the latest NVIDIA Hopper platform. Optimized for...
8 MIN READ
Dec 11, 2024
Three Building Blocks for Creating AI Virtual Assistants for Customer Service with an NVIDIA AI Blueprint
In today's fast-paced business environment, providing exceptional customer service is no longer just a nice-to-have—it's a necessity. Whether addressing...
10 MIN READ
Dec 09, 2024
Just Released: NVIDIA VILA VLM
Now available in preview, NVIDIA VILA is an advanced multimodal VLM that provides visual understanding of multi-images and video.
1 MIN READ
Dec 06, 2024
Content Moderation and Safety Checks with NVIDIA NeMo Guardrails
Content moderation has become essential in retrieval-augmented generation (RAG) applications powered by generative AI, given the extensive volume of...
10 MIN READ
Dec 03, 2024
Build an Agentic Video Workflow with Video Search and Summarization
Building a question-answering chatbot with large language models (LLMs) is now a common workflow for text-based interactions. What about creating an AI system...
11 MIN READ