Retrieval Augmented Generation (RAG)
Jan 16, 2025
How to Safeguard AI Agents for Customer Service with NVIDIA NeMo Guardrails
AI agents present a significant opportunity for businesses to scale and elevate customer service and support interactions. By automating routine inquiries and...
15 MIN READ
Dec 18, 2024
A Guide to Retrieval-Augmented Generation for AEC
Large language models (LLMs) are rapidly changing the business landscape, offering new capabilities in natural language processing (NLP), content generation,...
12 MIN READ
Dec 17, 2024
Develop Multilingual and Cross-Lingual Information Retrieval Systems with Efficient Data Storage
Efficient text retrieval is critical for a broad range of information retrieval applications such as search, question answering, semantic textual similarity,...
8 MIN READ
Dec 16, 2024
Top Posts of 2024 Highlight NVIDIA NIM, LLM Breakthroughs, and Data Science Optimization
2024 was another landmark year for developers, researchers, and innovators working with NVIDIA technologies. From groundbreaking developments in AI inference to...
4 MIN READ
Dec 16, 2024
Insights, Techniques, and Evaluation for LLM-Driven Knowledge Graphs
Data is the lifeblood of modern enterprises, fueling everything from innovation to strategic decision making. However, as organizations amass ever-growing...
15 MIN READ
Dec 16, 2024
An Easy Introduction to Multimodal Retrieval-Augmented Generation for Video and Audio
Building a multimodal retrieval-augmented generation (RAG) system is challenging. The difficulty comes from capturing and indexing information from across...
12 MIN READ
Dec 12, 2024
Integration of NVIDIA BlueField DPUs with WEKA Client Boosts AI Workload Efficiency
WEKA, a pioneer in scalable software-defined data platforms, and NVIDIA are collaborating to unite WEKA's state-of-the-art data platform solutions with powerful...
5 MIN READ
Dec 11, 2024
Three Building Blocks for Creating AI Virtual Assistants for Customer Service with an NVIDIA AI Blueprint
In today's fast-paced business environment, providing exceptional customer service is no longer just a nice-to-have—it's a necessity. Whether addressing...
10 MIN READ
Dec 03, 2024
Build an Agentic Video Workflow with Video Search and Summarization
Building a question-answering chatbot with large language models (LLMs) is now a common workflow for text-based interactions. What about creating an AI system...
11 MIN READ
Nov 22, 2024
Spotlight: TCS Increases Automotive Software Testing Speeds by 2x Using NVIDIA Generative AI
Generative AI is transforming every aspect of the automotive industry, including software development, testing, user experience, personalization, and safety....
8 MIN READ
Nov 20, 2024
Advancing Neuroscience Research with Visual Question Answering and Multimodal Retrieval
Leading healthcare organizations are turning to generative AI to help build applications that can deliver life-saving impacts. These organizations include the...
8 MIN READ
Nov 08, 2024
Transforming Telecom Networks to Manage and Optimize AI Workloads
5G global connections numbered nearly 2 billion earlier this year, and are projected to reach 7.7 billion by 2028. While 5G has delivered faster speeds, higher...
7 MIN READ
Nov 04, 2024
Frictionless Collaboration and Rapid Prototyping in Hybrid Environments with NVIDIA AI Workbench
NVIDIA AI Workbench is a free development environment manager that streamlines data science, AI, and machine learning (ML) projects on systems of choice. The...
10 MIN READ
Oct 28, 2024
Creating RAG-Based Question-and-Answer LLM Workflows at NVIDIA
The rapid development of solutions using retrieval augmented generation (RAG) for question-and-answer LLM workflows has led to new types of system...
11 MIN READ
Oct 28, 2024
NVIDIA GH200 Superchip Accelerates Inference by 2x in Multiturn Interactions with Llama Models
Deploying large language models (LLMs) in production environments often requires making hard trade-offs between enhancing user interactivity and increasing...
7 MIN READ
Oct 21, 2024
IBM’s New Granite 3.0 Generative AI Models Are Small, Yet Highly Accurate and Efficient
Today, IBM released the third generation of IBM Granite, a collection of open language models and complementary tools. Prior generations of Granite focused on...
5 MIN READ