LangChain
Jul 30, 2024
Enhancing RAG Pipelines with Re-Ranking
In the rapidly evolving landscape of AI-driven applications, re-ranking has emerged as a pivotal technique to enhance the precision and relevance of enterprise...
8 MIN READ
May 31, 2024
Building Safer LLM Apps with LangChain Templates and NVIDIA NeMo Guardrails
An easily deployable reference architecture can help developers get to production faster with custom LLM use cases. LangChain Templates are a new way of...
7 MIN READ
May 08, 2024
Tips for Building a RAG Pipeline with NVIDIA AI LangChain AI Endpoints
Retrieval-augmented generation (RAG) is a technique that combines information retrieval with a set of carefully designed system prompts to provide more...
13 MIN READ
Apr 26, 2024
New LLM: Snowflake Arctic Model for SQL and Code Generation
Large language models (LLMs) have revolutionized natural language processing (NLP) in recent years, enabling a wide range of applications such as text...
3 MIN READ
Apr 22, 2024
Mistral Large and Mixtral 8x22B LLMs Now Powered by NVIDIA NIM and NVIDIA API
This week’s model release features two new NVIDIA AI Foundation models, Mistral Large and Mixtral 8x22B, both developed by Mistral AI. These cutting-edge...
4 MIN READ
Mar 27, 2024
Develop Custom Enterprise Generative AI with NVIDIA NeMo
Generative AI is transforming computing, paving new avenues for humans to interact with computers in natural, intuitive ways. For enterprises, the prospect of...
14 MIN READ
Mar 18, 2024
How to Take a RAG Application from Pilot to Production in Four Steps
Generative AI has the potential to transform every industry. Human workers are already using large language models (LLMs) to explain, reason about, and solve...
8 MIN READ
Feb 27, 2024
Video: Build a RAG-Powered Chatbot in Five Minutes
Retrieval-augmented generation (RAG) is exploding in popularity as a technique for boosting large language model (LLM) application performance. From highly...
2 MIN READ
Feb 23, 2024
Evaluating Retriever for Enterprise-Grade RAG
The conversation about designing and evaluating Retrieval-Augmented Generation (RAG) systems is a long, multi-faceted discussion. Even when we look at retrieval...
14 MIN READ
Feb 21, 2024
Build an LLM-Powered API Agent for Task Execution
Developers have long been building interfaces like web apps to enable users to leverage the core products being built. To learn how to work with data in your...
10 MIN READ
Jan 04, 2024
Accelerating Inference on End-to-End Workflows with H2O.ai and NVIDIA
Data scientists are combining generative AI and predictive analytics to build the next generation of AI applications. In financial services, AI modeling and...
14 MIN READ
Dec 18, 2023
RAG 101: Retrieval-Augmented Generation Questions Answered
Data scientists, AI engineers, MLOps engineers, and IT infrastructure professionals must consider a variety of factors when designing and deploying a RAG...
10 MIN READ
Dec 18, 2023
RAG 101: Demystifying Retrieval-Augmented Generation Pipelines
Large language models (LLMs) have impressed the world with their unprecedented capabilities to comprehend and generate human-like responses. Their chat...
6 MIN READ
Dec 04, 2023
Create Lifelike Avatars with AI Animation and Speech Features in NVIDIA ACE
NVIDIA today unveiled major upgrades to the NVIDIA Avatar Cloud Engine (ACE) suite of technologies, bringing enhanced realism and accessibility to AI-powered...
3 MIN READ
Nov 30, 2023
Building Your First LLM Agent Application
When building a large language model (LLM) agent application, there are four key components you need: an agent core, a memory module, agent tools, and a...
10 MIN READ
Nov 15, 2023
Deploy Large Language Models at the Edge with NVIDIA IGX Orin Developer Kit
As large language models (LLMs) become more powerful and techniques for reducing their computational requirements mature, two compelling questions emerge....
9 MIN READ