Agentic AI / Generative AI

Deliver Personalized Retail Experiences with an AI-Powered Shopping Advisor

Aug 01, 2024

By Cynthia Countouris and Ruchika Kharwar

Discuss (0)

AI-Generated Summary

Dislike

The NVIDIA retail shopping advisor is a prebuilt AI workflow that provides a reference design for developing a retrieval-augmented generation application with large language models that can ingest product catalog data and offer human-like answers to customer inquiries.
This AI workflow includes a RAG model that leverages up-to-date product data, a sample dataset from the NVIDIA Employee Gear Store, and NVIDIA NIM microservices to ensure rapid deployment and optimized model performance.
Developers can use the provided JupyterLab Notebook server and sample notebook to prototype and experiment with their own data, and can deploy the application using NVIDIA NIM and a GPU-optimized Milvus Database to store vector embeddings.

AI-generated content may summarize information incompletely. Verify important information. Learn more

Imagine being able to put your best sales associate in front of every customer for every interaction. Your best sales associate offers product recommendations and how-to guidance, and explains related products customers might not have considered.

The NVIDIA retail shopping advisor, introduced in this post, is a prebuilt, end-to-end AI workflow. It provides a reference design that demonstrates how to develop a retrieval-augmented generation (RAG) application with large language models (LLMs) that can ingest product catalog data and use some of the latest generative AI features to provide a differentiated experience delivering contextually accurate, human-like answers to customers’ inquiries and recommendation requests.

Video 1. Watch a demo of the NVIDIA retail shopping advisor

Retail shopping advisor AI workflow

This retail shopping advisor AI workflow provides enterprises with a fast and advanced way to go from pilot to business value. It includes everything needed to make a consumer shopping experience conversational, precise, and accurate.

The retail shopping advisor reference architecture includes a RAG model that can leverage the most up to date product data when answering customer questions. Also included is a sample dataset of product data from the NVIDIA Employee Gear Store that represents a product catalog. You can use this reference example for adding your own product catalog and related data to create an interactive shopping advisor for your business.

Included with NVIDIA AI Enterprise, NVIDIA NIM microservices ensure rapid, enterprise-grade deployment and optimized model performance. The NVIDIA NeMo Retriever collection of NIM microservices enhance traditional LLM capabilities by effectively using a broad spectrum of enterprise data. This is just a small subset of the suite of software that developers can select from NVIDIA when constructing a shopping advisor application.

NVIDIA NIM is designed to streamline the deployment of generative AI applications, while ensuring security and scalability—key to successful enterprise adoption. NIM encapsulates models and integration code, which are deployed through a Kubernetes Helm chart. These can be deployed on the infrastructure of choice—on-premises or through a Cloud Service Provider. The setup of a NIM provides a path to elevate generative AI applications from a proof-of-concept to production, enabling “zero to inference in just 5 minutes.”

NeMo Retriever contains state-of-the-art, commercially-ready models for retrieval embedding and reranking. Access these microservices through the NVIDIA API catalog. The NVIDIA retail shopping advisor uses a GPU-optimized Milvus Database to store vector embeddings.

With NIM and NeMo Retriever microservices, you can construct your own retail shopping advisor application that can access data in real time, and is optimized to achieve greater recall compared to strictly open source frameworks. The performance benefits of NVIDIA NIM simplify solving a major customer issue of acquiring quality responses to lengthier search queries.

Follow along with a Jupyter Notebook

Included within this workflow is a JupyterLab Notebook server with a sample notebook that shows the solution’s features, so you can quickly prototype and experiment with your own data. Specifically, it covers how to:

Use LLMs
Use LLMs with retail product data
Create embeddings from product information
Use those embeddings to retrieve products most similar to a given query
Empower LLMs to make decisions, such as respond normally, or use tools like search or shopping cart APIs
Collect these pieces into a single ProductAdvisor utility class
Deploy this in a FastAPI backend and interact with that backend through a chat box in a React application

Get started

Learn more about how to build your own retail shopping advisor. When you’re ready to get started, apply for a 90-day free subscription to access the retail shopping advisor AI workflow for free. To experience additional NVIDIA NIM microservices, visit the NVIDIA API catalog.

For more details about how NVIDIA is helping retailers across novel use cases, check out the on-demand GTC session, How NVIDIA Accelerates Retailers on the Gen AI Journey.

To learn more about building RAG applications at your enterprise, visit NVIDIA/GenerativeAIExamples on GitHub. These examples will help you create your own shopping advisor that can accurately answer domain-specific questions around your enterprise’s products using the most viable information.

Discuss (0)

About the Authors

About Cynthia Countouris
A seasoned Silicon Valley executive, Cynthia has been at the forefront of emerging technologies, overseeing both product and marketing functions, and translating innovation into practical solutions. Her passion for the nexus of technology and human behavior started at Carnegie Mellon University where she gained her BS and MS, and continues with evangelizing the potential and power of AI for the Retail, CPG and QSR industries at NVIDIA. Prior to NVIDIA, Cynthia was SVP of product and marketing for MomentFeed, a leading mobile consumer engagement platform. She has served as executive and product leader for several Silicon Valley firms, including the founder of marketing automation, MarketFirst Software. Prior to MarketFirst, Cynthia led marketing technology and IT groups for industry leaders Autodesk and Silicon Graphics.

View all posts by Cynthia Countouris

About Ruchika Kharwar
Ruchika Kharwar is a product manager at NVIDIA with extensive experience in cloud computing and systems engineering. Before this, she was a Customer Engineer at Google, driving customer success and the adoption of Google Cloud Platform for nearly 6 years. Her background also includes roles at RedHat as a Cloud Success Architect and at Texas Instruments in software development. Ruchika has earned certifications like Google Cloud Certified Professional Cloud Architect, Data Engineer, and more. She excels at bridging technical and customer-facing roles to ensure successful technology adoption and customer satisfaction.

View all posts by Ruchika Kharwar