Chatbot
Dec 06, 2024
Content Moderation and Safety Checks with NVIDIA NeMo Guardrails
Content moderation has become essential in retrieval-augmented generation (RAG) applications powered by generative AI, given the extensive volume of...
10 MIN READ
Nov 11, 2024
Developing a 172B LLM with Strong Japanese Capabilities Using NVIDIA Megatron-LM
Generative AI has the ability to create entirely new content that traditional machine learning (ML) methods struggle to produce. In the field of natural...
6 MIN READ
Oct 23, 2024
Three Building Blocks for Creating AI Virtual Assistants for Customer Service with an NVIDIA AI Blueprint
In today's fast-paced business environment, providing exceptional customer service is no longer just a nice-to-have—it's a necessity. Whether addressing...
10 MIN READ
Oct 21, 2024
IBM’s New Granite 3.0 Generative AI Models Are Small, Yet Highly Accurate and Efficient
Today, IBM released the third generation of IBM Granite, a collection of open language models and complementary tools. Prior generations of Granite focused on...
5 MIN READ
Sep 27, 2024
AI Chatbot Delivers Multilingual Support to African Farmers
Some of Africa’s most resource-constrained farmers are gaining access to on-demand, AI-powered advice through a multimodal chatbot that gives detailed...
4 MIN READ
Sep 26, 2024
Low Latency Inference Chapter 2: Blackwell is Coming. NVIDIA GH200 NVL32 with NVLink Switch Gives Signs of Big Leap in Time to First Token Performance
Many of the most exciting applications of large language models (LLMs), such as interactive speech bots, coding co-pilots, and search, need to begin responding...
8 MIN READ
Aug 13, 2024
New NIM Available: Mistral Large 2 Instruct LLM
The new model by Mistral excels at a variety of complex tasks including text summarization, multilingual translation and reasoning, programming, question and...
1 MIN READ
Jul 22, 2024
Gets Hands-On Training at SIGGRAPH 2024
Complimentary trainings on OpenUSD, Digital Humans, LLMs and more with hands-on labs for Full Conference and Experience attendees.
1 MIN READ
Jun 26, 2024
Generate High-Quality, Context-Aware Responses for Chatbots and Search Engines with Llama 3-ChatQA
Experience and test Llama3-ChatQA models at scale with performance optimized NVIDIA NIM inference microservice using the NVIDIA API catalog.
1 MIN READ
Feb 27, 2024
Video: Build a RAG-Powered Chatbot in Five Minutes
Retrieval-augmented generation (RAG) is exploding in popularity as a technique for boosting large language model (LLM) application performance. From highly...
2 MIN READ
Feb 06, 2024
Top Retrieval-Augmented Generation (RAG) Sessions at NVIDIA GTC 2024 Sessions
Join us in-person or virtually and learn about the power of RAG with insights and best practices from experts at NVIDIA, visionary CEOs, data scientists, and...
1 MIN READ
May 02, 2023
How Speech Recognition Improves Customer Service in Telecommunications
The telecommunication industry has seen a proliferation of AI-powered technologies in recent years, with speech recognition and translation leading the charge....
7 MIN READ
Apr 26, 2023
An Introduction to Large Language Models: Prompt Engineering and P-Tuning
ChatGPT has made quite an impression. Users are excited to use the AI chatbot to ask questions, write poems, imbue a persona for interaction, act as a personal...
10 MIN READ
Apr 25, 2023
Increasing Inference Acceleration of KoGPT with NVIDIA FasterTransformer
Transformers are one of the most influential AI model architectures today and are shaping the direction of future AI R&D. First invented as a tool for...
6 MIN READ
Apr 25, 2023
NVIDIA Enables Trustworthy, Safe, and Secure Large Language Model Conversational Systems
Large language models (LLMs) are incredibly powerful and capable of answering complex questions, performing feats of creative writing, developing, debugging...
7 MIN READ
Mar 13, 2023
Power-Up Your Skills and Credentials at NVIDIA GTC 2023
Last August, I wrote a post about GTC that asked, ‘What if you could spend 8 hours with an AI legend while getting hands-on experience using some of the most...
6 MIN READ