NIM

Feb 05, 2025
Improving Translation Quality with Domain-Specific Fine-Tuning and NVIDIA NIM
Translation plays an essential role in enabling companies to expand across borders, with requirements varying significantly in terms of tone, accuracy, and...
8 MIN READ

Feb 05, 2025
Streamline Collaboration Across Local and Cloud Systems with NVIDIA AI Workbench
NVIDIA AI Workbench is a free development environment manager to develop, customize, and prototype AI applications on your GPUs. AI Workbench provides a...
8 MIN READ

Jan 30, 2025
New NVIDIA AI Blueprint: Build a Customizable RAG Pipeline
Connect AI applications to enterprise data using embedding and reranking models for information retrieval.
1 MIN READ

Jan 24, 2025
Optimize AI Inference Performance with NVIDIA Full-Stack Solutions
The explosion of AI-driven applications has placed unprecedented demands on both developers, who must balance delivering cutting-edge performance with managing...
9 MIN READ

Jan 22, 2025
Horizontal Autoscaling of NVIDIA NIM Microservices on Kubernetes
NVIDIA NIM microservices are model inference containers that can be deployed on Kubernetes. In a production environment, it’s important to understand the...
8 MIN READ

Jan 21, 2025
Lessons Learned from Building an AI Sales Assistant
At NVIDIA, the Sales Operations team equips the Sales team with the tools and resources needed to bring cutting-edge hardware and software to market. Managing...
10 MIN READ

Jan 16, 2025
How to Safeguard AI Agents for Customer Service with NVIDIA NeMo Guardrails
AI agents present a significant opportunity for businesses to scale and elevate customer service and support interactions. By automating routine inquiries and...
15 MIN READ

Jan 13, 2025
Accelerate Protein Engineering with the NVIDIA BioNeMo Blueprint for Generative Protein Binder Design
Designing a therapeutic protein that specifically binds its target in drug discovery is a staggering challenge. Traditional workflows are often a painstaking...
4 MIN READ

Jan 06, 2025
One-Click Deployments for the Best of NVIDIA AI with NVIDIA Launchables
AI development has become a core part of modern software engineering, and NVIDIA is committed to finding ways to bring optimized accelerated computing to every...
6 MIN READ

Jan 06, 2025
Build a Video Search and Summarization Agent with NVIDIA AI Blueprint
This post was originally published July 29, 2024 but has been extensively revised with NVIDIA AI Blueprint information. Traditional video analytics applications...
11 MIN READ

Dec 20, 2024
Build a Generative AI Medical Device Training Assistant with NVIDIA NIM Microservices
Innovation in medical devices continues to accelerate, with a record number authorized by the FDA every year. When these new or updated devices are introduced...
5 MIN READ

Dec 18, 2024
A Guide to Retrieval-Augmented Generation for AEC
Large language models (LLMs) are rapidly changing the business landscape, offering new capabilities in natural language processing (NLP), content generation,...
12 MIN READ

Dec 17, 2024
Fine-Tuning Small Language Models to Optimize Code Review Accuracy
Generative AI is transforming enterprises by driving innovation and boosting efficiency across numerous applications. However, adopting large foundational...
15 MIN READ

Dec 17, 2024
Boost Llama 3.3 70B Inference Throughput 3x with NVIDIA TensorRT-LLM Speculative Decoding
Meta's Llama collection of open large language models (LLMs) continues to grow with the recent addition of Llama 3.3 70B, a text-only...
8 MIN READ

Dec 17, 2024
Develop Multilingual and Cross-Lingual Information Retrieval Systems with Efficient Data Storage
Efficient text retrieval is critical for a broad range of information retrieval applications such as search, question answering, semantic textual similarity,...
8 MIN READ

Dec 16, 2024
Top Posts of 2024 Highlight NVIDIA NIM, LLM Breakthroughs, and Data Science Optimization
2024 was another landmark year for developers, researchers, and innovators working with NVIDIA technologies. From groundbreaking developments in AI inference to...
4 MIN READ