NVIDIA NeMo Customizer for Developers

NVIDIA NeMo™ Customizer is a high-performance, scalable microservice that simplifies fine-tuning and alignment of generative AI models for building domain-specific AI agents. Through an API-first approach, this microservice supports popular customization and post-training techniques such as low-rank adaptation (LoRA), full supervised fine-tuning (SFT), direct preference optimization (DPO), and Group Relative Policy Optimization (GRPO) with continued integration of the latest customization and alignment techniques. 

For broader reinforcement learning support with advanced RL algorithms and large-scale post-training, explore the open-source NeMo RL library, part of the NeMo framework.

NeMo Customizer, part of the NVIDIA NeMo software suite for managing the AI agent lifecycle, enables developers to seamlessly build data flywheels that continuously optimize AI agents for improved performance, efficiency, and cost.

Download NowRead Documentation


See NVIDIA NeMo Customizer in Action

Learn how NeMo Customizer enables developers to fine-tune large language models using domain-specific data—enabling the creation of tailored AI agents for tasks such as customer support, healthcare insights, enterprise automation, and many other applications.


How NVIDIA NeMo Customizer Works

NeMo Customizer provides an easy-to-use API that lets you customize generative AI models. Simply provide the dataset, model name, hyperparameters, and type of customization in the API payload. NeMo Customizer will initiate a job to tune the model, resulting in a customized version.

The architecture diagram below illustrates the flow for using NeMo Customizer, letting you seamlessly launch multiple customization jobs. In the depicted scenario, you can utilize NeMo Customizer to create two customization workflows: one for fine-tuning and one for alignment tuning. These outputs, along with NVIDIA NIM™, allow you to deploy a customized model tailored to your specific use case. 

NeMo Customizer currently supports DPO and GRPO for reinforcement learning (RL). For broader RL support with advanced algorithms and large-scale post-training, explore the open-source NeMo RL library part of the NeMo framework.

 A flowchart of how NVIDIA NeMo Customizer works

Introductory Blog

Read how NeMo Customizer simplifies the alignment and customization of generative AI models.

Tutorials

Explore tutorials designed to help you build custom generative AI models with the NeMo Customizer microservice.

Introductory Webinar

Learn how data flywheels enhance self-improving agentic AI systems and explore best practices for integrating NeMo components to optimize agent performance and cost-efficiency.

Watch Now

How-To Blog

Dive deeper into how NVIDIA NeMo microservices help build data flywheels with a case study and a quick overview of the steps in an end-to-end pipeline.


Ways to Get Started With NVIDIA NeMo Customizer

Get started with NeMo Customizer to simplify fine-tuning and alignment of large language models (LLMs) for domain-specific use cases, and for broader RL support with advanced algorithms and large-scale post-training, explore the open-source NeMo RL library, part of the NeMo framework.

Download icon

Download

Get free access to the NeMo Customizer microservice for research, development, and testing.

Download Microservices
Blueprint icon

Try

Jump-start building your AI solutions with NVIDIA AI Blueprints, customizable reference applications, available on the NVIDIA API catalog.

Try the Blueprint


Performance

NeMo Customizer uses several parallelism techniques to reduce the training time for large models with support for multi-GPU and multi-node infrastructure. These methods operate together to enhance the training process, ensuring optimal use of resources and improved training performance.

Experience 1.8x Faster Customization With NeMo Customizer

A chart showing 2x faster customization with NeMo Customizer

The benchmark represents customizing Llama-3-8B on one 8xH100 80G SXM with sequence packing (4096 pack size, 0.9958 packing efficiency).
On: customized with NeMo Customizer.
Off: customized with leading market alternatives.


Starter Kits

Start tuning your generative AI models with NeMo Customizer by accessing tutorials, best practices, and documentation for various use cases.

Customizing LLMs

Get started with popular customization techniques, such as LoRA, SFT, and p-tuning.

Data Flywheel

Enable self-improving agentic AI workflows by automating model optimization.

NeMo RL

Open-source library with support for advanced reinforcement learning algorithms and large-scale post-training of LLMs.


NVIDIA NeMo Customizer Learning Library


More Resources

Explore the Community

Get Training and Certification

Meet the Program for Startups

Ethical AI

NVIDIA’s platforms and application frameworks enable developers to build a wide array of AI applications. Consider potential algorithmic bias when choosing or creating the models being deployed. Work with the model’s developer to ensure that it meets the requirements for the relevant industry and use case; that the necessary instruction and documentation are provided to understand error rates, confidence intervals, and results; and that the model is being used under the conditions and in the manner intended.

Get started with NeMo Customizer today.

Download Now