Posts by Mehran Maghoumi
Generative AI
Sep 10, 2024
Streamlining Data Processing for Domain Adaptive Pretraining with NVIDIA NeMo Curator
Domain-adaptive pretraining (DAPT) of large language models (LLMs) is an important step towards building domain-specific models. These models demonstrate...
16 MIN READ
Generative AI
Jul 31, 2024
Curating Custom Datasets for LLM Parameter-Efficient Fine-Tuning with NVIDIA NeMo Curator
In a recent post, we discussed how to use NVIDIA NeMo Curator to curate custom datasets for pretraining or continuous training use cases of large language...
11 MIN READ
Generative AI
May 21, 2024
Curating Custom Datasets for LLM Training with NVIDIA NeMo Curator
Data curation is the first, and arguably the most important, step in the pretraining and continuous training of large language models (LLMs) and small language...
14 MIN READ
Generative AI
Mar 27, 2024
Scale and Curate High-Quality Datasets for LLM Training with NVIDIA NeMo Curator
Enterprises are using large language models (LLMs) as powerful tools to improve operational efficiency and drive innovation. NVIDIA NeMo microservices aim to...
6 MIN READ