Posts by Nicole Luo
Generative AI / LLMs
Jul 10, 2024
Curating Non-English Datasets for LLM Training with NVIDIA NeMo Curator
Data curation plays a crucial role in the development of effective and fair large language models (LLMs). High-quality, diverse training data directly...
12 MIN READ
Generative AI / LLMs
Jul 08, 2024
Deploy Multilingual LLMs with NVIDIA NIM
Multilingual large language models (LLMs) are increasingly important for enterprises operating in today's globalized business landscape. As businesses expand...
9 MIN READ
Generative AI / LLMs
May 17, 2024
Training Localized Multilingual LLMs with NVIDIA NeMo, Part 2
In Part 1, we discussed how to train a monolingual tokenizer and merge it with a pretrained LLM’s tokenizer to form a multilingual tokenizer. In this post, we...
8 MIN READ
Generative AI / LLMs
May 17, 2024
Training Localized Multilingual LLMs with NVIDIA NeMo, Part 1
In today's globalized world, the ability of AI systems to understand and communicate in diverse languages is increasingly crucial. Large language models (LLMs)...
14 MIN READ