Posts by Nicole Luo
Generative AI
Nov 13, 2024
Mastering LLM Techniques: Data Preprocessing
The advent of large language models (LLMs) marks a significant shift in how industries leverage AI to enhance operations and services. By automating routine...
14 MIN READ
Generative AI
Jul 10, 2024
Curating Non-English Datasets for LLM Training with NVIDIA NeMo Curator
Data curation plays a crucial role in the development of effective and fair large language models (LLMs). High-quality, diverse training data directly...
12 MIN READ
Generative AI
Jul 08, 2024
Deploy Multilingual LLMs with NVIDIA NIM
Multilingual large language models (LLMs) are increasingly important for enterprises operating in today's globalized business landscape. As businesses expand...
9 MIN READ
Generative AI
May 17, 2024
Training Localized Multilingual LLMs with NVIDIA NeMo, Part 2
In Part 1, we discussed how to train a monolingual tokenizer and merge it with a pretrained LLM’s tokenizer to form a multilingual tokenizer. In this post, we...
8 MIN READ
Generative AI
May 17, 2024
Training Localized Multilingual LLMs with NVIDIA NeMo, Part 1
In today's globalized world, the ability of AI systems to understand and communicate in diverse languages is increasingly crucial. Large language models (LLMs)...
14 MIN READ