Amit Bleiweiss

Amit Bleiweiss is a senior solution engineer at NVIDIA, where he focuses on large language models and natural language processing. He has 25 years of experience in applied machine learning and deep learning, with over 50 patents and publications in the domain. Amit received his MSc from Hebrew University of Jerusalem, where he specialized in machine learning.
Amit Bleiweiss

Posts by Amit Bleiweiss

Decorative image of an LLM on a purple background with the text, "Part 2".
Generative AI

Training Localized Multilingual LLMs with NVIDIA NeMo, Part 2

In Part 1, we discussed how to train a monolingual tokenizer and merge it with a pretrained LLM’s tokenizer to form a multilingual tokenizer. In this post, we... 8 MIN READ
Decorative image of an LLM on a purple background with the text, "Part 1".
Generative AI

Training Localized Multilingual LLMs with NVIDIA NeMo, Part 1

In today's globalized world, the ability of AI systems to understand and communicate in diverse languages is increasingly crucial. Large language models (LLMs)... 14 MIN READ
Decorative image of a globe surrounded by people speaking and texting in different languages, with the text Part 2.
Generative AI

Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 2

In the first post, we walked through the prerequisites for a neural machine translation example from English to Chinese, running the pretrained model with NeMo,... 11 MIN READ
Decorative image of a globe surrounded by people speaking and texting in different languages, with the text Part 1.
Generative AI

Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 1

Neural machine translation (NMT) is an automatic task of translating a sequence of words from one language to another. In recent years, the development of... 8 MIN READ
Decorative image of a RAG pipeline.
Generative AI

Tips for Building a RAG Pipeline with NVIDIA AI LangChain AI Endpoints

Retrieval-augmented generation (RAG) is a technique that combines information retrieval with a set of carefully designed system prompts to provide more... 13 MIN READ
Generative AI

Tune and Deploy LoRA LLMs with NVIDIA TensorRT-LLM

Large language models (LLMs) have revolutionized natural language processing (NLP) with their ability to learn from massive amounts of text and generate fluent... 15 MIN READ