NeMo

Decorative image of an LLM on a purple background with the text, "Part 2".

May 17, 2024

Training Localized Multilingual LLMs with NVIDIA NeMo, Part 2

In Part 1, we discussed how to train a monolingual tokenizer and merge it with a pretrained LLM’s tokenizer to form a multilingual tokenizer. In this post, we...

8 MIN READ

Decorative image of an LLM on a purple background with the text, "Part 1".

May 17, 2024

Training Localized Multilingual LLMs with NVIDIA NeMo, Part 1

In today's globalized world, the ability of AI systems to understand and communicate in diverse languages is increasingly crucial. Large language models (LLMs)...

14 MIN READ

May 15, 2024

Develop Secure, Reliable Medical Apps with RAG and NVIDIA NeMo Guardrails

Imagine an application that can sift through mountains of patient data, intelligently searching and answering questions about diagnoses, health histories, and...

6 MIN READ

Decorative image of a globe surrounded by people speaking and texting in different languages, with the text Part 2.

May 13, 2024

Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 2

In the first post, we walked through the prerequisites for a neural machine translation example from English to Chinese, running the pretrained model with NeMo,...

11 MIN READ

Decorative image of a globe surrounded by people speaking and texting in different languages, with the text Part 1.

May 13, 2024

Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 1

Neural machine translation (NMT) is an automatic task of translating a sequence of words from one language to another. In recent years, the development of...

8 MIN READ

Decorative image of multimodal RAG workflow.

May 12, 2024

Advanced AI and Retrieval-Augmented Generation for Code Development in High-Performance Computing

In the rapidly evolving field of software development, AI tools such as chatbots and GitHub Copilot have significantly transformed how developers write and...

8 MIN READ

May 08, 2024

Accelerate Generative AI Inference Performance with NVIDIA TensorRT Model Optimizer, Now Publicly Available

In the fast-evolving landscape of generative AI, the demand for accelerated inference speed remains a pressing concern. With the exponential growth in model...

9 MIN READ

nearly 100 training labs from GTC available on demand

May 07, 2024

NVIDIA GTC Training Labs On Demand Available Now

Missed GTC or want to replay your favorite training labs? Find it on demand with the NVIDIA GTC Training Labs playlist.

1 MIN READ

Image of a gridded cube with purple and green dots.

May 03, 2024

Explainer: What Is a Vector Database?

A vector database is an organized collection of vector embeddings that can be created, read, updated, and deleted at any point in time. Vector embeddings...

1 MIN READ

Apr 28, 2024

Turbocharging Meta Llama 3 Performance with NVIDIA TensorRT-LLM and NVIDIA Triton Inference Server

We're excited to announce support for the Meta Llama 3 family of models in NVIDIA TensorRT-LLM, accelerating and optimizing your LLM inference performance. You...

9 MIN READ

Apr 26, 2024

Enhance Text-to-Image Fine-Tuning with DRaFT+, Now Part of NVIDIA NeMo

Text-to-image diffusion models have been established as a powerful method for high-fidelity image generation based on given text. Nevertheless, diffusion models...

10 MIN READ

Apr 23, 2024

Webinar: Enhance LLMs with RAG and Accelerate Enterprise AI with Pure Storage and NVIDIA

Join Pure Storage and NVIDIA on April 25 to discover the benefits of enhancing LLMs with RAG for enterprise-scale generative AI applications.

1 MIN READ

Apr 22, 2024

Mistral Large and Mixtral 8x22B LLMs Now Powered by NVIDIA NIM and NVIDIA API

This week’s model release features two new NVIDIA AI Foundation models, Mistral Large and Mixtral 8x22B, both developed by Mistral AI. These cutting-edge...

4 MIN READ

Decorative image of text and speech recognition processes encircling the globe.

Apr 18, 2024

New Standard for Speech Recognition and Translation from the NVIDIA NeMo Canary Model

NVIDIA NeMo is an end-to-end platform for the development of multimodal generative AI models at scale anywhere—on any cloud and on-premises. The NeMo team...

4 MIN READ

Apr 18, 2024

Turbocharge ASR Accuracy and Speed with NVIDIA NeMo Parakeet-TDT

NVIDIA NeMo, an end-to-end platform for developing multimodal generative AI models at scale anywhere—on any cloud and on-premises—recently released...

6 MIN READ

Image of two people sitting in their cubicles with speech recognition visualizations in the background.

Apr 18, 2024

Pushing the Boundaries of Speech Recognition with NVIDIA NeMo Parakeet ASR Models

NVIDIA NeMo, an end-to-end platform for the development of multimodal generative AI models at scale anywhere—on any cloud and on-premises—released the...

6 MIN READ