Generative AI / LLMs
May 01, 2024
Spotlight: Continental and SoftServe Deliver Generative AI-Powered Virtual Factory Solutions with OpenUSD
With automotive consumers increasingly seeking more seamless, connected driving experiences, the industry has increased its focus on connectivity, advanced...
5 MIN READ
Apr 30, 2024
Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks
This week’s model release features DBRX, a state-of-the-art large language model (LLM) developed by Databricks. With demonstrated strength in programming and...
3 MIN READ
Apr 26, 2024
New LLM: Snowflake Arctic Model for SQL and Code Generation
Large language models (LLMs) have revolutionized natural language processing (NLP) in recent years, enabling a wide range of applications such as text...
3 MIN READ
Apr 26, 2024
Enhance Text-to-Image Fine-Tuning with DRaFT+, Now Part of NVIDIA NeMo
Text-to-image diffusion models have been established as a powerful method for high-fidelity image generation based on given text. Nevertheless, diffusion models...
10 MIN READ
Apr 25, 2024
Announcing Confidential Computing General Access on NVIDIA H100 Tensor Core GPUs
NVIDIA launched the initial release of the Confidential Computing (CC) solution in private preview for early access in July 2023 through NVIDIA LaunchPad....
3 MIN READ
Apr 23, 2024
Democratizing AI Workflows with Union.ai and NVIDIA DGX Cloud
GPUs were initially specialized for rendering 3D graphics in video games, primarily to accelerate linear algebra calculations. Today, GPUs have become one of...
7 MIN READ
Apr 23, 2024
Webinar: Enhance LLMs with RAG and Accelerate Enterprise AI with Pure Storage and NVIDIA
Join Pure Storage and NVIDIA on April 25 to discover the benefits of enhancing LLMs with RAG for enterprise-scale generative AI applications.
1 MIN READ
Apr 22, 2024
Advancing Cell Segmentation and Morphology Analysis with NVIDIA AI Foundation Model VISTA-2D
Genomics researchers use different sequencing techniques to better understand biological systems, including single-cell and spatial omics. Unlike single-cell,...
7 MIN READ
Apr 22, 2024
Mistral Large and Mixtral 8x22B LLMs Now Powered by NVIDIA NIM and NVIDIA API
This week’s model release features two new NVIDIA AI Foundation models, Mistral Large and Mixtral 8x22B, both developed by Mistral AI. These cutting-edge...
4 MIN READ
Apr 22, 2024
Turbocharging Meta Llama 3 Performance with NVIDIA TensorRT-LLM and NVIDIA Triton Inference Server
We're excited to announce support for the Meta Llama 3 family of models in NVIDIA TensorRT-LLM, accelerating and optimizing your LLM inference performance. You...
9 MIN READ
Apr 18, 2024
Turbocharge ASR Accuracy and Speed with NVIDIA NeMo Parakeet-TDT
NVIDIA NeMo, an end-to-end platform for developing multimodal generative AI models at scale anywhere—on any cloud and on-premises—recently released...
6 MIN READ
Apr 18, 2024
Pushing the Boundaries of Speech Recognition with NVIDIA NeMo Parakeet ASR Models
NVIDIA NeMo, an end-to-end platform for the development of multimodal generative AI models at scale anywhere—on any cloud and on-premises—released the...
6 MIN READ
Apr 17, 2024
Advancing Medical Image Decoding with GPU-Accelerated nvImageCodec
This post delves into the capabilities of decoding DICOM medical images within AWS HealthImaging using the nvJPEG2000 library. We'll guide you through the...
16 MIN READ
Apr 10, 2024
How Generative AI is Empowering Climate Tech with NVIDIA Earth-2
In the context of global warming, NVIDIA Earth-2 has emerged as a pivotal platform for climate tech, generating actionable insights in the face of increasingly...
14 MIN READ
Apr 05, 2024
Explainer: What Is Retrieval-Augmented Generation?
Retrieval-augmented generation enhances large language model prompts with relevant data for more practical, accurate responses.
1 MIN READ
Apr 03, 2024
New Lab: Generative AI Inference with NVIDIA NIM
Get started with NVIDIA NIM for deploying large language models (LLMs). Request access to a free, hands-on lab today.
1 MIN READ