featured

Image of a gridded cube with purple and green dots.

May 03, 2024

Explainer: What Is a Vector Database?

A vector database is an organized collection of vector embeddings that can be created, read, updated, and deleted at any point in time. Vector embeddings...

1 MIN READ

May 01, 2024

Spotlight: Continental and SoftServe Deliver Generative AI-Powered Virtual Factory Solutions with OpenUSD

With automotive consumers increasingly seeking more seamless, connected driving experiences, the industry has increased its focus on connectivity, advanced...

5 MIN READ

Apr 30, 2024

Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks

This week’s model release features DBRX, a state-of-the-art large language model (LLM) developed by Databricks. With demonstrated strength in programming and...

3 MIN READ

3 sessions for data scientists to watch from NVIDIA GTC 2024

Apr 29, 2024

Top Data Science Sessions from NVIDIA GTC 2024 Now Available On Demand

At GTC 2024, experts from NVIDIA and our partners shared insights about GPU-accelerated tools, optimizations, and best practices for data scientists. From the...

2 MIN READ

Three reflective green spheres hovering above three white platforms on a neutral background.

Apr 29, 2024

GPU-Powered Windows 365 Cloud PCs with NVIDIA RTX Virtual Workstation for High-End Graphics Workloads

Professional workflows have become more complex with the increased demand for graphics-intensive scenarios. From regular office applications to demanding...

7 MIN READ

Apr 28, 2024

Turbocharging Meta Llama 3 Performance with NVIDIA TensorRT-LLM and NVIDIA Triton Inference Server

We're excited to announce support for the Meta Llama 3 family of models in NVIDIA TensorRT-LLM, accelerating and optimizing your LLM inference performance. You...

9 MIN READ

Apr 26, 2024

Perception Model Training for Autonomous Vehicles with Tensor Parallelism

Due to the adoption of multicamera inputs and deep convolutional backbone networks, the GPU memory footprint for training autonomous driving perception models...

10 MIN READ

Apr 26, 2024

New LLM: Snowflake Arctic Model for SQL and Code Generation

Large language models (LLMs) have revolutionized natural language processing (NLP) in recent years, enabling a wide range of applications such as text...

3 MIN READ

Apr 26, 2024

Enhance Text-to-Image Fine-Tuning with DRaFT+, Now Part of NVIDIA NeMo

Text-to-image diffusion models have been established as a powerful method for high-fidelity image generation based on given text. Nevertheless, diffusion models...

10 MIN READ

Apr 25, 2024

Announcing Confidential Computing General Access on NVIDIA H100 Tensor Core GPUs

NVIDIA launched the initial release of the Confidential Computing (CC) solution in private preview for early access in July 2023 through NVIDIA LaunchPad....

3 MIN READ

Decorative image of different workflows against a grey background.

Apr 23, 2024

Democratizing AI Workflows with Union.ai and NVIDIA DGX Cloud

GPUs were initially specialized for rendering 3D graphics in video games, primarily to accelerate linear algebra calculations. Today, GPUs have become one of...

7 MIN READ

Apr 22, 2024

Advancing Cell Segmentation and Morphology Analysis with NVIDIA AI Foundation Model VISTA-2D

Genomics researchers use different sequencing techniques to better understand biological systems, including single-cell and spatial omics. Unlike single-cell,...

7 MIN READ

Apr 22, 2024

Just Released: NVIDIA Modulus v24.04

Modulus v24.04 delivers an optimized CorrDiff model and Earth2Studio for exploring weather AI models.

1 MIN READ

Apr 22, 2024

Developing Virtual Factory Solutions with OpenUSD and NVIDIA Omniverse

With NVIDIA AI, NVIDIA Omniverse, and the Universal Scene Description (OpenUSD) ecosystem, industrial developers are building virtual factory solutions that...

4 MIN READ

Apr 22, 2024

Mistral Large and Mixtral 8x22B LLMs Now Powered by NVIDIA NIM and NVIDIA API

This week’s model release features two new NVIDIA AI Foundation models, Mistral Large and Mixtral 8x22B, both developed by Mistral AI. These cutting-edge...

4 MIN READ

Photo of a cell tower at sunset among hills with fog.

Apr 22, 2024

Enhanced DU Performance and Workload Consolidation for 5G/6G with NVIDIA Aerial CUDA-Accelerated RAN

Aerial CUDA-Accelerated radio access network (RAN) enables acceleration of telco workloads, delivering new levels of spectral efficiency (SE) on a cloud-native...

14 MIN READ