General

May 08, 2024

Accelerate Generative AI Inference Performance with NVIDIA TensorRT Model Optimizer, Now Publicly Available

In the fast-evolving landscape of generative AI, the demand for accelerated inference speed remains a pressing concern. With the exponential growth in model...

9 MIN READ

nearly 100 training labs from GTC available on demand

May 07, 2024

NVIDIA GTC Training Labs On Demand Available Now

Missed GTC or want to replay your favorite training labs? Find it on demand with the NVIDIA GTC Training Labs playlist.

1 MIN READ

May 07, 2024

Enhancing Anomaly Detection in Linux Audit Logs with AI

In cybersecurity, identifying threats swiftly and accurately is paramount to the success of the modern enterprise. Linux audit logs, which record system...

10 MIN READ

May 06, 2024

Intelligent Traffic Offload with Enhanced AI-powered 5G Security for Enterprises

Securing the private 5G and applications at the edge comes with many challenges. Sophisticated AI– and ML-based attack campaigns require security to respond...

6 MIN READ

Decorative image of VILA and Jetson Orin workflow.

May 03, 2024

Visual Language Intelligence and Edge AI 2.0

VILA is a family of high-performance vision language models developed by NVIDIA Research and MIT. The largest model comes with ~40B parameters and the smallest...

8 MIN READ

Apr 30, 2024

Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks

This week’s model release features DBRX, a state-of-the-art large language model (LLM) developed by Databricks. With demonstrated strength in programming and...

3 MIN READ

3 sessions for data scientists to watch from NVIDIA GTC 2024

Apr 29, 2024

Top Data Science Sessions from NVIDIA GTC 2024 Now Available On Demand

At GTC 2024, experts from NVIDIA and our partners shared insights about GPU-accelerated tools, optimizations, and best practices for data scientists. From the...

2 MIN READ

Three reflective green spheres hovering above three white platforms on a neutral background.

Apr 29, 2024

GPU-Powered Windows 365 Cloud PCs with NVIDIA RTX Virtual Workstation for High-End Graphics Workloads

Professional workflows have become more complex with the increased demand for graphics-intensive scenarios. From regular office applications to demanding...

7 MIN READ

Apr 28, 2024

Turbocharging Meta Llama 3 Performance with NVIDIA TensorRT-LLM and NVIDIA Triton Inference Server

We're excited to announce support for the Meta Llama 3 family of models in NVIDIA TensorRT-LLM, accelerating and optimizing your LLM inference performance. You...

9 MIN READ

Apr 26, 2024

New LLM: Snowflake Arctic Model for SQL and Code Generation

Large language models (LLMs) have revolutionized natural language processing (NLP) in recent years, enabling a wide range of applications such as text...

3 MIN READ

Apr 26, 2024

Enhance Text-to-Image Fine-Tuning with DRaFT+, Now Part of NVIDIA NeMo

Text-to-image diffusion models have been established as a powerful method for high-fidelity image generation based on given text. Nevertheless, diffusion models...

10 MIN READ

Apr 25, 2024

Announcing Confidential Computing General Access on NVIDIA H100 Tensor Core GPUs

NVIDIA launched the initial release of the Confidential Computing (CC) solution in private preview for early access in July 2023 through NVIDIA LaunchPad....

3 MIN READ

Decorative image of different workflows against a grey background.

Apr 23, 2024

Democratizing AI Workflows with Union.ai and NVIDIA DGX Cloud

GPUs were initially specialized for rendering 3D graphics in video games, primarily to accelerate linear algebra calculations. Today, GPUs have become one of...

7 MIN READ

Apr 23, 2024

Webinar: Enhance LLMs with RAG and Accelerate Enterprise AI with Pure Storage and NVIDIA

Join Pure Storage and NVIDIA on April 25 to discover the benefits of enhancing LLMs with RAG for enterprise-scale generative AI applications.

1 MIN READ

Apr 22, 2024

Mistral Large and Mixtral 8x22B LLMs Now Powered by NVIDIA NIM and NVIDIA API

This week’s model release features two new NVIDIA AI Foundation models, Mistral Large and Mixtral 8x22B, both developed by Mistral AI. These cutting-edge...

4 MIN READ

Decorative image of text and speech recognition processes encircling the globe.

Apr 18, 2024

New Standard for Speech Recognition and Translation from the NVIDIA NeMo Canary Model

NVIDIA NeMo is an end-to-end platform for the development of multimodal generative AI models at scale anywhere—on any cloud and on-premises. The NeMo team...

4 MIN READ