featured
May 03, 2024
Explainer: What Is a Vector Database?
A vector database is an organized collection of vector embeddings that can be created, read, updated, and deleted at any point in time. Vector embeddings...
1 MIN READ
May 01, 2024
Spotlight: Continental and SoftServe Deliver Generative AI-Powered Virtual Factory Solutions with OpenUSD
With automotive consumers increasingly seeking more seamless, connected driving experiences, the industry has increased its focus on connectivity, advanced...
5 MIN READ
Apr 30, 2024
Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks
This week’s model release features DBRX, a state-of-the-art large language model (LLM) developed by Databricks. With demonstrated strength in programming and...
3 MIN READ
Apr 29, 2024
Top Data Science Sessions from NVIDIA GTC 2024 Now Available On Demand
At GTC 2024, experts from NVIDIA and our partners shared insights about GPU-accelerated tools, optimizations, and best practices for data scientists. From the...
2 MIN READ
Apr 29, 2024
GPU-Powered Windows 365 Cloud PCs with NVIDIA RTX Virtual Workstation for High-End Graphics Workloads
Professional workflows have become more complex with the increased demand for graphics-intensive scenarios. From regular office applications to demanding...
7 MIN READ
Apr 28, 2024
Turbocharging Meta Llama 3 Performance with NVIDIA TensorRT-LLM and NVIDIA Triton Inference Server
We're excited to announce support for the Meta Llama 3 family of models in NVIDIA TensorRT-LLM, accelerating and optimizing your LLM inference performance. You...
9 MIN READ
Apr 26, 2024
Perception Model Training for Autonomous Vehicles with Tensor Parallelism
Due to the adoption of multicamera inputs and deep convolutional backbone networks, the GPU memory footprint for training autonomous driving perception models...
10 MIN READ
Apr 26, 2024
New LLM: Snowflake Arctic Model for SQL and Code Generation
Large language models (LLMs) have revolutionized natural language processing (NLP) in recent years, enabling a wide range of applications such as text...
3 MIN READ
Apr 26, 2024
Enhance Text-to-Image Fine-Tuning with DRaFT+, Now Part of NVIDIA NeMo
Text-to-image diffusion models have been established as a powerful method for high-fidelity image generation based on given text. Nevertheless, diffusion models...
10 MIN READ
Apr 25, 2024
Announcing Confidential Computing General Access on NVIDIA H100 Tensor Core GPUs
NVIDIA launched the initial release of the Confidential Computing (CC) solution in private preview for early access in July 2023 through NVIDIA LaunchPad....
3 MIN READ
Apr 23, 2024
Democratizing AI Workflows with Union.ai and NVIDIA DGX Cloud
GPUs were initially specialized for rendering 3D graphics in video games, primarily to accelerate linear algebra calculations. Today, GPUs have become one of...
7 MIN READ
Apr 22, 2024
Advancing Cell Segmentation and Morphology Analysis with NVIDIA AI Foundation Model VISTA-2D
Genomics researchers use different sequencing techniques to better understand biological systems, including single-cell and spatial omics. Unlike single-cell,...
7 MIN READ
Apr 22, 2024
Just Released: NVIDIA Modulus v24.04
Modulus v24.04 delivers an optimized CorrDiff model and Earth2Studio for exploring weather AI models.
1 MIN READ
Apr 22, 2024
Developing Virtual Factory Solutions with OpenUSD and NVIDIA Omniverse
With NVIDIA AI, NVIDIA Omniverse, and the Universal Scene Description (OpenUSD) ecosystem, industrial developers are building virtual factory solutions that...
4 MIN READ
Apr 22, 2024
Mistral Large and Mixtral 8x22B LLMs Now Powered by NVIDIA NIM and NVIDIA API
This week’s model release features two new NVIDIA AI Foundation models, Mistral Large and Mixtral 8x22B, both developed by Mistral AI. These cutting-edge...
4 MIN READ
Apr 22, 2024
Enhanced DU Performance and Workload Consolidation for 5G/6G with NVIDIA Aerial CUDA-Accelerated RAN
Aerial CUDA-Accelerated radio access network (RAN) enables acceleration of telco workloads, delivering new levels of spectral efficiency (SE) on a cloud-native...
14 MIN READ