AI Enterprise
Aug 28, 2024
Boosting Llama 3.1 405B Performance up to 1.44x with NVIDIA TensorRT Model Optimizer on NVIDIA H200 GPUs
The Llama 3.1 405B large language model (LLM), developed by Meta, is an open-source community model that delivers state-of-the-art performance and supports a...
7 MIN READ
Aug 14, 2024
Optimizing Inference Efficiency for LLMs at Scale with NVIDIA NIM Microservices
As large language models (LLMs) continue to evolve at an unprecedented pace, enterprises are looking to build generative AI-powered applications that maximize...
8 MIN READ
Jul 23, 2024
Supercharging Llama 3.1 across NVIDIA Platforms
Meta's Llama collection of large language models are the most popular foundation models in the open-source community today, supporting a variety of use cases....
8 MIN READ
Jul 11, 2024
Optimize AI Model Performance and Maintain Data Privacy with Hybrid RAG
The rapidly evolving field of generative AI is focused on building neural networks that can create realistic content such as text, images, audio, and synthetic...
7 MIN READ
Jun 10, 2024
NVIDIA Text Embedding Model Tops MTEB Leaderboard
The latest embedding model from NVIDIA—NV-Embed—set a new record for embedding accuracy with a score of 69.32 on the Massive Text Embedding Benchmark...
6 MIN READ
Jun 02, 2024
Pegatron Simulates and Optimizes Factory Operations with AI-Enabled Digital Twins
Manufacturers face increased pressures to shorten production cycles, enhance productivity, and improve quality, all while reducing costs. To address these...
5 MIN READ
Jun 02, 2024
Streamline Development of AI-Powered Apps with NVIDIA RTX AI Toolkit for Windows RTX PCs
NVIDIA today launched the NVIDIA RTX AI Toolkit, a collection of tools and SDKs for Windows application developers to customize, optimize, and deploy AI models...
8 MIN READ
Mar 21, 2024
Speed Up Your AI Development: NVIDIA AI Workbench Goes GA
NVIDIA AI Workbench, a toolkit for AI and ML developers, is now generally available as a free download. It features automation that removes roadblocks for...
4 MIN READ
Mar 20, 2024
Powering Mission-Critical AI at the Edge with NVIDIA AI Enterprise IGX
NVIDIA SDKs have been instrumental in accelerating AI applications across a spectrum of use cases spanning smart cities, medical, and robotics. However,...
6 MIN READ
Mar 18, 2024
How to Take a RAG Application from Pilot to Production in Four Steps
Generative AI has the potential to transform every industry. Human workers are already using large language models (LLMs) to explain, reason about, and solve...
8 MIN READ
Mar 18, 2024
NVIDIA NIM Offers Optimized Inference Microservices for Deploying AI Models at Scale
The rise in generative AI adoption has been remarkable. Catalyzed by the launch of OpenAI’s ChatGPT in 2022, the new technology amassed over 100M users within...
6 MIN READ
Mar 07, 2024
Generate Stunning Images with Stable Diffusion XL on the NVIDIA AI Inference Platform
Diffusion models are transforming creative workflows across industries. These models generate stunning images based on simple text or image inputs by...
14 MIN READ
Mar 04, 2024
Solve Complex AI Tasks with Leaderboard-Topping Smaug 72B from NVIDIA AI Foundation Models
This week’s model release features the NVIDIA-optimized language model Smaug 72B, which you can experience directly from your browser. NVIDIA AI Foundation...
2 MIN READ
Feb 27, 2024
Unlock the Power of Small Language Model Phi-2 for Chat, Research, Coding, and More
This week’s model release features the NVIDIA-optimized language model Phi-2, which can be used for a wide range of natural language processing (NLP) tasks....
2 MIN READ
Jan 25, 2024
Advancing Production AI with NVIDIA AI Enterprise
While harnessing the potential of AI is a priority for many of today’s enterprises, developing and deploying an AI model involves time and effort. Often,...
7 MIN READ
Jan 24, 2024
Build Enterprise-Grade AI with NVIDIA AI Software
Following the introduction of ChatGPT, enterprises around the globe are realizing the benefits and capabilities of AI, and are racing to adopt it into their...
6 MIN READ