Generative AI

Sep 13, 2023
New Course: Generative AI Explained
Explore generative AI concepts and applications, along with challenges and opportunities in this self-paced course.
1 MIN READ

Sep 12, 2023
Power Your Business with NVIDIA AI Enterprise 4.0 for Production-Ready Generative AI
Crossing the chasm and reaching its iPhone moment, generative AI must scale to fulfill exponentially increasing demands. Reliability and uptime are critical for...
4 MIN READ

Sep 12, 2023
Generative AI and Accelerated Computing for Spear Phishing Detection
Spear phishing is the largest and most costly form of cyber threat, with an estimated 300,000 reported victims in 2021 representing $44 million in reported...
5 MIN READ

Sep 11, 2023
Accelerating Vector Search: Fine-Tuning GPU Index Algorithms
The first post in this series introduced vector search indexes, explained the role they play in enabling a widespread range of important applications, and...
11 MIN READ

Sep 11, 2023
Leading MLPerf Inference v3.1 Results with NVIDIA GH200 Grace Hopper Superchip Debut
AI is transforming computing, and inference is how the capabilities of AI are deployed in the world’s applications. Intelligent chatbots, image and video...
13 MIN READ

Sep 11, 2023
Accelerating Vector Search: Using GPU-Powered Indexes with RAPIDS RAFT
In the AI landscape of 2023, vector search is one of the hottest topics due to its applications in large language models (LLM) and generative AI. Semantic...
11 MIN READ

Sep 08, 2023
NVIDIA TensorRT-LLM Supercharges Large Language Model Inference on NVIDIA H100 GPUs
Large language models offer incredible new capabilities, expanding the frontier of what is possible with AI. But their large size and unique execution...
10 MIN READ

Sep 01, 2023
Speeding Up Text-To-Speech Diffusion Models by Distillation
Every year, as part of their coursework, students from the University of Warsaw, Poland get to work under the supervision of engineers from the NVIDIA Warsaw...
7 MIN READ

Aug 29, 2023
Streamline Generative AI Development with NVIDIA NeMo on GPU-Accelerated Google Cloud
Generative AI has become a transformative force of our era, empowering organizations spanning every industry to achieve unparalleled levels of productivity,...
9 MIN READ

Aug 11, 2023
Better 3D Meshes, from Reconstruction to Generative AI
Next-generation AI pipelines have shown incredible success in generating high-fidelity 3D models, ranging from reconstructions that produce a scene matching...
3 MIN READ

Aug 10, 2023
Selecting Large Language Model Customization Techniques
Large language models (LLMs) are becoming an integral tool for businesses to improve their operations, customer interactions, and decision-making processes....
12 MIN READ

Aug 08, 2023
Develop and Deploy Scalable Generative AI Models Seamlessly with NVIDIA AI Workbench
Developing custom generative AI models and applications is a journey, not a destination. It begins with selecting a pretrained model, such as a Large Language...
11 MIN READ

Aug 08, 2023
Unlocking the Power of Enterprise-Ready LLMs with NVIDIA NeMo
Generative AI has introduced a new era in computing, one promising to revolutionize human-computer interaction. At the forefront of this technological marvel...
10 MIN READ

Aug 08, 2023
Curating Trillion-Token Datasets: Introducing NVIDIA NeMo Data Curator
The latest developments in large language model (LLM) scaling laws have shown that when scaling the number of model parameters, the number of tokens used for...
8 MIN READ

Aug 04, 2023
Mitigating Stored Prompt Injection Attacks Against LLM Applications
Prompt injection attacks are a hot topic in the new world of large language model (LLM) application security. These attacks are unique due to how ‌malicious...
10 MIN READ

Aug 03, 2023
Securing LLM Systems Against Prompt Injection
Prompt injection is a new attack technique specific to large language models (LLMs) that enables attackers to manipulate the output of the LLM. This attack is...
15 MIN READ