AI Models

Explore and deploy top AI models built by the community, accelerated by NVIDIA’s AI inference platform, and run on NVIDIA-accelerated infrastructure.

Explore Models View Performance

Llama

Llama is Meta’s collection of open foundation models, most recently made multimodal with the 2025 release of Llama 4. NVIDIA worked with Meta to advance inference of these models with NVIDIA TensorRT™-LLM (TRT-LLM) to get maximum performance from data center GPUs like NVIDIA Blackwell and NVIDIA Hopper™ architecture GPUs. Optimized versions of several Llama models are available as NVIDIA NIM™ microservices for an easy-to-deploy experience. You can also customize Llama with your own data using the end-to-end NVIDIA NeMo™ framework.

DeepSeek

DeepSeek is a family of open-source models that features several powerful models using a mixture-of-experts (MoE) architecture and provides advanced reasoning capabilities. DeepSeek models can be optimized for performance using TensorRT-LLM for data center deployments. You can use NIM to try out the models for yourself or customize with the open-source NeMo framework.

Gemma

Gemma is Google DeepMind’s family of lightweight, open models. Gemma models span a variety of sizes and specialized domains to meet each developer's unique needs. NVIDIA has worked with Google to enable these models to run optimally on a variety of NVIDIA’s platforms, ensuring you get maximum performance on your hardware, from data center GPUs like NVIDIA Blackwell and NVIDIA Hopper architecture chips to Windows RTX and Jetson devices. Enterprise customers can deploy optimized containers using NVIDIA NIM microservices for production-grade support and customize using the end-to-end NeMo framework. With the latest release of Gemma 3n, these models are now natively multilingual and multimodal for your text, image, video, and audio data.

Phi

Microsoft Phi is a family of Small Language Models (SLMs) that provide efficient performance for commercial and research tasks. These models are trained on high quality training data and excel in mathematical reasoning, code generation, advanced reasoning, summarization, long document QA, and information retrieval. Due to their small size, Phi models can be deployed on devices in single GPU environments, such as Windows RTX and Jetson. With the launch of the Phi-4 series of models, Phi has expanded to include advanced reasoning and multimodality.

More Resources

Decorative image representing Developer Community

Join the NVIDIA Developer Program

Get Training and Certification

Decorative image representing Inception for Startups

Accelerate Your Startup

Ethical AI

NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their supporting model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse. Please report security vulnerabilities or NVIDIA AI Concerns here.

Try top community models today.

Contact Us

AI Models

Llama

Get Production-Ready Llama Models With NVIDIA NIM

Llama 4 on Ollama

Quantized Llama 3.1 8B on Hugging Face

DeepSeek

Get Production-Ready DeepSeek Models With NVIDIA NIM.

NVIDIA DeepSeek R1 FP4

DeepSeek on Ollama

Gemma

Get Started With Gemma Models With NVIDIA NIM

Gemma 3 Models on Ollama

Gemma-2b-it ONNX INT4

Phi

Get Production-Ready Phi Models With NVIDIA NIM

Phi on Ollama

Phi-3.5-mini-Instruct INT4 ONNX

More Resources

Join the NVIDIA Developer Program

Get Training and Certification

Accelerate Your Startup

Ethical AI

AI Models

Llama

Explore

Integrate

Optimize

Get Production-Ready Llama Models With NVIDIA NIM

Llama 4 on Ollama

Quantized Llama 3.1 8B on Hugging Face

DeepSeek

Explore

Integrate

Optimize

Get Production-Ready DeepSeek Models With NVIDIA NIM.

NVIDIA DeepSeek R1 FP4

DeepSeek on Ollama

Gemma

Explore

Integrate

Optimize

Get Started With Gemma Models With NVIDIA NIM

Gemma 3 Models on Ollama

Gemma-2b-it ONNX INT4

Phi

Explore

Integrate

Optimize

Get Production-Ready Phi Models With NVIDIA NIM

Phi on Ollama

Phi-3.5-mini-Instruct INT4 ONNX

More Resources

Join the NVIDIA Developer Program

Get Training and Certification

Accelerate Your Startup

Ethical AI