Posts by Anu Srivastava
Agentic AI / Generative AI
Jun 10, 2026
Run DiffusionGemma on NVIDIA for Developer-Ready, High-Throughput Text Generation
Developers building real-time AI—such as chat assistants, copilots, and agentic workflows—are often constrained by token-by-token generation speed. This...
4 MIN READ
Agentic AI / Generative AI
May 28, 2026
Run Step 3.7 Flash on NVIDIA GPUs with Enterprise-Ready Multimodal AI
AI applications are moving beyond text generation to multimodal systems that can perceive, search, and reason across images, documents, video, and...
3 MIN READ
Agentic AI / Generative AI
Apr 24, 2026
Build with DeepSeek V4 Using NVIDIA Blackwell and GPU-Accelerated Endpoints
DeepSeek just launched its fourth generation of flagship models with DeepSeek-V4-Pro and DeepSeek-V4-Flash, both targeted at enabling highly efficient...
6 MIN READ
Agentic AI / Generative AI
Apr 11, 2026
MiniMax M2.7 Advances Scalable Agentic Workflows on NVIDIA Platforms for Complex AI Applications
The release of MiniMax M2.7 adds enhancements to the popular MiniMax M2.5 model, built for agentic harnesses,...
4 MIN READ
Agentic AI / Generative AI
Apr 02, 2026
Bringing AI Closer to the Edge and On-Device with Gemma 4
The Gemmaverse expands with the launch of the latest Gemma 4 multimodal and multilingual models, designed to scale across the full spectrum of deployments, from...
6 MIN READ
Agentic AI / Generative AI
Feb 27, 2026
Develop Native Multimodal Agents with Qwen3.5 VLM Using NVIDIA GPU-Accelerated Endpoints
Alibaba has introduced the new open source Qwen3.5 series built for native multimodal agents. The first model in this series is a ~400B parameter native...
3 MIN READ