Shashank Verma

Shashank Verma is a deep learning technical marketing engineer at NVIDIA. He is responsible for developing and presenting developer-focused content on various deep learning frameworks. He obtained his master's in electrical engineering from the University of Wisconsin-Madison, where he focused on computer vision, security aspects in data science, and HPC.
Avatar photo

Posts by Shashank Verma

Generative AI / LLMs

Build Enterprise Retrieval-Augmented Generation Apps with NVIDIA Retrieval QA Embedding Model

Large language models (LLMs) are transforming the AI landscape with their profound grasp of human and programming languages. Essential for next-generation... 12 MIN READ
Generative AI / LLMs

Mastering LLM Techniques: Inference Optimization

Stacking transformer layers to create large models results in better accuracies, few-shot learning capabilities, and even near-human emergent abilities on a... 25 MIN READ
An illustration representing Nemotron-3-8b model family.
Generative AI / LLMs

NVIDIA AI Foundation Models: Build Custom Enterprise Chatbots and Co-Pilots with Production-Ready LLMs

Large language models (LLMs) are revolutionizing data science, enabling advanced capabilities in natural language understanding, AI, and machine learning.... 12 MIN READ
Data Science

Scaling Recommendation System Inference with NVIDIA Merlin Hierarchical Parameter Server

Recommendation systems are widely used today to personalize user experiences and improve customer engagement in various settings like e-commerce, social media,... 11 MIN READ
Recommenders / Personalization

Fast, Terabyte-Scale Recommender Training Made Easy with NVIDIA Merlin Distributed-Embeddings

Embeddings play a key role in deep learning recommender models. They are used to map encoded categorical inputs in data to numerical values that can be... 8 MIN READ
Simulation / Modeling / Design

Building and Deploying Conversational AI Models Using NVIDIA TAO Toolkit

Sign up for the latest Speech AI news from NVIDIA. Conversational AI is a set of technologies enabling human-like interactions between humans and devices based... 25 MIN READ