Posts by Zhilin Wang
Generative AI
Oct 03, 2024
New Reward Model Helps Improve LLM Alignment with Human Preferences
Reinforcement learning from human feedback (RLHF) is essential for developing AI systems that are aligned with human values and preferences. RLHF enables the...
4 MIN READ
Generative AI
Nov 27, 2023
Announcing HelpSteer: An Open-Source Dataset for Building Helpful LLMs
NVIDIA recently announced the NVIDIA NeMo SteerLM technique as part of the NVIDIA NeMo framework. This technique enables users to control large language model...
6 MIN READ
Generative AI
Oct 11, 2023
Announcing NVIDIA SteerLM: A Simple and Practical Technique to Customize LLMs During Inference
With the advent of large language models (LLMs) such as GPT-3, Megatron-Turing, Chinchilla, PaLM-2, Falcon, and Llama 2, remarkable progress in natural language...
10 MIN READ