Learning path
Alignment and RLHF: from first principles to frontier techniques
How do you take a raw language model and make it helpful, honest, and safe? This path traces the full arc — from the basics of reinforcement learning to the specific algorithms (RLHF, PPO, DPO, GRPO) that shape model behavior, and the labs and ideas pushing the frontier of alignment research. Take the steps in order; each one builds the vocabulary the next one needs.
Mixed level9 steps~62 min
9 steps