Brief posts I've written. RLHF 101: A Technical Tutorial on Reinforcement Learning from Human Feedback Causal Confounds in Sequential Decision Making A Unifying, Game-Theoretic Framework for Imitation Learning