This schedule is tentative and subject to change.
WEEK | TOPIC | DATE | LECTURE/DISCUSSION |
---|---|---|---|
1 | Intro and RL-post-training of LLMs | 09/24 | Introduction to course, syllabus. RL basics - Lecture (slides) (recording) |
09/26 | RL post-training history and RLHF intro - Lecture (slides) (recording) | ||
2 | RL from Human Feedback (RLHF) (for LLMs) | 10/1 | RLHF - Discussion |
10/3 | Personalized RLHF - Lecture (slides) (recording) | ||
3 | RLVR and reasoning, embodied instruction following | 10/8 | RLVR, reasoning, and technical aspects of RL post-training - Discussion |
10/10 | Embodied instruction following - Discussion | ||
4 | Learning from humans beyond LLMs | 10/15 | Inverse RL and other ways to learn from humans - Lecture |
10/17 | PbRL / robot learning from human feedback - Discussion | ||
5 | Multi-agent RL | 10/22 | Deep multi-agent RL intro - Lecture |
10/24 | State-of-the-art MARL, theory, fun MARL papers, and social skills like theory of Mind - Discussion | ||
6 | Human-AI coordination | 10/29 | Zero-shot coordination with humans - Lecture |
10/31 | Human-agent coordination and population-based training - Discussion | ||
7 | Emergent complexity | 11/5 | Emergent Complexity - Lecture |
11/7 | Emergent Complexity and Open-Endedness - Discussion | ||
8 | Social learning | 11/12 | Lecture |
11/14 | Social Learning - Discussion | ||
9 | Multi-agent RL for LLMs | 11/19 | Multi-agent RL for LLM - Lecture |
11/21 | MARL for LLMs, Multi-agent LLM systems, Red-teaming - Lecture | ||
10 | Extra topics | 11/26 | Bonus Lecture |
11/28 | No class | ||
11 | Final project presentations | 12/3 | Final project presentations |
12/5 | Final project presentations |