How can we accelerate AI when learning in an environment with other intelligent agents? This course focuses on Social Reinforcement Learning in multi-agent and human-AI interactions. From studying the natural world, we know that social learning is an incredibly powerful mechanism that helps both humans and animals rapidly adapt to new circumstances, coordinate with others, and drives the emergence of complex learned behaviors. From recent advances in AI, we know that reinforcement learning from human feedback (RLHF) is an incredibly powerful mechanism for improving the capabilities and alignment of large models. This course will link these two perspectives, examining the complexities of modeling, learning from, and coordinating with other agents, whether those agents are humans or other RL agents in a simulation. We will study how social learning can address fundamental issues in AI like learning and generalization, as well as improving the ability of AI to coordinate with and interact with people.
Although we will cover a brief introduction to reinforcement learning (RL), familiarity with RL and deep learning is encouraged. The course is a project course; in addition to reading and discussing relevant research papers, students will submit a team-based final project in the form of a research paper.
Instructor: Natasha Jaques (nj at cs)
TAs:
Yancheng Liang (yancheng at cs)
Lecture: WF: 10:00 AM - 11:20 AM (80 min lectures) Location: Johnson Hall (JHN) 175
Sept 24 - Dec 5, Nov 28 is a holiday
Remote attendance: https://meet.google.com/tge-vofz-zdw
Office Hours:
Natasha: 2:30-3:30pm on Fridays in Gates 234. No office hours Nov 28 and Dec 5.
Yancheng Liang (TA): 1:30-2:30pm on Wednesdays in Gates 151. No office hours Oct 1 and Dec 3.
Important Links
Schedule overview (by week):
Classes will be split so that on approximately half the days Natasha will present a ~50 minute lecture followed by ~30 minutes of questions and discussion, and on the remaining days we will have student-led paper presentations and discussions.
Each student in the class will sign up to present in one of the slots in the syllabus. They will prepare a 10 minute presentation on one of the papers for that discussion day. Each presentation will be followed by 5 minutes of clarifying question. After all the presentations finish, we will break into groups to discuss each of the papers, the themes that connect them, and interesting and impactful research directions that relate to them.
Class participation (15%)
Because this is primarily a discussion course, to make it work we need students to attend class in person, ask questions, and participate in paper discussions. Therefore, a portion of your grade depends on doing this. To get the full 15%, you need to read the relevant papers, show up on time, and make comments or ask questions in at least 16/19 classes (you are allowed to miss 3; see the Course Policies section). However, we hope you will go beyond this minimum requirement and make the most of the class by actively participating and sharing your thoughts and questions on the research we are learning about.
Paper reflections (10%)
We will have 9 Discussion classes for which you will prepare a 300-word paper reflection on one of the suggested papers, to be uploaded to EdSTEM. Reflections will be graded on a pass/fail basis. As per the Course Policies, you can miss submitting 1 reflection with no penalty. Presenters still need to submit reflections for the papers they are presenting. We are aware that it would be extremely easy to generate these summaries with an LLM. My only comment on this is if you want to learn something from the course, you’ll need to actually read the papers. If you’re not planning to read the papers, it would be better to drop and give up your spot to one of the other 55 students petitioning to get in. Also, if you and another student review the same paper and submit very similar LLM-generated summaries in public Ed posts, that might be a little embarrassing.
Lead discussion (10%)
During the quarter, every student is expected to present a paper at least once. Use the class schedule spreadsheet to sign up for a particular presentation time, and choose the paper you would like to present. Add a link to your presentation slides to the spreadsheet by at least 11:59pm two days before class.
Course project (50%)
Proposal (5%): due 11:59pm Oct 17, 2024
Writeup (35%): due 11:59pm Dec 1, 2025
Project presentation (10%): in class on Dec 3 and Dec 5
See the Class Project section for more information.
Peer review (15%)
Peer review is a big part of research, and in this class we will learn how to write high quality peer reviews. We will upload our course projects to a peer review system, such as OpenReview. You will be responsible for submitting 3 reviews of other students’ papers in the system, which will be due 1 week after the project is due, by 11:59pm on Dec 8. Note that this review procedure will be single-blind, since students will have seen the class project presentations. Each review will be worth 5% of your grade and will be graded based on whether it is thorough, complete, fair, and whether it gives the authors useful feedback for improving their paper. Note that the peer review feedback will not be used to determine the final grade for students’ papers, they will be graded by the instructors.
Late submissions and absences
Inclusion, feedback, accommodations, and other policies
We thank Brian Hou, Abhishek Gupta and Zoey Chen for providing us the template for the website.