Gradescope (Entry Code: 8DW6EX), Ed Board, Canvas, Exam Archive, Project Archive
Visualization generated via World Labs' Marble model.
This course is a deep dive into the details of deep learning algorithms, architectures, and tasks, with a focus on end-to-end models. We begin by grounding deep learning advancements particularly for the task of image classification; later, we generalize these ideas to many other tasks. During the 10-week course, students learn to implement and train their own neural networks and gain a detailed understanding of cutting-edge research in deep learning. Additionally, the final assignment provides the opportunity to train and apply multi-million parameter networks on student-chosen real-world problems. Through multiple hands-on assignments and the final course project, students acquire the toolset for setting up deep learning tasks and practical engineering tricks for training and fine-tuning deep neural networks.
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
||
|---|---|---|---|---|---|---|---|
![]() |
![]() |
|||||
|---|---|---|---|---|---|---|
Please default to using EdStem for communicating with the course staff. Email is best reserved for situations where you wish to only communicate with a certain subset of the course staff (e.g., you are following up with a specific TA based on a conversation you had at their office hours). If you have private matters which you wish to only communicate with the instructors, please send correspondence to Zixian or Ranjay to ensure a timely response.
Calculus (Math 126) and Linear Algebra (Math 208) are essential. Probability (CSE 312 or Math 394) is recommended.
CSE 446 is NOT a prerequisite. The neccessary fundamentals of machine learning will be covered in this class.
This class consists of lectures & recitations, 5 assignments, a midterm exam, a quiz, and a course project.