![Title image: The Ancient Secrets of Computer Vision](images/title.jpg)
## Course Information ##
This class is a general introduction to computer vision. It covers standard techniques in image processing like filtering, edge detection, stereo, flow, etc. (old-school vision), as well as newer, machine-learning based computer vision.
Course will be offered in a variety of modalities:
- In-person: [CSE2 G20](https://www.washington.edu/classroom/CSE2+G20)
- Remotely: [Zoom](https://washington.zoom.us/j/96650928585)
- Asynchronously: See below for lecture recordings
Participate in whatever way best suits your needs this quarter! **Please do not come to in-person class if you are sick or have reason to suspect you may be sick.**
### Instructor ###
Joseph Redmon
- Email: pjreddie@cs.washington.edu
### Office Hours ###
- Monday
- Yifan 1:00-2:00pm at https://washington.zoom.us/j/4601411492?pwd=Ym81d1FSNzlsa3R4RXFCY2xndzN4Zz09
- Tuesday
- Aaron 1:00-2:00pm at https://washington.zoom.us/j/96818345065?pwd=SzFpVkE3Sk5nZTVHT3A2azFFeWVnUT09
- Wednesday
- Varich 3:00-4:00pm at Gates Center 121
- Fatwir 3:30-4:30pm at https://washington.zoom.us/j/99495185111
- Thursday
- Joe 4:00-5:00pm at https://washington.zoom.us/j/3362756951
- Friday
- Hamsa 10:00-11:00am at https://washington.zoom.us/j/99049939766
- Aashna 1:00-2:00pm at https://washington.zoom.us/j/6336862950
### Resources ###
- Ed Discussion Board: https://edstem.org/us/courses/38147/discussion/
- Canvas: https://canvas.uw.edu/courses/1633294
- Zoom: https://washington.zoom.us/j/96650928585
Slides are a mishmash of lots of other people's work. Special thanks to: Rob Fergus, Linda Shapiro, Harvey Rhody, Rick Szeliski, Ali Farhadi, Robert Collins. Lectures 8 and 9 on Flow, 3d, and stereo are by [Connor Schenck](https://homes.cs.washington.edu/~schenckc/).
All of the slides, videos, and homeworks are free to use, modify, redistribute as you like without permission. Just make your own copy of the slides on Google Docs, don't ask to modify mine!
## Homeworks ##
The class has 6 homeworks where you will build out a computer vision library in C. We cover basic image manipulations, filtering, features, stitching, optical flow, machine learning, and convolutional neural networks.
Most of the homeworks will use [this repository](https://github.com/pjreddie/uwimg/). The individual homeworks can be found in the `src/` folder.
- [Homework 0: Fun with Color!](https://github.com/pjreddie/uwimg/tree/main/src/hw0), Due
- [Homework 1: Resizing](https://github.com/pjreddie/uwimg/tree/main/src/hw1), Due
- [Homework 2: Filtering and Convolutions](https://github.com/pjreddie/uwimg/tree/main/src/hw2), Due
- [Homework 3: Panoramas!](https://github.com/pjreddie/uwimg/tree/main/src/hw3), Due
- [Homework 4: Optical Flow](https://github.com/pjreddie/uwimg/tree/main/src/hw4), Due
- [Homework 5: Neural Networks and Machine Learning](https://github.com/pjreddie/uwimg/tree/main/src/hw5), Due
**Note:** due date subject to change if we haven't covered relevant material in time for the assignment.
**COVID Policy:** If you get COVID don't worry about doing your homework, rest, recover, do what you need to do to get better. If you feel like doing computer vision while sick go for it but also know you can take some time off. Once you are well please reach out to the course staff and we can figure out how to get you back on track with assingments and any missed classes. You will not be penalized for turning in assingments late due to COVID (or if you're having trouble getting caught back up afterward).
**Late Policy:**
- One “free” late day per assignment
- If you still cannot turn in on time:
- write a comment on canvas for the assignment, tell me what day you will turn in
- turn it in by that day
## Final Project ##
There is a final project worth 20% of the final grade.
Pick any area of computer vision that interests you and pursue some independent work in that area. Each project should have a significant technical component, software implementation, or large-scale study. Projects can focus on developing new techniques or tools in computer vision or applying existing tools to a new domain. If you don't have an idea you can train a classifier on birds and compete in the Kaggle competition posted on the Ed discussion board.
## Lectures ##
### Week 1: Image Basics ###
This week we cover the basics of computer vision. There's an introduction to the three levels of vision, **low-level** vision mostly concerns the pixels or groups of nearby pixels, **mid-level** vision starts to connect images to each other and the real world, and **high-level** vision connects images to semantics and meaning. There's background information on the human visual system, color, light, what an image actually is, and how it's stored in a computer. All fun stuff! Once you've learned the basics you should be ready for [Homework 0](https://github.com/pjreddie/uwimg/tree/main/src/hw0), which is mostly an introduction to the codebase we'll be using for the assignments.
#### Lecture 1: Introduction
- [Slides](https://docs.google.com/presentation/d/1vZuncM3rrJZakza94UU8FvpEd9Ph-9E9I3Hj4Xkr91U/edit?usp=sharing)
- [Video](https://washington.zoom.us/rec/play/gftZRD1FSuOKJSk6R3LHnE57ahfstbx5gsEQdgmMkX_rE4QfazFzVOet8rwONSQuJA4XkBUnC-0xXRDe.cxN3t5CZ9V-H5Gud)
#### Lecture 2: Human Vision, Color Spaces, Transforms ####
- [Slides](https://docs.google.com/presentation/d/1kTvnMCG7qZ8eoA4NF79Q-Qyirt-ZMq6RNv4xAkQh2bY/edit?usp=sharing)
- [Video](https://washington.zoom.us/rec/play/kLEgSk5Qbwz4Ih9565TRgE-JQ14iXe1aWwFPtJHcBmkhS-zF6u1MrUSwywrKsD92omsqXlN39ek2AA0.hK0GsIsJKs7jeSTN)
--------
### Week 2: Image Transformations ###
In week 2 we start to dive into low-level vision and image processing. You learn how to manipulate images and perform operations like resizing, sharpening, smoothing, and more. You'll apply this knowledge as you get started on [Homework 1](https://github.com/pjreddie/uwimg/tree/main/src/hw1).
#### Lecture 3: Image Coordinates, Resizing
- [Slides](https://docs.google.com/presentation/d/1imYQe7kDCahP2YO69L7FBiwqcEc4LgVEDXW7pDMFhOs/edit?usp=sharing)
- [Video](https://washington.zoom.us/rec/play/RTbJpxiAsdNvjL4GPCS1M5v0tXQy1lWLZek_5D-7nyOevtpj8cEenSEzwImw1gVdCq757nSNpbgKH3p6.2aE-C6ly-QpDmSo4)
#### Lecture 4: Resizing, Filters, Convolutions
- [Slides](https://docs.google.com/presentation/d/1owZEUBbFp-Iz5kLNo-KvppjtzJGF91wEzfviUphLWqs/edit?usp=sharing)
- [Video](https://washington.zoom.us/rec/play/H4YuURpOldWiF3mTqFtmDtadb6jq1rFLRaATnK2sNWWLBK0u1b5orUFji8RlDkFLZXMgbWX2HwpZvjOT.ldk7pqK8QP0fvyNh)
#### Tutorial: Debugging With GDB and Valgrind
- [Video](https://washington.zoom.us/rec/play/p8asnssLFVNhxeiBvl-AuxL-kL0KE4p60xnDwzVu5aawb_kNNtyk3Md4NfrS7qC9ieOLKCFcJxoKYLWR.1XQu0kV7WwvcYm7f)
--------
### Week 3: Edges and Features
Time to put those convolutions to use! For week 3 we delve in to what makes images interesting, what makes them unique, how to find correspondences between images, and how to fit models with a large number of outliers in the data.
#### Lecture 5: Edges and Features
- [Slides](https://docs.google.com/presentation/d/1WqcYCxTnYzbPmhAsi5N7heb1GACzYd5Rt441Z36cp28/edit?usp=sharing)
- [Video](https://washington.zoom.us/rec/share/WYsu57I7Dfxh5kxNCHh0YCTY-7_Ei377Oo--r6sn8dR1PoFhpKglyBK2VtXAwoNp.yv09_mP5wk4L2vkM)
#### Lecture 6: Harris, Matching, RANSAC
- [Slides](https://docs.google.com/presentation/d/1_HFh3SdmdyZ_j-sFS4Tw17DmhKfjmYZqvSp7TmfdD_M/edit?usp=sharing)
- [Video](https://washington.zoom.us/rec/play/Bsb9QehHLa3SsacjoAl9gb1NtpjZELVXgL38I3Lp5F6-xmfxVRhp7LHiEqIqLFL8I48f2H7la33Zayoq.nm-I49EicEY9Dx2y)
-------
### Week 4: Features and Flow
#### Lecture 7: Matching, RANSAC, HOG, and SIFT
- [Slides](https://docs.google.com/presentation/d/1Ti2o4HPX8xEIYpfgEm6W3EvGc2bWjiz0ZuyAZfTAuD4/edit?usp=sharing)
- [Video](https://washington.zoom.us/rec/play/TXjgDzdsf-ahgfC0Ll_eeBA1XOyyBbdDCNEjEneQ9R1r7aCl_bwZ_m06t4rBu1SmUjJUSt6PuU80P4Mv.w1RjbMOq8RiJeHan)
#### Lecture 8: Optical Flow
- [Slides](https://docs.google.com/presentation/d/1Z6xyKI5SWJ3Qs4aWcbP_bOlj34VW40WIHfhf3b4EN10/edit?usp=sharing)
- [Video](https://washington.zoom.us/rec/play/cENCWxDLGxq0LLRkIchmugQjdXme8Y-pQSqOksU_fS9pgOva0NF8xp6y8PTwtrMS1EJVRQFlJHsSUR5-.k3YyY58zolKgwfDb)
---------
### Week 5: Depth and ML Review
#### Lecture 9: 3D, Depth, and Stereo
- [Slides](https://docs.google.com/presentation/d/1e8RSHMYzrRtA3s4wfb7xXoZ1CTH8lKXFlNhPFQVD9KI/edit?usp=sharing)
- [Video](https://washington.zoom.us/rec/play/BxI7u9LGa0_SYguH6NKm7TINgfD72q8ytCzFroNCff6x5JKkJYYCgSyp3ZrHk1nMcnYx4tbVeosvgKnw.nllgdjg5s9lv5Dj_)
#### Lecture 10: Machine Learning for Computer Vision
- [Slides](https://docs.google.com/presentation/d/1vYhq30b9BEDzzl3rwP8uY6VF1x4jmtzJaxQGK0P2XAI/edit?usp=sharing)
- [Video](https://washington.zoom.us/rec/play/q8XH3uq6AZG7okO5kgugu3Z7uTpYzeeDQJK-aYDiCfOJHhq_vtaIYZWcsT5hchkzde_8HGK3xddQ2Odf.nDdBmiz3F6KpqHWq)
--------
### Week 6: Machine Learning and Neural Networks
#### Lecture 11: More Machine Learning for Computer Vision
- [Slides](https://docs.google.com/presentation/d/1c5qjBEQVhcwejWY6I88BwHGOdZiPvkDD6su1bbqqd2c/edit?usp=sharing)
- [Video](https://washington.zoom.us/rec/play/skP7YBude3-amlfiiNVZNMGwFd98QLe2x6vgEIcjcKO__BC1PwX2rXQ1yShVeX0vzXYcQt1vtbX-wYZe.xI8MmK98S9wZTSOu)
#### Lecture 12: Neural Networks
- [Slides](https://docs.google.com/presentation/d/1HQOET7oM3fzQba2KAcw20yEtqe_3ZLE9VTtfoPBgb8A/edit?usp=sharing)
- [Video](https://washington.zoom.us/rec/play/H1Ngn7LXY_tTY9Qell7ovuL85miUZyF1bTRtlKrxHx1mlrsRqGWuoPNYT84OzBaNivLK2eZQ13SaABr-.hexHbYjp3rldcpnI)
--------
### Week 7: Convolutional Neural Networks
#### Lecture 13: Convolutional Neural Networks
- [Slides](https://docs.google.com/presentation/d/1szC_xsXx4kBtVaM-6yVjdJRPc2SGgOq3rcpzKM06-UM/edit?usp=sharing)
- [Video](https://washington.zoom.us/rec/play/uVYzsOYE0V149F5B6xR-ihRPIXO07TtLsOgSMViiwAI9dIBWzmpGtg56t92pSIRdhclmDjC3xwXtaTQD.u_4YKhtM7Zdc1PTt)
#### Tutorial 1: Neural Networks in PyTorch
- [iPynb](https://colab.research.google.com/drive/1JR9vh7NpFB8f95cixhT0dQ9ky4xXY8Hi?usp=sharing)
#### Lecture 14: Network Architectures
- [Slides](https://docs.google.com/presentation/d/1IIJ6uxpwT3NdOzO2L-KDVsSNOfQM0uBcf4F0WFy-jVc)
- [Video](https://washington.zoom.us/rec/play/GlNJWWOkq-BEZsJUhvTcutnMh4KTVYEhOtHP5hYWfSE-AApeNmIfNjR6iG380uD7jxDUrMuoXQ9GLj1Y.nopw4YoU5ROCJQFa)
---------
### Week 8
#### Lecture 15: More Network Architecture Stuff
- [Slides](https://docs.google.com/presentation/d/1XFr96QGJFHoRr0HyydnrLntulH7vNlIQd_SHm88yLNk/edit?usp=sharing)
- [Video](https://washington.zoom.us/rec/play/6V48yIHUNVctwOSFxwLAGMq_Nhq1dsb5b4Z12yhuw_yakEVqU_Rwzz---n9JFDBLmMVrcER0VSipavfN.wtkWn-fFvu_TFa28)
#### Lecture 16: Semantic Segmentation
- [Slides](https://docs.google.com/presentation/d/1XFr96QGJFHoRr0HyydnrLntulH7vNlIQd_SHm88yLNk/edit?usp=sharing)
- [Video](https://washington.zoom.us/rec/play/sHSf3pyL_AMTMaWyyehGeqKWNDUBueiwCsxHAds0RGIJVnA-EKDBgdP77B9s-Wq8t8_QDuHPsyBs63Q.LgvtOzVrZsfiIYoJ)
#### Tutorial: CNNs in PyTorch
- [iPynb](https://colab.research.google.com/github/pjreddie/uwimg/blob/main/tutorial2-cnns-in-pytorch.ipynb)
----------
### Week 9
#### Lecture 16: Object Detection
- [Slides](https://docs.google.com/presentation/d/1CLVryBPNddEn8TwM2ngy1bHnlJC4t33OhERzVixFGj4/edit?usp=sharing)
- [Video](https://washington.zoom.us/rec/play/xrnuDXSN8qcG_nZ4ozVWji4vhjb5smHlH49fJFB4ZB7tyUCDcWoSMqR0e1NGDj00UMS40cMPW6As8Hch.4jfbF4mKfqkenBa6)
#### Lecture 18: Vision and Language
- [Slides](https://docs.google.com/presentation/d/1UJOl-C3qDUMw7ftgKJh7GEELxuNh0KRAtQpzhg3PWrc/edit?usp=sharing)
- [Video](https://washington.zoom.us/rec/play/WrFMPGTe575zGoPTs6VyRGALLGWD4GJyzP11r1CinSlxdCyigs2x5xgD2LNF3DALmbvowmVzrLJFY2qH.QHKub2GPDKaUlzDD)
#### Tutorial: Transfer Learning in PyTorch
- [iPynb - ImageNet and Transfer Learning](https://colab.research.google.com/drive/1EBz4feoaUvz-o_yeMI27LEQBkvrXNc_4?usp=sharing)
- [iPynb - Transfer Learning to Birds](https://colab.research.google.com/drive/1kHo8VT-onDxbtS3FM77VImG35h_K_Lav?usp=sharing)
-----------
### Week 10
#### Lecture 19: Generative Adversarial Networks
- [Slides](https://docs.google.com/presentation/d/1NgicaHQhluKQ0r39U4IWo8AtOBJUAQ2aBGCDq_DpgkA/edit?usp=sharing)
- [Video](https://washington.zoom.us/rec/play/TKDf9p_t11_aPYp5huw_LmhkwBtzNREGwdu0rCezUCJw3eahuqYl6WIKiGhSEOF64_EAia-Upu5zEHMk.sqSxdYY3Tj2n7a33)
#### Lecture 20: Diffusion Models
- [Slides](https://docs.google.com/presentation/d/1vm-nQ7igsO_GerOmGjAbTDP58MvH_sgf04Q7bxxvTKM/edit?usp=sharing)