Lectures: Thursdays 6:30 - 9:20 pm, Johnson Hall (JHN) 075
Office Hours: Wednesdays 5:00 - 6:00 pm, CSE 674, or by appointment
Matthew Brown, Richard Newcombe, Rob Gens, David Rosen, Steven Lovegrove, Jonathan Huang, Jakob Engel, Yuting Ye
TAs: Tanner Schmidt (tws10@cs.washington.edu), Melody Su (yhsu83@uw.edu)
A professional masters course in computer vision, emphasizing fundamentals of geometry and image formation as well as deep learning and image understanding. Covers low-level image processing, image geometry, motion and depth estimation, object recognition, deep learning and optimization techniques, and case studies of current research.
Grading:The grade is based on four projects. Each project will be a mix of coding and written answers. See the course overview below for handin dates. Late policy is as follows: Late assignments will be accepted up to 5 days after the deadline, but with 10% subtracted from the mark per day.
Week | Date | Topic | Description |
---|---|---|---|
1 | 3/29 | Introduction | |
Image Formation | Geometric and Photometric Image Formation, Pinhole Camera, Lenses, Sensors, Colour, Gamma, DCT, Image Coding | ||
Week 1 Notes | |||
2 | 4/5 | Filtering and Pyramids | Linear + Non-linear filtering, Correlation, Convolution, Gaussian + Laplacian Pyramids, Sampling and Aliasing |
Features and Matching | Detection, Correspondence, Edges, Corners, Regions, Patch matching, SIFT, Shape Context, Learning Features | ||
Week 2 Notes | |||
Project 1 start | |||
3 | 4/12 | Planar Geometry | Image Alignment, Linear and Projective Cameras + 2D Transforms |
RANSAC | Robust 2-view geometry estimation | ||
Epipolar Geometry | Epipolar Lines, Plane Constraint, Fundamental Matrix, Stereo Matching | ||
Week 3 Notes | |||
4 | 4/19 | Multiview Geometry | SFM/SLAM, 3D points, Camera Pose and Calibration, Bundle Adjustment |
Optimization | Noise Modelling, Non-L2 residuals, Convexity, Problem Solving | ||
Project 2 start | |||
4/22 | Project 1 due | ||
5 | 4/26 | Camera Tracking | Sparse vs dense, brightness constancy, optical flow, coarse to fine, lucas kanade, incremental rotation, dense tracking + mapping, ICP |
Dense Reconstruction | Stereo, disparity space, aggregation/block matching, plane sweep, baseline, denoising, depth priors, signed distance functions + depth fusion, voxel colouring | ||
LucasKanade ipynb | |||
PlaneSweep ipynb | |||
6 | 5/3 | Machine Learning for Vision | Intro, inductive learning, decision trees, instance-based learning |
Project 3 start (ipynb, html) | |||
5/6 | Project 2 due | ||
7 | 5/10 | Convolutional Neural Networks | Neural Networks, Backpropagation, Convolutional Neural Nets, Training Neural Networks, Additional slides |
8 | 5/17 | Object Detection | Sliding windows, detection with convnets, anchor regression+classification, SSD, Faster R-CNN, R-FCN, evaluation, IOU, AP, instance segmentation, keypoint detection |
Depth Estimation | Unsupervised Monocular Depth [Godard et al.] | ||
Project 4 start | |||
5/20 | Project 3 due | ||
9 | 5/24 | Hand Tracking | |
Dense Reconstruction | |||
SLAM | SLAM+VO: (Jakob Engel) Direct/indirect, sparse/dense, DSO, ORB-SLAM, photometric calibration, practical tips, evaluation, analysis | ||
10 | 5/31 | History and Futurology | |
6/3 | Project 4 due |