Course Description:

Visual recognition is the process of automatically extracting valuable information from visual data (images and videos). This includes: knowing what objects are in a picture, where they are, what's happening, and where it is happening. Given the nature of the problem and the abundanc e of data available today, visual recognition is typically formulated as a machine learning problem.

In this course we study the problems of instance-level object recognition ("what is this?"), category-level object recognition ("what kind of thing is this?"), attributes ("what is this thing like?"), parts ("what is this thing composed of?"), action recognition ("what's going on ?"), pose estimation ("what is the configuration of this thing?"), detection ("where is this thing in the image?"), and segmentation ("where e xactly is this thing in the image?"). We will cover a wide range of supervised, semi-supervised, and unsupervised models (consisting of fully- , partially-, and un-labeled data), as well as transferable models in object and activity recognition (learning from one dataset or task and applying to a different one).

Students are recommended to be familiar with either computer vision or machine learning. If you are interested in this course and you have been exposed to neither computer vision nor machine learning please ask for instructor's approval. Throughout the quarter I will provide necessary background material.

This course will consist of lectures, student presentations, and a course project. Students who register for only one credit are not required to do a course project.

Syllabus Overview: