CSE 590V: Computer vision seminar (Fall 2014)

CSE 590V: Computer vision seminar

Fall 2014

Hand with Reflecting Sphere by M. C. Escher.
Recolored by xenomorph1138

Course description

CSE 590V is a seminar/reading group focused on recent work in computer vision. We will cover papers from recent and upcoming conferences related to computer vision (CVPR, ICCV, ECCV, NIPS, SIGGRAPH). The seminar is open to everyone. We especially encourage first year graduate students who may be considering research in computer vision or related areas to participate.

Logistics

Time: Fridays 11am-12pm

Location: CSE 203

Organizers: Hamid Izadinia (izadinia @ cs washington edu) and Ricardo Martin (rmartin @ cs washington edu)

Class mailing list: cse590v @ cs washington edu (subscribe here)

Presentations

Each week we will cover a recent topic in computer vision by reading and discussing one or more relevant papers. A person will lead the discussion by presenting the chosen paper(s) for the week. We encourage all attendees to read the paper(s) beforehand and to actively participate in the discussion.

Each registered student will attend all classes and prepare a presentation (duration to be determined) on a selected paper(s). We will assign topics/papers during the first week based on preferences.

Each presenter will meet with the organizers before the class date to discuss the upcoming presentation, show prepared slides, and resolve any questions.

Schedule

Date	Presenters	Papers	Slides
Oct 3rd	Ezgi Mercan and Yao Lu	Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation R. Girshick, J. Donahue, T. Darrell, J. Malik, CVPR 2014 (PDF) Edge Boxes: Locating Object Proposals from Edges Lawrence C. Zitnick and Piotr Dollár, ECCV 2014 (PDF)	R-CNN slides, Object proposal slides
Oct 10th	Supasorn Suwajanakorn and Alon Milchgrub	Learning to be a Depth Camera for Close-Range Human Capture and Interaction Sean Ryan Fanello, Cem Keskin, Shahram Izadi, Pushmeet Kohli, David Kim, David Sweeney, Antonio Criminisi, Jamie Shotton, Sing Bing Kang, and Tim Paek, SIGGRAPH 2014 (PDF) Deep Learning Face Representation from Predicting 10,000 Classes Yi Sun, Xiaogang Wang, Xiaoou Tang, CVPR 2014 (PDF)	Depth camera slides, Face slides
Oct 17th	Alicia Clark and Nancy Wang	PanoContext: A Whole-room 3D Context Model for Panoramic Scene Understanding Yinda Zhang, Shuran Song, Ping Tan, Jianxiong Xiao, ECCV 2014 (PDF) Assessing the Quality of Actions Hamed Pirsiavash, Carl Vondrick, Antonio Torralba, ECCV 2014 (PDF)	Pano context slides, Actions quality slides
Oct 24th	Juliet Fiss and Edward Zhang	Eulerian Video Magnification for Revealing Subtle Changes in the World Hao-Yu Wu, Michael Rubinstein, Eugene Shih, John Guttag, Fredo Durand and William T. Freeman, SIGGRAPH 2012, (PDF) Phase-Based Video Motion Processing Neal Wadhwa, Michael Rubinstein, Fredo Durand and William T. Freeman, SIGGRAPH 2013 (PDF) The Visual Microphone: Passive Recovery of Sound from Video Abe Davis, Michael Rubinstein, Neal Wadhwa, Gautham Mysore, Fredo Durand and William T. Freeman, SIGGRAPH 2014 (PDF)	Eulerian Magnification slides, Phase-based Magnification slides, Visual Microphone slides
Oct 31st	Daniel Gordon and Shu Liang	Sliding Shapes for 3D Object Detection in Depth Images Shuran Song and Jianxiong Xiao, ECCV 2014 (PDF) Real-time Non-rigid Reconstruction using an RGB-D Camera Michael Zollhöfer, Matthias Nießner, Shahram Izadi, Christoph Rehmann, Christopher Zach, Matthew Fisher, Chenglei Wu, Andrew Fitzgibbon, Charles Loop, Christian Theobalt, Marc Stamminger, TOG 2014 (PDF)	Sliding Shapes slides, Reconstruction slides
Nov 7th	Daniel Miller and Alex Mariakakis	Zero-shot Recognition with Unreliable Attributes Dinesh Jayaraman, Kristen Grauman, NIPS 2014 (PDF) Programmable Automotive Headlights Robert Tamburo, Eriko Nurvitadhi, Abhishek Chugh, Mei Chen, Anthony Rowe, Takeo Kanade, Srinivasa Narasimhan, ECCV 2014 (PDF)	Headlight slides
Nov 14th	CVPR deadline	No seminar
Nov 21st	Fereshteh Sadeghi and Aditya Sankar	Reasoning About Object Affordances in a Knowledge Base Representation Yuke Zhu, Alireza Fathi, and Li Fei-Fei, ECCV 2014 (PDF) Piecewise Planar and Compact Floorplan Reconstruction from Images Ricardo Cabral and Yasutaka Furukawa, CVPR 2014 (PDF)	Object Affordance slides Floorplan Reconstruction slides
Nov 28th	Thanksgiving	No seminar
Dec 5th	Aleksander Holynski and Hamid Izadinia	Robust Global Translations with 1DSfM Kyle Wilson and Noah Snavely, ECCV 2014 (PDF) Visualizing and Understanding Convolutional Networks Matthew Zeiler, Rob Fergus, ECCV 2014 (PDF)	1DSfM slides Visualize CNN slides

Paper List

Geometric Scene Understanding (reconstruction/recognition/segmentation)

PanoContext: A Whole-room 3D Context Model for Panoramic Scene Understanding (PD F, website)
Y. Zhang, S. Song, P. Tan, and J. Xiao
ECCV 2014
Scene Chronology (PDF, website)
Kevin Matzen and Noah Snavely
ECCV 2014
Unfolding an Indoor Origami World (PDF, website)
David F. Fouhey, Abhinav Gupta, Martial Hebert
ECCV 2014
Reconstructing PASCAL VOC (PDF, website)
S. Vicente, J. Carreira, L. Agapito and J. Batista
CVPR 2014
LSD-SLAM: Large-Scale Direct Monocular SLAM (PDF, website)
Jakob Engel, Thomas Schöps, Daniel Cremers
ECCV14

Semantics/Scene understanding

Learning Deep Features for Scene Recognition using PLACES Database
Bolei Zhou, Jianxiong Xiao, Agata Garcia, Aude Oliva, Antonio Torralba
NIPS 2014
Reasoning About Object Affordances in a Knowledge Base Representation (PDF)
Yuke Zhu, Alireza Fathi, and Li Fei-Fei
ECCV 2014
Multi-scale Orderless Pooling of Deep Convolutional Activation Features (PDF)
Yunchao Gong, Liwei Wang, Ruiqi Guo, Svetlana Lazebnik
ECCV 2014
Patch to the Future: Unsupervised Visual Prediction (PDF)
Jacob Walker, Abhinav Gupta, and Martial Hebert
CVPR 2014

Reconstruction/Depth estimation

Color Map Optimization for 3D Reconstruction with Consumer Depth Cameras (PDF, website)
Qian-Yi Zhou and Vladlen Koltun
SIGGRAPH 2014
Robust Global Translations with 1DSfM (PDF, website)
Kyle Wilson and Noah Snavely
ECCV 2014

Automatic Object Discovery

Context as Supervisory Signal: Discovering Objects with Predictable Context (PDF, website)
Carl Doersch, Abhinav Gupta, Alexei Efros
ECCV 2014
Geodesic Object Proposals (PDF, website)
Philipp Krähenbühl and Vladlen Koltun
ECCV 2014
Edge Boxes: Locating Object Proposals from Edges (PDF, website)
Lawrence C. Zitnick and Piotr Dollár
ECCV 2014
Associative embeddings for large-scale knowledge transfer with self-assessment (PDF, website)
Alexander Vezhnevets, Vittorio Ferrari
CVPR 2014

Object Detection/Recognition

Part-based R-CNNs for Fine-grained Category Detection (PDF)
Ning Zhang, Jeff Donahue, Ross Girshick, Trevor Darrell
ECCV 2014
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation (PDF)
R. Girshick, J. Donahue, T. Darrell, J. Malik
CVPR 2014
Large-Scale Object Classification using Label Relation Graphs (PDF)
Jia Deng, Nan Ding, Yangqing Jia, Andrea Frome, Kevin Murphy, Samy Bengio, Yuan Li, Hartmut
Neven, Hartwig Adam
ECCV 2014
Sliding Shapes for 3D Object Detection in Depth Images (PDF, website)
S. Song, and J. Xiao.
ECCV 2014
From Large-Scale Object Classifiers to Large-Scale Object Detectors: An Adaptation Approach (PDF)
Judy Hoffman, Sergio Guadarrama, Eric Tzeng, Jeff Donahue, Trevor Darrell, Kate Saenko, Ross Girshick
NIPS 2014
Deformable Part Models are Convolutional Neural Networks (PDF)
Ross Girshick, Forrest Iandola, Trevor Darrell, Jitendra Malik
Tech Report

Face

Deep Learning Face Representation from Predicting 10,000 Classes (PDF)
Yi Sun, Xiaogang Wang, and Xiaoou Tang
CVPR 2014
Eigen-PEP for Video Face Recognition (PDF)
Haoxiang Li, Gang Hua, Xiaohui Shen, Zhe Lin, and Jonathan Brandt
ACCV 2014
Hybrid Deep Learning for Face Verification (PDF)
Yi Sun, Xiaogang Wang, and Xiaoou Tang
ICCV 2013
Towards Pose Robust Face Recognition (PDF)
Dong Yi, Zhen Lei, and Stan Z. Li
CVPR 2013
DeepFace: Closing the Gap to Human-Level Performance in Face Verification (PDF, website)
Yaniv Taigman, Ming Yang, Marc'Aurelio Ranzato, Lior Wolf
CVPR 2014
Real-time Non-rigid Reconstruction using an RGB-D Camera (PDF, website)
Chenglei Wu, Michael Zollhöfer, Matthias Nießner, Marc Stamminger, Shahram Izadi, Christian Theobalt
SIGGRAPH 2014

Human Pose estimation

Pose Machines: Articulated Pose Estimation via Inference Machines (PDF)
Varun Ramakrishna, Daniel Munoz, Martial Hebert, J. Andrew (Drew) Bagnell, and Yaser Ajmal Sheikh
ECCV 2014
Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation (PDF)
Jonathan Tompson, Arjun Jain, Yann LeCun, Christoph Bregler
NIPS 2014

Attributes

Zero-shot Recognition with Unreliable Attributes (PDF)
Dinesh Jayaraman, Kristen Grauman
NIPS 2014
Transient Attributes for High-Level Understanding and Editing of Outdoor Scenes(PDF, website)
Pierre-Yves Laffont and Zhile Ren and Xiaofeng Tao and Chao Qian and James Hays
SIGGRAPH 2014

Action Recognition

Weakly Supervised Action Labeling in Videos Under Ordering Constraints (PDF)
Piotr Bojanowski, Remi Lajugie, Francis Bach, Ivan Laptev, Jean Ponce, Cordelia Schmid, Josef Sivic
ECCV 2014
Assessing the Quality of Actions (PDF, website)
Hamed Pirsiavash, Carl Vondrick, Antonio Torralba
ECCV 2014
Large-Scale Video Classification with Convolutional Neural Networks (PDF, website)
Andrej Karpathy, George Toderici, Sanketh Shetty, Thomas Leung, Rahul Sukthankar, Li Fei-Fei
CVPR 2014
Parsing videos of actions with segmental grammars (PDF)
Hamed Pirsiavash, Deva Ramanan
CVPR 2014

Localization and Camera Tracking

Predicting Matchability (PDF)
W. Hartmann, M. Havlena, K. Schindler
CVPR 2014
Recognizing City Identity Via Attribute Analysis of Geo-tagged Images (PDF, website)
Bolei Zhou, Liu Liu, Aude Oliva, Antonio Torralba
ECCV 2014

Computational Photography

Learning to be a depth camera for close-range human capture and interaction (PDF, website)
Sean Ryan Fanello, Cem Keskin, Shahram Izadi, Pushmeet Kohli, Jamie Shotton, Antonio Criminisi, David Kim, David Sweeney, Sing Bing Kang
SIGGRAPH 2014

Low level/descriptors

Filter Forests for Learning Data-Dependent Convolutional Kernels (PDF, website)
Sean Ryan Fanello, Cem Keskin, Pushmeet Kohli, Shahram Izadi, Jamie Shotton, Antonio Criminisi, Ugo Pattacini, Tim Paek
CVPR 2014
Visualizing and Understanding Convolutional Networks (PDF)
Matthew Zeiler, Rob Fergus
ECCV 2014

Applications

First-person Hyper-lapse videos (PDF, website)
Johannes Kopf, Michael Cohen, Richard Szeliski
SIGGRAPH 2014
The Visual Microphone: Passive Recovery of Sound from Video (PDF, website)
Myers Davis, Michael Rubinstein, Neal Wadhwa, Gautham Mysore, Fredo Durand, William Freeman
SIGGRAPH 2014
Programmable Automotive Headlights (PDF)
Robert Tamburo, Eriko Nurvitadhi, Abhishek Chugh, Mei Chen, Anthony Rowe, Takeo Kanade, Srinivasa Narasimhan
ECCV 2014

CVPR14 best paper award winners

What Camera Motion Reveals About Shape with Unknown BRDF (PDF)
Manmohan Chandraker
CVPR 2014
Partial Optimality by Pruning for MAP-inference with General Graphical Models (PDF)
Paul Swoboda, Bogdan Savchynskyy, Joerg Kappes, Christoph Schnörr
CVPR 2014
3D Shape and Indirect Appearance by Structured Light Transport (PDF)
Matthew O'Toole, John Mather, Kyros Kutulakos
CVPR 2014