CSE 590V: Computer vision seminar

Fall 2014


Hand with Reflecting Sphere by M. C. Escher.
Recolored by xenomorph1138

Course description

CSE 590V is a seminar/reading group focused on recent work in computer vision. We will cover papers from recent and upcoming conferences related to computer vision (CVPR, ICCV, ECCV, NIPS, SIGGRAPH). The seminar is open to everyone. We especially encourage first year graduate students who may be considering research in computer vision or related areas to participate.


Logistics

Time: Fridays 11am-12pm

Location: CSE 203

Organizers: Hamid Izadinia (izadinia @ cs washington edu) and Ricardo Martin (rmartin @ cs washington edu)

Class mailing list: cse590v @ cs washington edu (subscribe here)


Presentations

Each week we will cover a recent topic in computer vision by reading and discussing one or more relevant papers. A person will lead the discussion by presenting the chosen paper(s) for the week. We encourage all attendees to read the paper(s) beforehand and to actively participate in the discussion.

Each registered student will attend all classes and prepare a presentation (duration to be determined) on a selected paper(s). We will assign topics/papers during the first week based on preferences.

Each presenter will meet with the organizers before the class date to discuss the upcoming presentation, show prepared slides, and resolve any questions.


Schedule

Date Presenters Papers Slides
Oct 3rd Ezgi Mercan and Yao Lu

  • Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
    R. Girshick, J. Donahue, T. Darrell, J. Malik, CVPR 2014 (PDF)
  • Edge Boxes: Locating Object Proposals from Edges
    Lawrence C. Zitnick and Piotr Dollár, ECCV 2014 (PDF)

R-CNN slides, Object proposal slides
Oct 10th Supasorn Suwajanakorn and Alon Milchgrub

  • Learning to be a Depth Camera for Close-Range Human Capture and Interaction
    Sean Ryan Fanello, Cem Keskin, Shahram Izadi, Pushmeet Kohli, David Kim, David Sweeney, Antonio Criminisi, Jamie Shotton, Sing Bing Kang, and Tim Paek, SIGGRAPH 2014 (PDF)
  • Deep Learning Face Representation from Predicting 10,000 Classes
    Yi Sun, Xiaogang Wang, Xiaoou Tang, CVPR 2014 (PDF)

Depth camera slides, Face slides
Oct 17th Alicia Clark and Nancy Wang

  • PanoContext: A Whole-room 3D Context Model for Panoramic Scene Understanding
    Yinda Zhang, Shuran Song, Ping Tan, Jianxiong Xiao, ECCV 2014 (PDF)
  • Assessing the Quality of Actions
    Hamed Pirsiavash, Carl Vondrick, Antonio Torralba, ECCV 2014 (PDF)

Pano context slides, Actions quality slides
Oct 24th Juliet Fiss and Edward Zhang

  • Eulerian Video Magnification for Revealing Subtle Changes in the World
    Hao-Yu Wu, Michael Rubinstein, Eugene Shih, John Guttag, Fredo Durand and William T. Freeman, SIGGRAPH 2012, (PDF)
  • Phase-Based Video Motion Processing
    Neal Wadhwa, Michael Rubinstein, Fredo Durand and William T. Freeman, SIGGRAPH 2013 (PDF)
  • The Visual Microphone: Passive Recovery of Sound from Video
    Abe Davis, Michael Rubinstein, Neal Wadhwa, Gautham Mysore, Fredo Durand and William T. Freeman, SIGGRAPH 2014 (PDF)

Eulerian Magnification slides, Phase-based Magnification slides, Visual Microphone slides
Oct 31st Daniel Gordon and Shu Liang

  • Sliding Shapes for 3D Object Detection in Depth Images
    Shuran Song and Jianxiong Xiao, ECCV 2014 (PDF)
  • Real-time Non-rigid Reconstruction using an RGB-D Camera
    Michael Zollhöfer, Matthias Nießner, Shahram Izadi, Christoph Rehmann, Christopher Zach, Matthew Fisher, Chenglei Wu, Andrew Fitzgibbon, Charles Loop, Christian Theobalt, Marc Stamminger, TOG 2014 (PDF)

Sliding Shapes slides, Reconstruction slides
Nov 7th Daniel Miller and Alex Mariakakis

  • Zero-shot Recognition with Unreliable Attributes
    Dinesh Jayaraman, Kristen Grauman, NIPS 2014 (PDF)
  • Programmable Automotive Headlights
    Robert Tamburo, Eriko Nurvitadhi, Abhishek Chugh, Mei Chen, Anthony Rowe, Takeo Kanade, Srinivasa Narasimhan, ECCV 2014 (PDF)

Headlight slides
Nov 14th CVPR deadline No seminar
Nov 21st Fereshteh Sadeghi and Aditya Sankar

  • Reasoning About Object Affordances in a Knowledge Base Representation
    Yuke Zhu, Alireza Fathi, and Li Fei-Fei, ECCV 2014 (PDF)
  • Piecewise Planar and Compact Floorplan Reconstruction from Images
    Ricardo Cabral and Yasutaka Furukawa, CVPR 2014 (PDF)

Object Affordance slides Floorplan Reconstruction slides
Nov 28th Thanksgiving No seminar
Dec 5th Aleksander Holynski and Hamid Izadinia

  • Robust Global Translations with 1DSfM
    Kyle Wilson and Noah Snavely, ECCV 2014 (PDF)
  • Visualizing and Understanding Convolutional Networks
    Matthew Zeiler, Rob Fergus, ECCV 2014 (PDF)

1DSfM slides Visualize CNN slides

Paper List

Geometric Scene Understanding (reconstruction/recognition/segmentation)

  • PanoContext: A Whole-room 3D Context Model for Panoramic Scene Understanding (PDF, website)
    Y. Zhang, S. Song, P. Tan, and J. Xiao
    ECCV 2014
  • Scene Chronology (PDF, website)
    Kevin Matzen and Noah Snavely
    ECCV 2014
  • Unfolding an Indoor Origami World (PDF, website)
    David F. Fouhey, Abhinav Gupta, Martial Hebert
    ECCV 2014
  • Reconstructing PASCAL VOC (PDF, website)
    S. Vicente, J. Carreira, L. Agapito and J. Batista
    CVPR 2014
  • LSD-SLAM: Large-Scale Direct Monocular SLAM (PDF, website)
    Jakob Engel, Thomas Schöps, Daniel Cremers
    ECCV14

Semantics/Scene understanding

  • Learning Deep Features for Scene Recognition using PLACES Database
    Bolei Zhou, Jianxiong Xiao, Agata Garcia, Aude Oliva, Antonio Torralba
    NIPS 2014
  • Reasoning About Object Affordances in a Knowledge Base Representation (PDF)
    Yuke Zhu, Alireza Fathi, and Li Fei-Fei
    ECCV 2014
  • Multi-scale Orderless Pooling of Deep Convolutional Activation Features (PDF)
    Yunchao Gong, Liwei Wang, Ruiqi Guo, Svetlana Lazebnik
    ECCV 2014
  • Patch to the Future: Unsupervised Visual Prediction (PDF)
    Jacob Walker, Abhinav Gupta, and Martial Hebert
    CVPR 2014

Reconstruction/Depth estimation

  • Color Map Optimization for 3D Reconstruction with Consumer Depth Cameras (PDF, website)
    Qian-Yi Zhou and Vladlen Koltun
    SIGGRAPH 2014
  • Robust Global Translations with 1DSfM (PDF, website)
    Kyle Wilson and Noah Snavely
    ECCV 2014

Automatic Object Discovery

  • Context as Supervisory Signal: Discovering Objects with Predictable Context (PDF, website)
    Carl Doersch, Abhinav Gupta, Alexei Efros
    ECCV 2014
  • Geodesic Object Proposals (PDF, website)
    Philipp Krähenbühl and Vladlen Koltun
    ECCV 2014
  • Edge Boxes: Locating Object Proposals from Edges (PDF, website)
    Lawrence C. Zitnick and Piotr Dollár
    ECCV 2014
  • Associative embeddings for large-scale knowledge transfer with self-assessment (PDF, website)
    Alexander Vezhnevets, Vittorio Ferrari
    CVPR 2014

Object Detection/Recognition

  • Part-based R-CNNs for Fine-grained Category Detection (PDF)
    Ning Zhang, Jeff Donahue, Ross Girshick, Trevor Darrell
    ECCV 2014
  • Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation (PDF)
    R. Girshick, J. Donahue, T. Darrell, J. Malik
    CVPR 2014
  • Large-Scale Object Classification using Label Relation Graphs (PDF)
    Jia Deng, Nan Ding, Yangqing Jia, Andrea Frome, Kevin Murphy, Samy Bengio, Yuan Li, Hartmut
    Neven, Hartwig Adam
    ECCV 2014
  • Sliding Shapes for 3D Object Detection in Depth Images (PDF, website)
    S. Song, and J. Xiao.
    ECCV 2014
  • From Large-Scale Object Classifiers to Large-Scale Object Detectors: An Adaptation Approach (PDF)
    Judy Hoffman, Sergio Guadarrama, Eric Tzeng, Jeff Donahue, Trevor Darrell, Kate Saenko, Ross Girshick
    NIPS 2014
  • Deformable Part Models are Convolutional Neural Networks (PDF)
    Ross Girshick, Forrest Iandola, Trevor Darrell, Jitendra Malik
    Tech Report

Face

  • Deep Learning Face Representation from Predicting 10,000 Classes (PDF)
    Yi Sun, Xiaogang Wang, and Xiaoou Tang
    CVPR 2014
  • Eigen-PEP for Video Face Recognition (PDF)
    Haoxiang Li, Gang Hua, Xiaohui Shen, Zhe Lin, and Jonathan Brandt
    ACCV 2014
  • Hybrid Deep Learning for Face Verification (PDF)
    Yi Sun, Xiaogang Wang, and Xiaoou Tang
    ICCV 2013
  • Towards Pose Robust Face Recognition (PDF)
    Dong Yi, Zhen Lei, and Stan Z. Li
    CVPR 2013
  • DeepFace: Closing the Gap to Human-Level Performance in Face Verification (PDF, website)
    Yaniv Taigman, Ming Yang, Marc'Aurelio Ranzato, Lior Wolf
    CVPR 2014
  • Real-time Non-rigid Reconstruction using an RGB-D Camera (PDF, website)
    Chenglei Wu, Michael Zollhöfer, Matthias Nießner, Marc Stamminger, Shahram Izadi, Christian Theobalt
    SIGGRAPH 2014

Human Pose estimation

  • Pose Machines: Articulated Pose Estimation via Inference Machines (PDF)
    Varun Ramakrishna, Daniel Munoz, Martial Hebert, J. Andrew (Drew) Bagnell, and Yaser Ajmal Sheikh
    ECCV 2014
  • Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation (PDF)
    Jonathan Tompson, Arjun Jain, Yann LeCun, Christoph Bregler
    NIPS 2014

Attributes

  • Zero-shot Recognition with Unreliable Attributes (PDF)
    Dinesh Jayaraman, Kristen Grauman
    NIPS 2014
  • Transient Attributes for High-Level Understanding and Editing of Outdoor Scenes(PDF, website)
    Pierre-Yves Laffont and Zhile Ren and Xiaofeng Tao and Chao Qian and James Hays
    SIGGRAPH 2014

Action Recognition

  • Weakly Supervised Action Labeling in Videos Under Ordering Constraints (PDF)
    Piotr Bojanowski, Remi Lajugie, Francis Bach, Ivan Laptev, Jean Ponce, Cordelia Schmid, Josef Sivic
    ECCV 2014
  • Assessing the Quality of Actions (PDF, website)
    Hamed Pirsiavash, Carl Vondrick, Antonio Torralba
    ECCV 2014
  • Large-Scale Video Classification with Convolutional Neural Networks (PDF, website)
    Andrej Karpathy, George Toderici, Sanketh Shetty, Thomas Leung, Rahul Sukthankar, Li Fei-Fei
    CVPR 2014
  • Parsing videos of actions with segmental grammars (PDF)
    Hamed Pirsiavash, Deva Ramanan
    CVPR 2014

Localization and Camera Tracking

  • Predicting Matchability (PDF)
    W. Hartmann, M. Havlena, K. Schindler
    CVPR 2014
  • Recognizing City Identity Via Attribute Analysis of Geo-tagged Images (PDF, website)
    Bolei Zhou, Liu Liu, Aude Oliva, Antonio Torralba
    ECCV 2014

Computational Photography

  • Learning to be a depth camera for close-range human capture and interaction (PDF, website)
    Sean Ryan Fanello, Cem Keskin, Shahram Izadi, Pushmeet Kohli, Jamie Shotton, Antonio Criminisi, David Kim, David Sweeney, Sing Bing Kang
    SIGGRAPH 2014

Low level/descriptors

  • Filter Forests for Learning Data-Dependent Convolutional Kernels (PDF, website)
    Sean Ryan Fanello, Cem Keskin, Pushmeet Kohli, Shahram Izadi, Jamie Shotton, Antonio Criminisi, Ugo Pattacini, Tim Paek
    CVPR 2014
  • Visualizing and Understanding Convolutional Networks (PDF)
    Matthew Zeiler, Rob Fergus
    ECCV 2014

Applications

  • First-person Hyper-lapse videos (PDF, website)
    Johannes Kopf, Michael Cohen, Richard Szeliski
    SIGGRAPH 2014
  • The Visual Microphone: Passive Recovery of Sound from Video (PDF, website)
    Myers Davis, Michael Rubinstein, Neal Wadhwa, Gautham Mysore, Fredo Durand, William Freeman
    SIGGRAPH 2014
  • Programmable Automotive Headlights (PDF)
    Robert Tamburo, Eriko Nurvitadhi, Abhishek Chugh, Mei Chen, Anthony Rowe, Takeo Kanade, Srinivasa Narasimhan
    ECCV 2014

CVPR14 best paper award winners

  • What Camera Motion Reveals About Shape with Unknown BRDF (PDF)
    Manmohan Chandraker
    CVPR 2014
  • Partial Optimality by Pruning for MAP-inference with General Graphical Models (PDF)
    Paul Swoboda, Bogdan Savchynskyy, Joerg Kappes, Christoph Schnörr
    CVPR 2014
  • 3D Shape and Indirect Appearance by Structured Light Transport (PDF)
    Matthew O'Toole, John Mather, Kyros Kutulakos
    CVPR 2014