CSE 590V: Computer vision seminar

Fall 2016


Hand with Reflecting Sphere by M. C. Escher.
Recolored by xenomorph1138

Course description

CSE 590V is a seminar/reading group focused on recent work in computer vision. We will cover papers from recent and upcoming conferences related to computer vision (CVPR, ICCV, ECCV, SIGGRAPH, NIPS). The seminar is open to everyone. We especially encourage first year graduate students who may be considering research in computer vision or related areas to participate.


Logistics

Time: Fridays 11am-12pm

Location: CSE 403

Organizers: Konstantinos Rematas (krematas @ cs washington edu) and Chris Sweeney (csweeney @ cs washington edu)


Presentations

Each week we will cover a recent topic in computer vision by reading and discussing one or more relevant papers. A person will lead the discussion by presenting the chosen paper(s) for the week. We encourage all attendees to read the paper(s) beforehand and to actively participate in the discussion.

Each registered student will attend all classes and prepare a presentation (duration to be determined) on a selected paper(s). We will assign topics/papers during the first week based on preferences.

Each presenter will meet with the organizers during office hours to discuss the upcoming presentation, show prepared slides, and resolve any questions.


Office hours: Wednesdays 3-4pm at CSE 284

We are available to help review and help clarify the papers you will be reading. Additionally, we are here to help you make your presentation as awesome as possible.

**It is mandatory to come to office hours if you are presenting that week.**

Schedule

Date Presenters Papers Slides
September 30 Organizational Meeting
October 7
  • Yuguan
  • Max Horton

  • High-Quality Depth from Uncalibrated Small Motion Clip (Project Page)
    Hyowon Ha, Sunghoon Im, Jaesik Park, Hae-Gon Jeon, and In So Kweon
    CVPR 2016
  • Deep Residual Learning for Image Recognition (PDF)
    Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun
    CVPR 2016

Residual Networks slides
October 14
  • Edward Zhang, Aleks
  • Aditya Sankar, Bindita Chaudhuri

  • Shading-aware Multi-view Stereo (PDF)
    Fabian Langguth, Kalyan Sunkavalli, Sunil Hadap, and Michael Goesele
    ECCV 2016
  • Colorful Image Colorization (PDF, Web)
    Richard Zhang, Phillip Isola, Alexei Efros
    ECCV 2016

Shading-aware MVS slides
Colorful Colorization slides
October 21
  • Aravind Rajeswaran, Jin Qu
  • Roy, Supasorn Suwajanakorn

  • Learning to Poke by Poking: Experiential Learning of Intuitive Physics ( PDF)
    Pulkit Agrawal, Ashvin Nair, Pieter Abbeel, Jitendra Malik, Sergey Levine
    NIPS 2016
  • Force from Motion: Decoding Physical Sensation in a First Person Video (PDF)
    Hyun Soo Park, Jyh-Jing Hwang, Jianbo Shi
    CVPR 2016

October 28
  • Rowan Zellers, Qi Hu
  • JJ Park, Qinyu

  • Fast R-CNN (PDF,website)
    Ross Girshick
    ICCV 2015
  • DynmamicFusion: Reconstruction and Tracking of Non-rigid Scenes in Real-Time (PDF)
    Richard Newcombe, Dieter Fox, Steve Seitz
    CVPR 2015

November 4
  • Ezgi Mercan
  • Xuan Luo, Isaac

  • Understanding and Predicting Memorability at Large Scale (Project Page)
    A. Khosla, AS Raju, A. Torralba, and A. Oliva
    ICCV 2015
  • Practical Multispectral Lighting Reproduction (Project Page)
    C. LeGendre et. al.
    SIGGRAPH 2016

November 11 CVPR submission deadline No seminar --
November 18
  • Chung-Yi Weng, Junha Roh
  • Eric, Chunjue Tang

  • Band-Sifting Decomposition for Image Based Material Editing (Project Page)
    Boyadzhiev et. al.
    ACM TOG 2016
  • Face2Face: Real-time Face Capture and Reenactment of RGB Videos (Project Page)
    J. Thies et. al.
    CVPR 2016

November 25 Thanksgiving No seminar --
December 2
  • Kiana Ehsani, Tsung-Wei Huang
  • Patrick Lancaster, Keunhong Park
  • Learning to Refine Object Segments (Paper)
    Pedro O. Pinheiro, Ronan Collobert, Piotr Dollar
    ECCV 2016
  • Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields (Paper)
    Liu et. al.
    Arxiv 2015
December 9
  • Aaron Walsman, Hessam
  • Daniel Gordon, Lucas
TBD TBD

Paper List (to be updated)

Geometric Scene Understanding (Reconstruction/Recognition/Segmentation)

  • Shading-aware Multi-view Stereo (PDF)
    Fabian Langguth, Kalyan Sunkavalli, Sunil Hadap, and Michael Goesele
    ECCV 2016
  • High-Quality Depth from Uncalibrated Small Motion Clip (Project Page)
    Hyowon Ha, Sunghoon Im, Jaesik Park, Hae-Gon Jeon, and In So Kweon
    CVPR 2016
  • 3D Modeling on the Go: Interactive 3D Reconstruction of Large-Scale Scenes on Mobile Devices (PDF)
    Thomas Schops, Torsten Sattler, Christian Hane, Marc Pollefeys
    3DV 2015
  • Structured Indoor Modeling (PDF)
    Satoshi Ikehata, Hang Yan, Yasutaka Furukawa
    ICCV 2015
  • Learning to Generate Chairs With Convolutional Neural Networks (PDF)
    Alexey Dosovitskiy, Jost Tobias Springenberg, Thomas Brox
    CVPR 2015
  • Single Image 3D Without a Single 3D Image (PDF)
    David Fouhey, Muhammad Wajahat Hussain, Abhinav Gupta, Martial Hebert
    ICCV 2015
  • Accurate Depth Map Estimation From a Lenslet Light Field Camera (PDF)
    Hae-Gon Jeon, Jaesik Park, Gyeongmin Choe, Jinsun Park, Yunsu Bok, Yu-Wing Tai, In So Kweon
    CVPR 2015
  • 3D Scanning Deformable Objects With a Single RGBD Sensor (PDF)
    Mingsong Dou, Jonathan Taylor, Henry Fuchs, Andrew Fitzgibbon, Shahram Izadi
    CVPR 2015
  • Pose Induction for Novel Object Categories (PDF)
    Shubham Tulsiani, Joao Carreira, Jitendra Malik
    ICCV 2015

Cameras and Computational Displays

  • Cinema 3D: Large Scale Automultiscopic Display(PDF, website)
    Netalee Efrat, Piotr Didyk, Mike Foshey, Wojciech Matusik, Anat Levin
    SIGGRAPH 2016
  • Practical Multispectral Lighting Reproduction(PDF, website)
    Chloe LeGendre, Xueming Yu, Dai Liu, Jay Busch, Andrew Jones, Sumanta Pattanaik, Paul Debevec
    SIGGRAPH 2016
  • Emulating Displays with Continuously Varying Frame Rates(PDF, website)
    Netalee Efrat, Piotr Didyk, Mike Foshey, Wojciech Matusik, Anat Levin
    SIGGRAPH 2016
  • Additive Light Field Displays: Realization of Augmented Reality with Holographic Optical Elements(PDF, website)
    Seungjae Lee, Changwon Jang, Seokil Moon, Jaebum Cho
    SIGGRAPH 2016

View Synthesis

  • Learning-Based View Synthesis for Light Field Cameras ( PDF)
    Nima Khademi Kalantari, Ting-Chun Wang, and Ravi Ramamoorthi
    SIGGRAPH 2016
  • View Synthesis by Appearance Flow ( PDF)
    Tinghui Zhou, Shubham Tulsiani, Weilun Sun, Jitendra Malik, Alexei A. Efros
    ECCV 2016
  • DeepStereo: Learning to Predict New Views from the World’s Imagery ( PDF)
    John Flynn, Ivan Neulander, James Philbin, Noah Snavely
    CVPR 2016

Material and Light

  • Unsupervised Texture Transfer from Images to Model Collections (Web)
    Tuanfeng Wang, Hao Su, Qixing Huang, Jingwei Huang, Leonidas Guibas, Niloy J. Mitra
    SIGGRAPH Asia 2016
  • Reflectance Modeling by Neural Texture Synthesis (Web)
    Miika Aittala, Timo Aila and Jaakko Lehtinen
    SIGGRAPH 2016
  • Time-varying Weathering in Texture Space (PDF)
    Rachele Bellini, Yanir Kleiman, Daniel Cohen-Or
    SIGGRAPH 2016

Reinforcement Learning

  • Learning Visual Predictive Models of Physics for Playing Billiards ( PDF)
    Katerina Fragkiadaki, Pulkit Agrawal, Sergey Levine, Jitendra Malik
    ICLR 2016
  • The Curious Robot: Learning Visual Representations via Physical Interactions ( PDF)
    Lerrel Pinto, Dhiraj Gandhi, Yuanfeng Han, Yong-Lae Park and Abhinav Gupta
    ECCV 2016
  • Learning to Poke by Poking: Experiential Learning of Intuitive Physics ( PDF)
    Pulkit Agrawal, Ashvin Nair, Pieter Abbeel, Jitendra Malik, Sergey Levine
    NIPS 2016
  • A Comparative Evaluation of Approximate Probabilistic Simulation and Deep Neural Networks as Accounts of Human Physical Scene Understanding ( PDF)
    Renqiao Zhang, Jiajun Wu, Chengkai Zhang, William T. Freeman, Joshua B. Tenenbaum
    arxiv 2016
  • Terrain-Adaptive Locomotion Skills Using Deep Reinforcement Learning( website)
    Xue Bin Peng, Glen Berseth, Michiel van de Panne
    SIGGRAPH 2016

Vision and Language

  • Generative Adversarial Text to Image Synthesis (PDF,website)
    Scott Reed, Zeynep Akata , Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele and Honglak Lee
    ICML 2016
  • VQA: Visual Question Answering (PDF,website)
    Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, Larry Zitnick, Devi Parikh
    ICCV 2015
  • Aligning Books and Movies: Towards Story-like Visual Explanations by Watching Movies and Reading Books (PDF,website)
    Yukun Zhu, Ryan Kiros, Rich Zemel, Ruslan salakhutdinov, Raquel Urtasun, Antonio Torralba, Sanja Fidler
    ICCV 2015
  • CIDEr: Consensus-Based Image Description Evaluation (PDF,website)
    Ramakrishna Vedantam, C. Lawrence Zitnick, Devi Parikh
    CVPR 2015
  • Show and Tell: A Neural Image Caption Generator (PDF)
    Oriol Vinyals, Alexander Toshev, Samy Bengio, Dumitru Erhan
    CVPR 2015

Scene Understanding/Semantics

  • Understanding and Predicting Memorability at a Large-scale (PDF,website)
    Aditya Khosla, Akhil Raju, Antonio Torralba, Aude Oliva
    ICCV 2015
  • What makes an object memorable? (PDF,website)
    Rachit Dubey, Joshua Peterson, Aditya Khosla, Ming-Hsuan Yang, and Bernard Ghanem
    ICCV 2015
  • Learning informative edge maps for indoor scene layout prediction
    Arun Mallya, Svetlana Lazebnik
    ICCV 2015

Automatic Object Discovery

  • Boosting Object Proposals: From Pascal to COCO (PDF)
    Jordi Pont-Tuset, Luc Van Gool
    ICCV 2015
  • Unsupervised Object Discovery and Tracking in Video Collections (PDF)
    Suha Kwak, Minsu Cho, Jean Ponce, Cordelia Schmid, Ivan Laptev
    ICCV 2015
  • Unsupervised Object Discovery and Localization in the Wild: Part-Based Matching With Bottom-Up Region Proposals (PDF)
    Minsu Cho, Suha Kwak, Cordelia Schmid, Jean Ponce
    CVPR 2015

Object Detection/Recognition

  • Fast R-CNN (PDF,website)
    Ross Girshick
    ICCV 2015
  • Semantic Guidance of Visual Attention for Localizing Objects in Scenes
    Juan Caicedo, Svetlana Lazebnik
    ICCV 2015
  • Webly Supervised Learning of Convolutional Networks (PDF)
    Xinlei Chen, Abhinav Gupta
    ICCV 2015
  • Love Thy Neighbors: Image Annotation by Exploiting Social Metadata (PDF)
    Lamberto Ballan, Justin Johnson, Fei-Fei Li
    ICCV 2015

Face

  • Face Flow (PDF)
    Patrick Snape, Anastasios Roussos, Yannis Panagakis, Stefanos Zafeiriou
    ICCV 2015
  • Web-Scale Training for Face Identification (PDF)
    Yaniv Taigman, Ming Yang, Marc'Aurelio Ranzato, Lior Wolf
    CVPR 2015
  • Detailed Spatio-Temporal Reconstruction of Eyelids (PDF)
    Amit Bermano, Thabo Beeler, Yeara Kozlov, Derek Bradely, Bernd Bickel, Markus Gross
    Siggraph 2015
  • Dynamic 3D Avatar Creation from Hand-held Video Input (PDF)
    Alexandru-Eugen Ichim, Sofien Bouaziz, Mark Pauly
    Siggraph 2015
  • High-Quality Hair Modeling from A Single Portrait Photo(PDF)
    Menglei Chai, Linjie Luo, Kalyan Sunkavalli, Nathan Carr†,Sunil Hadap, Kun Zhou
    Siggraph Asia 2015
  • A Mouth Full of Words: Visually Consistent Acoustic Redubbing (PDF)
    Sarah Taylor, Barry-John Theobald, Iain Matthews
    ICASSP 2015
  • Learning To Look Up: Realtime Monocular Gaze Correction Using Machine Learning(PDF)
    Daniil Kononenko, Victor Lempitsky
    CVPR 2015
  • Skin Microstructure Deformation with Displacement Map Convolution(PDF)
    Koki Nagano, Graham Fyffe, Oleg Alexander, Jernej Barbic, Hao Li, Abhijeet Ghosh, Paul Debevec
    Siggraph 2015
  • Time-offset interaction with a holocaust survivor(PDF)
    Ron Artstein, David Traum, Oleg Alexander, Anton Leuski, Andrew Jones, Kallirroi Georgila, Paul Debevec, William Swartout, Heather Maio, Stephen Smith
    IUI 2014
  • Driving High-Resolution Facial Scans with Video Performance Capture(PDF)
    Graham Fyffe, Andrew Jones, Oleg Alexander, Ryosuke Ichikari, Paul Debevec
    IUI 2014
  • Near-instant capture of high-resolution facial geometry and reflectance(PDF)
    Paul Graham, Graham Fyffe, Borom Tonwattanapong, Abhijeet Ghosh, Paul Debevec
    Siggraph 2015

Human Pose Estimation

  • General Automatic Human Shape and Motion Capture Using Volumetric Contour Cues (PDF, Web)
    Helge Rhodin, Nadia Robertini, Dan Casas, Christian Richardt, Hans-Peter Seidel, Christian Theobalt
    ECCV 2016
  • Keep it SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image (PDF)
    Federica Bogo, Angjoo Kanazawa, Christoph Lassner, Peter Gehler, Javier Romero and Michael J. Black
    ECCV 2016
  • Convolutional Pose Machines (Web)
    Shih-En Wei, Varun Ramakrishna, Takeo Kanade, Yaser Sheikh
    CVPR 2016
  • A Deep Learning Framework For Character Motion Synthesis and Editing (Web)
    Daniel Holden, Jun Saito, Taku Komura
    SIGGRAPH 2016
  • Dyna(Web)
    Gerard Pons-Moll, Javier Romero, Naureen Mahmood, Michael J. Black
    Siggraph 2015
  • Flowing ConvNets for Human Pose Estimation in Videos (PDF)
    Tomas Pfister, James Charles, Andrew Zisserman
    ICCV 2015
  • Maximum-Margin Structured Learning with Deep Networks for 3D Human Pose Estimation (PDF)
    Sijin Li, Weichen Zhang, Antoni Chan
    ICCV 2015

Action Recognition/Temporal Prediction

  • Dynamic Image Networks for Action Recognition (PDF)
    Hakan Bilen, Basura Fernando, Efstratios Gavves, Andrea Vedaldi and Stephen Gould
    CVPR 2016
  • Contextual Action Recognition with R*CNN (PDF)
    Georgia Gkioxari, Ross Girshick, Jitendra Malik
    ICCV 2015
  • Dense Optical Flow Prediction from a Static Image (PDF)
    Jacob Walker, Abhinav Gupta, Martial Hebert
    ICCV 2015
  • Temporal Perception and Prediction in Ego-Centric Video
    Yipin Zhou, Tamara Berg
    ICCV 2015
  • Learning Temporal Embeddings for Complex Video Analysis
    Vignesh Ramanathan, Kevin Tang, Greg Mori, Fei-Fei Li
    ICCV 2015
  • Storyline Representation of Egocentric Videos with an Applications to Story-based Search
    Bo Xiong, Leonid Sigal, Gunhee Kim
    ICCV 2015
  • Space-Time Tree Ensemble for Action Recognition
    Shugao Ma, Leonid Sigal, Stan Sclaroff
    CVPR 2015

Computational Photography

  • Colorful Image Colorization (PDF)
    Richard Zhang, Phillip Isola, Alexei Efros
    ECCV 2016
  • Occlusion-aware depth estimation using light-field cameras (PDF)
    Ting-Chun Wang, Alexei Efros, Ravi Ramamoorthi
    ICCV 2015
  • Visual Vibrometry: Estimating Material Properties From Small Motion in Video (PDF,website)
    Abe Davis, Katherine L. Bouman, Justin G. Chen, Michael Rubinstein, Fredo Durand, William T. Freeman
    ICCV 2015
  • Fast Bilateral-Space Stereo for Synthetic Defocus (PDF)
    Jonathan T. Barron, Andrew Adams, YiChang Shih, Carlos Hernandez
    ICCV 2015

Low Level/Descriptors

  • LIFT: Learned Invariant Feature Transform(PDF)
    Kwang Moo Yi, Eduard Trulls, Vincent Lepetit, and Pascal Fua
    ECCV 2016
  • Universal Correspondence Network(PDF)
    C. Choy, J. Gwak, S. Savarese and M.K. Chandraker
    NIPS 2016
  • Learning image representations equivariant to ego-motion (PDF)
    Dinesh Jayaraman, Kristen Grauman
    ICCV 2015
  • FlowWeb: Joint Image Set Alignment by Weaving Consistent, Pixel-wise Correspondences (PDF,website)
    Tinghui Zhou, Yong Jae Lee, Stella Yu, and Alexei A. Efros
    CVPR 2015
  • Unsupervised Visual Representation Learning by Context Prediction (PDF)
    Carl Doersch, Abhinav Gupta, Alexei Efros
    ICCV 2015
  • PatchMatch-based Automatic Lattice Detection for Near-Regular Textures
    Siying Liu, Tian-Tsong Ng, Minh Do, Kalyan Sunkavalli, Eli Shechtman, Nathan Carr
    ICCV 2015
  • EpicFlow: Edge-Preserving Interpolation of Correspondences for Optical Flow
    Jerome Revaud, Philippe Weinzaepfel, Zaid Harchaoui, Cordelia Schmid
    ICCV 2015
  • Understanding Image Representations by Measuring Their Equivariance and Equivalence (PDF)
    Karel Lenc, Andrea Vedaldi
    CVPR 2015

Applications

  • Learning Affordance for Direct Perception in Autonomous Driving (PDF,website)
    Chenyi Chen, Ari Seff, Alain Kornhauser, Jianxiong Xiao
    ICCV 2015
  • Where to Buy It: Matching Street Clothing Photos in Online Shops (PDF)
    Mohammadhadi Kiapour, Xufeng Han, Svetlana Lazebnik, Alex Berg, Tamara Berg
    ICCV 2015
  • Joint Photo Stream and Blog Post Summarization and Exploration (PDF)
    Gunhee Kim, Seungwhan Moon, Leonid Sigal
    CVPR 2015
  • Social Saliency Prediction (PDF)
    Hyun Soo Park, Jianbo Shi
    CVPR 2015
  • Panoptic Studio: A Massively Multiview System for Social Motion Capture (PDF)
    Hanbyul Joo, Hao Liu, Lei Tan, Lin Gui, Shohei Nobuhara, Yaser Sheikh
    ICCV 2015

CVPR16 Best paper award winners

  • Deep Residual Learning for Image Recognition (PDF)
    Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun
    CVPR 2016
  • Structural-RNN: Deep Learning on Spatio-Temporal Graphs (PDF)
    Ashesh Jain, Amir R. Zamir, Silvio Savarese, Ashutosh Saxena
    CVPR 2016

CVPR16 Best paper honorable mention

  • Sublabel-Accurate Relaxation of Nonconvex Energies (PDF)
    Thomas Möllenhoff, Emanuel Laude, Michael Moeller, Jan Lellmann, Daniel Cremers
    CVPR 2016