CSE 590V: Computer vision seminar (Fall 2016)

CSE 590V: Computer vision seminar

Fall 2016

Hand with Reflecting Sphere by M. C. Escher.
Recolored by xenomorph1138

Course description

CSE 590V is a seminar/reading group focused on recent work in computer vision. We will cover papers from recent and upcoming conferences related to computer vision (CVPR, ICCV, ECCV, SIGGRAPH, NIPS). The seminar is open to everyone. We especially encourage first year graduate students who may be considering research in computer vision or related areas to participate.

Logistics

Time: Fridays 11am-12pm

Location: CSE 403

Organizers: Konstantinos Rematas (krematas @ cs washington edu) and Chris Sweeney (csweeney @ cs washington edu)

Presentations

Each week we will cover a recent topic in computer vision by reading and discussing one or more relevant papers. A person will lead the discussion by presenting the chosen paper(s) for the week. We encourage all attendees to read the paper(s) beforehand and to actively participate in the discussion.

Each registered student will attend all classes and prepare a presentation (duration to be determined) on a selected paper(s). We will assign topics/papers during the first week based on preferences.

Each presenter will meet with the organizers during office hours to discuss the upcoming presentation, show prepared slides, and resolve any questions.

Office hours: Wednesdays 3-4pm at CSE 284

We are available to help review and help clarify the papers you will be reading. Additionally, we are here to help you make your presentation as awesome as possible.

**It is mandatory to come to office hours if you are presenting that week.**

Schedule

Date	Presenters	Papers	Slides
September 30	Organizational Meeting
October 7	Yuguan Max Horton	High-Quality Depth from Uncalibrated Small Motion Clip (Project Page) Hyowon Ha, Sunghoon Im, Jaesik Park, Hae-Gon Jeon, and In So Kweon CVPR 2016 Deep Residual Learning for Image Recognition (PDF) Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun CVPR 2016	Residual Networks slides
October 14	Edward Zhang, Aleks Aditya Sankar, Bindita Chaudhuri	Shading-aware Multi-view Stereo (PDF) Fabian Langguth, Kalyan Sunkavalli, Sunil Hadap, and Michael Goesele ECCV 2016 Colorful Image Colorization (PDF, Web) Richard Zhang, Phillip Isola, Alexei Efros ECCV 2016	Shading-aware MVS slides Colorful Colorization slides
October 21	Aravind Rajeswaran, Jin Qu Roy, Supasorn Suwajanakorn	Learning to Poke by Poking: Experiential Learning of Intuitive Physics ( PDF) Pulkit Agrawal, Ashvin Nair, Pieter Abbeel, Jitendra Malik, Sergey Levine NIPS 2016 Force from Motion: Decoding Physical Sensation in a First Person Video (PDF) Hyun Soo Park, Jyh-Jing Hwang, Jianbo Shi CVPR 2016
October 28	Rowan Zellers, Qi Hu JJ Park, Qinyu	Fast R-CNN (PDF,website) Ross Girshick ICCV 2015 DynmamicFusion: Reconstruction and Tracking of Non-rigid Scenes in Real-Time (PDF) Richard Newcombe, Dieter Fox, Steve Seitz CVPR 2015
November 4	Ezgi Mercan Xuan Luo, Isaac	Understanding and Predicting Memorability at Large Scale (Project Page) A. Khosla, AS Raju, A. Torralba, and A. Oliva ICCV 2015 Practical Multispectral Lighting Reproduction (Project Page) C. LeGendre et. al. SIGGRAPH 2016
November 11	CVPR submission deadline	No seminar	--
November 18	Chung-Yi Weng, Junha Roh Eric, Chunjue Tang	Band-Sifting Decomposition for Image Based Material Editing (Project Page) Boyadzhiev et. al. ACM TOG 2016 Face2Face: Real-time Face Capture and Reenactment of RGB Videos (Project Page) J. Thies et. al. CVPR 2016
November 25	Thanksgiving	No seminar	--
December 2	Kiana Ehsani, Tsung-Wei Huang Patrick Lancaster, Keunhong Park	Learning to Refine Object Segments (Paper) Pedro O. Pinheiro, Ronan Collobert, Piotr Dollar ECCV 2016 Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields (Paper) Liu et. al. Arxiv 2015
December 9	Aaron Walsman, Hessam Daniel Gordon, Lucas	TBD	TBD

Paper List (to be updated)

Geometric Scene Understanding (Reconstruction/Recognition/Segmentation)

Shading-aware Multi-view Stereo (PDF)
Fabian Langguth, Kalyan Sunkavalli, Sunil Hadap, and Michael Goesele
ECCV 2016

High-Quality Depth from Uncalibrated Small Motion Clip (Project Page)
Hyowon Ha, Sunghoon Im, Jaesik Park, Hae-Gon Jeon, and In So Kweon
CVPR 2016

3D Modeling on the Go: Interactive 3D Reconstruction of Large-Scale Scenes on Mobile Devices (PDF)
Thomas Schops, Torsten Sattler, Christian Hane, Marc Pollefeys
3DV 2015

Structured Indoor Modeling (PDF)
Satoshi Ikehata, Hang Yan, Yasutaka Furukawa
ICCV 2015

Learning to Generate Chairs With Convolutional Neural Networks (PDF)
Alexey Dosovitskiy, Jost Tobias Springenberg, Thomas Brox
CVPR 2015

Single Image 3D Without a Single 3D Image (PDF)
David Fouhey, Muhammad Wajahat Hussain, Abhinav Gupta, Martial Hebert
ICCV 2015

Accurate Depth Map Estimation From a Lenslet Light Field Camera (PDF)
Hae-Gon Jeon, Jaesik Park, Gyeongmin Choe, Jinsun Park, Yunsu Bok, Yu-Wing Tai, In So Kweon
CVPR 2015

3D Scanning Deformable Objects With a Single RGBD Sensor (PDF)
Mingsong Dou, Jonathan Taylor, Henry Fuchs, Andrew Fitzgibbon, Shahram Izadi
CVPR 2015

Pose Induction for Novel Object Categories (PDF)
Shubham Tulsiani, Joao Carreira, Jitendra Malik
ICCV 2015

Cameras and Computational Displays

Cinema 3D: Large Scale Automultiscopic Display(PDF, website)
Netalee Efrat, Piotr Didyk, Mike Foshey, Wojciech Matusik, Anat Levin
SIGGRAPH 2016

Practical Multispectral Lighting Reproduction(PDF, website)
Chloe LeGendre, Xueming Yu, Dai Liu, Jay Busch, Andrew Jones, Sumanta Pattanaik, Paul Debevec
SIGGRAPH 2016

Emulating Displays with Continuously Varying Frame Rates(PDF, website)
Netalee Efrat, Piotr Didyk, Mike Foshey, Wojciech Matusik, Anat Levin
SIGGRAPH 2016

Additive Light Field Displays: Realization of Augmented Reality with Holographic Optical Elements(PDF, website)
Seungjae Lee, Changwon Jang, Seokil Moon, Jaebum Cho
SIGGRAPH 2016

View Synthesis

Learning-Based View Synthesis for Light Field Cameras ( PDF)
Nima Khademi Kalantari, Ting-Chun Wang, and Ravi Ramamoorthi
SIGGRAPH 2016
View Synthesis by Appearance Flow ( PDF)
Tinghui Zhou, Shubham Tulsiani, Weilun Sun, Jitendra Malik, Alexei A. Efros
ECCV 2016
DeepStereo: Learning to Predict New Views from the World’s Imagery ( PDF)
John Flynn, Ivan Neulander, James Philbin, Noah Snavely
CVPR 2016

Material and Light

Unsupervised Texture Transfer from Images to Model Collections (Web)
Tuanfeng Wang, Hao Su, Qixing Huang, Jingwei Huang, Leonidas Guibas, Niloy J. Mitra
SIGGRAPH Asia 2016

Reflectance Modeling by Neural Texture Synthesis (Web)
Miika Aittala, Timo Aila and Jaakko Lehtinen
SIGGRAPH 2016

Time-varying Weathering in Texture Space (PDF)
Rachele Bellini, Yanir Kleiman, Daniel Cohen-Or
SIGGRAPH 2016

Reinforcement Learning

Learning Visual Predictive Models of Physics for Playing Billiards ( PDF)
Katerina Fragkiadaki, Pulkit Agrawal, Sergey Levine, Jitendra Malik
ICLR 2016
The Curious Robot: Learning Visual Representations via Physical Interactions ( PDF)
Lerrel Pinto, Dhiraj Gandhi, Yuanfeng Han, Yong-Lae Park and Abhinav Gupta
ECCV 2016
Learning to Poke by Poking: Experiential Learning of Intuitive Physics ( PDF)
Pulkit Agrawal, Ashvin Nair, Pieter Abbeel, Jitendra Malik, Sergey Levine
NIPS 2016
A Comparative Evaluation of Approximate Probabilistic Simulation and Deep Neural Networks as Accounts of Human Physical Scene Understanding ( PDF)
Renqiao Zhang, Jiajun Wu, Chengkai Zhang, William T. Freeman, Joshua B. Tenenbaum
arxiv 2016
Terrain-Adaptive Locomotion Skills Using Deep Reinforcement Learning( website)
Xue Bin Peng, Glen Berseth, Michiel van de Panne
SIGGRAPH 2016

Vision and Language

Generative Adversarial Text to Image Synthesis (PDF,website)
Scott Reed, Zeynep Akata , Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele and Honglak Lee
ICML 2016

VQA: Visual Question Answering (PDF,website)
Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, Larry Zitnick, Devi Parikh
ICCV 2015

Aligning Books and Movies: Towards Story-like Visual Explanations by Watching Movies and Reading Books (PDF,website)
Yukun Zhu, Ryan Kiros, Rich Zemel, Ruslan salakhutdinov, Raquel Urtasun, Antonio Torralba, Sanja Fidler
ICCV 2015

CIDEr: Consensus-Based Image Description Evaluation (PDF,website)
Ramakrishna Vedantam, C. Lawrence Zitnick, Devi Parikh
CVPR 2015

Show and Tell: A Neural Image Caption Generator (PDF)
Oriol Vinyals, Alexander Toshev, Samy Bengio, Dumitru Erhan
CVPR 2015

Scene Understanding/Semantics

Understanding and Predicting Memorability at a Large-scale (PDF,website)
Aditya Khosla, Akhil Raju, Antonio Torralba, Aude Oliva
ICCV 2015

What makes an object memorable? (PDF,website)
Rachit Dubey, Joshua Peterson, Aditya Khosla, Ming-Hsuan Yang, and Bernard Ghanem
ICCV 2015

Learning informative edge maps for indoor scene layout prediction
Arun Mallya, Svetlana Lazebnik
ICCV 2015

Automatic Object Discovery

Boosting Object Proposals: From Pascal to COCO (PDF)
Jordi Pont-Tuset, Luc Van Gool
ICCV 2015

Unsupervised Object Discovery and Tracking in Video Collections (PDF)
Suha Kwak, Minsu Cho, Jean Ponce, Cordelia Schmid, Ivan Laptev
ICCV 2015

Unsupervised Object Discovery and Localization in the Wild: Part-Based Matching With Bottom-Up Region Proposals (PDF)
Minsu Cho, Suha Kwak, Cordelia Schmid, Jean Ponce
CVPR 2015

Object Detection/Recognition

Fast R-CNN (PDF,website)
Ross Girshick
ICCV 2015

Semantic Guidance of Visual Attention for Localizing Objects in Scenes
Juan Caicedo, Svetlana Lazebnik
ICCV 2015

Webly Supervised Learning of Convolutional Networks (PDF)
Xinlei Chen, Abhinav Gupta
ICCV 2015

Love Thy Neighbors: Image Annotation by Exploiting Social Metadata (PDF)
Lamberto Ballan, Justin Johnson, Fei-Fei Li
ICCV 2015

Face

Face Flow (PDF)
Patrick Snape, Anastasios Roussos, Yannis Panagakis, Stefanos Zafeiriou
ICCV 2015

Web-Scale Training for Face Identification (PDF)
Yaniv Taigman, Ming Yang, Marc'Aurelio Ranzato, Lior Wolf
CVPR 2015

Detailed Spatio-Temporal Reconstruction of Eyelids (PDF)
Amit Bermano, Thabo Beeler, Yeara Kozlov, Derek Bradely, Bernd Bickel, Markus Gross
Siggraph 2015

Dynamic 3D Avatar Creation from Hand-held Video Input (PDF)
Alexandru-Eugen Ichim, Sofien Bouaziz, Mark Pauly
Siggraph 2015

High-Quality Hair Modeling from A Single Portrait Photo(PDF)
Menglei Chai, Linjie Luo, Kalyan Sunkavalli, Nathan Carr†,Sunil Hadap, Kun Zhou
Siggraph Asia 2015

A Mouth Full of Words: Visually Consistent Acoustic Redubbing (PDF)
Sarah Taylor, Barry-John Theobald, Iain Matthews
ICASSP 2015

Learning To Look Up: Realtime Monocular Gaze Correction Using Machine Learning(PDF)
Daniil Kononenko, Victor Lempitsky
CVPR 2015

Skin Microstructure Deformation with Displacement Map Convolution(PDF)
Koki Nagano, Graham Fyffe, Oleg Alexander, Jernej Barbic, Hao Li, Abhijeet Ghosh, Paul Debevec
Siggraph 2015

Time-offset interaction with a holocaust survivor(PDF)
Ron Artstein, David Traum, Oleg Alexander, Anton Leuski, Andrew Jones, Kallirroi Georgila, Paul Debevec, William Swartout, Heather Maio, Stephen Smith
IUI 2014

Driving High-Resolution Facial Scans with Video Performance Capture(PDF)
Graham Fyffe, Andrew Jones, Oleg Alexander, Ryosuke Ichikari, Paul Debevec
IUI 2014

Near-instant capture of high-resolution facial geometry and reflectance(PDF)
Paul Graham, Graham Fyffe, Borom Tonwattanapong, Abhijeet Ghosh, Paul Debevec
Siggraph 2015

Human Pose Estimation

General Automatic Human Shape and Motion Capture Using Volumetric Contour Cues (PDF, Web)
Helge Rhodin, Nadia Robertini, Dan Casas, Christian Richardt, Hans-Peter Seidel, Christian Theobalt
ECCV 2016

Keep it SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image (PDF)
Federica Bogo, Angjoo Kanazawa, Christoph Lassner, Peter Gehler, Javier Romero and Michael J. Black
ECCV 2016

Convolutional Pose Machines (Web)
Shih-En Wei, Varun Ramakrishna, Takeo Kanade, Yaser Sheikh
CVPR 2016

A Deep Learning Framework For Character Motion Synthesis and Editing (Web)
Daniel Holden, Jun Saito, Taku Komura
SIGGRAPH 2016

Dyna(Web)
Gerard Pons-Moll, Javier Romero, Naureen Mahmood, Michael J. Black
Siggraph 2015

Flowing ConvNets for Human Pose Estimation in Videos (PDF)
Tomas Pfister, James Charles, Andrew Zisserman
ICCV 2015

Maximum-Margin Structured Learning with Deep Networks for 3D Human Pose Estimation (PDF)
Sijin Li, Weichen Zhang, Antoni Chan
ICCV 2015

Action Recognition/Temporal Prediction

Dynamic Image Networks for Action Recognition (PDF)
Hakan Bilen, Basura Fernando, Efstratios Gavves, Andrea Vedaldi and Stephen Gould
CVPR 2016

Contextual Action Recognition with R*CNN (PDF)
Georgia Gkioxari, Ross Girshick, Jitendra Malik
ICCV 2015

Dense Optical Flow Prediction from a Static Image (PDF)
Jacob Walker, Abhinav Gupta, Martial Hebert
ICCV 2015

Temporal Perception and Prediction in Ego-Centric Video
Yipin Zhou, Tamara Berg
ICCV 2015

Learning Temporal Embeddings for Complex Video Analysis
Vignesh Ramanathan, Kevin Tang, Greg Mori, Fei-Fei Li
ICCV 2015

Storyline Representation of Egocentric Videos with an Applications to Story-based Search
Bo Xiong, Leonid Sigal, Gunhee Kim
ICCV 2015

Space-Time Tree Ensemble for Action Recognition
Shugao Ma, Leonid Sigal, Stan Sclaroff
CVPR 2015

Computational Photography

Colorful Image Colorization (PDF)
Richard Zhang, Phillip Isola, Alexei Efros
ECCV 2016

Occlusion-aware depth estimation using light-field cameras (PDF)
Ting-Chun Wang, Alexei Efros, Ravi Ramamoorthi
ICCV 2015

Visual Vibrometry: Estimating Material Properties From Small Motion in Video (PDF,website)
Abe Davis, Katherine L. Bouman, Justin G. Chen, Michael Rubinstein, Fredo Durand, William T. Freeman
ICCV 2015

Fast Bilateral-Space Stereo for Synthetic Defocus (PDF)
Jonathan T. Barron, Andrew Adams, YiChang Shih, Carlos Hernandez
ICCV 2015

Low Level/Descriptors

LIFT: Learned Invariant Feature Transform(PDF)
Kwang Moo Yi, Eduard Trulls, Vincent Lepetit, and Pascal Fua
ECCV 2016

Universal Correspondence Network(PDF)
C. Choy, J. Gwak, S. Savarese and M.K. Chandraker
NIPS 2016

Learning image representations equivariant to ego-motion (PDF)
Dinesh Jayaraman, Kristen Grauman
ICCV 2015

FlowWeb: Joint Image Set Alignment by Weaving Consistent, Pixel-wise Correspondences (PDF,website)
Tinghui Zhou, Yong Jae Lee, Stella Yu, and Alexei A. Efros
CVPR 2015

Unsupervised Visual Representation Learning by Context Prediction (PDF)
Carl Doersch, Abhinav Gupta, Alexei Efros
ICCV 2015

PatchMatch-based Automatic Lattice Detection for Near-Regular Textures
Siying Liu, Tian-Tsong Ng, Minh Do, Kalyan Sunkavalli, Eli Shechtman, Nathan Carr
ICCV 2015

EpicFlow: Edge-Preserving Interpolation of Correspondences for Optical Flow
Jerome Revaud, Philippe Weinzaepfel, Zaid Harchaoui, Cordelia Schmid
ICCV 2015

Understanding Image Representations by Measuring Their Equivariance and Equivalence (PDF)
Karel Lenc, Andrea Vedaldi
CVPR 2015

Applications

Learning Affordance for Direct Perception in Autonomous Driving (PDF,website)
Chenyi Chen, Ari Seff, Alain Kornhauser, Jianxiong Xiao
ICCV 2015

Where to Buy It: Matching Street Clothing Photos in Online Shops (PDF)
Mohammadhadi Kiapour, Xufeng Han, Svetlana Lazebnik, Alex Berg, Tamara Berg
ICCV 2015

Joint Photo Stream and Blog Post Summarization and Exploration (PDF)
Gunhee Kim, Seungwhan Moon, Leonid Sigal
CVPR 2015

Social Saliency Prediction (PDF)
Hyun Soo Park, Jianbo Shi
CVPR 2015

Panoptic Studio: A Massively Multiview System for Social Motion Capture (PDF)
Hanbyul Joo, Hao Liu, Lei Tan, Lin Gui, Shohei Nobuhara, Yaser Sheikh
ICCV 2015

CVPR16 Best paper award winners

Deep Residual Learning for Image Recognition (PDF)
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun
CVPR 2016

Structural-RNN: Deep Learning on Spatio-Temporal Graphs (PDF)
Ashesh Jain, Amir R. Zamir, Silvio Savarese, Ashutosh Saxena
CVPR 2016

CVPR16 Best paper honorable mention

Sublabel-Accurate Relaxation of Nonconvex Energies (PDF)
Thomas Möllenhoff, Emanuel Laude, Michael Moeller, Jan Lellmann, Daniel Cremers
CVPR 2016