Course description
CSE 590V is a seminar/reading group focused on recent work in computer vision. We will cover
papers from recent and upcoming conferences related to computer vision (CVPR, ICCV, ECCV,
SIGGRAPH, NIPS). The seminar is open to everyone. We especially encourage first year graduate
students who may be considering research in computer vision or related areas to participate.
Logistics
Time: Fridays 11am-12pm
Location: CSE 403
Organizers: Konstantinos Rematas (krematas @
cs washington edu) and Chris Sweeney (csweeney @ cs washington edu)
Presentations
Each week we will cover a recent topic in computer vision by reading and discussing one or more
relevant papers. A person will lead the discussion by presenting the chosen paper(s) for the
week. We encourage all attendees to read the paper(s) beforehand and to actively participate in
the discussion.
Each registered student will attend all classes and prepare a presentation (duration to be
determined) on a selected paper(s). We will assign topics/papers during the first week based on
preferences.
Each presenter will meet with the organizers during office hours to discuss the upcoming
presentation, show prepared slides, and resolve any questions.
Office hours: Wednesdays 3-4pm at CSE 284
We are available to help review and help clarify the papers you will be reading. Additionally, we are here to help you make your presentation as awesome as possible.
**It is mandatory to come to office hours if you are presenting that week.**
Schedule
Date |
Presenters |
Papers |
Slides |
September 30 |
Organizational Meeting |
|
|
October 7 |
|
- High-Quality Depth from Uncalibrated Small Motion Clip (Project Page)
Hyowon Ha, Sunghoon Im, Jaesik Park, Hae-Gon Jeon, and In So Kweon
CVPR 2016
- Deep Residual Learning for Image Recognition (PDF)
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun
CVPR 2016
|
Residual Networks slides
|
October 14 |
- Edward Zhang, Aleks
- Aditya Sankar, Bindita Chaudhuri
|
- Shading-aware Multi-view Stereo (PDF)
Fabian Langguth, Kalyan Sunkavalli, Sunil Hadap, and Michael Goesele
ECCV 2016
- Colorful Image Colorization (PDF, Web)
Richard Zhang, Phillip Isola, Alexei Efros
ECCV 2016
|
Shading-aware MVS slides
Colorful Colorization slides
|
October 21 |
- Aravind Rajeswaran, Jin Qu
- Roy, Supasorn Suwajanakorn
|
- Learning to Poke by Poking: Experiential Learning of Intuitive Physics (
PDF)
Pulkit Agrawal, Ashvin Nair, Pieter Abbeel, Jitendra Malik, Sergey Levine
NIPS 2016
-
Force from Motion: Decoding Physical Sensation in a First Person Video (PDF)
Hyun Soo Park, Jyh-Jing Hwang, Jianbo Shi
CVPR 2016
|
|
October 28 |
- Rowan Zellers, Qi Hu
- JJ Park, Qinyu
|
- Fast R-CNN (PDF,website)
Ross Girshick
ICCV 2015
-
DynmamicFusion: Reconstruction and Tracking of Non-rigid Scenes in Real-Time (PDF)
Richard Newcombe, Dieter Fox, Steve Seitz
CVPR 2015
|
|
November 4 |
- Ezgi Mercan
- Xuan Luo, Isaac
|
-
Understanding and Predicting Memorability at Large Scale (Project Page)
A. Khosla, AS Raju, A. Torralba, and A. Oliva
ICCV 2015
-
Practical Multispectral Lighting Reproduction (Project Page)
C. LeGendre et. al.
SIGGRAPH 2016
|
|
November 11 |
CVPR submission deadline |
No seminar |
-- |
November 18 |
- Chung-Yi Weng, Junha Roh
- Eric, Chunjue Tang
|
-
Band-Sifting Decomposition for Image Based Material Editing (Project Page)
Boyadzhiev et. al.
ACM TOG 2016
-
Face2Face: Real-time Face Capture and Reenactment of RGB Videos (Project Page)
J. Thies et. al.
CVPR 2016
|
|
November 25 |
Thanksgiving |
No seminar |
-- |
December 2 |
- Kiana Ehsani, Tsung-Wei Huang
- Patrick Lancaster, Keunhong Park
|
-
Learning to Refine Object Segments (Paper)
Pedro O. Pinheiro, Ronan Collobert, Piotr Dollar
ECCV 2016
-
Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields (Paper)
Liu et. al.
Arxiv 2015
|
|
December 9 |
- Aaron Walsman, Hessam
- Daniel Gordon, Lucas
|
TBD |
TBD |
Paper List (to be updated)
Geometric Scene Understanding (Reconstruction/Recognition/Segmentation)
- Shading-aware Multi-view Stereo (PDF)
Fabian Langguth, Kalyan Sunkavalli, Sunil Hadap, and Michael Goesele
ECCV 2016
- High-Quality Depth from Uncalibrated Small Motion Clip (Project Page)
Hyowon Ha, Sunghoon Im, Jaesik Park, Hae-Gon Jeon, and In So Kweon
CVPR 2016
- 3D Modeling on the Go: Interactive 3D Reconstruction of Large-Scale Scenes on Mobile Devices (PDF)
Thomas Schops, Torsten Sattler, Christian Hane, Marc Pollefeys
3DV 2015
- Structured Indoor Modeling (PDF)
Satoshi Ikehata, Hang Yan, Yasutaka Furukawa
ICCV 2015
- Learning to Generate Chairs With Convolutional Neural Networks (PDF)
Alexey Dosovitskiy, Jost Tobias Springenberg, Thomas Brox
CVPR 2015
- Single Image 3D Without a Single 3D Image (PDF)
David Fouhey, Muhammad Wajahat Hussain, Abhinav Gupta, Martial Hebert
ICCV 2015
- Accurate Depth Map Estimation From a Lenslet Light Field Camera (PDF)
Hae-Gon Jeon, Jaesik Park, Gyeongmin Choe, Jinsun Park, Yunsu Bok, Yu-Wing Tai, In So
Kweon
CVPR 2015
- 3D Scanning Deformable Objects With a Single RGBD Sensor (PDF)
Mingsong Dou, Jonathan Taylor, Henry Fuchs, Andrew Fitzgibbon, Shahram Izadi
CVPR 2015
- Pose Induction for Novel Object Categories (PDF)
Shubham Tulsiani, Joao Carreira, Jitendra Malik
ICCV 2015
Cameras and Computational Displays
- Cinema 3D: Large Scale Automultiscopic Display(PDF,
website)
Netalee Efrat, Piotr Didyk, Mike Foshey, Wojciech Matusik, Anat Levin
SIGGRAPH 2016
- Practical Multispectral Lighting Reproduction(PDF,
website)
Chloe LeGendre, Xueming Yu, Dai Liu, Jay Busch, Andrew Jones, Sumanta Pattanaik, Paul Debevec
SIGGRAPH 2016
- Emulating Displays with Continuously Varying Frame Rates(PDF,
website)
Netalee Efrat, Piotr Didyk, Mike Foshey, Wojciech Matusik, Anat Levin
SIGGRAPH 2016
- Additive Light Field Displays: Realization of Augmented Reality with Holographic Optical Elements(PDF,
website)
Seungjae Lee, Changwon Jang, Seokil Moon, Jaebum Cho
SIGGRAPH 2016
View Synthesis
- Learning-Based View Synthesis for Light Field Cameras (
PDF)
Nima Khademi Kalantari, Ting-Chun Wang, and Ravi Ramamoorthi
SIGGRAPH 2016
- View Synthesis by Appearance Flow (
PDF)
Tinghui Zhou, Shubham Tulsiani, Weilun Sun, Jitendra Malik, Alexei A. Efros
ECCV 2016
- DeepStereo: Learning to Predict New Views from the World’s Imagery (
PDF)
John Flynn, Ivan Neulander, James Philbin, Noah Snavely
CVPR 2016
Material and Light
- Unsupervised Texture Transfer from Images to Model Collections (Web)
Tuanfeng Wang, Hao Su, Qixing Huang, Jingwei Huang, Leonidas Guibas, Niloy J. Mitra
SIGGRAPH Asia 2016
- Reflectance Modeling by Neural Texture Synthesis (Web)
Miika Aittala, Timo Aila and Jaakko Lehtinen
SIGGRAPH 2016
- Time-varying Weathering in Texture Space (PDF)
Rachele Bellini, Yanir Kleiman, Daniel Cohen-Or
SIGGRAPH 2016
Reinforcement Learning
- Learning Visual Predictive Models of Physics for Playing Billiards (
PDF)
Katerina Fragkiadaki, Pulkit Agrawal, Sergey Levine, Jitendra Malik
ICLR 2016
- The Curious Robot: Learning Visual Representations via Physical Interactions (
PDF)
Lerrel Pinto, Dhiraj Gandhi, Yuanfeng Han, Yong-Lae Park and Abhinav Gupta
ECCV 2016
- Learning to Poke by Poking: Experiential Learning of Intuitive Physics (
PDF)
Pulkit Agrawal, Ashvin Nair, Pieter Abbeel, Jitendra Malik, Sergey Levine
NIPS 2016
- A Comparative Evaluation of Approximate Probabilistic Simulation and Deep Neural Networks as Accounts of Human Physical Scene Understanding (
PDF)
Renqiao Zhang, Jiajun Wu, Chengkai Zhang, William T. Freeman, Joshua B. Tenenbaum
arxiv 2016
- Terrain-Adaptive Locomotion Skills Using Deep Reinforcement Learning(
website)
Xue Bin Peng, Glen Berseth, Michiel van de Panne
SIGGRAPH 2016
Vision and Language
- Generative Adversarial Text to Image Synthesis (PDF,website)
Scott Reed, Zeynep Akata , Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele and Honglak Lee
ICML 2016
- VQA: Visual Question Answering (PDF,website)
Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, Larry
Zitnick, Devi Parikh
ICCV 2015
- Aligning Books and Movies: Towards Story-like Visual Explanations by Watching Movies and
Reading Books (PDF,website)
Yukun Zhu, Ryan Kiros, Rich Zemel, Ruslan salakhutdinov, Raquel Urtasun, Antonio
Torralba, Sanja Fidler
ICCV 2015
- CIDEr: Consensus-Based Image Description Evaluation (PDF,website)
Ramakrishna Vedantam, C. Lawrence Zitnick, Devi Parikh
CVPR 2015
- Show and Tell: A Neural Image Caption Generator (PDF)
Oriol Vinyals, Alexander Toshev, Samy Bengio, Dumitru Erhan
CVPR 2015
Scene Understanding/Semantics
- Understanding and Predicting Memorability at a Large-scale (PDF,website)
Aditya Khosla, Akhil Raju, Antonio Torralba, Aude Oliva
ICCV 2015
- What makes an object memorable? (PDF,website)
Rachit Dubey, Joshua Peterson, Aditya Khosla, Ming-Hsuan Yang, and Bernard Ghanem
ICCV 2015
- Learning informative edge maps for indoor scene layout prediction
Arun Mallya, Svetlana Lazebnik
ICCV 2015
Automatic Object Discovery
- Boosting Object Proposals: From Pascal to COCO (PDF)
Jordi Pont-Tuset, Luc Van Gool
ICCV 2015
- Unsupervised Object Discovery and Tracking in Video Collections (PDF)
Suha Kwak, Minsu Cho, Jean Ponce, Cordelia Schmid, Ivan Laptev
ICCV 2015
- Unsupervised Object Discovery and Localization in the Wild: Part-Based Matching With
Bottom-Up Region Proposals (PDF)
Minsu Cho, Suha Kwak, Cordelia Schmid, Jean Ponce
CVPR 2015
Object Detection/Recognition
- Semantic Guidance of Visual Attention for Localizing Objects in Scenes
Juan Caicedo, Svetlana Lazebnik
ICCV 2015
- Webly Supervised Learning of Convolutional Networks (PDF)
Xinlei Chen, Abhinav Gupta
ICCV 2015
- Love Thy Neighbors: Image Annotation by Exploiting Social Metadata (PDF)
Lamberto Ballan, Justin Johnson, Fei-Fei Li
ICCV 2015
Face
- Face Flow (PDF)
Patrick Snape, Anastasios Roussos, Yannis Panagakis, Stefanos Zafeiriou
ICCV 2015
- Web-Scale Training for Face Identification (PDF)
Yaniv Taigman, Ming Yang, Marc'Aurelio Ranzato, Lior Wolf
CVPR 2015
- Detailed Spatio-Temporal Reconstruction of Eyelids (PDF)
Amit Bermano, Thabo Beeler, Yeara Kozlov, Derek Bradely, Bernd Bickel, Markus Gross
Siggraph 2015
- Dynamic 3D Avatar Creation from Hand-held Video Input (PDF)
Alexandru-Eugen Ichim, Sofien Bouaziz, Mark Pauly
Siggraph 2015
- High-Quality Hair Modeling from A Single Portrait Photo(PDF)
Menglei Chai, Linjie Luo, Kalyan Sunkavalli, Nathan Carr†,Sunil Hadap, Kun Zhou
Siggraph Asia 2015
- A Mouth Full of Words: Visually Consistent Acoustic Redubbing (PDF)
Sarah Taylor, Barry-John Theobald, Iain Matthews
ICASSP 2015
- Learning To Look Up: Realtime Monocular Gaze Correction Using Machine Learning(PDF)
Daniil Kononenko, Victor Lempitsky
CVPR 2015
- Skin Microstructure Deformation with Displacement Map Convolution(PDF)
Koki Nagano, Graham Fyffe, Oleg Alexander, Jernej Barbic, Hao Li, Abhijeet Ghosh, Paul
Debevec
Siggraph 2015
- Time-offset interaction with a holocaust survivor(PDF)
Ron Artstein, David Traum, Oleg Alexander, Anton Leuski, Andrew Jones, Kallirroi
Georgila, Paul Debevec, William Swartout, Heather Maio, Stephen Smith
IUI 2014
- Driving High-Resolution Facial Scans with Video Performance Capture(PDF)
Graham Fyffe, Andrew Jones, Oleg Alexander, Ryosuke Ichikari, Paul Debevec
IUI 2014
- Near-instant capture of high-resolution facial geometry and reflectance(PDF)
Paul Graham, Graham Fyffe, Borom Tonwattanapong, Abhijeet Ghosh, Paul Debevec
Siggraph 2015
Human Pose Estimation
-
General Automatic Human Shape and Motion Capture Using Volumetric Contour Cues (PDF,
Web)
Helge Rhodin, Nadia Robertini, Dan Casas, Christian Richardt, Hans-Peter Seidel, Christian Theobalt
ECCV 2016
-
Keep it SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image (PDF)
Federica Bogo, Angjoo Kanazawa, Christoph Lassner, Peter Gehler, Javier Romero and Michael J. Black
ECCV 2016
- Convolutional Pose Machines (Web)
Shih-En Wei, Varun Ramakrishna, Takeo Kanade, Yaser Sheikh
CVPR 2016
- A Deep Learning Framework For Character Motion Synthesis and Editing (Web)
Daniel Holden, Jun Saito, Taku Komura
SIGGRAPH 2016
- Dyna(Web)
Gerard Pons-Moll, Javier Romero, Naureen Mahmood, Michael J. Black
Siggraph 2015
- Flowing ConvNets for Human Pose Estimation in Videos (PDF)
Tomas Pfister, James Charles, Andrew Zisserman
ICCV 2015
- Maximum-Margin Structured Learning with Deep Networks for 3D Human Pose Estimation (PDF)
Sijin Li, Weichen Zhang, Antoni Chan
ICCV 2015
Action Recognition/Temporal Prediction
- Dynamic Image Networks for Action Recognition (PDF)
Hakan Bilen, Basura Fernando, Efstratios Gavves, Andrea Vedaldi and Stephen Gould
CVPR 2016
- Contextual Action Recognition with R*CNN (PDF)
Georgia Gkioxari, Ross Girshick, Jitendra Malik
ICCV 2015
- Dense Optical Flow Prediction from a Static Image (PDF)
Jacob Walker, Abhinav Gupta, Martial Hebert
ICCV 2015
- Temporal Perception and Prediction in Ego-Centric Video
Yipin Zhou, Tamara Berg
ICCV 2015
- Learning Temporal Embeddings for Complex Video Analysis
Vignesh Ramanathan, Kevin Tang, Greg Mori, Fei-Fei Li
ICCV 2015
- Storyline Representation of Egocentric Videos with an Applications to Story-based Search
Bo Xiong, Leonid Sigal, Gunhee Kim
ICCV 2015
- Space-Time Tree Ensemble for Action Recognition
Shugao Ma, Leonid Sigal, Stan Sclaroff
CVPR 2015
Computational Photography
- Colorful Image Colorization (PDF)
Richard Zhang, Phillip Isola, Alexei Efros
ECCV 2016
- Occlusion-aware depth estimation using light-field cameras (PDF)
Ting-Chun Wang, Alexei Efros, Ravi Ramamoorthi
ICCV 2015
- Visual Vibrometry: Estimating Material Properties From Small Motion in Video (PDF,website)
Abe Davis, Katherine L. Bouman, Justin G. Chen, Michael Rubinstein, Fredo Durand,
William T. Freeman
ICCV 2015
- Fast Bilateral-Space Stereo for Synthetic Defocus (PDF)
Jonathan T. Barron, Andrew Adams, YiChang Shih, Carlos Hernandez
ICCV 2015
Low Level/Descriptors
- LIFT: Learned Invariant Feature Transform(PDF)
Kwang Moo Yi, Eduard Trulls, Vincent Lepetit, and Pascal Fua
ECCV 2016
- Universal Correspondence Network(PDF)
C. Choy, J. Gwak, S. Savarese and M.K. Chandraker
NIPS 2016
- Learning image representations equivariant to ego-motion (PDF)
Dinesh Jayaraman, Kristen Grauman
ICCV 2015
- FlowWeb: Joint Image Set Alignment by Weaving Consistent, Pixel-wise Correspondences (PDF,website)
Tinghui Zhou, Yong Jae Lee, Stella Yu, and Alexei A. Efros
CVPR 2015
- Unsupervised Visual Representation Learning by Context Prediction (PDF)
Carl Doersch, Abhinav Gupta, Alexei Efros
ICCV 2015
- PatchMatch-based Automatic Lattice Detection for Near-Regular Textures
Siying Liu, Tian-Tsong Ng, Minh Do, Kalyan Sunkavalli, Eli Shechtman, Nathan Carr
ICCV 2015
- EpicFlow: Edge-Preserving Interpolation of Correspondences for Optical Flow
Jerome Revaud, Philippe Weinzaepfel, Zaid Harchaoui, Cordelia Schmid
ICCV 2015
- Understanding Image Representations by Measuring Their Equivariance and Equivalence (PDF)
Karel Lenc, Andrea Vedaldi
CVPR 2015
Applications
- Learning Affordance for Direct Perception in Autonomous Driving (PDF,website)
Chenyi Chen, Ari Seff, Alain Kornhauser, Jianxiong Xiao
ICCV 2015
- Where to Buy It: Matching Street Clothing Photos in Online Shops (PDF)
Mohammadhadi Kiapour, Xufeng Han, Svetlana Lazebnik, Alex Berg, Tamara Berg
ICCV 2015
- Joint Photo Stream and Blog Post Summarization and Exploration (PDF)
Gunhee Kim, Seungwhan Moon, Leonid Sigal
CVPR 2015
- Social Saliency Prediction (PDF)
Hyun Soo Park, Jianbo Shi
CVPR 2015
- Panoptic Studio: A Massively Multiview System for Social Motion Capture (PDF)
Hanbyul Joo, Hao Liu, Lei Tan, Lin Gui, Shohei Nobuhara, Yaser Sheikh
ICCV 2015
CVPR16 Best paper award winners
- Deep Residual Learning for Image Recognition (PDF)
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun
CVPR 2016
- Structural-RNN: Deep Learning on Spatio-Temporal Graphs (PDF)
Ashesh Jain, Amir R. Zamir, Silvio Savarese, Ashutosh Saxena
CVPR 2016
CVPR16 Best paper honorable mention
- Sublabel-Accurate Relaxation of Nonconvex Energies (PDF)
Thomas Möllenhoff, Emanuel Laude, Michael Moeller, Jan Lellmann, Daniel Cremers
CVPR 2016
|