on computer workshops (iccvw2013) file2013ieeeinternational conference oncomputer visionworkshops...
TRANSCRIPT
2013 IEEE International
Conference on Computer Vision
Workshops
(ICCVW 2013)
Sydney, Australia
2-8 December 2013
4 IEEE IEEE Catalog Number: CFP1391A-POD
ISBN: 978-1-4799-3023-4
2013 IEEE International
Conference on ComputerVision Workshops
ICCVW 2013
Table of Contents
Message from the General Chairs xvii
Message from the Program Chairs xix
Organizing Committee and Area Chairs xxi
Sponsors xxiii
Workshop 1: Graphical Models for Scene Understanding:Challenges and Perspectives
Multi-instance Object Segmentation with Exemplars 1
Xuming He and Stephen Gould
Hierarchical Segment Support for Categorical Image Labeling 5
Michael Donoser and Hayko Riemenschneider
Convex Optimization for Scene Understanding 9
Mohamed Souiai, Claudia Nieuwenhuis, Evgeny Strekalovskiy, and Daniel Cremers
Proximity Priors for Variational Semantic Segmentation and Recognition 15
Julia Bergbauer, Claudia Nieuwenhuis, Mohamed Souiai, and Daniel Cremers
Workshop 2: 3rd Workshop on Consumer Depth Cameras
for Computer Vision (CDC4CV)
Session 1: Segmentation
Saliency Cut in Stereo Images 22
Jianteng Peng, Jianbing Shen, Yunde Jia, and Xuelong Li
Depth Interpolation via Smooth Surface Segmentation Using Tangent Planes
Based on the Superpixels of a Color Image 29
Kiyoshi Matsuo and Yoshimitsu Aoki
External Mask Based Depth and Light Field Camera 37
Dikpal Reddy, Jiamin Bai, and Ravi Ramamoorthi
V
Session 2: Tracking and Events
Asynchronous Stereo Vision for Event-Driven Dynamic Stereo Sensor Using
an Adaptive Cooperative Approach 45
Ewa Piatkowska, Ahmed Nabil Belbachir, and Margrit Gelautz
Tracking an RGB-D Camera Using Points and Planes 51
Esra Ataer-Cansizoglu, Yuichi Taguchi, Srikumar Ramalingam, and Tyler Garaas
Reliable Left Luggage Detection Using Stereo Depth and Intensity Cues 59
Csaba Beleznai, Peter Gemeiner, and Christian Zinner
Session 3: Face Analysis
Compact and Accurate 3-D Face Modeling Using an RGB-D Camera: Let's
Open the Door to 3-D Video Conference 67
Pavan Kumar Anasosalu, Diego Thomas, and Akihiro Sugimoto
Automatic Detection of Emotion Valence on Faces Using Consumer Depth
Cameras 75
Arman Savran, Ruben Gur, and Ragini Verma
Session 4: Actions and Gestures
Real-Time Sign Language Recognition Using a Consumer Depth Camera 83
Alina Kuznetsova, Laura Leal-Taixe, and Bodo Rosenhahn
Fusion of Skeletal and Silhouette-Based Features for Human Action
Recognition with RGB-D Devices 91
Alexandros Andre Chaaraoui, Jose Ramon Padilla-Lopez,
and Francisco Florez-Revuelta
vi
Workshop 3: IEEE Workshop on the VOT2013 Visual ObjectTracking Challenge
Matej Kristan, Roman Pflugfelder, Ales Leonardis, Jiri Matas, Fatih Porikli,
Luka Cehovin, Georg Nebehay, Gustavo Fernandez, Tomas Vojir, Adam Gatt,
Ahmad Khajenezhad, Ahmed Salahledin, AH Soltani-Farani, AH Zarezade,
Alfredo Petrosino, Anthony Milton, Behzad Bozorgtabar, Bo Li,
Chee Seng Chan, Cherkeng Heng, Dale Ward, David Kearney,
Dorothy Monekosso, Hakki Can Karaimer, Hamid R. Rabiee, Jianke Zhu,
Jin Gao, Jingjing Xiao, Junge Zhang, Junliang Xing, Kaiqi Huang,
Karel Lebeda, Lijun Cao, Mario Edoardo Maresca, Mei Kuan Lim,
Mohamed El Helw, Michael Felsberg, Paolo Remagnino, Richard Bowden,
Roland Goecke, Rustam Stolkin, Samantha Yueying Lim, Sara Maher,
Sebastien Poullot, Sebastien Wong, Shin'lchi Satoh, Weihua Chen,
Weiming Hu, Xiaoqin Zhang, Yang Li, and Zhiheng Niu
Robust Real-Time Tracking with Diverse Ensembles and Random Projections 112
Ahmed Salaheldin, Sara Maher, and Mohamed El Helw
Enhanced Distribution Field Tracking Using Channel Representations 121
Michael Felsberg
An Adaptive Combination of Multiple Features for Robust Tracking in Real
Scene 129
Weihua Chen, Lijun Cao, Junge Zhang, and Kaiqi Huang
An Enhanced Adaptive Coupled-Layer LGTracker++ 137
Jingjing Xiao, Rustam Stolkin, and Ales Leonardis
Graph Embedding Based Semi-supervised Discriminative Tracker 145
Jin Gao, Junliang Xing, Weiming Hu, and Xiaoqin Zhang
Long-Term Tracking through Failure Cases 153
Karel Lebeda, Simon Hadfield, Jiri Matas, and Richard Bowden
Workshop 6: Workshop on Computer Vision for Accelerated
Bioscience
The Visual Object Tracking VOT2013 Challenge Results .98
Virtual 3D Models of Insects for Accelerated Quarantine Control
Chuong Nguyen, David Lovell, Rolf Oberprieler, Debbie Jennings, Matt Adcock,
Eleanor Gates-Stuart, and John La Salle
161
Insect Soup Challenge: Segmentation, Counting, and Simple Classification
Katarina Mele
168
3D Plant Modelling via Hyperspectral ImagingJie Liang, AH Zia, Jun Zhou, and Xavier Sirault
172
vii
Super-Resolution 3D Reconstruction of Thick Biological Samples: A Computer
Vision Perspective 178
Alessio Del Bue, Franceses Cella Zanacchi, and Alberto Diaspro
Extended Gaussian-Filtered Local Binary Patterns for Colonoscopy Image
Classification 184
Siyamalan Manivannan, Ruixuan Wang, and Emanuele Trucco
Learning to Detect Basal Tubules of Nematocysts in SEM Images 190
Michael Lam, Janardhan Rao Doppa, Xu Hu, Sinisa Todorovic,
Thomas Dietterich, Abigail Reft, and Marymegan Daly
Dirichlet Process Mixtures of Multinomials for Data Mining in Mice Behaviour
Analysis 197
Matteo Zanotto, Diego Sona, Vittorio Murino, and Francesco Papaleo
Zero-Shot Learning and Detection of Teeth in Images of Bat Skulls 203
Xu Hu, Michael Lam, Sinisa Todorovic, Thomas G. Dietterich,
Maureen A. O'Leary, Andrea L. Cirranello, Nancy B. Simmons,
and Paul M. Velazco
High Precision Localization of Bacterium and Scientific Visualization 210
Mohammadreza Hosseini, Arcot Sowmya, Pascal Vallotton, and Tomasz Bednarz
Workshop 7: Computer Vision for Autonomous Driving
Making Bertha See 214
Uwe Franke, David Pfeiffer, Clemens Rabe, Carsten Knoeppel,
Markus Enzweiler, Fridtjof Stein, and RalfG. Herrtwich
Visual Odometry by Multi-frame Feature Integration 222
Hernan Badino, Akihiro Yamamoto, and Takeo Kanade
Integrated Pedestrian and Direction Classification Using a Random Decision
Forest 230
Junli Tao and Reinhard Klette
Priors for Stereo Vision under Adverse Weather Conditions 238
Stefan Gehrig, Maxim Reznitskii, Nicolai Schneider, Uwe Franke,
and Joachim Weickert
Spatio-temporal Good Features to Track 246
Christoph Feichtenhofer and Axel Pinz
viii
Workshop 8: Inference for Probabilistic Graphical Models
(PGMs)
Supervised Hierarchical Dirichlet Processes with Variational Inference 254
Cheng Zhang, Carl Henrik Ek, Xavi Gratal, Florian T. Pokorny,
and Hedvig Kjellstrom
Infinite Latent Conditional Random Fields 262
Yun Jiang and Ashutosh Saxena
Getting Feasible Variable Estimates from Infeasible Ones: MRF Local
Polytope Study 267
Bogdan Savchynskyy and Stefan Schmidt
Superpixel Coherency and Uncertainty Models for Semantic Segmentation 275
Seungryul Baek, Taegyu Lim, Yong Seok Heo, Sungbum Park, Hantak Kwak,
and Woosung Shim
Video Object Segmentation by Salient Segment Chain Composition 283
Dan Banica, Alexandru Agape, Adrian Ion, and Cristian Sminchisescu
Workshop 9: 2nd International Workshop on Dynamic ShapeCapture and Analysis (4DMOD)
A Convex Relaxation Approach to Space Time Multi-view 3D Reconstruction 291
Martin R. Oswald and Daniel Cremers
One-Shot Entire Shape Scanning by Utilizing Multiple Projector-Camera
Constraints of Grid Patterns 299
Nozomu Kasuya, Ryusuke Sagawa, Ryo Furukawa, and Hiroshi Kawasaki
Single-View RGBD-Based Reconstruction of Dynamic Human Geometry 307
Charles Malleson, Martin Klaudiny, Adrian Hilton, and Jean-Yves Guillemaut
Robust Model-Based 3D Torso Pose Estimation in RGB-D Sequences 315
Markos Sigalas, Maria Pateraki, lason Oikonomidis, and Panos Trahanias
Workshop 10: 2nd International Workshop on Large-ScaleVideo Search and Mining (LSVSM'13)
Learning Non-linear Calibration for Score Fusion with Applications to Image
and Video Classification 323
Tianyang Ma, Sangmin Oh, Amitha Perera, and Longin Jan Latecki
Pedestrian Attribute Classification in Surveillance: Database and Evaluation 331
Jianqing Zhu, Shengcai Liao, Zhen Lei, Dong Yi, and Stan Z. Li
An Adaptive Query Prototype Modeling Method for Image Search Reranking 339
Hong Lu, Guobao Jiang, Bohong Yang, and Xiangyang Xue
ix
Thematic Saliency Detection Using Spatial-Temporal Context 347
Ye Luo, Gangqiang Zhao, and Junsong Yuan
Workshop 11: Faces in-the-Wild Challenge (300-W)
Constrained Local Neural Fields for Robust Facial Landmark Detection in
the Wild 354
Tadas Baltrusaitis, Peter Robinson, and Louis-Philippe Morency
Localizing Facial Keypoints with Global Descriptor Search, Neighbour
Alignment and Locally Linear Models 362
Md. Kamrul Hasan, Christopher Pal, and Sharon Moalem
Guided Unsupervised Learning of Mode Specific Models for Facial Point
Detection in the Wild 370
Shashank Jaiswal, Timur R. Almaev, and Michel F. Valstar
Multiview Active Shape Models with SIFT Descriptors for the 300-W Face
Landmark Challenge 378
Stephen Milborrow, Tom E. Bishop, and Fred Nicolls
Extensive Facial Landmark Localization with Coarse-to-Fine Convolutional
Network Cascade 386
Erjin Zhou, Haoqiang Fan, Zhimin Cao, Yuning Jiang, and Qi Yin
Learn to Combine Multiple Hypotheses for Accurate Face Alignment 392
Junjie Yan, Zhen Lei, Dong Yi, and Stan Z. Li
300 Faces in-the-Wild Challenge: The First Facial Landmark Localization
Challenge 397
Christos Sagonas, Georgios Tzimiropoulos, Stefanos Zafeiriou, and Maja Pantic
Workshop 13: Large Scale Visual Commerce
Discovering Pictorial Brand Associations from Large-Scale Online Image Data 404
Gunhee Kim and Eric P. Xing
Workshop 14: THUMOS Challenge: Action Recognition with
a Large Number of Classes
Ordered Trajectories for Large Scale Human Action Recognition 412
O. V. Ramana Murthy and Roland Goecke
A Spatio-temporal Feature Based on Triangulation of Dense SURF 420
Do Hang Nga and Keiji Yanai
X
W15: The First International Workshop on Visual Domain
Adaptation and Dataset Bias
Workshop 16: Wearable Computer Vision Systems (WCVS)
Wearable Computer Vision Systems for a Cortical Visual Prosthesis 428
Wai Ho Li
Wearable Smartphone: Wearable Hybrid Framework for Hand and Foot
Gesture Interaction on Smartphone 436
Zhihan Lv
A Smartphone-Based Obstacle Detection and Classification System
for Assisting Visually Impaired People 444
Ruxandra Tapu, Bogdan Mocanu, Andrei Bursuc, and Titus Zaharia
Workshop 17: 1st IEEE International Workshop on ComputerVision for Converging Perspectives
Observing the Natural World with Flickr 452
Jingya Wang, Mohammed Korayem, and David J. Crandall
Processing Geotagged Image Sets for Collaborative Compositing and View
Construction 460
Levente Kovacs
From Label Maps to Label Strokes: Semantic Segmentation for Street Scenes
from Incomplete Training Data 468
Shengqi Zhu, Yiqing Yang, and Li Zhang
Workshop 18: 1st Workshop on Understanding HumanActivities: Context and Interactions (HACI 2013)
Iterative Action and Pose Recognition Using Global-and-Pose Features
and Action-Specific Models 476
Norimichi Ukita
A Multi-scale Approach to Gesture Detection and Recognition 484
Natalia Neverova, Christian Wolf, Giulio Pad, Giacomo Sommavilla,
Graham W. Taylor, and Florian Nebout
Context-Sensitive Conditional Ordinal Random Fields for Facial Action
Intensity Estimation 492
Ognjen Rudovic, Vladimir Pavlovic, and Maja Pantic
Temporal Poselets for Collective Activity Detection and Recognition 500
Moin Nabi, Alessio Del Bue, and Vittorio Murino
xi
Spatio-temporal Human-Object Interactions for Action Recognition in Videos 508
Victor Escorcia and Juan Carlos Niebles
Less Is More: Video Trimming for Action Recognition 515
Borislav Antic, Timo Milbich, and Bjorn Ommer
Workshop 19: 3D Representation and Recognition (3dRR-13)
Oral Session I: Modeling Object Shape and Pose
Cubistic Representation for Real-Time 3D Shape and Pose Estimation
of Unknown Rigid Object 522
Hiromasa Yoshimoto and Yuichi Nakamura
Object Detection by 3D Aspectlets and Occlusion Reasoning 530
Yu Xiang and Silvio Savarese
Efficient Object Localization and Pose Estimation with 3D Wireframe Models 538
Erdem Yoruk and Rene Vidal
Oral Session II: 3D Scenes and Fine-Grained 3D ObjectCategorization
Behind the Scenes: What Moving Targets Reveal about Static Scene
Geometry 546
Geoffrey Taylor and Fei Mai
3D Object Representations for Fine-Grained Categorization 554
Jonathan Krause, Michael Stark, Jia Deng, and Li Fei-Fei
Oral Session III: Features and Cues for Recognition
The Mesh-LBP: Computing Local Binary Patterns on Discrete Manifolds 562
Naoufel Werghi, Stefano Berretti, Alberto Del Bimbo, and Pietro Pala
Multiscale TILT Feature Detection with Application to Geometric Image
Segmentation 570
Chi-Pang Lam, Allen Y. Yang, Ehsan Elhamifar, and S. Shankar Sastry
Which Edges Matter? 578
Aayush Bansal, Adarsh Kowdle, Devi Parikh, Andrew Gallagher, and Larry Zitnick
xii
Workshop 20: Computer Vision in Vehicle Technology: From
Earth to Mars
Enhanced Target Tracking in UAV Imagery with P-N Learning and Structural
Constraints 586
Mennatullah Siam and Mohammed Elhelw
Evaluating Color Representations for On-Line Road Detection 594
Jose M. Alvarez, Theo Gevers, and Antonio M. Lopez
Direct Generation of Regular-Grid Ground Surface Map from In-Vehicle Stereo
Image Sequences 600
Shigeki Sugimoto, Kouma Motooka, and Masatoshi Okutomi
From Video Matching to Video Grounding 608
Georgios Evangelidis, Penan Diego, and Radu Horaud
Various Approaches for Driver and Driving Behavior Monitoring: A Review 616
Hang-Bong Kang
Evaluation of the Capabilities of Confidence Measures for Assessing Optical
Flow Quality 624
Patricia Marquez-Valle, Debora Gil, and Aura Hernandez-Sabate
Exploiting Sparsity for Real Time Video Labelling 632
Lachlan Home, Jose M. Alvarez, and Nick Barnes
Workshop 21: Big Data in 3D Computer Vision (BigData3DCV)A Novel Local Surface Description for Automatic 3D Object Recognition in Low
Resolution Cluttered Scenes 638
Syed Afaq AH Shah, Mohammed Bennamoun, Farid Boussaid,
and Amar A. El-Sallam
A Scalable Collaborative Online System for City Reconstruction 644
Ole Untzelmann, Torsten Sattler, Sven Middelberg, and LeifKobbelt
Fury of the Swarm: Efficient and Very Accurate Triangulation for Multi-view
Scene Reconstruction 652
Shawn Recker, Mauricio Hess-Flores, and Kenneth I. Joy
Sparse Approximations of 3D Mesh Geometry Using Frames as Overcomplete
Dictionaries 660
Maja Krivokuca, Waleed H. Abdulla, and Burkhard C. Wunsche
Targetless Calibration of a Lidar - Perspective Camera Pair 668
Levente Tamas and Zoltan Kato
Optimal Reduction of Large Image Databases for Location Recognition 676
Michal Havlena, Wilfried Hartmann, and Konrad Schindler
xiii
Three Dimensional Motion Trail Model for Gesture Recognition 684
Bin Liang and Lihong Zheng
3D Surface Extraction Using Incremental Tetrahedra Carving 692
Takayuki Sugiura, Akihiko Torii, and Masatoshi Okutomi
Fast and Accurate Large-Scale Stereo Reconstruction Using Variational
Methods 700
Georg Kuschk and Daniel Cremers
Kinect Shadow Detection and Classification 708
Teng Deng, Hui Li, Jianfei Cai, Tat-Jen Cham, and Henry Fuchs
Semantic Parsing of Street Scene Images Using 3D LiDAR Point Cloud 714
Pouria Babahajiani, Lixin Fan, and Moncef Gabbouj
Memory Efficient 3D Integral Volumes 722
Martin Urschler, Alexander Bornik, and Michael Donoser
Workshop 22: Decoding Subtle Cues from Social Interactions
Joint Alignment and Modeling of Correlated Behavior Streams 730
Liliana Lo Presti, Stan Sclaroff, and Agata Rozga
Markov Random Field Structures for Facial Action Unit Intensity Estimation 738
Georgia Sandbach, Stefanos Zafeiriou, and Maja Pantic
Video Based Children's Social Behavior Classification in Peer-Play Scenarios 746
Lu Tian, Dingrui Duan, Jinshi Cui, Li Wang, Hongbin Zha, and Hamid Aghajan
Trusting Skype: Learning the Way People Chat for Fast User Recognition
and Verification 748
Giorgio Roffo, Marco Cristani, Loris Bazzani, Ha Quang Minh, and Vittorio Murino
Self-Stimulatory Behaviours in the Wild for Autism Diagnosis 755
Shyam Sundar Rajagopalan, Abhinav Dhall, and Roland Goecke
Home Alone: Social Robots for Digital Ethnography of Toddler Behavior 762
Mohsen Malmir, Deborah Forster, Kendall Youngstrom, Lydia Morrison,
and Javier R. Movellan
Hand Gestures for Intelligent Tutoring Systems: Dataset, Techniques &
Evaluation 769
Suchitra Sathyanarayana, Gwen Littlewort, and Mamie Bartlett
Human Body-Parts Tracking for Fine-Grained Behavior Classification 777
Norimichi Ukita and Atsushi Nakazawa
xiv
Workshop 23: 5th International Workshop on Video Event
Categorization, Tagging, and Retrieval (VECTaR2013)
Oral Session 1
Spatio-temporal Context Modeling for BoW-Based Video Classification 779
Saehoon Yi and Vladimir Pavlovic
Semantic Video-to-Video Search Using Sub-graph Grouping and Matching 787
Tae Eun Choe, Hongli Deng, Feng Guo, Mun Wai Lee, and Niels Haering
Oral Session 2
NSH: Normality Sensitive Hashing for Anomaly Detection 795
Hirotaka Hachiya and Masakazu Matsugu
Dynamic Scene Classification Using Spatial and Temporal Cues 803
Arun Balajee Vasudevan, Srikanth Muralidharan, Shiva Pratheek Chintapalli,
and Shanmuganathan Raman
VGRAPH: An Effective Approach for Generating Static Video Summaries 811
Karim M. Mahmoud, Nagia M. Ghanem, and MohamedA. Ismail
Workshop 24: Underwater Vision Workshop
A Pixel-Wise Varifocal Camera Model for Efficient Forward Projection
and Linear Extrinsic Calibration of Underwater Cameras with Flat Housings 819
Ryo Kawahara, Shohei Nobuhara, and Takashi Matsuyama
Transmission Estimation in Underwater Single Images 825
P. Drews Jr., E. do Nascimento, F. Moraes, S. Botelho, and M. Campos
An Analysis of Monochrome Conversions and Normalizations on the Local
Binary Patterns Texture Descriptors 831
Navid Nourani-Vatani, Mark De Deuge, Bertrand Douillard, and Stefan B. Williams
Categorization of Underwater Habitats Using Dynamic Video Textures 838
Jun Hu, Han Zhang, Anastasia Miliou, Thodoris Tsimpidis, Hazel Thornton,
and Vladimir Pavlovic
Workshop 25: Color and Photometry in Computer Vision
(CVPV)
XV
Oral Session 1
Quick Approximation of Camera's Spectral Response from Casual Lighting 844
Dilip K. Prasad, Rang Nguyen, and Michael S. Brown
Polarization-Based Dehazing Using Two Reference Objects 852
Daisuke Miyazaki, Daisuke Akiyama, Masashi Baba, Ryo Furukawa,
Shinsaku Hiura, and Naoki Asada
Approximate Cross Channel Color Mapping from Sparse Color
Correspondences 860
Hasan Sheikh Faridul, Jurgen Stauder, Jonathan Kervec, and Alain Tremeau
Oral Session 2
Multiplex Image Projection Using Multi-band Projectors 868
Makoto Nonoyama, Fumihiko Sakaue, and Jun Sato
Mixing Paints for Generating Metamerism Art under 2 Lights and 3 Object
Colors 874
Daisuke Miyazaki, Kanami Takahashi, Masashi Baba, Hirooki Aoki,
Ryo Furukawa, Masahito Aoyama, and Shinsaku Hiura
Visual Material Traits: Recognizing Per-Pixel Material Context 883
Gabriel Schwartz and Ko Nishino
Separating Specular and Diffuse Reflection Components in the HSI Color
Space 891
Jianwei Yang, Lixing Liu, and Stan Z. Li
Colour Constancy from Both Sides of the Shadow Edge 899
Stuart E. Lynch, Mark S. Drew, and Graham D. Finlayson
Verification of Sky Models for Image Calibration 907
Rishi Ramakrishnan, Juan Nieto, and Steve Scheding
BRDF Estimation of Structural Color Object by Using Hyper Spectral Image 915
Yoshie Kobayashi, Tetsuro Morimoto, Imari Sato, Yasuhiro Mukaigawa,
and Katsushi Ikeuchi
Author Index 923
xvi