on computer workshops (iccvw2013) file2013ieeeinternational conference oncomputer visionworkshops...

2013 IEEE International

Conference on Computer Vision

Workshops

(ICCVW 2013)

Sydney, Australia

2-8 December 2013

4 IEEE IEEE Catalog Number: CFP1391A-POD

ISBN: 978-1-4799-3023-4

2013 IEEE International

Conference on ComputerVision Workshops

ICCVW 2013

Table of Contents

Message from the General Chairs xvii

Message from the Program Chairs xix

Organizing Committee and Area Chairs xxi

Sponsors xxiii

Workshop 1: Graphical Models for Scene Understanding:Challenges and Perspectives

Multi-instance Object Segmentation with Exemplars 1

Xuming He and Stephen Gould

Hierarchical Segment Support for Categorical Image Labeling 5

Michael Donoser and Hayko Riemenschneider

Convex Optimization for Scene Understanding 9

Mohamed Souiai, Claudia Nieuwenhuis, Evgeny Strekalovskiy, and Daniel Cremers

Proximity Priors for Variational Semantic Segmentation and Recognition 15

Julia Bergbauer, Claudia Nieuwenhuis, Mohamed Souiai, and Daniel Cremers

Workshop 2: 3rd Workshop on Consumer Depth Cameras

for Computer Vision (CDC4CV)

Session 1: Segmentation

Saliency Cut in Stereo Images 22

Jianteng Peng, Jianbing Shen, Yunde Jia, and Xuelong Li

Depth Interpolation via Smooth Surface Segmentation Using Tangent Planes

Based on the Superpixels of a Color Image 29

Kiyoshi Matsuo and Yoshimitsu Aoki

External Mask Based Depth and Light Field Camera 37

Dikpal Reddy, Jiamin Bai, and Ravi Ramamoorthi

V

Session 2: Tracking and Events

Asynchronous Stereo Vision for Event-Driven Dynamic Stereo Sensor Using

an Adaptive Cooperative Approach 45

Ewa Piatkowska, Ahmed Nabil Belbachir, and Margrit Gelautz

Tracking an RGB-D Camera Using Points and Planes 51

Esra Ataer-Cansizoglu, Yuichi Taguchi, Srikumar Ramalingam, and Tyler Garaas

Reliable Left Luggage Detection Using Stereo Depth and Intensity Cues 59

Csaba Beleznai, Peter Gemeiner, and Christian Zinner

Session 3: Face Analysis

Compact and Accurate 3-D Face Modeling Using an RGB-D Camera: Let's

Open the Door to 3-D Video Conference 67

Pavan Kumar Anasosalu, Diego Thomas, and Akihiro Sugimoto

Automatic Detection of Emotion Valence on Faces Using Consumer Depth

Cameras 75

Arman Savran, Ruben Gur, and Ragini Verma

Session 4: Actions and Gestures

Real-Time Sign Language Recognition Using a Consumer Depth Camera 83

Alina Kuznetsova, Laura Leal-Taixe, and Bodo Rosenhahn

Fusion of Skeletal and Silhouette-Based Features for Human Action

Recognition with RGB-D Devices 91

Alexandros Andre Chaaraoui, Jose Ramon Padilla-Lopez,

and Francisco Florez-Revuelta

vi

Workshop 3: IEEE Workshop on the VOT2013 Visual ObjectTracking Challenge

Matej Kristan, Roman Pflugfelder, Ales Leonardis, Jiri Matas, Fatih Porikli,

Luka Cehovin, Georg Nebehay, Gustavo Fernandez, Tomas Vojir, Adam Gatt,

Ahmad Khajenezhad, Ahmed Salahledin, AH Soltani-Farani, AH Zarezade,

Alfredo Petrosino, Anthony Milton, Behzad Bozorgtabar, Bo Li,

Chee Seng Chan, Cherkeng Heng, Dale Ward, David Kearney,

Dorothy Monekosso, Hakki Can Karaimer, Hamid R. Rabiee, Jianke Zhu,

Jin Gao, Jingjing Xiao, Junge Zhang, Junliang Xing, Kaiqi Huang,

Karel Lebeda, Lijun Cao, Mario Edoardo Maresca, Mei Kuan Lim,

Mohamed El Helw, Michael Felsberg, Paolo Remagnino, Richard Bowden,

Roland Goecke, Rustam Stolkin, Samantha Yueying Lim, Sara Maher,

Sebastien Poullot, Sebastien Wong, Shin'lchi Satoh, Weihua Chen,

Weiming Hu, Xiaoqin Zhang, Yang Li, and Zhiheng Niu

Robust Real-Time Tracking with Diverse Ensembles and Random Projections 112

Ahmed Salaheldin, Sara Maher, and Mohamed El Helw

Enhanced Distribution Field Tracking Using Channel Representations 121

Michael Felsberg

An Adaptive Combination of Multiple Features for Robust Tracking in Real

Scene 129

Weihua Chen, Lijun Cao, Junge Zhang, and Kaiqi Huang

An Enhanced Adaptive Coupled-Layer LGTracker++ 137

Jingjing Xiao, Rustam Stolkin, and Ales Leonardis

Graph Embedding Based Semi-supervised Discriminative Tracker 145

Jin Gao, Junliang Xing, Weiming Hu, and Xiaoqin Zhang

Long-Term Tracking through Failure Cases 153

Karel Lebeda, Simon Hadfield, Jiri Matas, and Richard Bowden

Workshop 6: Workshop on Computer Vision for Accelerated

Bioscience

The Visual Object Tracking VOT2013 Challenge Results .98

Virtual 3D Models of Insects for Accelerated Quarantine Control

Chuong Nguyen, David Lovell, Rolf Oberprieler, Debbie Jennings, Matt Adcock,

Eleanor Gates-Stuart, and John La Salle

161

Insect Soup Challenge: Segmentation, Counting, and Simple Classification

Katarina Mele

168

3D Plant Modelling via Hyperspectral ImagingJie Liang, AH Zia, Jun Zhou, and Xavier Sirault

172

vii

Super-Resolution 3D Reconstruction of Thick Biological Samples: A Computer

Vision Perspective 178

Alessio Del Bue, Franceses Cella Zanacchi, and Alberto Diaspro

Extended Gaussian-Filtered Local Binary Patterns for Colonoscopy Image

Classification 184

Siyamalan Manivannan, Ruixuan Wang, and Emanuele Trucco

Learning to Detect Basal Tubules of Nematocysts in SEM Images 190

Michael Lam, Janardhan Rao Doppa, Xu Hu, Sinisa Todorovic,

Thomas Dietterich, Abigail Reft, and Marymegan Daly

Dirichlet Process Mixtures of Multinomials for Data Mining in Mice Behaviour

Analysis 197

Matteo Zanotto, Diego Sona, Vittorio Murino, and Francesco Papaleo

Zero-Shot Learning and Detection of Teeth in Images of Bat Skulls 203

Xu Hu, Michael Lam, Sinisa Todorovic, Thomas G. Dietterich,

Maureen A. O'Leary, Andrea L. Cirranello, Nancy B. Simmons,

and Paul M. Velazco

High Precision Localization of Bacterium and Scientific Visualization 210

Mohammadreza Hosseini, Arcot Sowmya, Pascal Vallotton, and Tomasz Bednarz

Workshop 7: Computer Vision for Autonomous Driving

Making Bertha See 214

Uwe Franke, David Pfeiffer, Clemens Rabe, Carsten Knoeppel,

Markus Enzweiler, Fridtjof Stein, and RalfG. Herrtwich

Visual Odometry by Multi-frame Feature Integration 222

Hernan Badino, Akihiro Yamamoto, and Takeo Kanade

Integrated Pedestrian and Direction Classification Using a Random Decision

Forest 230

Junli Tao and Reinhard Klette

Priors for Stereo Vision under Adverse Weather Conditions 238

Stefan Gehrig, Maxim Reznitskii, Nicolai Schneider, Uwe Franke,

and Joachim Weickert

Spatio-temporal Good Features to Track 246

Christoph Feichtenhofer and Axel Pinz

viii

Workshop 8: Inference for Probabilistic Graphical Models

(PGMs)

Supervised Hierarchical Dirichlet Processes with Variational Inference 254

Cheng Zhang, Carl Henrik Ek, Xavi Gratal, Florian T. Pokorny,

and Hedvig Kjellstrom

Infinite Latent Conditional Random Fields 262

Yun Jiang and Ashutosh Saxena

Getting Feasible Variable Estimates from Infeasible Ones: MRF Local

Polytope Study 267

Bogdan Savchynskyy and Stefan Schmidt

Superpixel Coherency and Uncertainty Models for Semantic Segmentation 275

Seungryul Baek, Taegyu Lim, Yong Seok Heo, Sungbum Park, Hantak Kwak,

and Woosung Shim

Video Object Segmentation by Salient Segment Chain Composition 283

Dan Banica, Alexandru Agape, Adrian Ion, and Cristian Sminchisescu

Workshop 9: 2nd International Workshop on Dynamic ShapeCapture and Analysis (4DMOD)

A Convex Relaxation Approach to Space Time Multi-view 3D Reconstruction 291

Martin R. Oswald and Daniel Cremers

One-Shot Entire Shape Scanning by Utilizing Multiple Projector-Camera

Constraints of Grid Patterns 299

Nozomu Kasuya, Ryusuke Sagawa, Ryo Furukawa, and Hiroshi Kawasaki

Single-View RGBD-Based Reconstruction of Dynamic Human Geometry 307

Charles Malleson, Martin Klaudiny, Adrian Hilton, and Jean-Yves Guillemaut

Robust Model-Based 3D Torso Pose Estimation in RGB-D Sequences 315

Markos Sigalas, Maria Pateraki, lason Oikonomidis, and Panos Trahanias

Workshop 10: 2nd International Workshop on Large-ScaleVideo Search and Mining (LSVSM'13)

Learning Non-linear Calibration for Score Fusion with Applications to Image

and Video Classification 323

Tianyang Ma, Sangmin Oh, Amitha Perera, and Longin Jan Latecki

Pedestrian Attribute Classification in Surveillance: Database and Evaluation 331

Jianqing Zhu, Shengcai Liao, Zhen Lei, Dong Yi, and Stan Z. Li

An Adaptive Query Prototype Modeling Method for Image Search Reranking 339

Hong Lu, Guobao Jiang, Bohong Yang, and Xiangyang Xue

ix

Thematic Saliency Detection Using Spatial-Temporal Context 347

Ye Luo, Gangqiang Zhao, and Junsong Yuan

Workshop 11: Faces in-the-Wild Challenge (300-W)

Constrained Local Neural Fields for Robust Facial Landmark Detection in

the Wild 354

Tadas Baltrusaitis, Peter Robinson, and Louis-Philippe Morency

Localizing Facial Keypoints with Global Descriptor Search, Neighbour

Alignment and Locally Linear Models 362

Md. Kamrul Hasan, Christopher Pal, and Sharon Moalem

Guided Unsupervised Learning of Mode Specific Models for Facial Point

Detection in the Wild 370

Shashank Jaiswal, Timur R. Almaev, and Michel F. Valstar

Multiview Active Shape Models with SIFT Descriptors for the 300-W Face

Landmark Challenge 378

Stephen Milborrow, Tom E. Bishop, and Fred Nicolls

Extensive Facial Landmark Localization with Coarse-to-Fine Convolutional

Network Cascade 386

Erjin Zhou, Haoqiang Fan, Zhimin Cao, Yuning Jiang, and Qi Yin

Learn to Combine Multiple Hypotheses for Accurate Face Alignment 392

Junjie Yan, Zhen Lei, Dong Yi, and Stan Z. Li

300 Faces in-the-Wild Challenge: The First Facial Landmark Localization

Challenge 397

Christos Sagonas, Georgios Tzimiropoulos, Stefanos Zafeiriou, and Maja Pantic

Workshop 13: Large Scale Visual Commerce

Discovering Pictorial Brand Associations from Large-Scale Online Image Data 404

Gunhee Kim and Eric P. Xing

Workshop 14: THUMOS Challenge: Action Recognition with

a Large Number of Classes

Ordered Trajectories for Large Scale Human Action Recognition 412

O. V. Ramana Murthy and Roland Goecke

A Spatio-temporal Feature Based on Triangulation of Dense SURF 420

Do Hang Nga and Keiji Yanai

X

W15: The First International Workshop on Visual Domain

Adaptation and Dataset Bias

Workshop 16: Wearable Computer Vision Systems (WCVS)

Wearable Computer Vision Systems for a Cortical Visual Prosthesis 428

Wai Ho Li

Wearable Smartphone: Wearable Hybrid Framework for Hand and Foot

Gesture Interaction on Smartphone 436

Zhihan Lv

A Smartphone-Based Obstacle Detection and Classification System

for Assisting Visually Impaired People 444

Ruxandra Tapu, Bogdan Mocanu, Andrei Bursuc, and Titus Zaharia

Workshop 17: 1st IEEE International Workshop on ComputerVision for Converging Perspectives

Observing the Natural World with Flickr 452

Jingya Wang, Mohammed Korayem, and David J. Crandall

Processing Geotagged Image Sets for Collaborative Compositing and View

Construction 460

Levente Kovacs

From Label Maps to Label Strokes: Semantic Segmentation for Street Scenes

from Incomplete Training Data 468

Shengqi Zhu, Yiqing Yang, and Li Zhang

Workshop 18: 1st Workshop on Understanding HumanActivities: Context and Interactions (HACI 2013)

Iterative Action and Pose Recognition Using Global-and-Pose Features

and Action-Specific Models 476

Norimichi Ukita

A Multi-scale Approach to Gesture Detection and Recognition 484

Natalia Neverova, Christian Wolf, Giulio Pad, Giacomo Sommavilla,

Graham W. Taylor, and Florian Nebout

Context-Sensitive Conditional Ordinal Random Fields for Facial Action

Intensity Estimation 492

Ognjen Rudovic, Vladimir Pavlovic, and Maja Pantic

Temporal Poselets for Collective Activity Detection and Recognition 500

Moin Nabi, Alessio Del Bue, and Vittorio Murino

xi

Spatio-temporal Human-Object Interactions for Action Recognition in Videos 508

Victor Escorcia and Juan Carlos Niebles

Less Is More: Video Trimming for Action Recognition 515

Borislav Antic, Timo Milbich, and Bjorn Ommer

Workshop 19: 3D Representation and Recognition (3dRR-13)

Oral Session I: Modeling Object Shape and Pose

Cubistic Representation for Real-Time 3D Shape and Pose Estimation

of Unknown Rigid Object 522

Hiromasa Yoshimoto and Yuichi Nakamura

Object Detection by 3D Aspectlets and Occlusion Reasoning 530

Yu Xiang and Silvio Savarese

Efficient Object Localization and Pose Estimation with 3D Wireframe Models 538

Erdem Yoruk and Rene Vidal

Oral Session II: 3D Scenes and Fine-Grained 3D ObjectCategorization

Behind the Scenes: What Moving Targets Reveal about Static Scene

Geometry 546

Geoffrey Taylor and Fei Mai

3D Object Representations for Fine-Grained Categorization 554

Jonathan Krause, Michael Stark, Jia Deng, and Li Fei-Fei

Oral Session III: Features and Cues for Recognition

The Mesh-LBP: Computing Local Binary Patterns on Discrete Manifolds 562

Naoufel Werghi, Stefano Berretti, Alberto Del Bimbo, and Pietro Pala

Multiscale TILT Feature Detection with Application to Geometric Image

Segmentation 570

Chi-Pang Lam, Allen Y. Yang, Ehsan Elhamifar, and S. Shankar Sastry

Which Edges Matter? 578

Aayush Bansal, Adarsh Kowdle, Devi Parikh, Andrew Gallagher, and Larry Zitnick

xii

Workshop 20: Computer Vision in Vehicle Technology: From

Earth to Mars

Enhanced Target Tracking in UAV Imagery with P-N Learning and Structural

Constraints 586

Mennatullah Siam and Mohammed Elhelw

Evaluating Color Representations for On-Line Road Detection 594

Jose M. Alvarez, Theo Gevers, and Antonio M. Lopez

Direct Generation of Regular-Grid Ground Surface Map from In-Vehicle Stereo

Image Sequences 600

Shigeki Sugimoto, Kouma Motooka, and Masatoshi Okutomi

From Video Matching to Video Grounding 608

Georgios Evangelidis, Penan Diego, and Radu Horaud

Various Approaches for Driver and Driving Behavior Monitoring: A Review 616

Hang-Bong Kang

Evaluation of the Capabilities of Confidence Measures for Assessing Optical

Flow Quality 624

Patricia Marquez-Valle, Debora Gil, and Aura Hernandez-Sabate

Exploiting Sparsity for Real Time Video Labelling 632

Lachlan Home, Jose M. Alvarez, and Nick Barnes

Workshop 21: Big Data in 3D Computer Vision (BigData3DCV)A Novel Local Surface Description for Automatic 3D Object Recognition in Low

Resolution Cluttered Scenes 638

Syed Afaq AH Shah, Mohammed Bennamoun, Farid Boussaid,

and Amar A. El-Sallam

A Scalable Collaborative Online System for City Reconstruction 644

Ole Untzelmann, Torsten Sattler, Sven Middelberg, and LeifKobbelt

Fury of the Swarm: Efficient and Very Accurate Triangulation for Multi-view

Scene Reconstruction 652

Shawn Recker, Mauricio Hess-Flores, and Kenneth I. Joy

Sparse Approximations of 3D Mesh Geometry Using Frames as Overcomplete

Dictionaries 660

Maja Krivokuca, Waleed H. Abdulla, and Burkhard C. Wunsche

Targetless Calibration of a Lidar - Perspective Camera Pair 668

Levente Tamas and Zoltan Kato

Optimal Reduction of Large Image Databases for Location Recognition 676

Michal Havlena, Wilfried Hartmann, and Konrad Schindler

xiii

Three Dimensional Motion Trail Model for Gesture Recognition 684

Bin Liang and Lihong Zheng

3D Surface Extraction Using Incremental Tetrahedra Carving 692

Takayuki Sugiura, Akihiko Torii, and Masatoshi Okutomi

Fast and Accurate Large-Scale Stereo Reconstruction Using Variational

Methods 700

Georg Kuschk and Daniel Cremers

Kinect Shadow Detection and Classification 708

Teng Deng, Hui Li, Jianfei Cai, Tat-Jen Cham, and Henry Fuchs

Semantic Parsing of Street Scene Images Using 3D LiDAR Point Cloud 714

Pouria Babahajiani, Lixin Fan, and Moncef Gabbouj

Memory Efficient 3D Integral Volumes 722

Martin Urschler, Alexander Bornik, and Michael Donoser

Workshop 22: Decoding Subtle Cues from Social Interactions

Joint Alignment and Modeling of Correlated Behavior Streams 730

Liliana Lo Presti, Stan Sclaroff, and Agata Rozga

Markov Random Field Structures for Facial Action Unit Intensity Estimation 738

Georgia Sandbach, Stefanos Zafeiriou, and Maja Pantic

Video Based Children's Social Behavior Classification in Peer-Play Scenarios 746

Lu Tian, Dingrui Duan, Jinshi Cui, Li Wang, Hongbin Zha, and Hamid Aghajan

Trusting Skype: Learning the Way People Chat for Fast User Recognition

and Verification 748

Giorgio Roffo, Marco Cristani, Loris Bazzani, Ha Quang Minh, and Vittorio Murino

Self-Stimulatory Behaviours in the Wild for Autism Diagnosis 755

Shyam Sundar Rajagopalan, Abhinav Dhall, and Roland Goecke

Home Alone: Social Robots for Digital Ethnography of Toddler Behavior 762

Mohsen Malmir, Deborah Forster, Kendall Youngstrom, Lydia Morrison,

and Javier R. Movellan

Hand Gestures for Intelligent Tutoring Systems: Dataset, Techniques &

Evaluation 769

Suchitra Sathyanarayana, Gwen Littlewort, and Mamie Bartlett

Human Body-Parts Tracking for Fine-Grained Behavior Classification 777

Norimichi Ukita and Atsushi Nakazawa

xiv

Workshop 23: 5th International Workshop on Video Event

Categorization, Tagging, and Retrieval (VECTaR2013)

Oral Session 1

Spatio-temporal Context Modeling for BoW-Based Video Classification 779

Saehoon Yi and Vladimir Pavlovic

Semantic Video-to-Video Search Using Sub-graph Grouping and Matching 787

Tae Eun Choe, Hongli Deng, Feng Guo, Mun Wai Lee, and Niels Haering

Oral Session 2

NSH: Normality Sensitive Hashing for Anomaly Detection 795

Hirotaka Hachiya and Masakazu Matsugu

Dynamic Scene Classification Using Spatial and Temporal Cues 803

Arun Balajee Vasudevan, Srikanth Muralidharan, Shiva Pratheek Chintapalli,

and Shanmuganathan Raman

VGRAPH: An Effective Approach for Generating Static Video Summaries 811

Karim M. Mahmoud, Nagia M. Ghanem, and MohamedA. Ismail

Workshop 24: Underwater Vision Workshop

A Pixel-Wise Varifocal Camera Model for Efficient Forward Projection

and Linear Extrinsic Calibration of Underwater Cameras with Flat Housings 819

Ryo Kawahara, Shohei Nobuhara, and Takashi Matsuyama

Transmission Estimation in Underwater Single Images 825

P. Drews Jr., E. do Nascimento, F. Moraes, S. Botelho, and M. Campos

An Analysis of Monochrome Conversions and Normalizations on the Local

Binary Patterns Texture Descriptors 831

Navid Nourani-Vatani, Mark De Deuge, Bertrand Douillard, and Stefan B. Williams

Categorization of Underwater Habitats Using Dynamic Video Textures 838

Jun Hu, Han Zhang, Anastasia Miliou, Thodoris Tsimpidis, Hazel Thornton,

and Vladimir Pavlovic

Workshop 25: Color and Photometry in Computer Vision

(CVPV)

XV

Oral Session 1

Quick Approximation of Camera's Spectral Response from Casual Lighting 844

Dilip K. Prasad, Rang Nguyen, and Michael S. Brown

Polarization-Based Dehazing Using Two Reference Objects 852

Daisuke Miyazaki, Daisuke Akiyama, Masashi Baba, Ryo Furukawa,

Shinsaku Hiura, and Naoki Asada

Approximate Cross Channel Color Mapping from Sparse Color

Correspondences 860

Hasan Sheikh Faridul, Jurgen Stauder, Jonathan Kervec, and Alain Tremeau

Oral Session 2

Multiplex Image Projection Using Multi-band Projectors 868

Makoto Nonoyama, Fumihiko Sakaue, and Jun Sato

Mixing Paints for Generating Metamerism Art under 2 Lights and 3 Object

Colors 874

Daisuke Miyazaki, Kanami Takahashi, Masashi Baba, Hirooki Aoki,

Ryo Furukawa, Masahito Aoyama, and Shinsaku Hiura

Visual Material Traits: Recognizing Per-Pixel Material Context 883

Gabriel Schwartz and Ko Nishino

Separating Specular and Diffuse Reflection Components in the HSI Color

Space 891

Jianwei Yang, Lixing Liu, and Stan Z. Li

Colour Constancy from Both Sides of the Shadow Edge 899

Stuart E. Lynch, Mark S. Drew, and Graham D. Finlayson

Verification of Sky Models for Image Calibration 907

Rishi Ramakrishnan, Juan Nieto, and Steve Scheding

BRDF Estimation of Structural Color Object by Using Hyper Spectral Image 915

Yoshie Kobayashi, Tetsuro Morimoto, Imari Sato, Yasuhiro Mukaigawa,

and Katsushi Ikeuchi

Author Index 923

xvi

on computer workshops (iccvw2013) file2013ieeeinternational conference oncomputer visionworkshops...

Documents