lecture notes in computer science 8325 - springer978-3-319-04114...lecture notes in computer science...

Lecture Notes in Computer Science 8325Commenced Publication in 1973Founding and Former Series Editors:Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen

Editorial Board

David HutchisonLancaster University, UK

Takeo KanadeCarnegie Mellon University, Pittsburgh, PA, USA

Josef KittlerUniversity of Surrey, Guildford, UK

Jon M. KleinbergCornell University, Ithaca, NY, USA

Alfred KobsaUniversity of California, Irvine, CA, USA

Friedemann MatternETH Zurich, Switzerland

John C. MitchellStanford University, CA, USA

Moni NaorWeizmann Institute of Science, Rehovot, Israel

Oscar NierstraszUniversity of Bern, Switzerland

C. Pandu RanganIndian Institute of Technology, Madras, India

Bernhard SteffenTU Dortmund University, Germany

Madhu SudanMicrosoft Research, Cambridge, MA, USA

Demetri TerzopoulosUniversity of California, Los Angeles, CA, USA

Doug TygarUniversity of California, Berkeley, CA, USA

Gerhard WeikumMax Planck Institute for Informatics, Saarbruecken, Germany

Cathal Gurrin Frank HopfgartnerWolfgang Hurst Håvard JohansenHyowon Lee Noel O’Connor (Eds.)

MultiMedia Modeling20th Anniversary International Conference, MMM 2014Dublin, Ireland, January 6-10, 2014Proceedings, Part I

13

Volume Editors

Cathal GurrinDublin City University, IrelandE-mail: [email protected]

Frank HopfgartnerTechnische Universität Berlin / DAI-Labor, GermanyE-mail: [email protected]

Wolfgang HurstUniversiteit Utrecht, The NetherlandsE-mail: [email protected]

Håvard JohansenUiT The Arctic University of NorwayE-mail: [email protected]

Hyowon LeeSingapore University of Technology and Design, SingaporeE-mail: [email protected]

Noel O’ConnorDublin City University, IrelandE-mail: [email protected]

ISSN 0302-9743 e-ISSN 1611-3349ISBN 978-3-319-04113-1 e-ISBN 978-3-319-04114-8DOI 10.1007/978-3-319-04114-8Springer Cham Heidelberg New York Dordrecht London

Library of Congress Control Number: 2013955783CR Subject Classification (1998): H.3, H.5, I.5, H.2.8, H.4, I.4, I.2LNCS Sublibrary: SL 3 – Information Systems and Application,incl. Internet/Web and HCI

© Springer International Publishing Switzerland 2014This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part ofthe material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation,broadcasting, reproduction on microfilms or in any other physical way, and transmission or informationstorage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodologynow known or hereafter developed. Exempted from this legal reservation are brief excerpts in connectionwith reviews or scholarly analysis or material supplied specifically for the purpose of being entered andexecuted on a computer system, for exclusive use by the purchaser of the work. Duplication of this publicationor parts thereof is permitted only under the provisions of the Copyright Law of the Publisher’s location,in its current version, and permission for use must always be obtained from Springer. Permissions for usemay be obtained through RightsLink at the Copyright Clearance Center. Violations are liable to prosecutionunder the respective Copyright Law.The use of general descriptive names, registered names, trademarks, service marks, etc. in this publicationdoes not imply, even in the absence of a specific statement, that such names are exempt from the relevantprotective laws and regulations and therefore free for general use.While the advice and information in this book are believed to be true and accurate at the date of publication,neither the authors nor the editors nor the publisher can accept any legal responsibility for any errors oromissions that may be made. The publisher makes no warranty, express or implied, with respect to thematerial contained herein.

Typesetting: Camera-ready by author, data conversion by Scientific Publishing Services, Chennai, India

Printed on acid-free paper

Springer is part of Springer Science+Business Media (www.springer.com)

Preface

These proceedings contain the papers presented at MMM 2014, the 20th An-niversary International Conference on MultiMedia Modeling. The conference wasorganized by Inisght Centre for Data Analytics, Dublin City University, and washeld during January 6-10, 2014, at the wonderful venue of the Guinness Store-house in Dublin, Ireland. We greeted the attendees at MMM 2014 with thefollowing address: “Táimid an-bhrodúil fáilte a chur romhaibh chuig Baile ÁthaCliath agus chuig an fichiú Comhdháil Idirnáisiúnta bliain ar Samhaltú Ilmheán.Tá súil againn go mbeidh am iontach agaibh anseo in Éirinn agus go mbeidhbhur gcuairt taitneamhnach agus sásúil. Táimid an-bhrodúil go háirithe fáiltea chur roimh na daoine ón oiread sin tíortha difriúla agus na daoine a tháinigas i bhfad i gcéin. Tá an oiread sin páipéar curtha isteach chuigh an chomhd-háil seo go bhfuil caighdeán na bpáipéar, na bpóstaer agus na léiriú an-ard arfad agus táimid ag súil go mór le hócaid iontach. We are delighted to welcomeyou to Dublin for the 20th Anniversary International Conference on MultimediaModeling. We hope that the attendees have a wonderful stay in Ireland and thattheir visits are both enjoyable and rewarding. We are very proud to welcomevisitors from both Ireland and abroad and we are delighted to be able to includein the proceedings such high-quality papers, posters, and demonstrations.”

MMM 2014 received a total 176 submissions across four categories; 103 full-paper submissions, 24 short paper submissions, and 12 demonstration submis-sions. Of these submissions, 55% were from Europe, 41% from Asia, 3% fromthe Americas, and 1% from the Middle East. All full paper submissions werereviewed by at least three members of the 110-person Program Committee, forwhom we owe a debt of gratitude for providing their valuable time to MMM 2014.Of the 103 full papers submitted, 30 were selected for oral presentation, whichequates to a 29% acceptance rate. A further 16 papers were chosen for posterpresentation. For short papers, a total of 11 were accepted for poster presenta-tion, representing a 45% acceptance rate. In addition, nine demonstrations froma total of 12 submissions were accepted for MMM 2014. We accepted 28 specialsession submissions across the five special sessions and six Video Browser Show-down (VBS 2014) submissions. The accepted contributions represent the stateof the art in multimedia modeling research and cover a diverse range of topicsincluding: applications of multimedia modelling, interactive retrieval, image andvideo collections, 3D and augmented reality, temporal analysis of multimediacontent, compression and streaming.

As in recent years, MMM 2014 included VBS 2014. This year we made theVBS a half-day workshop, which took place on January 7, 2014. For the firsttime in 2014, we also co-located the WinterSchool on Multimedia Processing andApplications (WMPA 2014), which ran during January 6-7, 2014.

As is usual for MMM, there were a number of special sessions accepted forinclusion in MMM 2014. Each special session paper was also reviewed by at least

VI Preface

three members of the Program Committee. The following five special sessionswere selected for inclusion in MMM 2014:– Social Geo-Media Analytics and Retrieval– Multimedia Hyperlinking and Retrieval– 3D Multimedia Computing and Modeling– Multimedia Analysis for Surveillance Video and Security– Mediadrom: Artful Post-TV Scenarios

We would like to thank our invited keynote speakers (Anil Kokaram from Googleand Narrative/Memoto representative) for their stimulating contributions to theconference. Special thanks go to the short papers co-chairs, Neil O’Hare andRichang Hong, and the demonstrations co-chairs, Udo Kruschwitz and HideoJoho. We are also fortunate to have worked with wonderful supporting chairs,such as Yantao Zheng (sponsorship chair and publicity chair), Alex Hauptmann,Susanne Boll, and Jialie Shen (international liaisons). We also acknowledge thecommitment of our local organization team including, Rami Albatal (studentsponsorship and local organization chair), and our two designers and webmas-ters, David Scott and Yang Yang. Special mention goes to Alan Smeaton for hisconstant availability to provide support and advice. Finally, we wish to thankthe wonderful local organization team, Lijuan Marissa Zhou, ZhenXing Zhang,Zhengwei Qiu, Brian Moynagh, Stefan Terziyski, Teng Qi He, Na Li and ZaherHinbarji. In addition, we wish to thank all authors who spent their time andeffort to submit their work to MMM 2014, and all of the participants and stu-dent volunteers for their contributions and valuable support. Our gratitude alsogoes to the MMM 2014 Program Committee members, the Award Committeemembers, and the other invited reviewers for the 500 reviews required for MMM2014.

We are grateful to the sponsors for generously providing financial support forthe conference, including Dublin City University, Science Foundation Ireland,Fáilte Ireland, Google, ISCA and the Insight Centre for Data Analytics. Wewould also like to thank the School of Computing at Dublin City University, inparticular David Sinclair, Mark Roantree, and Rory O’Connor for their supportas the heads of the School of Computing. Our special thanks go to the Insightadmin team, Margaret Malone, Deirdre Sheridan, Barbara Flynn, and AnneTroy from the DCU Finance Department, Ana Terres from the DCU Officeof the Vice-President for Research, Harald Weinreich from ConfTool, and ourcontacts in the Guinness Storehouse and the Smock Alley Theatre. Finally, wethank all of the attendees at MMM 2014 who made the trip to Dublin to attendMMM 2014, VBS 2014, and WMPA 2014.

January 2014 Cathal GurrinNoel O’ConnorWolfgang Hurst

Hyowon LeeFrank Hopfgartner

Håvard Johansen

Organization

MMM 2014 was organized by Dublin City University, Ireland.

Organizing Committee

General Chair

Cathal Gurrin Dublin City University, Ireland

Program Co-chairs

Noel O’Connor Dublin City University, IrelandWolfgang Hürst University of Utrecht, The NetherlandsHyowon Lee Singapore University of Technology and Design,

Singapore

Special Session Co-chairs

Frank Hopfgartner Technische Universität Berlin, GermanyHåvard Johansen University of Tromsø, Norway

Short Paper Co-chairs

Neil O’Hare Yahoo Labs, SpainRichang Hong Hefei University of Technology, China

Demonstration Co-chairs

Hideo Joho University of Tsukuba, JapanUdo Kruschwitz University of Essex, UK

Student Support Chair

Rami Albatal Dublin City University, Ireland

Advertising/Sponsorship Chair

Yantao Zheng Google, USA

VIII Organization

International Liaisons

USA: Alex Hauptmann Carnegie Mellon University, USAEurope: Susanne Boll University of Oldenburg, GermanyAsia: Jialie Shen Singapore Management University, Singapore

Local Organizing Co-chairs

Rami Albatal Dublin City University, IrelandLijuan Zhou Dublin City University, Ireland

Website

Yang Yang Dublin City University, IrelandDavid Scott Dublin City University, Ireland

Program Committee

Amin Ahmadi Dublin City University, IrelandRami Albatal Dublin City University, IrelandLaurent Amsaleg CNRS-IRISA, FranceNoboru Babaguchi Osaka University, JapanJenny Benois-Pineau LABRI/University of Bordeaux, FranceLaszlo Boeszoermenyi Klagenfurt University, AustriaSusanne Boll University of Oldenburg, GermanyVincent Charvillat University of Toulouse, FranceGene Cheung National Institute of Informatics, JapanLiang-Tien Chia Nanyang Technological University, SingaporeInsook Choi Columbia College Chicago, USAKonstantinos Chorianopoulos Ionian University, GreeceWei-Ta Chu National Chung Cheng University, TaiwanTat-Seng Chua National University of Singapore, SingaporeKathy M. Clawson University of Ulster, UKMatthew Cooper FX Palo Alto Laboratory, USAW. Bas de Haas Utrecht University, The NetherlandsFrancois Destelle Dublin City University, IrelandCem Direkoglu Dublin City University, IrelandAjay Divakaran SRI International, USALingyu Duan Peking University, ChinaStéphane Dupont University of Mons, BelgiumThierry Dutoit University of Mons, BelgiumMaria Eskevich Dublin City University, IrelandJianping Fan University of North Carolina, USAGerald Friedland ICSI Berkeley, USAYue Gao National University of Singapore, Singapore

Organization IX

William Grosky University of Michigan, USACathal Gurrin Dublin City University, IrelandMartin Halvey Glasgow Caledonian University, UKAllan Hanbury Vienna University of Technology, AustriaAndreas Henrich University of Bamberg, GermanyRichang Hong Hefei University of Technology, ChinaFrank Hopfgartner Technische Universität Berlin, GermanyJun-Wei Hsieh National Taiwan Ocean University, TaiwanWinston Hsu National Taiwan University, TaiwanBenoit Huet EURECOM, FranceWolfgang Hürst University of Utrecht, The NetherlandsIchiro Ide Nagoya University, JapanRongrong Ji Xiamen University, ChinaYu-Gang Jiang Fudan University, ChinaHåvard Johansen University of Tromsø, NorwayMohan Kankanhalli National University of Singapore, SingaporeYoshihiko Kawai NHK, JapanYiannis Kompatsiaris Information Technologies Institute, GreeceUdo Kruschwitz University of Essex, UKMartha Larson Delft University of Technology,

The NetherlandsDuy-Dinh Le National Institute of Informatics, JapanHyowon Lee Singapore University of Technology and Design,

SingaporeMichael Lew Leiden University, The NetherlandsHaojie Li Dalian University of Technology, ChinaKe Liang Advanced Digital Sciences Center, SingaporeSuzanne Little Dublin City University, IrelandDong Liu Columbia University, USAXiaobai Liu University of California at Los Angeles, USAYan Liu The Hong Kong Polytechnic University,

Hong KongYuan Liu Ricoh Software Research Center, ChinaGuojun Lu Monash University, AustraliaNadia Magnenat-Thalmann University of Geneva (MIRALab), SwitzerlandJose M. Martinez Universidad Autónoma de Madrid, SpainDavide Andrea Mauro Institut Mines-Télécom, TÉLÉCOM

ParisTech, CNRS-LTCI, FranceKevin McGuinness Dublin City University, IrelandRobert Mertens Hochschule Weserbergland, GermanyFlorian Metze Carnegie Mellon University, USADavid Monaghan Dublin City University, IrelandHenning Müller HES-SO Valais, SwitzerlandChong-Wah Ngo City University of Hong Kong, Hong KongNaoko Nitta Osaka University, Japan

X Organization

Lyndon Nixon STI International GmbH, AustriaNoel O’Connor Dublin City University, IrelandNeil O’Hare Yahoo Labs, SpainVincent Oria New Jersey Institute of Technology, USAMarco Paleari Italian Institute of Technology, ItalyFernando Pereira Instituto Superior Técnico - Instituto de

Telecomunicações, PortugalMiriam Redi Eurecom, FranceMukesh Kumar Saini University of Ottawa, CanadaJitao Sang Institute of Automation, Chinese Academy of

Sciences, ChinaShin’ichi Satoh National Institute of Informatics, JapanKlaus Schoeffmann Klagenfurt University, AustriaDavid Scott Dublin City University, IrelandCaifeng Shan Philips, The NetherlandsJialie Shen Singapore Management University, SingaporeKoichi Shinoda Tokyo Institute of Technology, JapanMei-Ling Shyu University of Miami, USAAlan Smeaton Dublin City University, IrelandJia Su Nippon Telegraph and Telephone Corporation,

JapanYongqing Sun NTT Media Intelligence Laboratories, JapanRobby Tan Utrecht University, The NetherlandsShuhei Tarashima NTT Media Intelligence Laboratories, JapanXinmei Tian University of Science and Technology of China,

ChinaDian Tjondronegoro Queensland University of Technology, AustraliaShingo Uchihashi Fuji Xerox Co., Ltd., JapanEgon L. van den Broek Utrecht University, The NetherlandsNico van der Aa Utrecht University, The NetherlandsCoert van Gemeren Utrecht University, The NetherlandsJingdong Wang Microsoft Research Asia, ChinaXin-Jing Wang Microsoft Research Asia, ChinaLai Kuan Wong Multimedia University, MalaysiaMarcel Worring University of Amsterdam, The NetherlandsFeng Wu Microsoft Research Asia, ChinaPeng Wu Hewlett-Packard, USAQiang Wu University of Technology Sydney, AustraliaXiao Wu Southwest Jiaotong University, ChinaChangsheng Xu Institute of Automation, Chinese Academy

of Sciences, ChinaKeiji Yanai University of Electro-Communications, JapanYou Yang Huazhong University of Science and

Technology, ChinaZheng-Jun Zha Hefei Institute of Intelligent Machines,

Chinese Academy of Sciences, China

Organization XI

Cha Zhang Microsoft Research, USALijuan Zhou Dublin City University, IrelandRoger Zimmermann National University of Singapore, Singapore

Additional Reviewers

Zheng Song National University of SingaporeHanwang Zhang National University of Singapore

XII Organization

Sponsoring Institutions

Organization XIII

In Cooperation with

Bernauer-Budiman Inc., Reading, Mass.The Hofmann-International Company, San Louis Obispo, Cal.Kramer Industries, Heidelberg, Germany

Table of Contents – Part I

Interactive Indexing and Retrieval

A Comparative Study on the Use of Multi-label ClassificationTechniques for Concept-Based Video Indexing and Annotation . . . . . . . . . 1

Fotini Markatopoulou, Vasileios Mezaris, and Ioannis Kompatsiaris

Coherence Analysis of Metrics in LBP Space for Interactive FaceRetrieval . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

Yuchun Fang, Ying Tan, and Chanjuan Yu

A Hybrid Machine-Crowd Approach to Photo Retrieval ResultDiversification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25

Anca-Livia Radu, Bogdan Ionescu, María Menéndez,Julian Stöttinger, Fausto Giunchiglia, andAntonella De Angeli

Visual Saliency Weighting and Cross-Domain Manifold Ranking forSketch-Based Image Retrieval . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37

Takahiko Furuya and Ryutarou Ohbuchi

A Novel Approach for Semantics-Enabled Search of MultimediaDocuments on the Web . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50

Lydia Weiland and Ansgar Scherp

Video to Article Hyperlinking by Multiple Tag Property Exploration . . . 62Zhineng Chen, Bailan Feng, Hongtao Xie, Rong Zheng, and Bo Xu

Rebuilding Visual Vocabulary via Spatial-temporal Context Similarityfor Video Retrieval . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74

Lei Wang, Eyad Elyan, and Dawei Song

Approximating the Signature Quadratic Form Distance Using ScalableFeature Signatures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86

Jakub Lokoč

A Novel Human Action Representation via Convolution ofShape-Motion Histograms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98

Teck Wee Chua and Karianto Leman

How Do Users Search with Basic HTML5 Video Players? . . . . . . . . . . . . . 109Claudiu Cobârzan and Klaus Schoeffmann

XVI Table of Contents – Part I

Multimedia Collections

Visual Recognition by Exploiting Latent Social Links in ImageCollections . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121

Li-Jia Li, Xiangnan Kong, and Philip S. Yu

Collections for Automatic Image Annotation and Photo TagRecommendation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133

Philip J. McParlane, Yashar Moshfeghi, and Joemon M. Jose

Graph-Based Multimodal Clustering for Social Event Detection inLarge Collections of Images . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146

Georgios Petkos, Symeon Papadopoulos, Emmanouil Schinas, andYiannis Kompatsiaris

Tag Relatedness Using Laplacian Score Feature Selection and AdaptedJensen-Shannon Divergence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 159

Hatem Mousselly-Sergieh, Mario Döller, Elöd Egyed-Zsigmond,Gabriele Gianini, Harald Kosch, and Jean-Marie Pinon

User Intentions in Digital Photo Production: A Test Data Set . . . . . . . . . 172Mathias Lux, Desara Xhura, and Alexander Kopper

Personal Media Reunion: Re-collecting Media Content Scattered overSmart Devices and Social Networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183

Mohamad Rabbath and Susanne Boll

Summarised Presentation of Personal Photo Sets . . . . . . . . . . . . . . . . . . . . . 195Nuno Datia, João Moura-Pires, and Nuno Correia

Applications

MOSRO: Enabling Mobile Sensing for Real-Scene Objects with GridBased Structured Output Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 207

Heng-Yu Chi, Wen-Huang Cheng, Ming-Syan Chen, andArvin Wen Tsui

TravelBuddy: Interactive Travel Route Recommendation with a VisualScene Interface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 219

Cheng-Yao Fu, Min-Chun Hu, Jui-Hsin Lai, Hsuan Wang, andJa-Ling Wu

Who’s the Best Charades Player? Mining Iconic Movement of SemanticConcepts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231

Yung-Huan Hsieh, Shintami C. Hidayati, Wen-Huang Cheng,Min-Chun Hu, and Kai-Lung Hua

Table of Contents – Part I XVII

Tell Me about TV Commercials of This Product . . . . . . . . . . . . . . . . . . . . . 242Cai-Zhi Zhu, Siriwat Kasamwattanarote, Xiaomeng Wu, andShin’ichi Satoh

A Data-Driven Personalized Digital Ink for Chinese Characters . . . . . . . . 254Tianyang Yi, Zhouhui Lian, Yingmin Tang, and Jianguo Xiao

Local Segmentation for Pedestrian Tracking in Dense Crowds . . . . . . . . . . 266Clement Creusot

An Optimization Model for Aesthetic Two-Dimensional Barcodes . . . . . . 278Chengfang Fang, Chunwang Zhang, and Ee-Chien Chang

Live Key Frame Extraction in User Generated Content Scenarios forEmbedded Mobile Platforms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 291

Alexandro Sentinelli and Luca Celetto

Understanding Affective Content of Music Videos through LearnedRepresentations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 303

Esra Acar, Frank Hopfgartner, and Sahin Albayrak

Robust Image Restoration via Reweighted Low-Rank MatrixRecovery . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 315

Yigang Peng, Jinli Suo, Qionghai Dai, Wenli Xu, and Song Lu

Learning to Infer Public Emotions from Large-Scale Networked VoiceData . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 327

Zhu Ren, Jia Jia, Lianhong Cai, Kuo Zhang, and Jie Tang

Joint People Recognition across Photo Collections Using Sparse MarkovRandom Fields . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 340

Markus Brenner and Ebroul Izquierdo

Temporal Analysis

Event Detection by Velocity Pyramid . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 353Zhuolin Liang, Nakamasa Inoue, and Koichi Shinoda

Fusing Appearance and Spatio-temporal Features for Multiple CameraTracking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 365

Nam Trung Pham, Karianto Leman, Richard Chang,Jie Zhang, and Hee Lin Wang

A Dense SURF and Triangulation Based Spatio-temporal Feature forAction Recognition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 375

Do Hang Nga and Keiji Yanai

Resource Constrained Multimedia Event Detection . . . . . . . . . . . . . . . . . . . 388Zhen-Zhong Lan, Yi Yang, Nicolas Ballas, Shoou-I Yu, andAlexander Haputmann

XVIII Table of Contents – Part I

Random Matrix Ensembles of Time Correlation Matrices to AnalyzeVisual Lifelogs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 400

Na Li, Martin Crane, Heather J. Ruskin, and Cathal Gurrin

3D and Augmented Reality

Exploring Distance-Aware Weighting Strategies for AccurateReconstruction of Voxel-Based 3D Synthetic Models . . . . . . . . . . . . . . . . . . 412

Hani Javan Hemmat, Egor Bondarev, and Peter H.N. de With

Exploitation of Gaze Data for Photo Region Labeling in an ImmersiveEnvironment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 424

Tina Walber, Ansgar Scherp, and Steffen Staab

MR Simulation for Re-wallpapering a Room in a Free-Hand Movie . . . . . 436Masashi Ueda, Itaru Kitahara, and Yuichi Ohta

Segment and Label Indoor Scene Based on RGB-D for the VisuallyImpaired . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 449

Zhe Wang, Hong Liu, Xiangdong Wang, and Yueliang Qian

A Low-Cost Head and Eye Tracking System for Realistic EyeMovements in Virtual Avatars . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 461

Yingbo Li, Haolin Wei, David S. Monaghan, and Noel E. O’Connor

Real-Time Skeleton-Tracking-Based Human Action Recognition UsingKinect Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 473

Georgios Th. Papadopoulos, Apostolos Axenopoulos, andPetros Daras

Kinect vs. Low-cost Inertial Sensing for Gesture Recognition . . . . . . . . . . 484Marc Gowing, Amin Ahmadi, François Destelle,David S. Monaghan, Noel E. O’Connor, and Kieran Moran

Yoga Posture Recognition for Self-training . . . . . . . . . . . . . . . . . . . . . . . . . . 496Hua-Tsung Chen, Yu-Zhen He, Chun-Chieh Hsu, Chien-Li Chou,Suh-Yin Lee, and Bao-Shuh P. Lin

Real-Time Gaze Estimation Using a Kinect and a HD Webcam . . . . . . . . 506Yingbo Li, David S. Monaghan, and Noel E. O’Connor

Compression, Transcoding and Streaming

A Framework of Video Coding for Compressing Near-DuplicateVideos . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 518

Hanli Wang, Ming Ma, Yu-Gang Jiang, and Zhihua Wei

Table of Contents – Part I XIX

An Improved Similarity-Based Fast Coding Unit Depth DecisionAlgorithm for Inter-frame Coding in HEVC . . . . . . . . . . . . . . . . . . . . . . . . . 529

Rui Fan, Yongfei Zhang, Zhe Li, and Ning Wang

Low-Complexity Rate-Distortion Optimization Algorithms for HEVCIntra Prediction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 541

Zhe Sheng, Dajiang Zhou, Heming Sun, and Satoshi Goto

Factor Selection for Reinforcement Learning in HTTP AdaptiveStreaming . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 553

Tingyao Wu and Werner Van Leekwijck

Stixel on the Bus: An Efficient Lossless Compression Scheme for DepthInformation in Traffic Scenarios . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 568

Qing Rao, Christian Grünler, Markus Hammori, andSamarjit Chakraborty

A New Saliency Model Using Intra Coded High Efficiency VideoCoding (HEVC) Frames . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 580

Matthew Oakes and Charith Abhayaratne

Multiple Reference Frame Transcoding from H.264/AVC to HEVC . . . . . 593Antonio Jesus Diaz-Honrubia, Jose Luis Martinez, andPedro Cuenca

Author Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 605

Table of Contents – Part II

Special Session: Mediadrom: Artful Post-TVScenarios

Organising Crowd-Sourced Media Content via a Tangible DesktopApplication . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

Sema Alaçam, Yekta İpek, Özgün Balaban, and Ceren Kayalar

Scenarizing Metropolitan Views: FlanoGraphing the Urban Spaces . . . . . 11Bénédicte Jacobs, Laure-Anne Jacobs, Christian Frisson,Willy Yvart, Thierry Dutoit, and Sylvie Leleu-Merviel

Scenarizing CADastre Exquisse: A Crossover between Snoezelingin Hospitals/Domes, and Authoring/Experiencing SoundfulComic Strips . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

Cédric Sabato, Aurélien Giraudet, Virginie Delattre, Yves Desnos,Christian Frisson, Rudi Giot, Willy Yvart, François Rocca,Stéphane Dupont, Guy Vandem Bemden, Sylvie Leleu-Merviel, andThierry Dutoit

An Interactive Device for Exploring Thematically Sorted Artworks . . . . . 34Aurélie Baltazar, Pascal Baltazar, and Christian Frisson

Special Session: MM Analysis for Surveillance Videoand Security Applications

Hierarchical Audio-Visual Surveillance for Passenger Elevators . . . . . . . . . 44Teck Wee Chua, Karianto Leman, and Feng Gao

An Evaluation of Local Action Descriptors for Human ActionClassification in the Presence of Occlusion . . . . . . . . . . . . . . . . . . . . . . . . . . 56

Iveel Jargalsaikhan, Cem Direkoglu, Suzanne Little, andNoel E. O’Connor

Online Identification of Primary Social Groups . . . . . . . . . . . . . . . . . . . . . . 68Dimitra Matsiki, Anastasios Dimou, and Petros Daras

Gait Based Gender Recognition Using Sparse Spatio TemporalFeatures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80

Matthew Collins, Paul Miller, and Jianguo Zhang

Perspective Multiscale Detection and Tracking of Persons . . . . . . . . . . . . . 92Marcos Nieto, Juan Diego Ortega, Andoni Cortes, and Seán Gaines

XXII Table of Contents – Part II

Human Action Recognition in Video via Fused Optical Flowand Moment Features – Towards a Hierarchical Approach to ComplexScenario Recognition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104

Kathy Clawson, Min Jing, Bryan Scotney, Hui Wang, and Jun Liu

Special Session: 3D Multimedia Computing andModeling

Sparse Patch Coding for 3D Model Retrieval . . . . . . . . . . . . . . . . . . . . . . . . 116Zhenbao Liu, Shuhui Bu, Junwei Han, and Jun Wu

3D Object Classification Using Deep Belief Networks . . . . . . . . . . . . . . . . . 128Biao Leng, Xiangyang Zhang, Ming Yao, and Zhang Xiong

Pursuing Detector Efficiency for Simple Scene Pedestrian Detection . . . . 140De-Dong Yuan, Jie Dong, Song-Zhi Su, Shao-Zi Li, andRong-Rong Ji

Multi-view Action Synchronization in Complex Background . . . . . . . . . . . 151Longfei Zhang, Shuo Tang, Shikha Singhal, and Gangyi Ding

Parameter-Free Inter-view Depth Propagation for Mobile Free-ViewVideo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 161

Binbin Xiong, Weimin Wu, Haojie Li, Hongtao Yu, and Hanzi Mao

Coverage Field Analysis to the Quality of Light Field Rendering . . . . . . . 170Changjian Zhu, Li Yu, and Peng Zhou

Special Session: Social Geo-Media Analytics andRetrieval

Personalized Recommendation by Exploring Social Users’ Behaviors . . . . 181Guoshuai Zhao, Xueming Qian, and He Feng

Where Is the News Breaking? Towards a Location-Based EventDetection Framework for Journalists . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 192

Bahareh Rahmanzadeh Heravi, Donn Morrison,Prashant Khare, and Stephane Marchand-Maillet

Location-Aware Music Artist Recommendation . . . . . . . . . . . . . . . . . . . . . . 205Markus Schedl and Dominik Schnitzer

Task-Driven Image Retrieval Using Geographic Information . . . . . . . . . . . 214Peixiang Dong, Kuizhi Mei, Ji Zhang, Hao Lei, and Jianping Fan

The Evolution of Research on Multimedia Travel Guide Searchand Recommender Systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 227

Junge Shen, Zhiyong Cheng, Jialie Shen, Tao Mei, and Xinbo Gao

Table of Contents – Part II XXIII

Special Session: Multimedia Hyperlinking andRetrieval

Average Precision: Good Guide or False Friend to Multimedia SearchEffectiveness? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 239

Robin Aly, Dolf Trieschnigg, Kevin McGuinness,Noel E. O’Connor, and Franciska de Jong

An Investigation into Feature Effectiveness for MultimediaHyperlinking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 251

Shu Chen, Maria Eskevich, Gareth J.F. Jones, and Noel E. O’Connor

Mining the Web for Multimedia-Based Enriching . . . . . . . . . . . . . . . . . . . . . 263Mathilde Sahuguet and Benoit Huet

Short Papers

Spatial Similarity Measure of Visual Phrases for Image Retrieval . . . . . . . 275Jiansong Chen, Bailan Feng, and Bo Xu

Semantic Based Background Music Recommendation for HomeVideos . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 283

Yin-Tzu Lin, Tsung-Hung Tsai, Min-Chun Hu,Wen-Huang Cheng, and Ja-Ling Wu

Smoke Detection Based on a Semi-supervised Clustering Model . . . . . . . . 291Haiqian He, Liqun Peng, Deshun Yang, and Xiaoou Chen

Empirical Exploration of Extreme SVM-RBF Parameter Valuesfor Visual Object Classification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 299

Rami Albatal and Suzanne Little

Real-World Event Detection Using Flickr Images . . . . . . . . . . . . . . . . . . . . . 307Naoko Nitta, Yusuke Kumihashi, Tomochika Kato, andNoboru Babaguchi

Spectral Classification of 3D Articulated Shapes . . . . . . . . . . . . . . . . . . . . . 315Zhenbao Liu, Feng Zhang, and Shuhui Bu

Improving Scene Detection Algorithms Using New SimilarityMeasures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 323

Stefan Zwicklbauer, Britta Meixner, and Harald Kosch

EvoTunes: Crowdsourcing-Based Music Recommendation . . . . . . . . . . . . . 331Jun-Ho Choi and Jong-Seok Lee

Affect Recognition Using Magnitude Models of Motion . . . . . . . . . . . . . . . 339Oussama Hadjerci, Adel Lablack, Ioan Marius Bilasco, andChaabane Djeraba

XXIV Table of Contents – Part II

Effects of Audio Compression on Chord Recognition . . . . . . . . . . . . . . . . . . 345Aiko Uemura, Kazumasa Ishikura, and Jiro Katto

The Perceptual Characteristics of 3D Orientation . . . . . . . . . . . . . . . . . . . . 353Wang Heng, Zhang Cong, Hu Ruimin, Tu Weiping, andWang Xiaochen

Demonstrations

Folkioneer: Efficient Browsing of Community Geotagged Imageson a Worldwide Scale . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 361

Hatem Mousselly-Sergieh, Daniel Watzinger, Bastian Huber,Mario Döller, Elöd Egyed-Zsigmond, and Harald Kosch

Muithu: A Touch-Based Annotation Interface for Activity Loggingin the Norwegian Premier League . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 365

Magnus Stenhaug, Yang Yang, Cathal Gurrin, and Dag Johansen

FoodCam: A Real-Time Mobile Food Recognition System EmployingFisher Vector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 369

Yoshiyuki Kawano and Keiji Yanai

The LIRE Request Handler: A Solr Plug-In for Large Scale ContentBased Image Retrieval . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 374

Mathias Lux and Glenn Macstravic

M3 + P3 + O3 = Multi-D Photo Browsing . . . . . . . . . . . . . . . . . . . . . . . . . . 378Björn Thór Jónsson, Áslaug Eiríksdóttir, Ólafur Waage,Grímur Tómasson, Hlynur Sigurthórsson, and Laurent Amsaleg

Tools for User Interaction in Immersive Environments . . . . . . . . . . . . . . . . 382Noel E. O’Connor, D. Alexiadis, K. Apostolakis, Petros Daras,E. Izquierdo, Y. Li, D.S. Monaghan, F. Rivera, C. Stevens,S. Van Broeck, J. Wall, and H. Wei

RESIC: A Tool for Music Stretching Resistance Estimation . . . . . . . . . . . . 386Jun Chen and Chaokun Wang

A Visual Information Retrieval System for Radiology Reportsand the Medical Literature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 390

Dimitrios Markonis, René Donner, Markus Holzer, Thomas Schlegl,Sebastian Dungs, Sascha Kriewel, Georg Langs, and Henning Müller

Eolas: Video Retrieval Application for Helping Tourists . . . . . . . . . . . . . . . 394Zhenxing Zhang, Yang Yang, Ran Cui, and Cathal Gurrin

Table of Contents – Part II XXV

Video Browser Showdown

Audio-Visual Classification Video Browser . . . . . . . . . . . . . . . . . . . . . . . . . . 398David Scott, Zhenxing Zhang, Rami Albatal,Kevin McGuinness, Esra Acar, Frank Hopfgartner, Cathal Gurrin,Noel E. O’Connor, and Alan F. Smeaton

Content-Based Video Browsing with Collaborating Mobile Clients . . . . . . 402Claudiu Cobârzan, Marco A. Hudelist, and Manfred Del Fabro

Browsing Linked Video Collections for Media Production . . . . . . . . . . . . . 407Werner Bailer, Wolfgang Weiss, Christian Schober, andGeorg Thallinger

VERGE: An Interactive Search Engine for Browsing VideoCollections . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 411

Anastasia Moumtzidou, Konstantinos Avgerinakis,Evlampios Apostolidis, Vera Aleksić, Fotini Markatopoulou,Christina Papagiannopoulou, Stefanos Vrochidis, Vasileios Mezaris,Reinhard Busch, and Ioannis Kompatsiaris

Signature-Based Video Browser . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 415Jakub Lokoč, Adam Blažek, and Tomáš Skopal

NII-UIT: A Tool for Known Item Search by Sequential PatternFiltering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 419

Thanh Duc Ngo, Vu Hoang Nguyen, Vu Lam, Sang Phan,Duy-Dinh Le, Duc Anh Duong, and Shin’ichi Satoh

Author Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 423