graph-based multimodal clustering for social event detection in large collections of images

MMM 2014

Graph-based multimodal clustering for social event

detection in large collections of images

Georgios Petkos, Symeon Papadopoulos, Emmanouil Schinas, Yiannis KompatsiarisInformation Technologies Institute (ITI)Centre for Research & Technologies Hellas (CERTH)

MMM 2014 Georgios Petkos et al.#2

Overview

• The problem of social event detection• Existing approaches• Proposed approach• Evaluation• Summary & future work

MMM 2014 Georgios Petkos et al.

the problem

entertainment

personal

wedding / birthday / drinks

concert / play / sports

demonstration / riot / speech

Social events?

Attended by people and represented by multimedia content shared online

Pope Francis

Pope Benedict

2007: iPhone release

2008: Android release

2010: iPad release

http://petapixel.com/2013/03/14/a-starry-sea-of-cameras-at-the-unveiling-of-pope-francis/

Social event detection

Social event detection involves the automatic organization of a multimedia collection C into groups of items, each (group) of which corresponds to a distinct event. Can be treated as a multimodal clustering problemCOLLECTION

EVENT DETECTION

EVENT SET

existing approaches

Supervised event detection

• Rationale: use a large number of “known” event assignments to “learn” how to identify “same event” / “same cluster” relationships

Two variants:• Item-to-item: learn whether two items belong to the same

event cluster or not. – Model Input: the set of per modality distances between two images.

• Item-to-cluster: learn whether a new item belongs to a given event cluster or not. – Model input: the set of per modality distances between an image and

a prototype representation of the event.

Utilizing the “same event” model for clustering

• Item-to-item: – (Incremental). For each incoming image, average all item-to-item SE

scores for all items in each cluster. Assign to best-matching cluster if average above threshold or create new cluster (Becker et. al.).

– (Batch). Compute all item-item SE scores between each image and all other images and form an indicator vector. Cluster indicator vectors (Petkos et. al.).

• Item-to-cluster: – (Incremental). For each cluster maintain a multimodal representation.

Compute SE score between each incoming item and the existing prototype event representations. Assign to best-matching cluster if above threshold or create new cluster (Becker et. al). Alternatively use a second model for deciding if a new cluster should be added or not (Reuter et. al.).

proposed approach

Overview of proposed approach

• Item-to-item SE model utilized.• Candidate neighbours selection step (first appears in (Reuter et. al)) using a set of per modality indexes. • Graph representation.• Community detection on graph. Two variants of the algorithm:

• Batch: SCAN• Incremental: QCA

Proposed approach: advantages

• Item-to-cluster methods may suffer from incorrect prototype representations (due to averaging). • Candidate neighbours selection step makes the application of the method much more scalable.• Graph representation: in order to introduce a scalable item-to-item approach without averaging.

evaluation

Evaluation setup

• Used the dataset of the 2012 SED task of MediaEval• Ground truth: 7,779 photos clustered around 149

events (18 technical, 79 soccer, 52 Indignados)• Assess the following aspects:

– accuracy of same-event classification– compare clustering quality between item-to-cluster and

the two versions of item-to-item (batch & incremental)– measure contributions of different features– study generalization abilities of same event model

Evaluation setup

Features:• Uploader identity.• Actual image content:

– GIST– SURF, aggregated using the VLAD scheme

• Textual features: title, description and tags. Either a TF-IDF or a BM25 weighting scheme is utilized.

• Time of media creation.• Location, when available (geodesic distance).

Appropriate indices are utilized in order to rapidly fetch the candidate neighbours for each modality.

Evaluation: SE accuracy & clustering quality

• Same event classification accuracy 98.58% (SVM)– 10K pos/neg training, 10K pos/neg testing (random)

• Clustering quality (NMI): 30/119 training/testing events [10 random splits]– Incremental same or better than batch– Item-to-item better than item-to-cluster (significant at 0.95 confidence)

• When non-event photos enter the dataset, NMI degrades quickly

BATCH INCREMENTAL ITEM-TO-CLUSTER

AVG 0.924 0.934 0.898

STD 0.019 0.021 0.027

NON-EVENT BATCH INCREMENTAL ITEM-TO-CLUSTER

5% 0.4824 0.5164 0.3954

10% 0.3421 0.3683 0.2899

* In the second table, results were obtained using sed2011 for training and sed2012 for testing.

Evaluation: contribution of features

• Same experiments using limited sets of features

• Repeating the same experiments without the use of blocking led to significantly worse results– e.g. 0.030 for visual, 0.7148 for textual

• Time is an extremely important feature

FEATUERS BATCH INCREMENTAL

VISUAL 0.8020 ∓ 0.0193 0.8179 ∓ 0.0151

TEXTUAL 0.7925 ∓ 0.0255 0.7792 ∓ 0.0310

VISUAL+TIME 0.9244 ∓ 0.0195 0.9360 ∓ 0.0183

TEXTUAL+TIME 0.9016 ∓ 0.0173 0.9049 ∓ 0.0209

Evaluation: generalizing same event model

• Train using one event type > test on a different one• In most cases negative impact• In few cases, performance is very high!

soccer technical Indignados

soccer - 0.8658 0.8494

technical 0.7967 - 0.8977

Indignados 0.9645 0.8456 -

INCREMENTAL

soccer technical Indignados

soccer - 0.8892 0.8667

technical 0.7661 - 0.7735

Indignados 0.9845 0.8482 -

summary & future work

Summary

• Scalable item-to-item multimodal clustering approach for SED

• Key characteristics:– Item-to-item “same event” model– Candidate neighbor selection – Organization of “same event” relationships to a graph– Efficient graph clustering algorithms: SCAN (batch) / QCA

(incremental)

• In general though, item-to-item approaches are less scalable than item-to-cluster approaches

Future work

• Extend method so that non-event images are properly handled

• Multiple sources of multimedia

• The MediaEval datasets are somewhat limited. Investigate the effect of crawling / image collection to the quality of results

thank you!

questions?Acknowledgements

online clustering of same-event graph

QCA maintains community structure incrementally following graph change operations: node & edge addition (removal operations not applicable in same event graph): based on the concept of community attraction forces

X new nodenew edge

force from Cu to Cz

force from Cz to Cu

• Depending on a test (computed based on local graph structure), community structure could remain the same, X assigned to Cu or A to Cz.

• If A is assigned to Cu, all its neighbours will be checked for potential reassignment.

graph clustering :: SCAN

outlier

(μ,ε)- corestructural similarity

• resilient to spurious links (e.g. visual links that connect unrelated images)

• very fast (scales linearly to the number of edges)• leaves less-/ and over-connected items out of the clustering

References

• Reuter, T., & Cimiano, P. (2012, June). Event-based classification of social media streams. In Proceedings of the 2nd ACM International Conference on Multimedia Retrieval (p. 22). ACM.

• Petkos, G., Papadopoulos, S., & Kompatsiaris, Y. (2012). Social event detection using multimodal clustering and integrating supervisory signals. In Proceedings of the 2nd ACM International Conference on Multimedia Retrieval (p. 23). ACM.

• Becker, H., Naaman, M. & Gravano, L.. Learning similarity metrics for event identification in social media. In Proceedings of the third ACM International Conference on Web search and Data Mining, WSDM ’10, pages 291–300, New York.

• Nguyen, N., Dinh, T., Xuan, Y., & Thai, M.. Adaptive algorithms for detecting community structure in dynamic social networks. In INFOCOM 2011. 30th IEEE International Conference on Computer Communications, Joint Conference of the IEEE Computer and Communications Societies, 10-15 April 2011, Shanghai, China, pages 2282–2290. IEEE, 2011.

• Xu, X., Yuruk, N., Feng, Z. & Schweiger, T.. SCAN: a structural clustering algorithm for networks. In Proceedings of the 13th ACM SIGKDD, KDD ’07, pages 824–833, NY, USA, 2007. ACM

graph-based multimodal clustering for social event detection in large collections of images

clustering item

item approach

batch item

new item

itemitem se scores

incoming item

scalable item

advantages item

Technology

social event detection using multimodal clustering and...

p-dbscan: a density based clustering algorithm for ... ·...

fuzzy clustering 2009/2010. 2 what is data clustering? ...

konstantin vorontsov - bigartm: open source library for...

clustering clustering

monitoria multimodal cerebral multimodal monitoring[2]

bigartm: open source library for regularized multimodal...

classification and clustering methods development and...

brain connectivity analysis: from unimodal to multimodal ·...

fiata and multimodal coridors - unece · pdf filemode or...

adaptive isolation model using data clustering for...

2016 multimodal asset performance report€¦ · table of...

deep multimodal clustering for unsupervised audiovisual...

deep multimodal clustering for unsupervised audiovisual...

multimodal analysis of video collections: visual exploration...

automatic face annotation in personal photo collections...

automatic classification for various images collections...

affinity clustering: hierarchical clustering at...

multimodal data visualization, denoising and clustering

temporal event clustering for digital photo collections...