exploiting ontologies for automatic image annotation munirathnam srikanth, joshua varner, mitchell...

17
Exploiting Ontologies fo r Automatic Image Annota tion Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR 2004

Upload: nigel-ward

Post on 18-Jan-2016

230 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR

Exploiting Ontologies for Automatic Image Annotation

Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan

Language Computer CorporationSIGIR 2004

Page 2: Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR

INTRODUCTION This paper exploits ontological relationshi

ps between annotation words The hierarchy is used in the context of gen

erating a visual vocabulary The hierarchy is used in proposed hierarc

hical classification approach for automatic image annotation

Page 3: Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR

MOTIVATION

Ronaldo seals Brazil's place in the last eightwith a shot through Geert de Vlieger's legs late onto eliminate Belgium

Q: What is the jersey color of Brazil's national soccer team?

Page 4: Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR

MOTIVATION It is possible to annotate images with

words that do not have explicit examples in the training set EX. Corel data set can annotate an image

as an animal even when no image in the training set that is so annotated

tiger, cougar, cat, horse, cow, etc., can be placed under the concept node of animal

Page 5: Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR

Ontology-induced Visual Vocabulary

Instead of a random selection of initial cluster centers for the K-means clustering, the hierarchy in the annotation words are used to perform the selection

Given the hierarchy of annotation words images regions are grouped under each node in the hierarchy

Page 6: Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR

Ontology-induced Visual Vocabulary An image region r in image I is placed

under a concept w if w is an annotation word for the image I One of its hyponyms annotates the imag

e I Averaging the feature vectors of the i

mages regions in the set produces a semantically-motivated initial cluster center

Page 7: Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR

Weighted K-means clustering Weights can be assigned to image regions

quantifying their contribution to the particular cluster

Page 8: Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR

IMAGE ANNOTATION BYHIERARCHICAL CLASSIFICATION

Page 9: Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR

IMAGE ANNOTATION BYHIERARCHICAL CLASSIFICATION

The dependencies between the annotation words are identified by the ISA hierarchy in WordNet

As an example, assume the hierarchy of 'tiger' ISA 'cat‘ ISA 'animal' ISA 'ROOT'

Page 10: Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR

INDUCING HIERARCHIES INWORD ANNOTATIONS

we made a coarser assumption that a particular word is used in only one sense in the the whole corpus

The sense for a word is selected based on hyponyms and hypernyms appear as annotations of an image in the collection

Page 11: Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR

EXPERIMENTS The experiments are performed on the Cor

el Data set 5000 annotated images split into a training set

of 4500 images with the remaining 500 used for testing

A total of 371 words annotate the data set with up to 5 annotation words for each image

714 nodes is generated for the 371 unique annotation words

Page 12: Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR

EXPERIMENTS The number of words predicted as

an annotation at least once for any image in the test set

Page 13: Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR

EXPERIMENTS

Page 14: Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR

EXPERIMENTS

Page 15: Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR

EXPERIMENTS

Page 16: Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR

CONCLUSIONS This paper proposed methods to use

hierarchies induced on annotation words to improve automatic image annotation

Presents a method for generating visual vocabularies based on the semantics of the annotation words and their hierarchical organization

Page 17: Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR

CONCLUSIONS The hierarchy derived from WordNet is used

as a framework to generate classification models

Both classification methods performed better than translation model in our experiments

We intend to verify the improvements obtained using the proposed methods on a larger and more challenging data set like TREC Video dataset