a generic annotation tool for multimodal dialogue

20
Anvil A Generic Annotation Tool for Multimodal Dialogue Michael Kipp University of the Saarland & DFKI Germany [email protected]

Upload: others

Post on 12-Sep-2021

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A Generic Annotation Tool for Multimodal Dialogue

Anvil

A Generic Annotation Tool for Multimodal Dialogue

Michael KippUniversity of the Saarland & DFKI

Germany

[email protected]

Page 2: A Generic Annotation Tool for Multimodal Dialogue

AnViLAnnotation of Video and Spoken Language

Page 3: A Generic Annotation Tool for Multimodal Dialogue

Motivation• Find new gestures and their semantic and

temporal relation to speech• Analysis of empirical data on video files• Notepad for parallel events (video synchronized):

– speech– head movements– gestures

• Parallel information layers = Tracks• Notes for one video = Annotation

Page 4: A Generic Annotation Tool for Multimodal Dialogue

Annotation

Welcome to Anvilwords

head movements

gestures

Elements within one track must not overlap!

Tracks Track Elements

Page 5: A Generic Annotation Tool for Multimodal Dialogue

words

head movements

gestures

Tracks Track Elements

token = „Welcome“gesture class = deicticrepetition = falsecertainty = 4

Annotation

token: Stringgesture class: ValueSet {deictic, emblem, iconic}repetition: Booleancertainty: Number(1,4)

Page 6: A Generic Annotation Tool for Multimodal Dialogue

Track Hierarchiesprimary and secondary tracks

sentences

words

gestures

Welcome to Anvil

gesture phases

gesture cluster

Page 7: A Generic Annotation Tool for Multimodal Dialogue

• Track hierarchies express a very specific kind of relation

• How to code general interdependenciesbetween elements in the same or different tracks?

• Put logical links into a new type of attribute

Page 8: A Generic Annotation Tool for Multimodal Dialogue

Links

Welcome to Anvilwords

head movements

gestures

category = „deictic“lexical affiliate = { }

token = „Anvil“

Page 9: A Generic Annotation Tool for Multimodal Dialogue

Reciprocal Links

Welcome to Anvilwords

head movements

gestures

category = „deictic“lexical affiliate = { }

token = „Anvil“backlink = { }

lexical affiliate: ReciprocalLink(backlink)

Page 10: A Generic Annotation Tool for Multimodal Dialogue

Specification• Specify

– Tracks name + type + reference track

– For each track: Attributes name + type• String, Boolean, Number(x, y)• ValueSet {set of user-defined labels}• MultiLink, ReciprocalLink(A)

• This is the annotation scheme= specification file (XML)

Page 11: A Generic Annotation Tool for Multimodal Dialogue

main window video window(frame-accurate control)

track window

annotationboard

activetrack

tracks track elements playback lineselectedelement

waveform

Page 12: A Generic Annotation Tool for Multimodal Dialogue

Project Tool

• Anvil only good for one annotation at a time• A project is a set of annotations with equal

specification (same scheme)• What can you do with projects?

– overview (coder, date, progress)– search & find across annotations– export found units for analysis

Page 13: A Generic Annotation Tool for Multimodal Dialogue

annotation files

annotation info

• coder• last modification time• summary/notes• each track‘s size

ProjectTool

search engine

Page 14: A Generic Annotation Tool for Multimodal Dialogue

list of hits

click tojump/load

Page 15: A Generic Annotation Tool for Multimodal Dialogue

Import/Export• Anvil is not suited for speech transcription or

statistical analysis• Import PRAAT tier to Anvil track• Import XWAVES speech transcription• Export single track of an annotation as ASCII text• Export single track of a project to text tables• Import/Export by working with Anvil XML

annotation files (e.g. using XSL)

Page 16: A Generic Annotation Tool for Multimodal Dialogue

Extending Anvilusing Plug-ins

• File reader/writers/converters(other formats!)

• Different views on the annotation:tables, matrices, trees, 3D graphics, etc.

• Analysis modules

Page 17: A Generic Annotation Tool for Multimodal Dialogue

GestureAnalysisPlug-in

list of lexical affiliates

relative timing

handedness

single gesture

Page 18: A Generic Annotation Tool for Multimodal Dialogue

Technical Data

• Developed on PentiumIII PC 500MHz 256MB

• Platform-independent (Java + JMF)• XML-based• Supports QuickTime and AVI• Small (1.2MB) and easy to install• Documented 43 page manual + papers + demo files

FREEWARE

Page 19: A Generic Annotation Tool for Multimodal Dialogue

Circulation of Anvil• Papers + presentations at

– Agents 2001 (Workshop)– Oralité et Gestualité (Orage) 2001– EuroSpeech 2001

• Part of ISLE (formerly EAGLES) survey• Anvil mailing list: 80 entries

(HCI, Ling., Phonetics, CA, Psycholing., Ethology, Animation, Tools/Corpora - Research & Teaching)

• Model function for NITE, LDC, Observerand TalkBank project

Page 20: A Generic Annotation Tool for Multimodal Dialogue

Summary

• Anvil is a free video annotation tool for– coding single annotations– browsing multiple annotations

• Clean separation of tool and coding scheme• Leaves special tasks to other programs

speech transcription, statistical analysis• Extensible via plug-in architecture

www.dfki.de/~kipp/anvil