exploring the "betrothed lovers" and other literary works

31
Exploring the “Betrothed Lovers” and other literary works Andrea Bolioli, Riccardo Tasso ”If you enjoy it, you understand it”

Upload: celi

Post on 08-Jul-2015

276 views

Category:

Technology


1 download

DESCRIPTION

DH Workshop in memory of Emanuele Pianta, Trento 10 December, 2013 As part of the activities of Digital Humanities group in FBK, a one-day workshop on "Digital Humanities: Current state and Future challenges". Exploring the “Betrothed Lovers” and other literary works by Andrea Bolioli, Riccardo Tasso 2 Our company: Cross Library:Spin-off of FBK (Trento) and CELI (Torino) Digital Humanities and School Our claim: If you enjoy it, you understand it! Our product: the "crunched" book 3 A propotype for literature: I promessi sposi 2.0 4 Exploring literary works 5 A research project: Sèduco 6 HLT tasks for literature processing 7 The Annotation Framework 8 Our Annotation Model: An annotation is a span of text characterized by a 9 Our Annotation Model: An annotation may have attributes 10 Our Annotation Model: An annotation may be classified 11 Our Annotation Model: An annotation may be related 12 Object Store 13-14 Text Store 15-20 The annotation query engine 21 Crunched Book SNA 22 Actors Graph 23 Pinocchio Actors (1) 24 Pinocchio Actors (1) 25 Speakers Graph 26 Promessi Sposi Speakers 27 Pinocchio Speakers 28 Romeo and Juliet 29 Crunched Book SNA (speakers) 30 Future works 31 Thank You! @CrossLib http://www.cross-library.com

TRANSCRIPT

Page 1: Exploring the "Betrothed Lovers" and other literary works

Exploring the “Betrothed Lovers”

and other literary works

Andrea Bolioli, Riccardo Tasso

”If you enjoy it, you understand it”

Page 2: Exploring the "Betrothed Lovers" and other literary works

Our claim: If you enjoy it,

you understand it!

Our product: the "crunched" book

Spin-off of FBK (Trento)

and CELI (Torino)

Digital Humanities and School

www.cross-library.com

Our company: Cross Library

Page 3: Exploring the "Betrothed Lovers" and other literary works

A propotype for literature: I promessi sposi 2.0

«The Betrothed», by Alessandro Manzoni www.crunchedbook.com

Page 4: Exploring the "Betrothed Lovers" and other literary works

Exploring literary works

NARRATIVE SEQUENCES

CHARACTERS SOCIAL NETWORKS

LOCATIONS

Page 5: Exploring the "Betrothed Lovers" and other literary works

A research project: Sèduco

Sharing Educational Content

www.seduco.it

Partners: Cross Library,

OpenContent,

FBK, IPRASE

and 4 high schools

«Exploring the Betrothed Lovers»,

A. Bolioli, M. Casu, M. Lana, R. Roda,

Computational Models of Narrative workshop CMN 2013,

Hamburg 4-6 august 2013

Page 6: Exploring the "Betrothed Lovers" and other literary works

HLT tasks for literature processing

• Automatic text segmentation:

narrative sequences, quoted speech,

other text units

• Entity mention annotation:

speakers, mentions of characters

(agents) and locations (not only GPEs,

e.g. "castello dell'Innominato" - castle

of the Unnamed, osteria della Luna

piena" - tavern of the Full Moon)

• Quoted speech attribution

Page 7: Exploring the "Betrothed Lovers" and other literary works

The Annotation Framework

Page 8: Exploring the "Betrothed Lovers" and other literary works

Our Annotation Model

An annotation is a span of text characterized by

a <begin, end>

Page 9: Exploring the "Betrothed Lovers" and other literary works

Our Annotation Model

An annotation may have attributes:

Page 10: Exploring the "Betrothed Lovers" and other literary works

Our Annotation Model

An annotation may be classified:

Page 11: Exploring the "Betrothed Lovers" and other literary works

Our Annotation Model

An annotation may be related:

Page 12: Exploring the "Betrothed Lovers" and other literary works

Object Store

An annotation is persisted:

“A graph database stores data in a graph, the

most generic of data structures, capable of

elegantly representing any kind of data in a

highly accessible way”

An annotation is persisted:

Page 13: Exploring the "Betrothed Lovers" and other literary works

Text Store

Annotations, annotations, annotations... But what about text?

Page 14: Exploring the "Betrothed Lovers" and other literary works

Text Store

Annotations, annotations, annotations... But what about text?

Page 15: Exploring the "Betrothed Lovers" and other literary works

The annotation query engine

And (finally) you can search and find annotations

Page 16: Exploring the "Betrothed Lovers" and other literary works

The annotation query engine

Choose a MAIN annotation filter:

{ "main": { "@class": "Sequence" } }

Returns all the Annotations: whose class is Sequence

Page 17: Exploring the "Betrothed Lovers" and other literary works

The annotation query engine

Specify annotation's attributes:

{ "main": { "@class": "Fragment", "type": "speech" } }

Returns all the Annotations: whose class is Fragment of (sub)type "speech"

Page 18: Exploring the "Betrothed Lovers" and other literary works

The annotation query engine

Specify annotation's relations:

{ "main": { "@class": "Sequence", "out('actor')": "pinocchio", "out('place')": "paese_balocchi" } }

Returns all the Annotations: whose class is Sequence with an actor relation to "pinocchio" with a place relation to "paese_balocchi"

Page 19: Exploring the "Betrothed Lovers" and other literary works

The annotation query engine

Choose second level filter:

{ "main": { "@class": "Sequence" }, "filter": { "@class": "@Fragment", "type": "speech" } }

Returns all the Annotations: whose class is Sequence which CONTAIN a given annotation (speech)

Page 20: Exploring the "Betrothed Lovers" and other literary works

The annotation query engine

Full text search:

{ "main": { "@class": "Sequence", "out('actor')": "pinocchio" }, "@text": "storia" }

Returns all the Annotations: whose class is Sequence with an actor relation to "pinocchio" whose text contains "storia" keyword

Page 21: Exploring the "Betrothed Lovers" and other literary works

Crunched Book SNA

Page 22: Exploring the "Betrothed Lovers" and other literary works

Actors Graph

Page 23: Exploring the "Betrothed Lovers" and other literary works

Pinocchio Actors (1)

Page 24: Exploring the "Betrothed Lovers" and other literary works

Pinocchio Actors (2)

Page 25: Exploring the "Betrothed Lovers" and other literary works

Speakers Graph

Page 26: Exploring the "Betrothed Lovers" and other literary works

Promessi Sposi Speakers

Page 27: Exploring the "Betrothed Lovers" and other literary works

Pinocchio Speakers

Page 28: Exploring the "Betrothed Lovers" and other literary works

Romeo and Juliet

Page 29: Exploring the "Betrothed Lovers" and other literary works

Crunched Book SNA (speakers)

Promessi Sposi Pinocchio Romeo & Juliet

nodes 86 62 35

edges 182 104 236

diameter 6 6 3

density 0.061 0.055 0.397

connected components 1 1 1

communities 6 11 3

clustering coefficient 0.528 0.614 0.813

avg. path length 2.814 2.395 1.64

Page 30: Exploring the "Betrothed Lovers" and other literary works

Future works

Other crunched books (in January):

«Le avventure di Pinocchio», «Romeo and Juliet»

Next DH projects:

• Annotating and visualizing ancient places in latin literature

• A multilingual work (latin, english, italian and chinese)

Page 31: Exploring the "Betrothed Lovers" and other literary works

Thank You!

@CrossLib

http://www.cross-library.com

”If you enjoy it, you understand it”