dialogue structure in microtext · dialogue structure in microtext aaai-11 workshop on analyzing...

50
Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Upload: others

Post on 24-Jul-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Dialogue Structure in MicrotextAAAI-11 Workshop on Analyzing Microtext

Micha Elsner

School of InformaticsUniversity of Edinburgh

8 August, 2011

Page 2: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Unconventional interactions

2

Page 3: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Where do we fit in?

Speech

I Prosody, tone of voiceI No latencyI Short utterancesI Strictly sequential

Text

I Just wordsI Ultra-high latencyI Long documentsI Hierarchical / searchable

3

Page 4: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Interfaces matter!

I Mostly text, some multimediaI Low latency (IM) vs high latency (web forum)I Short turns (twitter) vs long turns (email)I Sequential (Huffington comments) vs structured (Slashdot

comments)

These differences affect the language we see!

4

Page 5: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Overview

Introduction

Case study: IRC chat

IRC vs speech

Text and speech models for disentangling IRC

Conclusions

5

Page 6: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Speech: conversational structure

I One speaker at a timeI Has the floor (Sacks et al)

I Speaker signals intent tokeep talking or finish

I Coordination via shortutterances:

I Filled pauses “uh”,backchannels “yeah”

(Fox, Sudderth et al: A Sticky HDP-HMM)

6

Page 7: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Turn-taking in IRC chat

0 10 100 10000

20

40

seconds

Fre

quen

cy

I More tolerant of long pausesI And possibly of “interruption”

I Some backchannel-like utterances:I ∼ 10% one-word comments: “lol”, “ok”I Switchboard: ∼ 17% backchannels: “yeah”, “uh-huh”

7

Page 8: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Turn-taking in IRC chat

0 10 100 10000

20

40

seconds

Fre

quen

cy

I More tolerant of long pausesI And possibly of “interruption”

I Some backchannel-like utterances:I ∼ 10% one-word comments: “lol”, “ok”I Switchboard: ∼ 17% backchannels: “yeah”, “uh-huh”

7

Page 9: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Multiple floorsUsually several conversations at a time

I Between 2 and 3 active during each utterance

Chatters participate in many conversationsI The more one speaks, the more threads they speak in

0 10 20 30 40 50 60Utterances

0

1

2

3

4

5

6

7

8

9

10

Threads

8

Page 10: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Questions

What information is useful?How well do text/speech models adapt?

9

Page 11: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Disentanglement (threading)

(Elsner+Charniak ACL 08, CL 10, ACL 11)

10

Page 12: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Preliminaries

Six annotators marked 800 lines of chatI From a Linux tech support forum on IRC

11

Page 13: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

An initial model

Correlation clustering framework:I Classify each pair of utterances as “same thread” or

“different thread”I Partition the transcript to keep “same” utterances together

and split “different” ones apartI NP-hard, so we use heuristics

12

Page 14: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

An initial model

Correlation clustering framework:I Classify each pair of utterances as “same thread” or

“different thread”I Partition the transcript to keep “same” utterances together

and split “different” ones apartI NP-hard, so we use heuristics

12

Page 15: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Classifying utterances

Pair of utterances: same conversation or different?

Chat-based features (F 66%)I Time between utterancesI Same speakerI Speaker’s name mentioned

Discourse features (F 58%)

Word overlap (F 56%)

Combined model (F 71%)

13

Page 16: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Classifying utterances

Pair of utterances: same conversation or different?

Chat-based features (F 66%)

Discourse features (F 58%)I Questions, answers, greetings, etc.

Word overlap (F 56%)

Combined model (F 71%)

13

Page 17: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Classifying utterances

Pair of utterances: same conversation or different?

Chat-based features (F 66%)

Discourse features (F 58%)

Word overlap (F 56%)I Weighted by word probability in corpusI Simplistic coherence feature

Combined model (F 71%)

13

Page 18: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Classifying utterances

Pair of utterances: same conversation or different?

Chat-based features (F 66%)

Discourse features (F 58%)

Word overlap (F 56%)

Combined model (F 71%)

13

Page 19: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Assigning a single sentence

It’s easy to maximize the objective locally...I Even though the global problem is hard

AccuracySame as previous 56Corr. Clustering 76

14

Page 20: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Models from text and speech

Models which may apply here...I Initially designed for putting sentences in orderI Distinguish coherent sequence of utterances from

randomnessI Many different aspects of languageI Not all our own work.

15

Page 21: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Entity grid

Model of transitions from sentence to sentence(Lapata+Barzilay ‘05,Barzilay+Lapata ‘05):

Text Syntactic roleSuddenly a White Rabbit ran by her. subject

Alice heard the Rabbit say “I shall be late!” objectThe Rabbit took a watch out of its pocket. subjectAlice started to her feet. missing

16

Page 22: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Topical entity grid

Relationships between different words“a crow infected with West Nile...”“the outbreak was the first...”

Our own work.

I Represents words in a “semantic space”: LDA (Blei+al ‘01)I Entity-grid-like model of transitionsI “Semantics” can be noisy...

I More sensitive than the Entity Grid, but easy to fool!

17

Page 23: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

IBM Model 1

Single sentence of contextLearns word-to-word relationships directly

18

Page 24: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Pronouns

Detect passages with stranded pronouns:(Charniak+Elsner ‘09), (Elsner+Charniak ‘08)

19

Page 25: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Old vs new information

New information needs complex packaging“Secretary of State Hillary Clinton”

Old information doesn’t“Clinton”

Soft constraints: put the “new”-looking phrasefirst(Elsner+Charniak ‘08) following (Poesio+al ‘05)

Works well for news, poorly on speech and chatI Entities introduced in different ways

20

Page 26: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Old vs new information

New information needs complex packaging“Secretary of State Hillary Clinton”

Old information doesn’t“Clinton”

Soft constraints: put the “new”-looking phrasefirst(Elsner+Charniak ‘08) following (Poesio+al ‘05)

Works well for news, poorly on speech and chatI Entities introduced in different ways

20

Page 27: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Synthetic speech transcripts

21

Page 28: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Speech results

Chance 50Corr. Clustering 76EGrid 77Topical EGrid 78IBM-1 69Pronouns 52Time 58Combined 83

I Coherence approach outperforms previous

I Topical model is usefulI Pronouns very bad

Best models: sensitive, many-sentence context

22

Page 29: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Speech results

Chance 50Corr. Clustering 76EGrid 77Topical EGrid 78IBM-1 69Pronouns 52Time 58Combined 83

I Coherence approach outperforms previousI Topical model is useful

I Pronouns very bad

Best models: sensitive, many-sentence context

22

Page 30: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Speech results

Chance 50Corr. Clustering 76EGrid 77Topical EGrid 78IBM-1 69Pronouns 52Time 58Combined 83

I Coherence approach outperforms previousI Topical model is usefulI Pronouns very bad

Best models: sensitive, many-sentence context

22

Page 31: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Speech results

Chance 50Corr. Clustering 76EGrid 77Topical EGrid 78IBM-1 69Pronouns 52Time 58Combined 83

I Coherence approach outperforms previousI Topical model is usefulI Pronouns very bad

Best models: sensitive, many-sentence context

22

Page 32: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Pronominals in speech and text

Different usage patterns...Corpus Deictics Pronouns 3rd person pronounsWSJ .04 0.64 0.52Switchboard .12 1.18 0.39IRC .09 0.92 0.31

News models totally inadequate here...I Microtext also differs from speech pattern

23

Page 33: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

IRC results

Chat-specific 74

Corr. Clustering 76Chat+EGrid 79Chat+Topical EGrid 77Chat+IBM-1 76Chat+Pronouns 74

I Coherence still outperform previousI Lexical models not as good

I Lack of data: trained on phone conversations

I Pronouns same as before

24

Page 34: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

IRC results

Chat-specific 74Corr. Clustering 76

Chat+EGrid 79Chat+Topical EGrid 77Chat+IBM-1 76Chat+Pronouns 74

I Coherence still outperform previousI Lexical models not as good

I Lack of data: trained on phone conversations

I Pronouns same as before

24

Page 35: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

IRC results

Chat-specific 74Corr. Clustering 76Chat+EGrid 79

Chat+Topical EGrid 77Chat+IBM-1 76Chat+Pronouns 74

I Coherence still outperform previous

I Lexical models not as good

I Lack of data: trained on phone conversations

I Pronouns same as before

24

Page 36: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

IRC results

Chat-specific 74Corr. Clustering 76Chat+EGrid 79Chat+Topical EGrid 77Chat+IBM-1 76

Chat+Pronouns 74

I Coherence still outperform previousI Lexical models not as good

I Lack of data: trained on phone conversations

I Pronouns same as before

24

Page 37: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

IRC results

Chat-specific 74Corr. Clustering 76Chat+EGrid 79Chat+Topical EGrid 77Chat+IBM-1 76Chat+Pronouns 74

I Coherence still outperform previousI Lexical models not as good

I Lack of data: trained on phone conversationsI Pronouns same as before

24

Page 38: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

More data

800 utterances not enough for you?I Much larger corpora from (Martell+Adams ‘08)

I Using our annotation software and protocolI ∼ 20000 total utterances from three newsgroups

M+A corporaCorr. Clustering 89EGrid 93

25

Page 39: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Full-scale disentanglement

Unfortunately, scalability problems withadvanced models...

I So results only for simple model

Annotators

Best Baseline Corr. Clustering

Agreement 53

35 (Pause 35) 41

a

Best overall result: (Wang+Oard ‘09) 47

26

Page 40: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Full-scale disentanglement

Unfortunately, scalability problems withadvanced models...

I So results only for simple model

Annotators Best Baseline

Corr. Clustering

Agreement 53 35 (Pause 35)

41

a

Best overall result: (Wang+Oard ‘09) 47

26

Page 41: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Full-scale disentanglement

Unfortunately, scalability problems withadvanced models...

I So results only for simple model

Annotators Best Baseline Corr. ClusteringAgreement 53 35 (Pause 35) 41

a

Best overall result: (Wang+Oard ‘09) 47

26

Page 42: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Full-scale disentanglement

Unfortunately, scalability problems withadvanced models...

I So results only for simple model

Annotators Best Baseline Corr. ClusteringAgreement 53 35 (Pause 35) 41

a

Best overall result: (Wang+Oard ‘09) 47

26

Page 43: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

What we’ve learned

IRC chat is like speechI Turn-taking and floor controlI Models based on lexical/entity coherence

I But resource-poor (should be surmountable)

Real differences existI Floors more fluidI Referring behavior

I Full NPsI Pronominals

27

Page 44: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Conclusions

Chat disentanglementI Sophisticated models can help!I Still technical problems

I Scaling inference, building topic models...I Some real differences from speech

I Coreference is a new challenge

MicrotextI Interface determines communication behaviorI May vary from any previous mode of communication

I Important to consider before applying off-the-shelf models

28

Page 45: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Conclusions

Chat disentanglementI Sophisticated models can help!I Still technical problems

I Scaling inference, building topic models...I Some real differences from speech

I Coreference is a new challenge

MicrotextI Interface determines communication behaviorI May vary from any previous mode of communication

I Important to consider before applying off-the-shelf models

28

Page 46: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

Thanks!

Thanks to...I Eugene Charniak, Mark Johnson, Regina BarzilayI Former labmates at Brown UniversityI Google Fellowship for NLPI Craig Martell for NPS dataset

Corpus and software availablecs.brown.edu/∼ melsnerbitbucket.org/melsner/browncoherence

29

Page 47: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

One-to-one overlap

30

Page 48: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

One-to-one overlap

30

Page 49: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

One-to-one overlap

30

Page 50: Dialogue Structure in Microtext · Dialogue Structure in Microtext AAAI-11 Workshop on Analyzing Microtext Micha Elsner School of Informatics University of Edinburgh 8 August, 2011

One-to-one overlap

30