1st international workshop on computational approaches to … · 2019-08-08 · 14:30 –16:00...

45
1 st International Workshop on Computational Approaches to Historical Language Change LChange´19 Nina Tahmasebi, Lars Borin, Adam Jatowt, Yang Xu In conjunction with ACL2019 Florence, Italy, August 2, 2019 LChange'19 • ACL2019 • N. Tahmasebi

Upload: others

Post on 14-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

1st International Workshop on Computational Approaches to

Historical Language ChangeLChange´19

Nina Tahmasebi, Lars Borin, Adam Jatowt, Yang Xu

In conjunction with ACL2019

Florence, Italy, August 2, 2019LChange'19 • ACL2019 • N. Tahmasebi

Submission info

34 accepted

57 reviewers!

55 submissions

full papers; 18

short papers; 14

abstracts; 2

LChange'19 • ACL2019 • N. Tahmasebi

Dis

trib

uti

on

of

top

ics

application

compositional meaning

dialectal use

grammatical change

testset for historical emb. paces

lexical replacement

semchange

sound change

syntactic change

variant analysis

visualizationsword-order changes

LChange'19 • ACL2019 • N. Tahmasebi

Dat

a se

ts

Survey of Computational Approaches to Lexical Semantic Change,Tahmasebi, Borin & Jatowt, 2018, https://arxiv.org/abs/1811.06278LChange'19 • ACL2019 • N. Tahmasebi

Lan

guag

es w

ork

ed o

nChineseDanishFrisian

Indo-AryanNorwegian

Romanian

Russian

Swedish

Dutch

Icelandic

Italian

Portugeese

Latin

Spanish

French

German

English

LChange'19 • ACL2019 • N. Tahmasebi

20132008 201220102009 2011 2014 2015 2016 2017 2018

embeddings

dynamic embeddings

neural embeddings

topic models

word sense induction

LChange'19 • ACL2019 • N. Tahmasebi

2019

33 papers at Lchange’19!

3 more papers at ACL

Met

ho

ds

benroulli em with context features BoW

dynamic subword-incorporated embedding

(DSE)

dynamic topic models with genre

edit distance

Frequency

Gaussian processes

hierarchical clustering

Jaccard

LDA

LSA

LSTM

LSTM part-of-speech tagger

parallell sets technique

Random Forest

RNN

split-and-mergerSteamgraph/sankey diagrams

structural skip-gram WordEmbedding Networks (WEN)

CBOWFastText

logistic regression

PPMI

SVD

W2V

LChange'19 • ACL2019 • N. Tahmasebi

Met

ho

ds

neural emb, 26

62%

other methods,

1638%

LChange'19 • ACL2019 • N. Tahmasebi

Unique meeting

LChange'19 • ACL2019 • N. Tahmasebi

Let’s talk

LChange'19 • ACL2019 • N. Tahmasebi

Bring together our communities

Users

• Who need the end-result for

their work

Qualitative approach

• Empirical

• Corpus-based

• Theory-driven

• Smaller-scale

• High-quality

• Case-studies

Quantitative approach

• Empirical

• Corpus-based

• Larger-scale

• Lower accuracy

LChange'19 • ACL2019 • N. Tahmasebi

Future work:

• Evaluation (First keynote)

• Semantic change – sense-differentiated

• Methods for smaller data

• Moving forward: not reconstructing change but planning for future

• Connect computational methods with lingustic theory (Second keynote)

• Solve problems that exist

LChange'19 • ACL2019 • N. Tahmasebi

Survey of Computational Approaches to Lexical Semantic Change,Tahmasebi, Borin & Jatowt, 2018, https://arxiv.org/abs/1811.06278

LChange'19 • ACL2019 • N. Tahmasebi

2019 – 2022The Swedish Research Council

Wp1: Swedish word sense induction

Wp2: Semantic change

Wp3: Lexical replacements

Wp4: Applications

WP*: Evaluation• Integrated in all work packages

Sign up for our news-list to keep posted

Today’s schedule9:00 – 9:15 Introduction

9:15 – 10:30 Haim Dubossarsky: Semantic Change in

the Time of Machine Learning, Doing it Right! (Keynote)

(Chair: Lea Frermann)

10:30 – 10:45 Coffee Break

10:45 – 12:30 Session 2 (Chair: Richard Johansson)

12:30 – 13:30 Lunch Break

13:30 – 14:30 Claire Bowern: Semantic Change and

Semantic Stability: Variation is Key (Keynote)

(Chair: Dominik Schlechtweg)

14:30 – 16:00 Poster Session with coffee

16:00 – 16:40 Session 5 (Chair: Simon Hengchen)

16:40 – 17:15 Discussion and Closing

Test your presentations 15 mins before your session!

Don’t forget to send them to us!

LChange'19 • ACL2019 • N. Tahmasebi

Lexical Semantic Change from a computational perspective

LChange'19 • ACL2019 • N. Tahmasebi

time

Spelling change

Teutschland Deutschland

LChange'19 • ACL2019 • N. Tahmasebi

Lexical replacement:Named entity change

time

St. Petersburg St. Petersburg

Petrograd

Leningrad

LChange'19 • ACL2019 • N. Tahmasebi

LChange'19 • ACL2019 • N. Tahmasebi

time

Lexical replacement:

Petrograd St. Petersburg

felitious happy

LChange'19 • ACL2019 • N. Tahmasebi

awesomeHe was an

awesome leader!

He was an

awesome leader!

time

LChange'19 • ACL2019 • N. Tahmasebi

time

What is the problem?

St. Petersburg

Petrograd

Finding

LChange'19 • ACL2019 • N. Tahmasebi

What is the problem?

St. Petersburg

Finding Interpreting

time

Petrograd

LChange'19 • ACL2019 • N. Tahmasebi

girl

criminal

Wolf ‘varg’

LChange'19 • ACL2019 • N. Tahmasebi

Evaluation

LChange'19 • ACL2019 • N. Tahmasebi

collective text

individual

individual text

signaltopic, cluster, vector…

signal change

Evaluation

LChange'19 • ACL2019 • N. Tahmasebi

LSC – individually trained embedding spaces

Single-point embedding space

ti

multiple time points

Track an individual word w over time

Change point/degree

detection

align

Vector space image: Nieto Pina and

Johansson, RANLP’15LChange'19 • ACL2019 • N. Tahmasebi

signal change

Evaluationminimum optimum medium

collective text

LChange'19 • ACL2019 • N. Tahmasebi

Evaluation

individual

individual text

signal change

minimum optimum medium

LChange'19 • ACL2019 • N. Tahmasebi

Evaluation

individual text

signaltopic, cluster, vector…

minimum optimum medium

collective text

LChange'19 • ACL2019 • N. Tahmasebi

Evaluation

individual

individual text

signaltopic, cluster, vector…

signal change

minimum optimum medium

collective text

LChange'19 • ACL2019 • N. Tahmasebi

Evaluation

signal change

minimum optimum medium

LChange'19 • ACL2019 • N. Tahmasebi

Currently, we do noneof the previous

Pre-determined list ofPositive examplesNegative examplesPairs

Evaluation

Controlled data

Top/bottomresults

3ways

LChange'19 • ACL2019 • N. Tahmasebi

Unsupervised Lexical Semantic Change Detection

English LatinGerman Swedish

https://languagechange.org/[email protected]

Dates:Trial data July 31, 2019Training data Sept. 4, 2019Test data Dec. 3, 2019Evaluation Jan. 10, 2020Evaluation Jan. 31, 2020Paper subm. Febr. 23, 2020

Dominik SchlechtwegUniversity of Stuttgart

Barbara McGillivrayThe Alan Turing Institute / University of Cambridge

Simon HengchenUniversity of Helsinki

Haim DubossarskyUniversity of Cambridge

Nina TahmasebiUniversity of Gothenburg

SemEval 2020 Task 1:

Singal and Signal change

signaltopic, cluster, vector…

signal change

LChange'19 • ACL2019 • N. Tahmasebi

Interpretable change signal

Find changes automatically:find what changes, how it

changed and when it changedStone

Music

Lifestyle

Rock

LChange'19 • ACL2019 • N. Tahmasebi

Change type

Novel related ws

Novel unrelated ws

Broadening

Join

Narrowing

Split

Death

Novel word sense

Novel word

Change

Single-senseSense-differentiated

LChange'19 • ACL2019 • N. Tahmasebi

Return to individual instances:

Given a word in a document at time t

a a

1Mark words that are likely to have a changed meaning

2Find all changes to the word

LChange'19 • ACL2019 • N. Tahmasebi

Who is the end-user?

LChange'19 • ACL2019 • N. Tahmasebi

What is the problem?

St. Petersburg

Finding Interpreting

time

Petrograd

LChange'19 • ACL2019 • N. Tahmasebi

LChange'19 • ACL2019 • N. Tahmasebi

Study interlinked changesNot cases in isolation

Solve problems that exist!

Our second keynote:Claire Bowern

LChange'19 • ACL2019 • N. Tahmasebi

Not only reconstruct changebut moving forward:

• Construct archives, store data to avoid problems in the future?

• Temporal IR?

LChange'19 • ACL2019 • N. Tahmasebi

Open challenges

• Sense-differentiated embeddings

• Methods for smaller-scale data

• More languages

• Methods for testing & evaluation

• Evaluating usefulness for follow-up analysis

• What is computational meaning and how does it affect change detection?

Stone

Music

Lifestyle

Rock

LChange'19 • ACL2019 • N. Tahmasebi

LChange'19 • ACL2019 • N. Tahmasebi

Thank you for attending!www.languagechange.org