heterogeneous supervision for relation extraction: a ...mintz et al. “distant supervision for...

54
Heterogeneous Supervision for Relation Extraction: A Representation Learning Approach Liyuan Liu , Xiang Ren , Qi Zhu , Huan Gui , Shi Zhi , Heng Ji and Jiawei Han University of Illinois at Urbana-Champaign, Urbana, IL, USA Computer Science Department, Rensselaer Polytechnic Institute, USA

Upload: others

Post on 04-Jul-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

HeterogeneousSupervisionforRelationExtraction:ARepresentationLearningApproach

Liyuan Liu❖,XiangRen❖,QiZhu❖, Huan Gui❖,ShiZhi❖,Heng Ji◇ andJiaweiHan❖

❖UniversityofIllinoisatUrbana-Champaign,Urbana, IL,USA◇ComputerScienceDepartment,RensselaerPolytechnic Institute,USA

Page 2: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Relation Extraction

• Goal: acquirestructuredknowledgefromunstructuredtext

2

“HusseinwasborninAmmanon14November1935.”

Hussein

Amman 14 Nov1935

born-in born-on

Page 3: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Relation Extraction

• Formal Definition:• Sentence-level relation extraction:

• Classify a relationmention into a set of relation types of interest orNot-Target-Type (None)

3

Hussein wasborn inAmmanon14November1935.

Hussein Amman

entity pair

sentence / context

Born-in

President-of

Died-in

Parents-of

None

Page 4: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Relation Extraction

• Formal Definition:• Sentence-level relation extraction:

• Classify a relationmention into a set of relation types of interest orNot-Target-Type (None)

4

Hussein wasborn inAmmanon14November1935.

Hussein Amman

entity pair

sentence / context

Born-in

President-of

Died-in

Parents-of

Nonemulti-class classification

Page 5: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Related Work

5

• Supervised Learning:

• Multi-class classification

Page 6: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Related Work

6

• Supervised Learning:

• Multi-class classification

Dataset with human annotation is the bottleneck

Limited, might even notexisted for many domains

Hard to get, and costly

Slow, and sometimes outdated

……

Page 7: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Related Work

7

• Bootstrap learning:

• Start with a set of seed patterns / annotations, iteratively generate more

• Suffers from semantic shift

Mintzetal.“Distantsupervisionforrelationextractionwithoutlabeleddata”,ACL2009

Page 8: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Related Work

8

• Distant Supervision:

• Automatically generate annotationsby Knowledge Base

Mintzetal.“Distantsupervisionforrelationextractionwithoutlabeleddata”,ACL2009

Page 9: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Related Work

9

• Distant Supervision:

• Automatically generate annotationsby Knowledge Base• ("Obama","USA",ObamawasborninHonolulu,Hawaii,USAashehasalwayssaid)

• Born-in (correct)• President-of (wrong).

Page 10: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Related Work

10

• Distant Supervision:

• Automatically generate annotationsby Knowledge Base

Distant supervision only encodes KB, while we have more than KB

Page 11: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Heterogeneous Supervision

• Provide a general framework to encode knowledge for supervision:• Knowledge Base, domain-specific patterns, ……

• Labelling functions:

11

return died_in for < , , s> if DiedIn( , ) in KBreturn born_in for < , , s> if match(‘ * born in * ’, s)

return died_in for < , , s> if match(‘ * killed in * ’, s)

return born_in for < , , s> if BornIn( , ) in KB

⇤ e1 e2�4

�2 e1 e2 e1 e2e1 e2�3

�1 e1 e2 e1 e2

Domain-specificPatterns

KnowledgeBase

Ratner,AlexanderJ.,etal."Dataprogramming:Creatinglargetrainingsets,quickly."NIPS.2016.

Page 12: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Heterogeneous Supervision & Distant Supervision:

• Heterogeneous Supervision is an extension of Distant Supervision:• Both encode external informationand providesupervision,• Heterogeneous Supervision can encodemore.

12

Information typeKBP NYT

# of Relation Types # of Relation Mentions # of Relation Types # of Relation Mentions

Knowledge Base 7 133955 25 530767

Domain-specific Patterns 13 225977 16 43820

Table1. Statistic of Heterogeneous Supervision

Page 13: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Heterogeneous Supervision & Distant Supervision:

• Heterogeneous Supervision is an extension of Distant Supervision:• Both encode external informationand providesupervision,• Heterogeneous Supervision can encodemore.

13

Information typeKBP NYT

# of Relation Types # of Relation Mentions # of Relation Types # of Relation Mentions

Knowledge Base 7 133955 25 530767

Domain-specific Patterns 13 225977 16 43820

Table1. Statistic of Heterogeneous Supervision

Page 14: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Robert Newton "Bob" Ford was an American outlaw best known for killing his gang leader Jesse James ( ) in Missouri ( )

Hussein ( ) was born in Amman ( ) on 14 November 1935.Gofraid ( ) died in 989, said to be killed in Dal Riata ( ).

D e1e2c2

c3

c1e1

e1

e2

e2

return died_in for < , , s> if DiedIn( , ) in KBreturn born_in for < , , s> if match(‘ * born in * ’, s)

return died_in for < , , s> if match(‘ * killed in * ’, s)

return born_in for < , , s> if BornIn( , ) in KB

⇤ e1 e2�4

�2 e1 e2 e1 e2e1 e2�3

�1 e1 e2 e1 e2�1

�2

�4

�3

c1 c3c2

Robert Newton "Bob" Ford was an American outlaw best known for killing his gang leader Jesse James ( ) in Missouri ( )

Hussein ( ) was born in Amman ( ) on 14 November 1935.Gofraid ( ) died in 989, said to be killed in Dal Riata ( ).

D e1e2c2

c3

c1e1

e1

e2

e2

return died_in for < , , s> if DiedIn( , ) in KBreturn born_in for < , , s> if match(‘ * born in * ’, s)

return died_in for < , , s> if match(‘ * killed in * ’, s)

return born_in for < , , s> if BornIn( , ) in KB

⇤ e1 e2�4

�2 e1 e2 e1 e2e1 e2�3

�1 e1 e2 e1 e2�1

�2

�4

�3

c1c3 c2

Challenges

• Relation Extraction• Resolve Conflicts among Heterogeneous Supervision

14

Page 15: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

ReHession• Heterogeneous Supervision

• Our Solution: A Representation Learning Approach

• RelationMentionRepresentation

• True Label Discovery component

• Relation Extraction component

• Experiments

15

Page 16: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Conflicts among Heterogeneous Supervision

• Most simple way: majority voting

16

Robert Newton "Bob" Ford was an American outlaw best known for killing his gang leader Jesse James ( ) in Missouri ( )

Hussein ( ) was born in Amman ( ) on 14 November 1935.Gofraid ( ) died in 989, said to be killed in Dal Riata ( ).

D e1e2c2

c3

c1e1

e1

e2

e2

return died_in for < , , s> if DiedIn( , ) in KBreturn born_in for < , , s> if match(‘ * born in * ’, s)

return died_in for < , , s> if match(‘ * killed in * ’, s)

return born_in for < , , s> if BornIn( , ) in KB

⇤ e1 e2�4

�2 e1 e2 e1 e2e1 e2�3

�1 e1 e2 e1 e2�1

�2

�4

�3

c1 c3c2

Robert Newton "Bob" Ford was an American outlaw best known for killing his gang leader Jesse James ( ) in Missouri ( )

Hussein ( ) was born in Amman ( ) on 14 November 1935.Gofraid ( ) died in 989, said to be killed in Dal Riata ( ).

D e1e2c2

c3

c1e1

e1

e2

e2

return died_in for < , , s> if DiedIn( , ) in KBreturn born_in for < , , s> if match(‘ * born in * ’, s)

return died_in for < , , s> if match(‘ * killed in * ’, s)

return born_in for < , , s> if BornIn( , ) in KB

⇤ e1 e2�4

�2 e1 e2 e1 e2e1 e2�3

�1 e1 e2 e1 e2�1

�2

�4

�3

c1c3 c2

Page 17: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Conflicts among Heterogeneous Supervision

• How to resolve conflicts among Heterogeneous Supervision?• Works for C3 and C2, but not work for C1

17

Robert Newton "Bob" Ford was an American outlaw best known for killing his gang leader Jesse James ( ) in Missouri ( )

Hussein ( ) was born in Amman ( ) on 14 November 1935.Gofraid ( ) died in 989, said to be killed in Dal Riata ( ).

D e1e2c2

c3

c1e1

e1

e2

e2

return died_in for < , , s> if DiedIn( , ) in KBreturn born_in for < , , s> if match(‘ * born in * ’, s)

return died_in for < , , s> if match(‘ * killed in * ’, s)

return born_in for < , , s> if BornIn( , ) in KB

⇤ e1 e2�4

�2 e1 e2 e1 e2e1 e2�3

�1 e1 e2 e1 e2�1

�2

�4

�3

c1 c3c2

Robert Newton "Bob" Ford was an American outlaw best known for killing his gang leader Jesse James ( ) in Missouri ( )

Hussein ( ) was born in Amman ( ) on 14 November 1935.Gofraid ( ) died in 989, said to be killed in Dal Riata ( ).

D e1e2c2

c3

c1e1

e1

e2

e2

return died_in for < , , s> if DiedIn( , ) in KBreturn born_in for < , , s> if match(‘ * born in * ’, s)

return died_in for < , , s> if match(‘ * killed in * ’, s)

return born_in for < , , s> if BornIn( , ) in KB

⇤ e1 e2�4

�2 e1 e2 e1 e2e1 e2�3

�1 e1 e2 e1 e2�1

�2

�4

�3

c1c3 c2

Page 18: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Conflicts among Heterogeneous Supervision

• For more complicated models, several principles have been proposed:• Truth Discovery:

• Some sources (labeling functions)would bemore reliable than others

• Refer the reliability of different sources and the true label at the same time

• Source ConsistencyAssumption: a sourceislikelytoprovidetrueinformationwith thesameprobabilityforallinstances.

18Li,Yaliang,etal."Asurveyon truthdiscovery."AcmSigkddExplorationsNewsletter17.2(2016):1-16.

Page 19: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Conflicts among Heterogeneous Supervision

• For more complicated models, several principles have been proposed:• Truth Discovery:

• May not fit our scenario very well

19

Robert Newton "Bob" Ford was an American outlaw best known for killing his gang leader Jesse James ( ) in Missouri ( )

Hussein ( ) was born in Amman ( ) on 14 November 1935.Gofraid ( ) died in 989, said to be killed in Dal Riata ( ).

D e1e2c2

c3

c1e1

e1

e2

e2

return died_in for < , , s> if DiedIn( , ) in KBreturn born_in for < , , s> if match(‘ * born in * ’, s)

return died_in for < , , s> if match(‘ * killed in * ’, s)

return born_in for < , , s> if BornIn( , ) in KB

⇤ e1 e2�4

�2 e1 e2 e1 e2e1 e2�3

�1 e1 e2 e1 e2�1

�2

�4

�3

c1 c3c2

Robert Newton "Bob" Ford was an American outlaw best known for killing his gang leader Jesse James ( ) in Missouri ( )

Hussein ( ) was born in Amman ( ) on 14 November 1935.Gofraid ( ) died in 989, said to be killed in Dal Riata ( ).

D e1e2c2

c3

c1e1

e1

e2

e2

return died_in for < , , s> if DiedIn( , ) in KBreturn born_in for < , , s> if match(‘ * born in * ’, s)

return died_in for < , , s> if match(‘ * killed in * ’, s)

return born_in for < , , s> if BornIn( , ) in KB

⇤ e1 e2�4

�2 e1 e2 e1 e2e1 e2�3

�1 e1 e2 e1 e2�1

�2

�4

�3

c1c3 c2

Page 20: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Conflicts among Heterogeneous Supervision

• For more complicated models, several principles have been proposed:• Truth Discovery:

• These models are context-agnostic,while context is important for Relation Extraction

20

Page 21: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Conflicts among Heterogeneous Supervision

• For more complicated models, several principles have been proposed:• Truth Discovery:

• Distant Supervision:

• Partial-label associationhas been proposed to resolve conflicts amongDistantSupervision, and proved to be effective.

𝑙(𝑧,𝑂&) = max{0, 1 − max1∈34

𝜙(𝑧, 𝑟) − max17∉34

𝜙(𝑧, 𝑟7) }

21

Most likely positive relation type Most likely negative relation type

XiangRen, et al. “CoType: JointExtractionofTypedEntitiesandRelationswithKnowledgeBases”,WWW,2017.

Page 22: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Conflicts among Heterogeneous Supervision

• For more complicated models, several principles have been proposed:• Truth Discovery:

• Distant Supervision:

• Partial-label associationhas been proposed to resolve conflicts amongDistantSupervision, and proved to be effective.

𝑙(𝑧,𝑂&) = max{0, 1 − max1∈34

𝜙(𝑧, 𝑟) − max17∉34

𝜙(𝑧, 𝑟7) }

• For Distant Supervision, all annotations come from KnowledgeBase.• For Heterogeneous Supervision, annotations are fromdifferent sources, and some couldbe more reliable than others.

22

Page 23: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Conflicts among Heterogeneous Supervision

• To fit our problem, we introduce context awareness to truth discovery,and modified the assumption:

• A sourceislikelytoprovidetrueinformationwith thesameprobabilityforinstanceswith similar context.

23

Page 24: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Heterogeneous Supervision

• To fit our assumption, we add one constraint to labeling functions:• each labeling function can annotate only one relation type based on onesource of information

• Reasons:• Different information sources often have different reliabilities

• Some sources annotate different relation types without consistency• KB-based labeling functionmayhave higher recall on ‘president-of’ than ‘born-in’

24

Page 25: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Heterogeneous Supervision

• To fit our assumption, we add one constraint to labeling functions:• each labeling function can annotate only one relation type based on onesource of information

25

return r for<e1,e2,s> ifr (e1,e2)inKB

Page 26: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Heterogeneous Supervision

• To fit our assumption, we add one constraint to labeling functions:• each labeling function can annotate only one relation type based on onesource of information

26

return died_in for < , , s> if DiedIn( , ) in KBreturn born_in for < , , s> if match(‘ * born in * ’, s)

return died_in for < , , s> if match(‘ * killed in * ’, s)

return born_in for < , , s> if BornIn( , ) in KB

⇤ e1 e2�4

�2 e1 e2 e1 e2e1 e2�3

�1 e1 e2 e1 e2

Domain-specificPatterns

KnowledgeBase

Page 27: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

ReHession• Heterogeneous Supervision

• Our Solution: A Representation Learning Approach

• RelationMentionRepresentation

• True Label Discovery component

• Relation Extraction component

• Experiments

27

Page 28: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Heterogeneous Supervision for Relation Extraction

• Relation Extraction:

• Matching contextwith proper relation type

• Heterogeneous Supervision:

• Refer true labels in a context-aware manner

28

Page 29: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Heterogeneous Supervision for Relation Extraction

• Relation Extraction:

• Matching contextwith proper relation type

• Heterogeneous Supervision:

• Refer true labels in a context-aware manner

29

context

Page 30: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

A Representation Learning Approach

30

Robert Newton "Bob" Ford was an American outlaw best known for killing his gang leader Jesse James ( ) in Missouri ( )

Hussein ( ) was born in Amman ( ) on 14 November 1935.Gofraid ( ) died in 989, said to be killed in Dal Riata ( ). Truth Discovery View

Relation Extraction View

infer `true' label

training RelationExtraction model

Interact through context representation

representation of proficient subsetrepresentation of relation mention representation of relation type born_in

representation of relation type died_in

D e1e2c2

c3

c1e1

e1

e2

e2

‘true’

�1

�2

�4

�3

c1c3 c2

Labeling Functions

Relation Mentions

Vector Space

training TruthDiscovery model

proficient subset

return died_in for < , , s> if DiedIn( , ) in KBreturn born_in for < , , s> if match(‘ * born in * ’, s)

return died_in for < , , s> if match(‘ * killed in * ’, s)

return born_in for < , , s> if BornIn( , ) in KB

⇤ e1 e2�4

�2 e1 e2 e1 e2e1 e2�3

�1 e1 e2 e1 e2

Page 31: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

A Representation Learning Approach

31

Robert Newton "Bob" Ford was an American outlaw best known for killing his gang leader Jesse James ( ) in Missouri ( )

Hussein ( ) was born in Amman ( ) on 14 November 1935.Gofraid ( ) died in 989, said to be killed in Dal Riata ( ). Truth Discovery View

Relation Extraction View

infer `true' label

training RelationExtraction model

Interact through context representation

representation of proficient subsetrepresentation of relation mention representation of relation type born_in

representation of relation type died_in

D e1e2c2

c3

c1e1

e1

e2

e2

‘true’

�1

�2

�4

�3

c1c3 c2

Labeling Functions

Relation Mentions

Vector Space

training TruthDiscovery model

proficient subset

return died_in for < , , s> if DiedIn( , ) in KBreturn born_in for < , , s> if match(‘ * born in * ’, s)

return died_in for < , , s> if match(‘ * killed in * ’, s)

return born_in for < , , s> if BornIn( , ) in KB

⇤ e1 e2�4

�2 e1 e2 e1 e2e1 e2�3

�1 e1 e2 e1 e2

Heterogeneous Supervision generation

RelationMention Representation

Relation Extraction

True Label Discovery

Page 32: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

A Representation Learning Approach

32

Robert Newton "Bob" Ford was an American outlaw best known for killing his gang leader Jesse James ( ) in Missouri ( )

Hussein ( ) was born in Amman ( ) on 14 November 1935.Gofraid ( ) died in 989, said to be killed in Dal Riata ( ). Truth Discovery View

Relation Extraction View

infer `true' label

training RelationExtraction model

Interact through context representation

representation of proficient subsetrepresentation of relation mention representation of relation type born_in

representation of relation type died_in

D e1e2c2

c3

c1e1

e1

e2

e2

‘true’

�1

�2

�4

�3

c1c3 c2

Labeling Functions

Relation Mentions

Vector Space

training TruthDiscovery model

proficient subset

return died_in for < , , s> if DiedIn( , ) in KBreturn born_in for < , , s> if match(‘ * born in * ’, s)

return died_in for < , , s> if match(‘ * killed in * ’, s)

return born_in for < , , s> if BornIn( , ) in KB

⇤ e1 e2�4

�2 e1 e2 e1 e2e1 e2�3

�1 e1 e2 e1 e2

Heterogeneous Supervision generation

RelationMention Representation

Relation Extraction

True Label Discovery

Page 33: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Relation Mention Representation

• Text Feature Extraction

• Text Feature Representation

• Relation Mention Representation

33

HEAD_EM1_HusseinTKN_EM1_Hussein

bornHEAD_EM2_Amman

……

Text Feature Extraction

Text Feature Representation

Mapping from Text Embedding to Relation Mention Embedding:

Robert Newton "Bob" Ford was an American outlaw best known for killing his gang leader Jesse James ( ) in Missouri ( )

Gofraid ( ) died in 989, said to be killed in Dal Riata ( ).

D e1 e2c2

c1e1 e2

Hussein ( ) was born in Amman ( ) on 14 November 1935.c3 e1 e2

tanh(W · 1

|fc1 |X

fi2fc1

vi)vi 2 Rnv

zc 2 Rnz

Page 34: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Text Feature Extraction

We adopted texture features, POS-tagging and brown clustering to extract features

34

C3: Hussein was born in Ammanon 14 November 1935

HEAD_EM1_HusseinTKN_EM1_Hussein

bornHEAD_EM2_Amman

……

Text Feature Extraction

Text Feature Representation

Mapping from Text Embedding to Relation Mention Embedding:

Robert Newton "Bob" Ford was an American outlaw best known for killing his gang leader Jesse James ( ) in Missouri ( )

Gofraid ( ) died in 989, said to be killed in Dal Riata ( ).

D e1 e2c2

c1e1 e2

Hussein ( ) was born in Amman ( ) on 14 November 1935.c3 e1 e2

tanh(W · 1

|fc1 |X

fi2fc1

vi)vi 2 Rnv

zc 2 Rnz

Page 35: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Text Feature Representation

• Leverage features’ co-occurrence information to learn the representation , andhelp the model generalize better.

• Loss function of this part:

35

co-occurrence here refers to featuresoccur in the same relation mentioninstead of the same shifting window

Negative sampling

Featureembeddingfor feature 𝑓;

Featureset of 𝑐

Page 36: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Relation Mention Representation

• Here, we adopted the bag-of-features assumption, and add transformationweights to allow representation of relationmention and features to be indifferent semantic space.

36

Page 37: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

A Representation Learning Approach

37

Robert Newton "Bob" Ford was an American outlaw best known for killing his gang leader Jesse James ( ) in Missouri ( )

Hussein ( ) was born in Amman ( ) on 14 November 1935.Gofraid ( ) died in 989, said to be killed in Dal Riata ( ). Truth Discovery View

Relation Extraction View

infer `true' label

training RelationExtraction model

Interact through context representation

representation of proficient subsetrepresentation of relation mention representation of relation type born_in

representation of relation type died_in

D e1e2c2

c3

c1e1

e1

e2

e2

‘true’

�1

�2

�4

�3

c1c3 c2

Labeling Functions

Relation Mentions

Vector Space

training TruthDiscovery model

proficient subset

return died_in for < , , s> if DiedIn( , ) in KBreturn born_in for < , , s> if match(‘ * born in * ’, s)

return died_in for < , , s> if match(‘ * killed in * ’, s)

return born_in for < , , s> if BornIn( , ) in KB

⇤ e1 e2�4

�2 e1 e2 e1 e2e1 e2�3

�1 e1 e2 e1 e2

Heterogeneous Supervision generation

RelationMention Representation

Relation Extraction

True Label Discovery

Page 38: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

True label discovery

• Assume:• A labeling function would annotate similar instances with the same reliability

38

Context Information: 𝐳

Page 39: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

True label discovery

• Assume:• A labeling function would annotate similar instances with the same reliability

39

Context Information: 𝐳

foreachlabeling function, thereexistsanproficient subset,containing instancesthat it canpreciselyannotate.

Page 40: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

True label discovery

• How to decide which label is correct?

• Probabilitymodel and maximum likelihoodestimate

40

Corresponding to ourassumption and setting Identify the true label

Page 41: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

True label discovery

• Probability Model:• Describing the generationof Heterogeneous Supervision?• Different from crowdsourcing.E.g., ONE worker may annotate:

• ("Obama","USA",ObamawasborninHonolulu,Hawaii,USAashehasalwayssaid)• Born-in• President-of• Citizen-of• …

41

Cautious Worker

Careless Worker

Exists some randomness

Page 42: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

True label discovery

• Probability Model:• Describing the generationof Heterogeneous Supervision?• Different from crowdsourcing.E.g., ONE worker may annotate:

• ("Obama","USA",ObamawasborninHonolulu,Hawaii,USAashehasalwayssaid)• Born-in• President-of• Citizen-of• …

• But One labeling function can only annotate One relation type:• Randomness exists in the correctness, not in the choice of relation type

42

Page 43: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

True label discovery

• Describing the correctness of Heterogeneous Supervision

43

zc|C|

|⇤|lisc,i⇢c,i

|O|

𝜌?,; = 𝛿(𝑜?,; == 𝑜?∗)

underlying true label

correctness of annotation 𝑜?,;

observed annotation

Representation of relation mention

Representation of labeling function

whether c belongs to the proficient subset of 𝐥𝐢

Page 44: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

True label discovery

• Describing the correctness of Heterogeneous Supervision

• 𝑝 𝜌?,; = 1 = 𝑝 𝜌?,; = 1 𝑠?,; = 1 ∗ 𝑝 𝑠?,; = 1 + 𝑝 𝜌?,; = 1 𝑠?,; = 0 ∗ 𝑝 𝑠?,; = 0

• 𝑝 𝑠?,; = 1 = 𝜎(𝐥;I ∗ 𝐳?)

44

zc|C|

|⇤|lisc,i⇢c,i

|O|

Page 45: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

A Representation Learning Approach

45

Robert Newton "Bob" Ford was an American outlaw best known for killing his gang leader Jesse James ( ) in Missouri ( )

Hussein ( ) was born in Amman ( ) on 14 November 1935.Gofraid ( ) died in 989, said to be killed in Dal Riata ( ). Truth Discovery View

Relation Extraction View

infer `true' label

training RelationExtraction model

Interact through context representation

representation of proficient subsetrepresentation of relation mention representation of relation type born_in

representation of relation type died_in

D e1e2c2

c3

c1e1

e1

e2

e2

‘true’

�1

�2

�4

�3

c1c3 c2

Labeling Functions

Relation Mentions

Vector Space

training TruthDiscovery model

proficient subset

return died_in for < , , s> if DiedIn( , ) in KBreturn born_in for < , , s> if match(‘ * born in * ’, s)

return died_in for < , , s> if match(‘ * killed in * ’, s)

return born_in for < , , s> if BornIn( , ) in KB

⇤ e1 e2�4

�2 e1 e2 e1 e2e1 e2�3

�1 e1 e2 e1 e2

Heterogeneous Supervision generation

RelationMention Representation

Relation Extraction

True Label Discovery

Page 46: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Relation Extraction

• Adopts soft-max as the relation extractor:

• Loss function: KL-Divergence:

46

Page 47: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

A Representation Learning Approach

47

Robert Newton "Bob" Ford was an American outlaw best known for killing his gang leader Jesse James ( ) in Missouri ( )

Hussein ( ) was born in Amman ( ) on 14 November 1935.Gofraid ( ) died in 989, said to be killed in Dal Riata ( ). Truth Discovery View

Relation Extraction View

infer `true' label

training RelationExtraction model

Interact through context representation

representation of proficient subsetrepresentation of relation mention representation of relation type born_in

representation of relation type died_in

D e1e2c2

c3

c1e1

e1

e2

e2

‘true’

�1

�2

�4

�3

c1c3 c2

Labeling Functions

Relation Mentions

Vector Space

training TruthDiscovery model

proficient subset

return died_in for < , , s> if DiedIn( , ) in KBreturn born_in for < , , s> if match(‘ * born in * ’, s)

return died_in for < , , s> if match(‘ * killed in * ’, s)

return born_in for < , , s> if BornIn( , ) in KB

⇤ e1 e2�4

�2 e1 e2 e1 e2e1 e2�3

�1 e1 e2 e1 e2

Heterogeneous Supervision generation

RelationMention Representation

Relation Extraction

True Label Discovery

Page 48: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Model Learning

• Joint optimize three components

48

Page 49: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

ReHession• Heterogeneous Supervision

• Our Solution: A Representation Learning Approach

• RelationMentionRepresentation

• True Label Discovery component

• Relation Extraction component

• Experiments

49

Page 50: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Experiments

• 1. Relation extraction (with None) and Relation classification (withoutNone):• NL: train relation extractor with all annotations• TD: train relation extractor with `true’ label inferred by Investment (comparedtrue label discoverymodel)

50

Page 51: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Experiments

51

Page 52: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Experiments

• 2. Effectiveness of proposed true label discovery component:• Ori: with proposed context-aware true label discovery component• LD: with Investment(compared true label discovery model)

52

Page 53: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Case Study

53

Page 54: Heterogeneous Supervision for Relation Extraction: A ...Mintz et al. “Distant supervision for relation extraction without labeled data”, ACL 2009. Related Work 8 • Distant Supervision:

Thank YouQ & A

54