table-to-text generation by structure-aware seq2seq learning · neural text generation from...

27
| Table-to-text Generation by Structure-aware Seq2seq Learning 1 Authors: Tianyu Liu, Kexiang Wang, Lei Sha, Baobao Chang and Zhifang Sui Affiliation: Key Laboratory of Computational Linguistics(ICL),Peking University, Beijing, China Speaker: Tianyu Liu E-mail: [email protected]

Upload: others

Post on 03-Oct-2020

13 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|

Table-to-text Generation by Structure-aware Seq2seq Learning

1

Authors:Tianyu Liu,KexiangWang,LeiSha,BaobaoChangandZhifang SuiAffiliation:KeyLaboratoryofComputationalLinguistics(ICL),PekingUniversity,Beijing,ChinaSpeaker:Tianyu LiuE-mail:[email protected]

Page 2: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|

Knowledge Modeling

2

Machine ReadingUnstructured text Structured data

MachineWritingStructured data Unstructured text

Page 3: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

| 3

Knowledge Modeling

Machine ReadingUnstructured text Structured data

MachineWritingStructured data Unstructured text

Page 4: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|

Data-to-text

4

WisemanS,Shieber SM,RushAM.Challengesindata-to-documentgeneration EMNLP 2017

Page 5: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|

Data-to-text

5

MeiH,BansalM,WalterMR.Whattotalkaboutandhow?selectivegenerationusinglstms withcoarse-to-finealignment NAACL2016.

Page 6: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|

Data-to-text

6

DušekO,Jurčíček F. Sequence-to-sequence generationforspoken dialogueviadeepsyntaxtreesandstrings. ACL 2016.

Page 7: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|

Data-to-text

7

GeorgeMikell (born Jurgis Mikelaitis;4April1929)isaLithuanian-Australianactorandwriterbestknown forhisperformances in TheGunsofNavarone (1961)and TheGreatEscape (1963).

Our work !

Page 8: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|

How to represent a table ?

8

1. Key-value map(Name, George Mikell); (Nationality, Lithuanian Australian); (Years Active, 1957--present) ….

Page 9: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|

How to represent a table ?

9

1. Key-value map

2. Triple(George Mikell, FullName, George Mikell);(George Mikell, Nationality, Lithuanian Australian);(George Mikell, Known for, The Guns of Navarone); (The Guns of Navarone, Directed by, J.Lee Thompson) ….

(Name, George Mikell); (Nationality, Lithuanian Australian); (Years Active, 1957--present) ….

Page 10: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|

How to represent a table ?

10

1. Key-value map

2. Triple(George Mikell, FullName, George Mikell);(George Mikell, Nationality, Lithuanian Australian);(George Mikell, Known for, The Guns of Navarone); (The Guns of Navarone, Directed by, J.Lee Thompson) ….

(Name, George Mikell); (Nationality, Lithuanian Australian); (Years Active, 1957--present) ….

3. Multi-element Tuple

Page 11: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|

How to represent a table ?

11

1. Key-value map

2. Triple(George Mikell, FullName, George Mikell);(George Mikell, Nationality, Lithuanian Australian);(George Mikell, Known for, The Guns of Navarone); (The Guns of Navarone, Directed by, J.Lee Thompson) ….

(Name, George Mikell); (Nationality, Lithuanian Australian); (Years Active, 1957--present) ….

3. Multi-element Tuple

Page 12: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|12

Table Encoder

ℎ#

George Mikell

(name,1,2)

(name,2,1)

ℎ$ ℎ%

Jurgis

(birthname,1,2)

……

……

The

ℎ$&

Great

ℎ$' ℎ%(

Escape

(known for,5,3)

(known for,6,2)

(known for,7,1)

Field-gating Tableencoder

𝑧# 𝑧$ 𝑧% 𝑧$& 𝑧$' 𝑧%(

𝑠%

George Mikell is

𝑠+ 𝑠,

an Australian<sos>

George Mikell is an Australian actor

𝑠-𝑠$𝑠#

Description Decoder

Word

Field & Pos

Page 13: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|13

(name, George Mikell)Word embedding + Structure embedding

Table Encoder

Page 14: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|14

(name, George Mikell)Word embedding + Field embedding

Table Encoder

Page 15: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|15

(name, George Mikell)Structure embedding (Field type embedding + Position embedding)

Table Encoder

Page 16: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|16

(name, George Mikell)

Lebret R,Grangier D,Auli M.Neuraltextgenerationfromstructureddatawithapplicationtothebiographydomain. EMNLP 2016

Table EncoderStructure embedding (Field type embedding + Position embedding)

Page 17: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|17

Field-gating Encoder

ℎ#

George Mikell

(name,1,2)

(name,2,1)

ℎ$ ℎ%

Jurgis

(birthname,1,2)

……

……

The

ℎ$&

Great

ℎ$' ℎ%(

Escape

(known for,5,3)

(known for,6,2)

(known for,7,1)

Field-gating Tableencoder

𝑧# 𝑧$ 𝑧% 𝑧$& 𝑧$' 𝑧%(

Vanilla LSTM

Undate cell state

Page 18: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|

Human-like Generation

18

GeorgeMikell (born Jurgis Mikelaitis;4April1929)isaLithuanian-Australianactorandwriterbestknown forhisperformances in The Guns ofNavarone (1961)and The Great Escape(1963).

name GeorgeMikell

birthname Jurgis Mikelaitis

birthdate 4April1929 (age 88)

birthplace Bildeniai, Lithuania

nationality Lithuanian,Australian

occupation Actor,writer

years active 1957–present

known for The Guns of NavaroneThe Great Escape

Page 19: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|

Human-like Generation

19

GeorgeMikell (born Jurgis Mikelaitis;4April1929)isaLithuanian-Australianactorandwriterbestknown forhisperformances in The Guns ofNavarone (1961)and The Great Escape(1963).

name GeorgeMikell

birthname Jurgis Mikelaitis

birthdate 4April1929 (age 88)

birthplace Bildeniai, Lithuania

nationality Lithuanian,Australian

occupation Actor,writer

years active 1957–present

known for The Guns of NavaroneThe Great Escape

Page 20: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|

Human-like Generation

20

GeorgeMikell (born Jurgis Mikelaitis;4April1929)isaLithuanian-Australian actorandwriterbestknown forhisperformances in The Guns ofNavarone (1961)and The Great Escape(1963).

name GeorgeMikell

birthname Jurgis Mikelaitis

birthdate 4April1929 (age 88)

birthplace Bildeniai, Lithuania

nationality Lithuanian,Australian

occupation Actor,writer

years active 1957–present

known for The Guns of NavaroneThe Great Escape

Page 21: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|

Human-like Generation

21

GeorgeMikell (born Jurgis Mikelaitis;4April1929)isaLithuanian-Australianactorandwriterbestknown forhisperformances in The Guns ofNavarone (1961)and The Great Escape(1963).

name GeorgeMikell

birthname Jurgis Mikelaitis

birthdate 4April1929 (age 88)

birthplace Bildeniai, Lithuania

nationality Lithuanian,Australian

occupation Actor,writer

years active 1957–present

known for The Guns of NavaroneThe Great Escape

Page 22: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|22

Word-level & Field-level Dual Attention

ℎ#

George Mikell

(name,1,2)

(name,2,1)

ℎ$ ℎ%

Jurgis

(birthname,1,2)

……

……

The

ℎ$&

Great

ℎ$' ℎ%(

Escape

(known for,5,3)

(known for,6,2)

(known for,7,1)

𝑠%

George Mikell is

𝑠+ 𝑠,

an Australian<sos>

George Mikell is an Australian actor

𝑠,

……⊙

⊙ ……

……

……

⨁Embeddingfor ‘Australian’

dual attentionvector

Word level attention

Field level attention

⊗Dual attention

Input for 𝑠,

Field-gating Tableencoder

𝑠-𝑠$𝑠#

𝑧# 𝑧$ 𝑧% 𝑧$& 𝑧$' 𝑧%(

Description Decoder

Word-level Attention:

Field-level Attention:

Dual Attention:

Context Vector:

Page 23: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|23

Case study

Page 24: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|

Dataset: WIKIBIO

24

WIKIBIO (Lebret,Grangier,andAuli (2016)) contains728,321 articlesfromEnglishWikipedia(Sep2015).Thedatasetusesthefirstsentenceofeacharticleasthedescriptionoftherelated infobox.

GeorgeMikell (born JurgisMikelaitis;4April1929)isaLithuanian-Australian actorandwriterbestknown forhisperformancesin The Guns ofNavarone (1961)and The Great Escape(1963).

name GeorgeMikell

birthname Jurgis Mikelaitis

birthdate 4April1929 (age 88)

birthplace Bildeniai, Lithuania

nationality Lithuanian, Australian

occupation Actor,writer

years active 1957–present

known for The Guns of NavaroneThe Great Escape

Infobox Description

Lebret R,Grangier D,Auli M.Neuraltextgenerationfromstructureddatawithapplicationtothebiographydomain. EMNLP 2016

Page 25: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|

Experiments

25

Statistical Baselines:KN & Template KN:TheKneser-Ney(KN)model isawidelyusedlanguagemodel.TemplateKNisaKNmodelovertemplateswhichalsoservesasabaselinein[Lebret etal.,2016]

NLM& Table NLM:NLMisanaivestatisticallan-guage modelproposedbyLebret etal.2016.TableNLM includeslocalandglobalconditioning overthetables

Neural Network Baseline:Seq2seq: vanillaseq2seqneuralarchitectureisalsoprovidedasastrongbaselinewhichusestheconcatenationofwordembedding, fieldembedding andpositionembedding asthemodelinput.

Page 26: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|

Does the order matter?

26

name GeorgeMikell

birthname Jurgis Mikelaitis

birthdate 4April1929 (age 88)

birthplace Bildeniai, Lithuania

nationality Lithuanian, Australian

occupation Actor,writer

years active 1957–present

known for The Guns of NavaroneThe Great Escape

years active 1957–present

occupation Actor,writer

known for The Guns of NavaroneThe Great Escape

birthplace Bildeniai, Lithuania

nationality Lithuanian, Australian

birthname Jurgis Mikelaitis

name GeorgeMikell

birthdate 4April1929 (age 88)

Shuffle

Page 27: Table-to-text Generation by Structure-aware Seq2seq Learning · Neural text generation from structured data with application to the biography domain. EMNLP2016 | Experiments 25 Statistical

|27

Thanks.