question answering over freebase with multi-column convolutional neural networks · 2017. 9....

24
Question Answering over Freebase with Multi-Column Convolutional Neural Networks Li Dong 1 , Furu Wei 2 , Ming Zhou 2 , Ke Xu 1 1 SKLSDE, Beihang University, Beijing, China 2 Microsoft Research, Beijing, China

Upload: others

Post on 08-Feb-2021

9 views

Category:

Documents


0 download

TRANSCRIPT

  • Question Answering over Freebase with Multi-Column Convolutional Neural Networks

    Li Dong1, Furu Wei2, Ming Zhou2, Ke Xu11SKLSDE, Beihang University, Beijing, China

    2Microsoft Research, Beijing, China

  • Freebase Large-scale knowledge base A rich resource to answer open-domain questions

    Challenge natural language questions ~ structured semantics of Freebase How to bridge the gap?

    Question Answering over Freebase

    when did Avatar release in UK 2009-12-17Question: Answer:

  • Semantic parsing (Berant et al., 2013; Bao et al., 2014; etc.) Question Formal Meaning Representation Structured Queries Answer Example

    Utterance: Which college did Obama go to Logical form: (and (Type University) (Education BarackObama)) Denotation: Occidental College, Columbia University

    Challenges Huge search space Lexical triggers

    Mainstream Methods (1/2)

    Example is borrowed from the website of SEMPRE

  • Information extraction over knowledge base 1. Retrieve candidate answers from Freebase 2. Extract features 3. Classification / Ranking

    Mainstream Methods (2/2)

    Question

    Candidate Answers

    Features

    Classifier

    Correct Answer

    (Yao and Van Durme, 2014)

    Question

    Candidate Answers

    Ranking score

    Correct Answer

    (Bordes et al., 2014a; 2014b)

    Sum of Word Embeddings

    Candidate Embedding

  • Question answering -> Constraint matching Answer type, answer path (relation), answer context

    Question understanding with convolutional neural networks

    Proposed Method

    Question

    Candidate Answers

    Type

    Multi-columnConvolutional Neural Networks

    Relation Context Type Relation Context

    Matching Score

    Ranker Answer

  • when did Avatar release in UK

    Convolutional Layer

    Max-Pooling Layer

    Shared Word Representations

    Avatarm.0bth54

    James Cameronm.03_gd

    film.film.directed_by

    type.object.type

    people.person

    film.producer

    type.object.typem.09w09jk

    film.film.release_date_s

    type.object.typefilm.film_regional_release_date

    United Statesof Americam.09c7w0

    film.film_regional_release_date.film_release_region

    film.film_regional_release_date.release_date

    2009-12-18

    datetime

    value_type

    m.0gdp17zfilm.film.release_date_s

    type.object.type

    film.film_regional_release_date

    United Kingdomm.07ssc

    film.film_regional_release_date.film_release_region

    film.film_regional_release_date.release_date

    2009-12-17

    datetime

    value_type

    Score Layer

    Score

    + +Dot Product

    Answer Path

    Answer Context

    Answer Type

    Model Overview

  • when did Avatar release in UK

    Convolutional Layer

    Max-Pooling Layer

    Shared Word Representations

    Avatarm.0bth54

    James Cameronm.03_gd

    film.film.directed_by

    type.object.type

    people.person

    film.producer

    type.object.typem.09w09jk

    film.film.release_date_s

    type.object.typefilm.film_regional_release_date

    United Statesof Americam.09c7w0

    film.film_regional_release_date.film_release_region

    film.film_regional_release_date.release_date

    2009-12-18

    datetime

    value_type

    m.0gdp17zfilm.film.release_date_s

    type.object.type

    film.film_regional_release_date

    United Kingdomm.07ssc

    film.film_regional_release_date.film_release_region

    film.film_regional_release_date.release_date

    2009-12-17

    datetime

    value_type

    Score Layer

    Score

    + +Dot Product

    Answer Path

    Answer Context

    Answer Type

    Model Overview

    Avatar.0bth54

    cted_byfilm.film.release

    _date_s

    m.0gdp17zfilm.film.release_date_s

    type.object.type

    film.film_regional_release_date

    United Kingdomm.07ssc

    film.film_regional_release_date.film_release_region

    film.film_regional_release_date.release_date

    2009-12-17

    datetime

    value_type

    Answer Context

    Answer Type

    Answer Path

    Candidate

  • Learn vector representations for candidate answers (Bordes et al., 2014a; Bordes et al., 2014b)

    Answer path relations between the candidate node and the entity asked in

    question 𝑎𝑎𝑎𝑎𝑎𝑎 𝒓𝒓𝟏𝟏, 𝒓𝒓𝟐𝟐, … ,𝒓𝒓𝒏𝒏 : average of relation embeddings

    Answer contextAnswer type

    Embedding Candidate Answers

    Avatarm.0bth54

    James Cameronm.03_gd

    film.film.directed_byfilm.film.release

    _date_s

    film.film region

    m.0gdp17zfilm.film.release_date_s

    type.object.type

    film.film_regional_release_date

    United Kingdomm.07ssc

    film.film_regional_release_date.film_release_region

    film.film_regional_release_date.release_date

    2009-12-17

    datetime

    value_type

    Answer Path

    Answer Context

    Answer Type

    Candidate

    Asked entity

  • Learn vector representations for candidate answers (Bordes et al., 2014a; Bordes et al., 2014b)

    Answer context 1-hop entities and relations connected to the answer path 𝑎𝑎𝑎𝑎𝑎𝑎 𝒄𝒄𝟏𝟏, 𝒄𝒄𝟐𝟐, … , 𝒄𝒄𝒏𝒏 : average of context entity and relation embeddings

    Answer pathAnswer type

    Embedding Candidate Answers

    Avatarm.0bth54

    James Cameronm.03_gd

    film.film.directed_byfilm.film.release

    _date_s

    film.film region

    m.0gdp17zfilm.film.release_date_s

    type.object.type

    film.film_regional_release_date

    United Kingdomm.07ssc

    film.film_regional_release_date.film_release_region

    film.film_regional_release_date.release_date

    2009-12-17

    datetime

    value_type

    Answer Path

    Answer Context

    Answer Type

  • Learn vector representations for candidate answers (Bordes et al., 2014a; Bordes et al., 2014b)

    Answer type common.topic.notable_types, value type (e.g., float, string, datetime) 𝑎𝑎𝑎𝑎𝑎𝑎 𝒕𝒕𝟏𝟏, 𝒕𝒕𝟐𝟐, … , 𝒕𝒕𝒏𝒏 : average of type embeddings

    Answer pathAnswer context

    Embedding Candidate Answers

    Avatarm.0bth54

    James Cameronm.03_gd

    film.film.directed_byfilm.film.release

    _date_s

    film.film region

    m.0gdp17zfilm.film.release_date_s

    type.object.type

    film.film_regional_release_date

    United Kingdomm.07ssc

    film.film_regional_release_date.film_release_region

    film.film_regional_release_date.release_date

    2009-12-17

    datetime

    value_type

    Answer Path

    Answer Context

    Answer Type

  • when did Avatar release in UK

    Convolutional Layer

    Max-Pooling Layer

    Shared Word Representations

    Avatarm.0bth54

    James Cameronm.03_gd

    film.film.directed_by

    type.object.type

    people.person

    film.producer

    type.object.typem.09w09jk

    film.film.release_date_s

    type.object.typefilm.film_regional_release_date

    United Statesof Americam.09c7w0

    film.film_regional_release_date.film_release_region

    film.film_regional_release_date.release_date

    2009-12-18

    datetime

    value_type

    m.0gdp17zfilm.film.release_date_s

    type.object.type

    film.film_regional_release_date

    United Kingdomm.07ssc

    film.film_regional_release_date.film_release_region

    film.film_regional_release_date.release_date

    2009-12-17

    datetime

    value_type

    Score Layer

    Score

    + +Dot Product

    Answer Path

    Answer Context

    Answer Type

    Model Overview

    when did Avatar release in UK

    Convolutional Layer

    Max-Pooling Layer

    Shared Word Representations

    t

    people.pe

  • when did Avatar release in UK

    Convolutional Layer

    Max-Pooling Layer

    Shared Word Representations

    Avatarm.0bth54

    James Cameronm.03_gd

    film.film.directed_by

    type.object.type

    people.person

    film.producer

    type.object.typem.09w09jk

    film.film.release_date_s

    type.object.typefilm.film_regional_release_date

    United Statesof Americam.09c7w0

    film.film_regional_release_date.film_release_region

    film.film_regional_release_date.release_date

    2009-12-18

    datetime

    value_type

    m.0gdp17zfilm.film.release_date_s

    type.object.type

    film.film_regional_release_date

    United Kingdomm.07ssc

    film.film_regional_release_date.film_release_region

    film.film_regional_release_date.release_date

    2009-12-17

    datetime

    value_type

    Score Layer

    Score

    + +Dot Product

    Answer Path

    Answer Context

    Answer Type

    Model Overview

    film film

    dat

    Score Layer

    Score

    + +Dot Product

    Answer Path

    Answer Context

    Answer Type

  • Negative instance 𝑎𝑎𝑎 is randomly sampled from the set of candidate answersHinge loss for (𝑞𝑞, 𝑎𝑎) and (𝑞𝑞, 𝑎𝑎𝑎)

    Objective function 𝐴𝐴𝑞𝑞: set of correct answers 𝑅𝑅𝑞𝑞 ⊆ 𝐶𝐶𝑞𝑞\A𝑞𝑞: set of wrong answers

    Back-propagation, AdaGrad, max-norm regularization

    Model Training

    , where

  • Inference (During Test)

    when did Avatar release in UKCandidate Answers

    (2-hop entities/attributes)

    Type

    Multi-columnConvolutional Neural Networks

    Relation Context Type Relation Context

    Matching Score

    Ranker Answer

    Avatar 1. Link to entity in Freebase

    2. Retrieve candidates

    3. Compute vector representations

    4. Compute scores

  • If there are more than one correct answers Use the margin 𝑚𝑚 in objective function as threshold Candidates whose scores are not far from the best answer are

    regarded as predicted results

    Inference (During Test)

  • Question understanding results of paraphrases should be same who is the father of A who is A’s father

    So, the vectors of paraphrases computed by neural networks should be similar Hinge loss Negative instance is randomly sampled

    Question Paraphrases for Multi-Task Learning

  • WebQuestions (Berant et al., 2013) wh- questions collected by querying Google Suggest API Annotated in Amazon Mechanical Turk Train: 3023, Dev: 755, Test: 2032

    Example Question: what is the name of justin bieber brother? Url: http://www.freebase.com/view/en/justin_bieber Answers: {Jazmyn Bieber, Jaxon Bieber}

    Paraphrases (Fader et al., 2013) Collected from the WikiAnswers website ~2.4M questions, grouped into ~355k paraphrase clusters

    Experiments

    http://www.freebase.com/view/en/justin_bieber

  • Better or comparable results than baseline methods

    Experimental Results

    Semantic Parsing

    Information Extraction

  • Ablation experiments w/o path/type/context

    without the specific column w/o multi-column

    tying parameters of multiple columns w/o paraphrase

    without question paraphrases 1-hop

    1-hop paths to generate candidates

    Model Analysis

  • Salience score How much a word affects question understanding Replace a word with stop words, how much the vectors are affected

    Salient Question Words Detection

    is the microsoft located

    where is the microsoft located Multi-ColumnNeural Networks Type Relation Context

    Type Relation Context

    Multi-ColumnNeural Networks

    vector distance is salience score

  • Observations wh- words nouns dependent of the wh- words

    type/country/leader verbs

    speak/located

    Salient Question Words Detection

  • Question answering over unstructured text

    Future Work

    … Titanic is a 1997 American epic romantic disaster film directed, written, co-produced, and co-edited by James Cameron. …

    Who is the director of Titanic?

    Type: Person

    Type

    Multi-columnConvolutional Neural Networks

    Relation Context Type Relation Context

    Matching Score

    Sentence Modeling

    https://en.wikipedia.org/wiki/James_Cameron

  • THANKS!

  • Question Answering over Freebase with Multi-Column Convolutional Neural NetworksQuestion Answering over FreebaseMainstream Methods (1/2)Mainstream Methods (2/2)Proposed MethodModel OverviewModel OverviewEmbedding Candidate AnswersEmbedding Candidate AnswersEmbedding Candidate AnswersModel OverviewModel OverviewModel TrainingInference (During Test)Inference (During Test)Question Paraphrases for Multi-Task LearningExperimentsExperimental ResultsModel AnalysisSalient Question Words DetectionSalient Question Words DetectionFuture WorkTHANKS!幻灯片编号 24