Transcript
Page 1: Semantics Interpretation

1

Semantics Interpretation

Allen’s Chapter 9

J&M’s Chapter 15

Page 2: Semantics Interpretation

2

Rule-by-rule semantic interpretation

• Computing Logical forms (i.e., Semantic Interpretation) can be coupled with parsing

• A syntactic tree can be generated from a specified logical form (i.e., Semantic Realization)

• To couple syntax and semantic, the meaning of all constituent must be determined

• By using features, the relation between the meaning of a constituent and that of its sub constituents can be specified in the grammar

• Each syntactical rule is associated with a semantic interpretation rule (this is called Rule-by-Rule semantic interpretation)

Page 3: Semantics Interpretation

3

Semantic interpretation and compositionality

• Semantic interpretation is assumed to be a compositional process (similar to parsing)

• The meaning of a constituent is derived from the meaning of its sub constituents

• Interpretations can be built incrementally from the interpretations of sub phrases

• Compositional models make grammars easier to extend and maintain

Page 4: Semantics Interpretation

4

Semantic interpretation and compositionality

• Integrating parsing and semantic interpretation is harder than it may seem

• One classic obstacle is the inconsistency between syntactical structures, and corresponding structures of the logical forms

• Syntactical structure of Jill loves every dog is: ((Jill) (loves (every dog)))

• Its unambiguous logical form is:(EVERY d1 : (DOG1 d1) (LOVES1 l1 (NAME j1 “Jill”) d1)

Page 5: Semantics Interpretation

5

Difficulties of Semantic interpretation via compositionality

• In the syntactical structure, every dog is a sub constituent of VP, whereas in the logical form, the situation is reversed

• How the meaning of every dog could be represented in isolation and then used to construct the meaning of the sentence?

• Using quasi-logical forms is one way around this problem

(LOVES1 l1 (NAME j1 “Jill”) <EVERY d1 DOG1>)

Page 6: Semantics Interpretation

6

Problem with Idioms

• Another obstacle for the compositionality assumption is the presence of idiomsJack kicked the bucket = Jack died

• Solution1: semantic meaning to be assigned to the entire phrase, rather than building it incrementally

• What about: The bucket was kicked by Jack?• Jack kicked the bucket is ambiguous between:

– (KICK1 k1 (NAME j1 “Jack”) <THE b1 BUCKET1>)– (DIE1 d1 (NAME j1 “Jack”))

• Solution2: adding a new sense of die for the verb kick with sub categorization for an object BUCKET1

Page 7: Semantics Interpretation

7

Semantic interpretation and compositionality

• In compositional semantic interpretation, we should be able to assign a semantic structure to any syntactic constituent

• For instance, we should be able to assign a uniform form of meaning to every verb phrase in any rule involving a VP

• The meaning of the VP in Jack laughed can be shown by a unary predicate: (LAUGHED1 l1 (NAME j1 “Jack”))

• The VP in, Jack loved Sue, should also be represented by a unary predicate:

(LOVES1 l2 (NAME j1 “Jack”) (NAME s1 “Sue”))• But, how can we express such complex unary predicates?• Lambda calculus provides a formalism for representing such

predicates

Page 8: Semantics Interpretation

8

Lambda calculus

• Using lambda calculus, loved sue is represented as:( x (LOVES1 l1 x (NAME s1 “SUE”)))

• We can apply a lambda expression to an argument, by a process called lambda reduction( x (LOVES1 l1 x (NAME s1 “SUE”))(NAME j1 “Jack”)

= (LOVES1 l1 (NAME j1 “Jack”) (NAME s1 “Sue”))

Page 9: Semantics Interpretation

9

Lambda calculus

• Lambda-calculus provides a handy tool to couple syntax and semantics

• Now verb phrases with even different structures can easily be conjoinedSue laughs and opens the door( a (LAUGHES1 l1 a)) and ( a (OPENS1 o1 a) <THE d1 DOOR1>) can be conjoined into:( a (& (LAUGHES1 l1 a) (OPENS1 o1 a) <THE d1 DOOR1>)))Applying it to (NAME s1 “Sue”) would produce the logical form:(& (LAUGHES1 l1 (NAME s1 “Sue”))

(OPENS1 o1 (NAME s1 “Sue”)) <THE d1 DOOR1>))

Page 10: Semantics Interpretation

10

Lambda calculus (Cont.)

• Propositional phrase modifiers in noun phrases could be handled in different ways

1. Search for location of modifiers in noun phases and incorporate them into the interpretations

• Works for the man in the store, but not for the man is in the store or the man was thought to be in the store

2. Give and independent meaning to prepositional phrases– in the store is represented by a unary predicate

( o (IN-LOC1 o <THE s1 STORE1>)• The man in the store

(<THE m1 (& (MAN1 m1) (IN-LOC1 m1 <THE s1 STORE1>))>)

• The man is in the store(IN-LOC1 <THE m1 MAN1> <THE s1 STORE1>)

Page 11: Semantics Interpretation

11

Compositional approach to semantics

• In general, each major syntactic phrase corresponds to a particular semantic construction:– VPs and PPs map to unary predicates, – sentences map to propositions, – NPs map to terms, and – minor categories are used in building major

categories

Page 12: Semantics Interpretation

12

A simple grammar and lexicon for semantic interpretation

• By using features, logical forms can be computed during parsing

• The main extension needed is a SEM feature, which is added to each lexical entry and rule

• Example:(S SEM (?semvp ?semnp)) (NP SEM ?semnp) (VP SEM ?semvp)

NP with SEM (NAME m1 “Mary”)

VP with SEM ( a (SEES1 e8 a (NAME j1 “Jack)))

• Then the SEM feature of S is:(( a (SEES1 e8 a (NAME j1 “Jack))) (NAME m1 “Mary”)), is reduced to

(SEES1 e8 (NAME m1 “Mary”) (NAME j1 “Jack)

Page 13: Semantics Interpretation

13

Mary sees Jack

Page 14: Semantics Interpretation

14

Sample lexicon with SEM features

Page 15: Semantics Interpretation

15

A sample grammar with SEM and VAR features

• A feature called VAR is also needed to store the discourse variable that corresponds to the constituents

• The VAR feature is automatically generated when a lexical constituent is constructed from a word

• Then the VAR feature is passed up the parse tree by being treated as a head feature

• This guarantees that the discourse variables are always unique • If ?v is m1, SEM of the man is: <THE m1 (MAN1 m1)>

Page 16: Semantics Interpretation

16

Morphological rules with SEM feature

• The lexical rules for morphological derivations must also be modified to handle SEM features

• For instance, the rule for converting singular nouns to plural ones take the SEM of a singular noun and adds a PLUR operator (L7)

• A similar technique is used for tense operators (L1, L2, and L3)

Page 17: Semantics Interpretation

17

Using Chart Parsing for semantic interpretation

• Only two simple modifications are needed to use the standard Chart Parser to handle semantic interpretation:

• When a lexical rule is instantiated, the VAR feature is set to a new discourse variable

• Whenever a constituent is built, its SEM is simplified by possible lambda reductions

Page 18: Semantics Interpretation

18

Jill saw the dog

Page 19: Semantics Interpretation

19

Handling auxiliary verbs

• (VP SEM ( a1 (?semaux (?semvp a1)))) (AUX SUBCAT ?v SEM ?semaux)

(VP VFORM ?v SEM ?semvp)

• If ?semaux is the modal operator CAN1, and ?semvp is ( x (LAUGHS1 e3 X))

• Then the SEM of the VP can laugh will be( a1 (CAN1 (( x (LAUGHS1 e3 x)) a1))

( a1 (CAN1 (LAUGHS1 e3 a1)))

Page 20: Semantics Interpretation

20

Handling Prepositional Phrases

• Propositional phrases play two different semantic roles:

1. PP can be a modifier to a noun phrase or a verb phrase (cry in the corner)

2. PP may be needed by a head word, and the preposition acts more as a term than as an independent predicate (Jill decided on a couch)

Page 21: Semantics Interpretation

21

PP as a modifier of a noun phrase

• The appropriate rule for interpreting PP is:(PP SEM ( y (?semp y ?semnp)))

(P SEM ?semp) (NP SEM ?

semnp)

• if SEM of P is: IN-LOC1, and SEM of NP is: <THE c1 (CORNER1 c1)>,

• Then the SEM of PP in the corner will be:( y (IN-LOC1 y <THE c1 CORNER1>)),

Page 22: Semantics Interpretation

22

PP as a modifier of a noun phrase

• If the PP is used in the rule:(CNP SEM ( n (& (?semcnp n) (?sempp n))))

(CNP SEM ?semcnp) (PP SEM ?sempp)• Then SEM of CNP man in the corner will be: ( n (& (MAN1 n) (( y (IN-LOC1 y <THE c1 CORNER1>)) n)))• That is reduced to:

( n (& (MAN1 n) (IN-LOC1 n <THE c1 CORNER1>)))• Then using rule

(NP VAR ?v SEM <?semart ?v (?semcnp ?v)>) (ART SEM ?semart) (CNP SEM ?semcnp)

• The SEM of the man in the corner will be:<THE m1 (( y (& (MAN1 y) (IN-LOC1 y <THE c1 CORNER1>))) m1)>

• Which is reduced to:<THE m1 (& (MAN1 m1) (IN-LOC1 m1 <THE c1 CORNER1>))>

Page 23: Semantics Interpretation

23

PP as a modifier of a verb phrase

• Appropriate rule is:(VP VAR ?v SEM ( x (& (?semvp x) (?sempp ?v))))

(VP VAR ?v SEM ?semvp) (PP SEM ?sempp)

• SEM of PP in the corner is:( y (IN-LOC1 y <THE c1 CORNER1>)),

• SEM of VP of cry is: (x (CRIES1 e1 x)),

• SEM of VP of cry in the corner will be:( a (& (CRIES1 e1 a) (IN-LOC1 e1

<THE c1 CORNER1>)))

Page 24: Semantics Interpretation

24

Parse tree of Cry in the corner

Page 25: Semantics Interpretation

25

PP as a sub constituent of a head word

• Jill decided on a couch is ambiguous:1. Jill made a decision while she was on a couch2. Jill made a decision about a couch

• The appropriate rule for the second one is:• VP V[_pp:on] NP PP[on]

– The desired logical form for this is( s (DECIDES-ON1 d1 s <A c1 (COUCH1 c1)>))

• Here the preposition “on” doesn’t have any semantic contribution (it is only a term)

• A new binary feature PRED is needed to determine the role of preposition (i.e., a predicate (+) or a term(-))

Page 26: Semantics Interpretation

26

Rules for handling PPs

Page 27: Semantics Interpretation

27

Different parse trees for the VP decide on the coach

Page 28: Semantics Interpretation

28

Lexicalized semantic interpretation

• So far all the complexity of the semantic interpretation has been encoded in grammar rules

• A different approach is to encode complexities in lexical entries and have simpler rules

• Verb decide has two senses, with two different SEM values:

1. Intransitive sense: ( y (DECIDES1 e1 y))

2. Transitive sense: ( o ( y (DECIDE-ON1 e1 y o)))

Page 29: Semantics Interpretation

29

Lexicalized semantic interpretation

• There is a trade off between the complexity of the grammatical rules and the complexity of the lexical entries

• The more complex that logical forms are, the more attractive the second approach becomes

• For instance, to handle thematic roles with simple lexical entries, we need extra rules to classify different type of verbs

Page 30: Semantics Interpretation

30

Lexicalized semantic interpretation

• See and eat both have transitive forms where the subject fills the AGENT role and the object fills the THEME role

• Break also has a transitive sense where the subject fills the INSTR role and the object fills the THEME role

• We need a new feature ROLES to be added to the lexical entries to identify the appropriate form, and a separate rule for each

Page 31: Semantics Interpretation

31

Lexicalized semantic interpretation

(VP VAR ?v SEM ( a (?semv ?v [AGENT a] [THEME ?semnp]))) (V ROLES AGENT-THEME SEM ?semv) (NP SEM ?semnp)

(VP VAR ?v SEM ( a (?semv ?v [INSTR a] [THEME ?semnp]))) (V ROLES INSTR-THEME SEM ?semv) (NP SEM ?semnp)

• Additional rules must be added for all combinations of roles that can be used by the verbs

• This approach is cumbersome, because it requires adding thematic role information to lexicon anyway (using the ROLES feature)

Page 32: Semantics Interpretation

32

Lexicalized semantic interpretation

• It is simpler to enter the appropriate form only in the lexicon:

See: (V VAR ?v SEM ( o ( a (SEES1 ?v [AGENT a] [THEME o]))))Break: (V VAR ?v SEM ( o ( a (BREAKS1 ?v [INSTR a] [THEME

o]))))

• In this case, just a single rule is needed:(VP SEM (?semv ?semnp))

(V SEM ?semv) (NP SEM ?semnp)

• SEM of VP See the book is represented as:( o ( a (SEES1 b1 [AGENT a] [THEME o]))))<THE b1 (BOOK1 b1)> =

( a (SEES1 b1 [AGENT a] [THEME <THE b1 (BOOK1 b1)>]))))

• SEM of VP Break the book is represented as:( o ( a (BREAKS1 b1 [INSTR a] [THEME o]))))<THE b1 (BOOK1 b1)> =

( a (BREAKS1 b1 [INSTR a] [THEME <THE b1 (BOOK1 b1)>]))))

Page 33: Semantics Interpretation

33

Hierarchical lexicons

• The problem of making the lexicon more complex is that there are too many words

• Verb give allows:– I gave the money– I gave John the money– I gave the money to John

• The lexical entries for give includes:– (V SUBCAT _np

SEM o a (GIVE * [AGENT a] [THEME o]))– (V SUBCAT _np_np

SEM r o a (GIVE * [AGENT a] [THEME o] [TO-POSS r))– (V SUBCAT _np_pp:to

SEM r o a (GIVE * [AGENT a] [THEME o] [TO-POSS r))

Page 34: Semantics Interpretation

34

Hierarchical lexicons

• This cause a lot of redundancies in the lexicon

• A better idea is to exploit regularities of verbs in English

• A very large class of transitive verbs (give, take, see, find, paint, etc.), which describe an action, use the same semantic interpretation rule for SUBCAT _np

Page 35: Semantics Interpretation

35

Hierarchical lexicons

• The idea of hierarchical lexicon is to organize verb senses in such a way that their shared properties can be used (via inheritance)– INTRANS-ACT defines verbs with SUBCAT _none

s (?PREDN * [AGENT s])

– TRANS-ACT for verbs with SUBCAT _np o a (?PREDN * [AGENT a] [THEME o])

• Give: (V ROOT give PREDN GIVE1SUP {BITRANS-TO-ACT TRANS-ACT})

Page 36: Semantics Interpretation

36

Lexical hierarchy

Page 37: Semantics Interpretation

37

Handling simple questions

• To handle simple questions, we only need to extend the appropriate grammar rules with the SEM feature

• The lexical entry for the wh-words are extended with the SEM feature:who: (PRO WH {Q R} SEM WHO1 AGR {3s 3p})

• But, how SEM and GAP features interact?

Page 38: Semantics Interpretation

38

Who did Jill see?

Page 39: Semantics Interpretation

39

Prepositional phrase Wh-Questions

• Questions can begin with prepositional phrases:– In which box did you put the book?

– Where did Jill go?

• The semantic interpretation of these questions depend on the type of PPs

(S INV – SEM (WH-query ?sems)) (PP WH Q PRED ?p PTYPE ?pt SEM ?sempp)(S INV + SEM ?sems GAP (PP PRED ?p PTYPE ?pt SEM ?sempp))

Page 40: Semantics Interpretation

40

Prepositional phrase Wh-Questions

• To handle wh-terms like where the following rule is also needed:(PP PRED ?p PTYPE ?pt SEM ?sempp)

(PP-WRD PRED ?p PTYPE ?pt SEM ?sempp)

• There would also be two lexical entries:(PP-WRD PTYPE {LOC MOT} PRED – VAR ?v

SEM <WH ?v (LOC1 ?v)>)

(PP PRED + VAR ?v SEM ( x (AT-LOC x <WH ?v (LOC1 ?v)>)))

Page 41: Semantics Interpretation

41

Where did Jill go?

• But, Wh-questions starting with a PP with + PRED cannot be handled, yet• The difficulty comes from the restriction that in rule VP VP PP, the

GAP is passed only into the VP sub constituent• Thus there is still no way to create a PP gap that modifies a verb phrase

Page 42: Semantics Interpretation

42

Relative Clauses

• The similarity between relative clauses and wh-questions continues at the semantic level, too.

• The logical form of a relative clause is a unary predicate, which is produced by

(CNP SEM ( ?x (& (?semcnp ?x) (?semrel ?x))) (CNP SEM ?semcnp) (REL SEM ?semrel)

• When the embedded sentence is completed, a lambda is wrapped around it to form the relative clause

Page 43: Semantics Interpretation

43

Who Jill saw

• The wh-term variable is specified in a feature called RVAR in the rule

(REL SEM ( ?v ?sems)) (NP WH R RVAR ?v AGR ?a SEM ?semnp)(S [fin] INV – GAP (NP AGR ?a SEM ?semnp) SEM ?sems)

Page 44: Semantics Interpretation

44

Semantic Interpretation by Features Unificationversus Lambda reductions

• Semantic interpretation can be performed by just using feature values and variables

• The basic idea is to introduce new features for the argument positions that would have been filled with lambda reductions

Page 45: Semantics Interpretation

45

Jill saw the dog

Page 46: Semantics Interpretation

46

Features versus Lambda expressions

Page 47: Semantics Interpretation

47

Features versus Lambda expressions

• One advantage of using features is that no special mechanism (e.g., Lambda Reduction) is needed for semantic interpretation

• Another significant advantage is that the grammar in this form is reversible (can be used to generate sentences)

• But, not all lambda expressions can be eliminated using this technique

• Handling conjoined subject phrases as in sue and Sam saw Jack, SUBJ variable need to unify with both Sue and Sam, which is not possible

Page 48: Semantics Interpretation

48

Generating sentences from Logical Forms

• Intuitively, it should be easy to reverse a grammar and use it for generation:

• Decompose the logical form of each constituent into a series of lexical constituents with the appropriate meaning

• But, not all grammars are reversible: e.g., a grammar with Lambda reduction

Page 49: Semantics Interpretation

49

Generating sentences from Logical Forms

• Consider (<PAST SEES1> s1 (NAME j1 “Jill”) <THE d1 (DOG1 d1)>)

• Rule S with SEM (?semvp ?semnp) cannot be unified with the above logical form

• The problem is that lambda reduction was used to convert the original logical form, which was

((a (<PAST SEES1> d1 a <THE d1 (DOG1 d1)>)) (NAME j1 “Jill”)

• Lambda abstraction can be used to find a match, but the problem is that there are three possible lambda abstractions– (e (<PAST SEES1> e (NAME j1 “Jill”)

<THE d1 (DOG1 d1)>– (a (<PAST SEES1> s1 a <THE d1 (DOG1 d1)>))– (o (<PAST SEES1> s1(NAME j1 “Jill”) o))

Page 50: Semantics Interpretation

50

Realization versus Parsing

• Parsing and realization both can be viewed as building a syntactic tree

• A parser starts with the words and tries to find a tree that accounts for them and hence determine the logical form of the sentence

• A realizer starts with a logical form and tries to find a tree to account for it and hence determine the words to realize it

Page 51: Semantics Interpretation

51

Realization and Parsing

• Using standard top-down parsing algorithm is extremely inefficientConsider again (<PAST SEES1> s1

(NAME j1 “Jill”) <THE d1 (DOG1 d1)>)(NP SEM ?semsubj)(VP SUBJ ?semsubj

SEM (<PAST SEES1> s1 (NAME j1 “Jill”) <THE d1 (DOG1 d1)>))

• The problem is that the SEM of NP is unconstrained• Solution: expand the constituents in a different order (e.g.,

choosing the subject based on decisions about the verb and the structure of the verb phrase)

Page 52: Semantics Interpretation

52

Head Driven Realization algorithm

Page 53: Semantics Interpretation

53

Head driven Realization

• Using rule 3 of grammar 9.14 ?semv = <PAST SEES1>,

?v = s1,?semsubj = (NAME j1 “Jill”),?semnp = <THE d1 <DOG1 d1)>

• After rewriting the VP, the following list of constituents is obtained:

(NP SEM (NAME j1 “Jill”))(V [_np] SEM <PAST SEES1>)(NP SEM <THE d1 (DOG1 d1)>)

Page 54: Semantics Interpretation

54

Head driven Realization

• Since there is no non-lexical head, the algorithm picks any non-lexical constituent with a bound SEM (e.g., the first NP)

• Using rule 5, it yields to (NAME SEM “Jill”)

• Selecting the remaining NP, and using rules 6, and then 7, produce the following list:

(NAME SEM “Jill”), (V [_np] SEM <PAST SEES1>),(ART SEM THE), (N SEM DOG1)

• Now it is easy to produce Jill saw the dog

Page 55: Semantics Interpretation

55

Ambiguities in Realization

• With large grammars, a wider range of forms would be possible

• If only the SEM feature is specified, the realization program would randomly pick between active and passive sentences when allowed by the verb

• It might also pick between different sub categorization structures – Jill gave the dog to Jack– Jill gave Jack the dog

• Different word senses may be picked– Jack gave the money to the Humane Society– Jack donated the money to the Humane Society

• To force particular realization, other features in addition to SEM may be necessary (e.g., VOICE to force active voice)


Top Related