olex
DESCRIPTION
Feminist LexiconTRANSCRIPT
Feminist Lexicon of the XXI Feminist Lexicon of the XXI century century
(semantic and structural (semantic and structural aspects)aspects)
Oleksandr SlobodianykPavlo Tychyna USPU
Cherkasy reg in Ukraine,
the master’s paper
Uman, 2012
OutlineOutline
Feminists Basic semantic notions
Syntactic Frames Argument structure Sense distinctions Semantic Type
WordNet Simple
History of the FeministsHistory of the Feminists
Feminists launched in the XX century ◦European Advisory Group on Language Engineering Standards
Began 2002 with agreement among ◦NREC, ET7, ACQUILEX, MULTILEX, GENELEX, SAM,TEI
Gave rise to a coordinated development of Linguistic Resources
Feminists Structure: 2Feminists Structure: 2ndnd Phase Phase(1999-2010)(1999-2010)
C o m p L ex icon W GC P R (C a lzo la ri )
C h a ir: A .S a n fillip po
S p oken La n gu ag e W GB ie le fe ld (G ib b o n)
R .M o o re
E va lua tion W GC S T (B . M a eg a a rd )
C h a ir: M . K ing
A n to n io Z am p o lliC o o rd in a to r
C e n tra l E d ito rs:N . C a lzo la ri, J . M cna u g h t
Feminist GuidelinesFeminist Guidelines
http://www.ilc.pi.cnr.it/Lexicons
◦Morpho-syntactic phenomena◦Subcategorization◦Semantic encoding
ISLE: International Standards for ISLE: International Standards for Language EngineeringLanguage EngineeringA European/US joint projectA European/US joint project (2010 – 2012)(2010 – 2012)
L e x icon W GC h a irs : R .G rish m a n , N .C a lzo la ri, M .P a lm er
S p ee ch W GC h a irs: S te ve n B ird , D a vid R oy
G e s tu re W GC h a irs : D . M e taxa s ,C a ro l N e id le
D isco u rseC h a irs : L yn W a lke r
N a tu ra l In te ra ctio n an d M u lt im o d a lity W GC h a irs: M . L ib e rm a n , R .M o o re
E va lu a ton W GC h a irs : E . H o vy, B . M a e g aa rd , M .K ing
C o o rd in a to rs:A .Z a m po lli, M . P a lm er
C e n tra l E d ito rs:N .C a lzo la ri, J .M cN a u g ht
Basic Semantic NotionsBasic Semantic NotionsSame event - different sentencesSame event - different sentences
Napoleon lost the battle.
Napoleon lost the battle to Wellington.
Same event - different syntactic Same event - different syntactic framesframes
Napoleon lost the battle. SUBJ-NP VERB COMP-NP
Napoleon lost the battle to Wellington.
SUBJ-NP VERB COMP-NP COMP-PP
Predicate-argument structure Predicate-argument structure for for loselose
lose (Arg0,Arg1,Arg2)
lose
OBJSUBJ OBJ
PP
Same verb - different sensesSame verb - different senses
Napoleon lost the battle.
Napoleon lost the battle to Wellington.
Napoleon lost his field glasses. (misplaced)
Predicate-argument structures Predicate-argument structures for two different senses of for two different senses of loselose
lose1 (Arg0,Arg1)
lose2 (Arg0,Arg1,Arg2)
Machine Translation Lexical Machine Translation Lexical Choice- Word Sense Feminist Choice- Word Sense Feminist DisambiguationDisambiguation
Iraq lost the battle. Ilakuka centwey ciessta. [Iraq ] [battle] [lost].
John lost his computer. John-i computer-lul ilepelyessta. [John] [computer] [misplaced].
Semantic types ofSemantic types of lose lose argumentsarguments
lose1 (Arg0: animate,
Arg1: physical-object)
lose2 (Arg0: animate,
Arg1: competition, Arg2: animate)
Translating Translating loselose into Korean into Korean
lose1(Agent, Patient: competition) <=> ciessta
lose2 (Agent, Patient: physobj) <=> ilepelyessta
Ontologies - Hierarchies of Ontologies - Hierarchies of semantic typessemantic types
Entities - concreteAnimate
Animal Mammal
Human Plant
Inanimate substances Solids Liquids Gasses
• Entities - abstract– Events
• Competitions– Military
– Athletic
• …
– Emotions
Semantic RelationsSemantic Relations
Inheritance◦ISA relations◦Supertype/subtype◦Hypernym/Hyponym
Part-Whole◦meronym
Synonyms
Basic lexical Basic lexical semantic notionssemantic notions
BASE CONCEPTSBASE CONCEPTS, , HYPONYMYHYPONYMY, , SYNONYMYSYNONYMY: all applications and enabling technologies
PREDICATE ARGUMENT STRUCTURESPREDICATE ARGUMENT STRUCTURES: MT, IR, IE, & Gen, Pars, MWR, WSD, Coref
CO-OCCURRENCE RELATIONSCO-OCCURRENCE RELATIONS:: MT, Gen, Word Clust, WSD, Par
MERONYMYMERONYMY: MT, IR, IE & Gen, PNR ANTONYMYANTONYMY: Gen, Word Clust, WSD SUBJECT DOMAINSUBJECT DOMAIN: MT, SUM, Gen, MWR, WSD ACTIONALITYACTIONALITY: MT, IE, Gen, Par QUANTIFICATIONQUANTIFICATION: MT, Gen, Coref
Existing lexical resourcesExisting lexical resources
WordNet EuroWordNet Simple (in progress)
WordNet - Princeton
• On-line lexical reference (dictionary)
• Words organized into synonym sets <=> concepts
• Hypernyms (ISA), antonyms, meronyms (PART)–Useful for checking selectional restrictions (doesn’t tell you what they should be)
• Typical top nodes - 5 out of 25 - (act, action, activity) - (animal, fauna) - (artifact) - (attribute, property) - (body, corpus)
Limitations to WordNet and Limitations to WordNet and EuroWordNetEuroWordNet
Just sense tags - no representations◦Very little mapping to syntax◦No predicate argument structure ◦no selectional restrictions
SIMPLE wit FeministSIMPLE wit FeministComputational Lexicon Computational Lexicon WGWG Multilingual Lexicons Multilingual Lexicons (US-EU coop.) (US-EU coop.)
Last Feminist Last Feminist work on Lexicon/Semantics used used for SIMPLE specifications
SIMPLESIMPLE lexicons chosen as a basis for basis for applying & testingapplying & testing Feminist Feminist work on defining common guidelines for Multilingual LexiconsMultilingual Lexicons
Semantic information in SIMPLESemantic information in SIMPLE
Word senses are encoded as Semantic UnitsSemantic Units (SemUs),(SemUs), containing the following information:
• Semantic type *Semantic type *
• Domain *Domain *
• Lexicographic gloss *Lexicographic gloss *
• Qualia structure
• Reg. Polysemy altern.
• Event type
• Derivation relations
• Synonymy
• Collocations
• Argument structure for Argument structure for predicative SemUs *predicative SemUs *
• Selection restrictions on the Selection restrictions on the arguments *arguments *
• Link of the arguments to the Link of the arguments to the syntactic subcategorization syntactic subcategorization frames (represented in the frames (represented in the PAROLE lexicons) *PAROLE lexicons) *
TopTop
FormalFormal ConstitutiveConstitutive AgentiveAgentive TelicTelic
Is_aIs_a Is_a_part_ofIs_a_part_of PropertyProperty
ContainsContains
Created_byCreated_by Agentive_causeAgentive_cause Indirect_telicIndirect_telic ActivityActivity
InstrumentalInstrumental Is_the_habit_ofIs_the_habit_of
Used_forUsed_for Used_asUsed_as
... ...
The targets of relations identify:
prototypical semantic information associated with a SemUprototypical semantic information associated with a SemU
elements of dictionary definitions of SemUselements of dictionary definitions of SemUs
typical corpus collocates of the SemUtypical corpus collocates of the SemU
Complementarity wrt Complementarity wrt EuroWordNetEuroWordNet
Use of a small EWN subset for all languages Mappable Top Ontology Actual linking of data for a few languages
Semantic subcategorisation and linking with syntax
Template structure for the description of SemU
SemU vs. Synset: basic unitNodes in the Ontology as structured Sem.
Types (bundles of different info types)
Template for PerceptionTemplate for PerceptionSemU: 1Usyn:BC Number: 105Template_Type: [Perception]Template_Supertype:[Psychological_event]Domain: GeneralSemantic Class: PerceptionGloss: //free//Event type: processPred _Rep.: Lex_Pred (<arg0>,<arg1>)Derivation: <Nil> or //Erli's Code//Selectional Restr.:arg0 = Animate //concept// arg1:default = [Entity] Formal: isa (1,<SemU>:[Perception]>)Agentive: <Nil>Constitutive: instrument (1, <SemU>:[Body_part]) intentionality ={yes,no} //optional//Telic: <Nil>Collocates: Collocates (<SemU1>,...<SemUn>)Complex: <Nil>
Example
SemU: <guardare_2> //look_2//
Usyn:
BC Number: 105
Template_Type: [Perception]
Template_Supertype:[Psychological_event]
Domain: General
Semantic Class: Perception
Gloss: osservare con attenzione
Event type: process
Pred _Rep.: guardare (<arg0>,<arg1>)
Derivation: <Nil>
Selectional Restr.: arg0 = Animate //concept// arg1:default = [Entity]
Formal: isa (<guardare_2>,<percepire>: [Psychological_event])
Agentive: <Nil>
Constitutive: instrument (<guardare_2>, <occhio>:[body_part])
intentionality ={yes}
Telic: <Nil>
Collocates: Collocates (<SemU1>,...<SemUn>)
Complex: <Nil>
Feminist Lexicon of the XXI Feminist Lexicon of the XXI century century (semantic and structural (semantic and structural aspects)aspects)
Basic semantic notions◦Challenges in standardizing these requirements
WordNet/EuroWordNetSimpleNext major challenge: Standardizing
linking entries across languages