entity and aspect extraction for organizing news comments · analysis and lexical database search....
TRANSCRIPT
![Page 1: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/1.jpg)
EntityandAspectExtractionforOrganizingNewsComments
RadityoEkoPrasojo,MounaKacimi&WernerNuttEMCLWorkshop,11–12February2016
![Page 2: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/2.jpg)
CommentsinNewsWebsiteTypically, commentsarelistedbasedondate-timeandreplyrelation.
Problem:difficultytocatchtheflowofthediscussionsandtounderstandtheirmainpointsofagreementanddisagreement.
Example:whyisindependencegood/bad fortheScots?Willtheireconomy beaffected?
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
2
![Page 3: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/3.jpg)
Thereisaneedfororganizingcomments
to help users to:(1) have a better understandingof the viewpoints related to each topic(2) facilitate the participation in discussions and thus increase the
chance of acquiring new viewpoints
by clustering comments containing similar discussions:
• they talk about the same entities
• they argue about the same aspects of those entities.EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionfor
OrganizingNewsComments3
![Page 4: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/4.jpg)
Contributions
• Improvementsonstate-of-the-artunsupervisedentityextractiontools(Zemanta,NERD,AIDAYago)• Addressedissues:noisesandlowcoverage(duetocoreferences)
• Introduced aspectextractioninnewsdomain• Previously:aspectsonlyonproductreviewdomain(Zhang&Liu,2014)
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
4
![Page 5: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/5.jpg)
EntityextractionBaseline:UnsupervisedTools
• Improvedbyapplying:• Entityfiltering• Namenormalization• EntitysearchonKB• Coreference resolution
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
5
Tools:DbpediaStanfordCoreNLP
“Don't be afraid of Rasmussen or NATO, this is none of their business. If he ends up sending USS Ronald Reagan aircraft carrier to the coast of Scotland, then he should have done the same to Crimea.”
<http://dbpedia.org/resource/NATO>
<http://dbpedia.org/resource/Scotland><http://dbpedia.org/resource/USS_Ronald_Reagan>
“Don't be afraid of Rasmussen or NATO,this is none of their business. If he ends up sending USS Ronald Reagan aircraft carrier to the coast of Scotland, then he should have done the same to Crimea.”
<http://dbpedia.org/resource/Crimea>
<http://dbpedia.org/resource/Aircraft_Carrier>
![Page 6: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/6.jpg)
“Don't be afraid of Rasmussen or NATO,this is none of their business. If he ends up sending USS Ronald Reagan aircraft carrier to the coast of Scotland, then he should have done the same to Crimea.”
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
6
rdf:type ofNATO?à←dbo:PopulatedPlace
Anentity isaninstanceofsomewell-definedclass
rdf:type?à
←dbo:Shiprdf:type?à←dbo:Location
rdf:type?à←owl:Thing
EntityFiltering
![Page 7: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/7.jpg)
“Don't be afraid of Rasmussen or NATO,this is none of their business. If he ends up sending USS Ronald Reagan aircraft carrier to the coast of Scotland, then he should have done the same to Crimea.”
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
7
Weremoveentities thatdon’thaverdf:type otherthanowl:Thing andowl:Class
rdf:type ofNATO?à←dbo:PopulatedPlace
Anentity isaninstanceofsomewell-definedclass
rdf:type?à
←dbo:Shiprdf:type?à←dbo:Location
Risk:lowerrecall
rdf:type?à←owl:Thing
EntityFiltering
![Page 8: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/8.jpg)
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
8
Anentitymayappearusingnon-propernames(alias)
“Don't be afraid of Rasmussen or NATO, this is none of their business. If he ends up sending USS Ronald Reagan aircraft carrier to the coast of Scotland, then he should have done the same to Crimea.”
Lorem ips um dolor s it amet, consecte turadipisc ing el i t. Ut s usc ipi t lacus ac bibendum mol lis. Pe llenteas dsadas que t incitUnited States
Lorem ips um dolor s it amet, consecte turadipisc ing el i t. Ut s usc ipi t lacus ac bibendum mol lis. Pe llentesquetinc idunt vu lputate ligula a e ffic itur. Aliquam era tv olutpat. Al iquam sedenim et tortor
fringi l la asdasdasdasdas das dqwrqweqweqwr w qeqw eAnders Fogh Rasmussen.
Maecenas vehicula urnaeget m etus imperdie t com modo.
ScotlandRejectsIndependenceinReferendumLorem ipsumdolorsitamet,consectetur adipiscing elit.Ut suscipit lacusacbibendum mollis.Pellentesque tincidunt vulputate ligulaaefficitur.Aliquam erat
…………foaf:givenName,foaf:surname?à
←{‘Anders’, ‘Fogh’, ‘Rasmussen’}
dbpedia-owl:wikiPageRedirects of?à←{‘US’,‘USA’, ‘U.S.’,‘U.S.A.’,…}
NameNormalization
![Page 9: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/9.jpg)
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
9
Anentitymayappearusingnon-propernames(alias)
“Don't be afraid of Rasmussen or NATO, this is none of their business. If he ends up sending USS Ronald Reagan aircraft carrier to the coast of Scotland, then he should have done the same to Crimea.”
Lorem ips um dolor s it amet, consecte turadipisc ing el i t. Ut s usc ipi t lacus ac bibendum mol lis. Pe llenteas dsadas que t incitUnited States
Lorem ips um dolor s it amet, consecte turadipisc ing el i t. Ut s usc ipi t lacus ac bibendum mol lis. Pe llentesquetinc idunt vu lputate ligula a e ffic itur. Aliquam era tv olutpat. Al iquam sedenim et tortor
fringi l la asdasdasdasdas das dqwrqweqweqwr wqeqwe Anders Fogh Rasmussen. Maecenas
v eh icu la urna ege tm etus im perdie t com modo.
ScotlandRejectsIndependenceinReferendumLorem ipsumdolorsitamet,consectetur adipiscing elit.Ut suscipit lacusacbibendum mollis.Pellentesque tincidunt vulputate ligulaaefficitur.Aliquam erat
…………foaf:givenName,foaf:surname?à
←{‘Anders’, ‘Fogh’, ‘Rasmussen’}
dbpedia-owl:wikiPageRedirects of?à←{‘US’,‘USA’, ‘U.S.’,‘U.S.A.’,…}
Risk:lowerprecision
NameNormalization
![Page 10: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/10.jpg)
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
10
Sometimes,mappingforanaliascannotbefound
AllrelatedentitiesofNATOandtheiraliasesà
←dbprop:leaderName dbpedia:Anders_Fogh_Rasmussen,{‘AndersFogh’, ‘Rasmussen’}
Stringcomparison“Don't be afraid of Rasmussen or NATO,
Context-RelatedEntitySearch
![Page 11: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/11.jpg)
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
11
Sometimes,mappingforanaliascannotbefound
AllrelatedentitiesofNATOandtheiraliasesà
←dbprop:leaderName dbpedia:Anders_Fogh_Rasmussen,{‘AndersFogh’, ‘Rasmussen’}
Stringcomparison
Risk:lowerprecision
“Don't be afraid of Rasmussen or NATO,
Context-RelatedEntitySearch
![Page 12: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/12.jpg)
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
12
“Don't be afraid of Rasmussen or NATO, this is none of their business. If he ends up sending USS Ronald Reagan aircraft carrier to the coast of Scotland, then he should have done the same to Crimea.”
StanfordCoreNLP
Coreference Resolution
![Page 13: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/13.jpg)
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
13
“Don't be afraid of Rasmussen or NATO, this is none of their business. If he ends up sending USS Ronald Reagan aircraft carrier to the coast of Scotland, then he should have done the same to Crimea.”
StanfordCoreNLP
Risk:lowerprecision
Coreference resolution
Coreference Resolution
![Page 14: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/14.jpg)
EntityExtraction– ExperimentSetup
• 10newsarticlesthatuseDISQUS• 100commentshavingthehighestwordcounts• 5studentsasentity andaspect annotators• Annotateddataasgroundtruth
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
14
![Page 15: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/15.jpg)
EntityExtraction– ExperimentResults
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
15
![Page 16: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/16.jpg)
Aspects
• ofproductentities• Theaspectsofanentitye arethecomponents andattributes ofe.(Zhang&Liu,2014)
• ofentitiesonnews• “MoreScots woulddefinitelyhavevoted nothanyes”(voting - action)• “OrlandoBloomisagoodactor”(acting - skill)• “…it’stherightthe rightofScottishPeople”(right - possession)• Other:components,attributes,andmoods
Anaspect isallwhatisarguableaboutanentityEMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionfor
OrganizingNewsComments16
![Page 17: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/17.jpg)
TypesofAspects
• “…it’stherightthe rightofScottishPeople.”(explicit)
• “Tesco is large.”(implicit)
• “Scotlandisavery beautiful country.”(semi-implicit)
• “TheScotscan vote howevertheywant.” (semi-implicit)
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
17
aspect::right
aspect::employer
aspect::beauty
aspect::voting
![Page 18: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/18.jpg)
ExtractionofExplicitAspects:ExploitingDependency
“…it’stherightthe rightofScottishPeople.”
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
18
StanfordCoreNLP Annotation
Extractallnounphrasesthathaveannmod:of,nmod:in,nmod:on,ornmod:atrelationtowardstheentity inthesentence.
PrepositionalDependency
![Page 19: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/19.jpg)
• Specifically,thecoreference resolutionpart
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
19
ExtractionofExplicitAspects,CombinedwithEntityExtraction
“Don't be afraid of Rasmussen or NATO, this is none of their business.”
possessivedependency
![Page 20: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/20.jpg)
ExtractionofImplicitAspects:Adjective-to-aspectMappingIdeafrom(Zhang&Liu,2014)
“Tesco islarge”
Inothercomments,wefoundasaspectsofTesco,qualifiedas“large”:• employer(2x)• backoffice(1x)• callcenteroperation(1x)
Weconclude:mostprobably,theemployer aspectofTesco wasmeant.Resultisfurtherimprovedby(1)takingintoaccountfrequentcontextwordsand(2)lexicalrelationsofadjective.
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
20
![Page 21: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/21.jpg)
ExtractionofSemi-ImplicitAspects(1)
• Semi-implicitaspects:implicitaspectsthatdon’thavemapping.• “Scotlandisavery beautiful country.”
beautiful beautyWesearchforanounphrasethatisconnectedtotheentityandthewordindicatingtheaspectusingalexicalrelation.EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionfor
OrganizingNewsComments21
WordNet:attribute
StanfordCoreNLP Annotation
Lexicaldatabasesearch Weconsiderfollowinglexicalrelations:attribute>pertainym>participleofverb>derivationallyrelatedform(drf)>seealso
![Page 22: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/22.jpg)
ExtractionofSemi-ImplicitAspects(2)
• Generallycanbeusedtoidentifyaspects fromverb,adjective,ornoun.• Otherexamples:
beautiful beautyinstrumental instrumentvote votinginexcusable excusable justifiable
justification justify• Iftherearemultiplepossibleaspects forasingleword,weuseWordNet::Similaritytodecideforthebestone.
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
22
WordNet:attribute
WordNet:pertainym
WordNet:derivationally relatedform
WordNet:antonym WordNet:similar to
WordNet:drfWordNet:drf
![Page 23: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/23.jpg)
AspectExtraction–ExperimentandResults
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
23
![Page 24: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/24.jpg)
Visualization
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
24
REPrasojo,MKacimi,FDarari.IEEEInfoVis.Chicago,25-30October2015.
demoisnowavailableatorcaestra.inf.unibz.it
![Page 25: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/25.jpg)
Conclusion• Generalcontribution:aframeworkfororganizingnewscomment usingentityextractionandaspectextraction.
• Ourentityextraction: unsupervisedtools+entityfiltering,namenormalization,entitysearch,andcoreference resolution.
• Weextractexplicit,implicit,andsemi-implicit aspectsusinggrammaranalysisandlexicaldatabasesearch.
• Experimentshowsimprovementonbothentity andaspect extractioncomparedtobaselinetechnique.
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
25
![Page 26: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/26.jpg)
FutureWorks
• Addressinglimitations:• Conceptextractions• Idiomsandothermetaphoricalexpressions• Difficultcoreference (e.g.demonstrativepronouns)• Experimentsonmore,variousdata
• Completethemissingpieces:• Sentimentanalysisfornewscomments
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
26
![Page 27: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/27.jpg)
• EuropeanMaster’sProgramme inComputationalLogic
• To-KnowProject
• SIGIRStudentTravelGrant
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
27
Acknowledgements
![Page 28: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/28.jpg)
Thankyou!
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
28
![Page 29: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/29.jpg)
Extraslides
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
29
![Page 30: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/30.jpg)
Anentity canbe…
aperson,alocation,anorganization,oranywell-definedconcept suchasnationalities,languages,orwars.
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
30
![Page 31: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/31.jpg)
Entity Extraction Tasks
1. Recognition– throughpropernames/rigiddesignators(Coates-Stephens,1992)(Thielen,1995)(Nadeau&Sekine,2006)
2. Disambiguation – bymappingtoarepresentationinaKB
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
31
“Scotland can vote however it wants, it's the Scottish peoples right.”
“If he ends up sending USS Ronald Reagan aircraft carrier to the coast of Scotland, then he should have done the same to Crimea.”
<http://dbpedia.org/resource/USS_Ronald_Reagan_(CVN-76)>
<http://dbpedia.org/resource/Crimea><http://dbpedia.org/resource/Scotland>
![Page 32: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/32.jpg)
SupervisedvsUnsupervisedApproachestoEntityExtraction
AspectofEE
Prominent tools
Recognitionability
Disambiguation ability
Running time
Supervised
StanfordNLP
dependsonthetrainingset
limited
fast
UnsupervisedZemanta,AlchemyAPI,NERD,AidaYAGO
domain-independent
providedbyKBs
slow
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
32
![Page 33: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/33.jpg)
Thereisaneedfororganizingcommentsto help users to:(1) have a better understandingof the viewpoints related to each topic(2) facilitate the participation in discussions and thus increase the
chance of acquiring new viewpoints
Our contribution is a comment organization framework:
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
33
Entityextraction Aspectextraction
Sentimentanalysis
Visualizationandorganization
![Page 34: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/34.jpg)
ExtractionofImplicitAspects:Adjective-to-aspectMappingIdeafrom(Zhang&Liu,2014)
“Tesco islarge”
Inothercomments,wefoundasaspectsofTesco,qualifiedas“large”:• employer(2x)• backoffice(1x)• callcenteroperation(1x)
Weconclude:mostprobably,theemployer aspectofTesco wasmeant.EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionfor
OrganizingNewsComments34
![Page 35: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/35.jpg)
ExtractionofImplicitAspects:ContextWordsSupposenowweneedatiebreaker!
“Tesco islargeandthepayisgood” contextwords:{large,pay,company}
Inothercomments,wefoundasaspectsofTesco,qualifiedas“large”:• employer(2x)contextwords:{employer,large,salary,job,people}• backoffice(2x)contextwords:{back,office,large,headquarter,building}• callcenteroperation(1x)
Weusecontext-words difference,computedusingWordNet:Similarity.Weconclude:mostprobably,theemployer aspectofTesco wasmeant.EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionfor
OrganizingNewsComments35
rdf:type
contextwords:{employer,large,salary,job,people}contextwords:{back,office,large,area,building}
contextwords:{large,pay,company},company
![Page 36: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/36.jpg)
ExtractionofImplicitAspects:ExperimentandResult(1)
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
36
Adjective-to-aspectmappingData Precision RecallAllNews 76.87 26.12
withcontextwordsPrecision Recall88.65 30.03
![Page 37: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/37.jpg)
ExtractionofImplicitAspects:LexicalMapping
“Tesco islarge”
Inadditionto“large”,countalsoaspectsqualifiedwithsimilaradjectives(“huge”,“big”,…)
Similarityismeasuredusing1. lexicalrelationship<synonym>and<similarto>inWordNet2. WordNet::Similarity
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
37
![Page 38: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/38.jpg)
ExtractionofImplicitAspects:ExperimentandResult(2)
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
38
Lexicalmapping(1-step)Data Precision RecallAllNews 87.07 32.72
2-stepsPrecision Recall83.33 32.76
![Page 39: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/39.jpg)
ExtractionofSemi-ImplicitAspects(2)
• Ifwedon’tfindtheaspectwithin1stepoflexicalsearch:• “OrlandoBloomisagoodactor”
act actoracting
Wesearchfortheclosestnountothepseudo-aspect.EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionfor
OrganizingNewsComments39
WordNet:derivationally relatedformLexicaldatabasesearch Tofind intermediatesynsets,weconsider
following lexicalrelations:synonym>antonym>similarto>derivationallyrelatedform(drf) >seealsoWordNet:derivationally relatedform
StanfordCoreNLP Annotation
![Page 40: Entity and Aspect Extraction for Organizing News Comments · analysis and lexical database search. • Experiment shows improvement on both entity and aspectextraction compared to](https://reader036.vdocuments.mx/reader036/viewer/2022062507/5fdcae86fbea8724777ba3e0/html5/thumbnails/40.jpg)
FindingAspects
1. Findwordsthatrepresenttheaspectjob,beautiful,large,acted<noun>,<adjective>,or<verb>
2. Identifytheaspectjob,beauty,employer,acting
EMCLWorkshop2016 Prasojo,Kacimi,&Nutt- EntityandAspectExtractionforOrganizingNewsComments
40
Technique:grammaranalysisTool:StanfordCoreNLP
Technique:frequency-basedmapping (implicit)lexicaldatabasesearch(semi-implicit)
Tools:WordNet,DBpedia