practical sentiment analysis

Download Practical sentiment analysis

Post on 26-Jan-2015

7.301 views

Category:

Technology

4 download

Embed Size (px)

DESCRIPTION

Practical Sentiment Analysis tutorial at Sentiment Symposium, 29 Oct San Francisco

TRANSCRIPT

  • 1. Practical Sentiment AnalysisDr. Diana Maynard University of Sheffield, UK The University of Sheffield, 1995-2012This work is licenced under the Creative Commons Attribution-NonCommercial-ShareAlike LicenceSentment Analysis Symposium, San Francisco, October 2012

2. Some of the things you can learn todaySentment Analysis Symposium, San Francisco, October 2012 3. Which is more accurate?Ask the audience?Or Phone a Friend?Sentment Analysis Symposium, San Francisco, October 2012 4. How did the Royal Wedding influencethe UKs health? Sentment Analysis Symposium, San Francisco, October 2012 5. Can Twitter help us predict the stock market?Hey Jon, Derek in Scunthorpes having a bacon and egg sandwich. Isthat good for wheat futures? Sentment Analysis Symposium, San Francisco, October 2012 6. Can Twitter predict earthquakes? Sentment Analysis Symposium, San Francisco, October 2012 7. Why do deaths confuse sentimentanalysis tools?Sentment Analysis Symposium, San Francisco, October 2012 8. Outline Introduction to Opinion Mining concepts and motivation, strengths and weaknesses ofcurrent systems subtasks of an opinion mining system and the majorchallenges Introduction to GATE main components Why use GATE for opinion mining? Applications examples of developing various real applications in GATE machine learning and rule-based approaches Sentment Analysis Symposium, San Francisco, October 2012 9. Part 1: Introduction to Opinion MiningSentment Analysis Symposium, San Francisco, October 2012 10. Introduction to Opinion Mining:Concepts and MotivationSentment Analysis Symposium, San Francisco, October 2012 11. The Social WebInformation, thoughtsand opinions are sharedprolifically these days onthe social webSentment Analysis Symposium, San Francisco, October 2012 12. Drowning in information It can be difficult to get therelevant information out of suchlarge volumes of data in a usefulway Social web analysis is all about theusers who are actively engagedand generate content Social networks are pools of awide range of articulationmethods, from simple "I like it"buttons to complete articlesSentment Analysis Symposium, San Francisco, October 2012 13. Opinion Mining Along with entity, topic andevent recognition, opinionmining forms thecornerstone for social webanalysis Sentment Analysis Symposium, San Francisco, October 2012 14. Opinion mining is not just about product reviews Much opinion mining research has been focused aroundreviews of films, books, electronics etc. But there are many other uses companies want to know what people think finding out political and social opinions and moods investigating how public mood influences the stock market investigating and preserving community memories drawing inferences from social analytics Sentment Analysis Symposium, San Francisco, October 2012 15. Analysing Public Mood Closely related to opinion mining, is theanalysis of sentiment and mood Mood of the Nation project at BristolUniversityhttp://geopatterns.enm.bris.ac.uk/mood/ Mood has proved more useful thansentiment for things like stock marketprediction (fluctuations are driven mainlyby fear rather than by things likehappiness or sadness)Sentment Analysis Symposium, San Francisco, October 2012 16. But there are lots of tools thatanalyse social media already.... Here are some examples: Sentiment140: http://www.sentiment140.com/ Twends: http://twendz.waggeneredstrom.com/ Twittratr: http://twitrratr.com/ SocialMention: http://socialmention.com/ TipTop: http://feeltiptop.com/ TweetFeel: http://www.tweetfeel.com/Sentment Analysis Symposium, San Francisco, October 2012 17. Why not use existing online sentiment apps? Easy to search for opinions about famous people, brands andso on Hard to search for more abstract concepts, perform a non-keyword based string search e.g. to find opinions about Lady Gagas dress, you can often only search on Lady Gaga to get hits Theyre suitable for a quick sanity check of social media, butnot really for business needs And the opinion finding they do isnt very good... Sentment Analysis Symposium, San Francisco, October 2012 18. Whitney Houston wasnt very popular...Sentment Analysis Symposium, San Francisco, October 2012 19. Or was she?Sentment Analysis Symposium, San Francisco, October 2012 20. What did people think of the Olympics?Sentment Analysis Symposium, San Francisco, October 2012 21. Twittraters view of the OlympicsA keyword search for Olympics shows exactly how existingsystems fail to cut the mustard Lookup of sentiment words is not enough if theyre part of longer words theyre used in different contexts the tweet itself isnt relevant theyre used in a negative or sarcastic sentence theyre ambiguousSentment Analysis Symposium, San Francisco, October 2012 22. Accuracy of twitter sentiment apps Mine the social media sentiment apps and youll find a huge difference ofopinions After the Royal Wedding, a search for opinions on Pippa Middletongave the following: TweetFeel: 25% positive, 75% negative Twendz: no results TipTop: 42% positive, 11% negative Twitter Sentiment: 62% positive, 38% negative Sentment Analysis Symposium, San Francisco, October 2012 23. Tracking opinions over time and space Opinions can be extracted with a time stamp and/or a geo-location We can then analyse changes to opinions about the sameentity/event over time, and other statistics We can also measure the impact of an entity or event on the overallsentiment about an entity or another event, over the course of time(e.g. in politics) Also possible to incorporate statistical (non-linguistic) techniques toinvestigate dynamics of opinions, e.g. find statistical correlationsbetween interest in certain topics or entities/events andnumber/impact/influence of tweets etc. Twitter acitivity over 24 hours plotted on a world maphttp://bit.ly/SgGhIJ Sentment Analysis Symposium, San Francisco, October 2012 24. Measuring impact over timeWe can measure the impact of a political entity or event on the overallsentiment about another entity or event, over the course of time.Aggregation of opinions over entities and events to cover sentences anddocumentsCombined with time information and/or geo-locations, we can thenanalyse changes to opinions about the same entity/event over time, andother statistical correlationsSentment Analysis Symposium, San Francisco, October 2012 25. Social networks can trigger new events Not only can online social networks provide asnapshot of current or past situations, butthey can actually trigger chains of reactionsand events Ultimately these events might led to societal,political or administrative changes Since the Royal Wedding, Pilates classes havebecome incredibly popular in the UK solely asa result of social media. Pippa Middletons bottom has its ownFacebook page and twitter account, andpictures of her bottom are worth more thanthose of her face!Sentment Analysis Symposium, San Francisco, October 2012 26. Predicting the futureSentment Analysis Symposium, San Francisco, October 2012 27. Predicting other peoples decisions It would be useful to predict what products people will buy,what films they want to see, or what political party theyllsupport Sentment Analysis Symposium, San Francisco, October 2012 28. Predicting Presidential Candidates Michael Wu from Lithium did a study of sentiment data onvarious social web apps about presidential candidates inMarch 2012 http://lithosphere.lithium.com/t5/Building-Community-the-Platform/Big-Data-Big-Prediction-Looking-through-the-Predictive-Window/ba-p/41068 His analysis involved taking the positive sentiments minus thenegative sentiments, over a 2 week period, and also includingthe neutral sentiments Neutral sentiments were weighted at 1/10 and added to thenet sentiment He saw a close correlation between his analysis and theGallup polls, but he warns us to be cautious... Sentment Analysis Symposium, San Francisco, October 2012 29. Predictive Analysis Windows Predictive analytics is about trying to look into the future through thepredictive window of your data. If you try to look outside this window, your future will look very blurry. Its like weather forecasting the smaller the window, the moreaccurate youll be The important question is not whether social media data can predictelection outcome, but how far ahead can it be predicted? For something that changes very quickly like the financial market, thepredictive window will be very short. For things that do not change as fast, the predictive window will belonger. For social media sentiment data, the window for election forecasting isabout 1.5 to 2 weeks, (1 to be conservative).Sentment Analysis Symposium, San Francisco, October 2012 30. Aggregate sentiment finding Aggregate sentiment finding (e.g. OConnor et al 2010) typicallyuses shallow techniques based on sentiment word counting.Idea is that if youre only trying to find aggregates then suchtechniques are sufficient, even though theyre far from perfect. Although the error rate can be high, with a fairly large number ofmeasurements, these errors will cancel out relative to the quantitywe are interested in estimating (aggregate public opinion). The claim is that using standard text analytics techniques on suchdata can actually be harmful, because theyre designed to optimiseper-document classification accuracy rather than assessingaggregate population proportions. Their method shows some correlation with public sentiment pollsbut they conclude that better opinion mining would be beneficial.Sentment Analysis Symposium, San Francisco, October 2012 31. Social media and politics Twitter provides real-time feedback on political debates thats muchfaster than traditional polling. Social media chatter can gauge how a candidates message is beingreceived or even warn of a popularity dive. Campaigns that closely monitor the Twittersphere have a better feel ofvoter sentiment, allowing candidate

Recommended

View more >