bottari: location based social media analysis with semantic web

16
BOTTARI: Location based Social Media Analysis with Semantic Web Emanuele Della Valle Joint work with: CEFRIEL: Irene Celino, Daniele Dell’Aglio, Marco Balduini SALTLUX: Tony Lee, Seonho Kim SIEMENS: Volker Tresp, Yi Huang

Upload: emanuele-della-valle

Post on 27-Jan-2015

107 views

Category:

Education


1 download

DESCRIPTION

Bottari is a LarKC application http://www.larkc.eu/. It offers a real-time personalized recommendation service for restaurants in Insa-dong(Seoul) listening to the reputation of the restaurants on social media. Social media anlytics is powered by LarKC inductive and deductive stream reasoning solution. Learn more at http://larkc.cefriel.it/lbsma/bottari/ .

TRANSCRIPT

Page 1: BOTTARI: Location based Social Media Analysis with Semantic Web

BOTTARI: Location based Social Media Analysis with Semantic

WebEmanuele Della Valle

Joint work with:CEFRIEL: Irene Celino, Daniele Dell’Aglio, Marco Balduini

SALTLUX: Tony Lee, Seonho Kim SIEMENS: Volker Tresp, Yi Huang

Page 2: BOTTARI: Location based Social Media Analysis with Semantic Web

Watch this first :-)

226.10.2011 - SW Challenge 2011, ISWC 2011, Bonn, Germany

http://www.youtube.com/watch?v=c1FmZUz5BOo

Page 3: BOTTARI: Location based Social Media Analysis with Semantic Web

• An augmented reality application for personalized recommendation of restaurants in Seoul

What have you seen?

326.10.2011 - SW Challenge 2011, ISWC 2011, Bonn, Germany

Page 4: BOTTARI: Location based Social Media Analysis with Semantic Web

• Yes and no!

• Same use case, more “democratic”

• We do “reality mining” by listening to the social media

Yet another ?

426.10.2011 - SW Challenge 2011, ISWC 2011, Bonn, Germany

Page 5: BOTTARI: Location based Social Media Analysis with Semantic Web

Architecture

526.10.2011 - SW Challenge 2011, ISWC 2011, Bonn, Germany

out

Query Rewriter

Query Evaluator

RDF2Matrix Plug-in

Streaming Linked Data

Server

SOR Invoker

SOR geo-spatial

KB

Social Media Crawler and Sentiment

Miner

HTTP

PU

LL

: Qu

ery

Initi

ate

d

PUSH: Data Initiated

SPARQL

androjena

Page 6: BOTTARI: Location based Social Media Analysis with Semantic Web

Sentiment Mining

626.10.2011 - SW Challenge 2011, ISWC 2011, Bonn, Germany

Micropost message

MorphologicallyAnalyzable?

Rule based Analysis

Auto generated rules

Auto generated rules

Learneddocuments

SVMs

Syllable KernelSyllable Kernel

Sentiment of the tweet

Yes No

• Precision tests:– Auto-generated

rules ≈ 70%

– Manually-coded rules ≈ 90%

– Syllable kernel ≈ 50~60%

• Our target > 85%

Page 7: BOTTARI: Location based Social Media Analysis with Semantic Web

SOR - Geo-Spatial KB

726.10.2011 - SW Challenge 2011, ISWC 2011, Bonn, Germany

Page 8: BOTTARI: Location based Social Media Analysis with Semantic Web

C-SPARQL and Streaming Linked Data Server

826.10.2011 - SW Challenge 2011, ISWC 2011, Bonn, Germany

Page 9: BOTTARI: Location based Social Media Analysis with Semantic Web

• A machine learning framework for inductive materialization

– Detects interesting data patterns– Predics RDF-triples

• i.e., which restaurant a user will tweet positively about

• Caractheristics – Capability to deal with sparse, high-dimensional

and incomplete data– Multivariate latent space based approach– Modularized approach for easily integrating contextual

information

SUNS (Statistical Unit Node Sets)

926.10.2011 - SW Challenge 2011, ISWC 2011, Bonn, Germany

Page 10: BOTTARI: Location based Social Media Analysis with Semantic Web

SELECT DISTINCT ?poi ?name ?lat ?long ?numPos ?prob WHERE { ?poi a ns:NamedPlace ; ns:name ?name ; geo:lat ?lat ; geo:long ?long . FILTER (f:within_distance(37.5, 126.9, ?lat, ?long, 200)) FILTER (f:dest_point_viewing(37.5, 126.9, ?lat, ?long, 90, 200)) { :someUser sioc:creator_of ?tweet . ?tweet twd:talksAboutPositively ?poi . WITH PROBABILITY ?prob ENSURE PROBABILITY [0.5..1) } ?poi twd:numberOfPositiveTweets ?numPos . } ORDER BY DESC(?numPos), ?prob, f:distance(37.5, 126.9, ?lat, ?long)LIMIT 10

Query Processing

1026.10.2011 - SW Challenge 2011, ISWC 2011, Bonn, Germany

GEO-SPATIAL

PROBABILISTIC

STREAMING

Page 11: BOTTARI: Location based Social Media Analysis with Semantic Web

LarKC At Work

1126.10.2011 - SW Challenge 2011, ISWC 2011, Bonn, Germany

out

Query Rewriter

Query Evaluator

RDF2Matrix Plug-in

Streaming Linked Data

Server

SOR Invoker

SOR geo-spatial

KB

Social Media Crawler and Sentiment

Miner

HTTP

PU

LL

: Qu

ery

Initi

ate

d

PUSH: Data Initiated

SPARQL

androjena

Probabilistic part of the query to get personalized

recommendations (the “for me” button in BOTTARI)

Geo-Spatial part of the query

to get POIs closer to user

location

Streaming part of the query to get trends in users' sentiment

(the “emerging” button in BOTTARI)

Input user query is split

Results of the different

computations are joined

Page 12: BOTTARI: Location based Social Media Analysis with Semantic Web

Evaluation - Efficacy

1226.10.2011 - SW Challenge 2011, ISWC 2011, Bonn, Germany

5 10 15 20 25 30

0,7

random

knnItem

emerging (C-SPARQL)

for me (SUNS)

SUNS + C-SPARQL

0,6

0,5

0,4

0,3

0,2

0,1

Page 13: BOTTARI: Location based Social Media Analysis with Semantic Web

Evaluation - Efficiency

1326.10.2011 - SW Challenge 2011, ISWC 2011, Bonn, Germany

Hardware: 2.66 GHz Intel Core 2 Duo with 8 GB RAM

Page 14: BOTTARI: Location based Social Media Analysis with Semantic Web

Evaluation – Scalability

1426.10.2011 - SW Challenge 2011, ISWC 2011, Bonn, Germany

Number of concurrent users

Que

ry L

aten

cy (

sec)

Page 15: BOTTARI: Location based Social Media Analysis with Semantic Web

• End-user application

• Attractive and functional interface

• Real-world dynamic data

• Fully based on Semantic Web technologogies– RDF as common data format between heterogenous

components– SPARQL as query language

• Rigorously evaluated– Effective– High throughput for handling dynamic data– Scalable in number of concurrent users

• Commercial Potential

Conclusions

1526.10.2011 - SW Challenge 2011, ISWC 2011, Bonn, Germany

Page 16: BOTTARI: Location based Social Media Analysis with Semantic Web

Emanuele Della ValleJoint work with:

CEFRIEL: Irene Celino, Daniele Dell’Aglio, Marco Balduini SALTLUX: Tony Lee, Seonho Kim SIEMENS: Volker Tresp, Yi Huang

Any question?