data science in social media
DESCRIPTION
Data Science in Social Media. Naveen Kumar Sridhar. Data Reliability Trust Relation between Reliability and Trust Ontologies SPARQL Future work. Agenda. One of my friends posted a status message on Facebook, No classes today. Thank you Irene. First question that came to my mind, - PowerPoint PPT PresentationTRANSCRIPT
Data Science in Social MediaData Science in Social Media
Naveen Kumar Sridhar
AgendaAgenda
Data ReliabilityTrustRelation between Reliability and TrustOntologiesSPARQLFuture work
MotivationMotivation
One of my friends posted a status message on Facebook,
No classes today. Thank you Irene.
First question that came to my mind,
Is this person reliable?
Data ReliabilityData Reliability
Definition:Reliable data is evidence you can trust.
Example:If two people are doing the experiment and the result is almost same most of the time, then we consider the data to be reliable.
In social network like Facebook, data can be classified as reliable data based on the trust on a person who posts the data.
EvidenceEvidence
Definition‘Evidence’ is data which is judged to be relevant.
Example:When you investigate cooling, temperature data is relevant, so it is evidence. The length of the thermometer is not relevant, so it is not evidence.
TrustTrust
Definition:Trust in a person is a commitment to an action based on a belief that the future actions of that person will lead to good outcome. (Golbeck and Hendler)Example:A believes in B expecting that B provides certain information which are correct to the context.
Trust TypesTrust Types
Direct Trust
Relational Trust
System Trust
Direct TrustDirect Trust
Scenarios:You know a mechanic who has already repaired your car. You know a friend who lives in Chennai. He posts a status message
Weather is pretty cool
Do not forget that trust is contextual
Relational TrustRelational Trust
ScenariosYou are new to Troy. You have to buy some apples. Your friend says
‘Price Chopper is the right place’ Your friend likes a status message of his friend in Facebook
‘There is a DJ tonite @ Brown’s’
System TrustSystem Trust
ScenariosYou are going to a small town which has a small grocery store. They have only nameless toothpaste. You can see a FDA approval over the label.You see a message posted by some person you don’t know. You can see that he is from RPI or Stanford.
Facebook DataFacebook Data
OntologyOntology
DefinitionAn ontology is an explicit specification of a conceptualizationTools:Protégé 4.2.0Link:https://dl-web.dropbox.com/get/Independent%20Study/OntologyTrustFacebook.owl?w=3db9b219
SPARQLSPARQL
SPARQL stands for SPARQL Protocol and RDF Query Language.
Used SPARQL to query the data obtained from Facebook which was in the RDF form.
This can be integrated later with the ontology so that we can infer trust values directly.
Ontology ExplanationOntology Explanation
Trustor is someone who must choose whether, and how much, to trust.
Trustee is someone or something that is to be trustedThe ontology is designed to find the different forms
of trusted Trustees.
Ontology ExplanationOntology Explanation
Trustee class is divided into five sub-classes
Completely TrustedSomewhat TrustedSomewhat UntrustedCompletely UntrustedDon’t know
Ontology ExplanationOntology Explanation
Trust classes areAssociation – To find out whether there is an association between trustor and trusteePriorKnowledge – To find out whether the trustee has prior knowledgeMetadata Source – To find out whether Trustee has valid IP addressSourceOfData – To find out whether trustee has mentioned the links of the source
Ontology ExplanationOntology ExplanationCompletely Trusted – Trustee is a professor/has a
professional relationship and has provided the source link of data and has prior knowledge.
Somewhat Trusted – Trustee is a professor/ has a professional relationship/has provided the source and also has prior knowledge
Somewhat Untrusted – Trustee is not having any one these:priorknowledge/link/association
Ontology ExplanationOntology ExplanationCompletely Untrusted – Trustee has no trust factors
associated with himDon’t know – If the trustee does not fall in any of the
four categories mentioned above categories.
SPARQL End PointSPARQL End PointSample RDF is placed in an existing triple storeAs data is sensitive, I pulled data for one person(any
amount of data is possible)Data in Facebook which I can see when I am online,
is pulled from graph API in RDF format.
Link : http://bit.ly/t67iav
Future WorkFuture WorkOntology will be used to infer trust of the data from
Facebook(complete model)PML Trust Ontology can be created for Social
Networks.
ReferencesReferences
http://www.ci.austin.tx.us/auditor/downloads/testing.pdfhttp://dig.csail.mit.edu/2011/Papers/ruleml/paper.pdfhttp://www.cs.helsinki.fi/group/cinco/publications/pdfs/viljanen05towards.pdfhttp://www.ai.sri.com/daml/services/owl-s/security/context/ToiDen04.pdf http://www.cs.st-andrews.ac.uk/~tristan/pubs/sn2011.pdfhttp://www.cypherpunks.ca/~iang/pubs/tep.pdfhttp://www.justice.gov/ag/annualreports/pr2009/sect1/data.pdfhttp://www.justice.gov/ag/annualreports/pr2009/sect1/data.pdfhttp://www-ksl.stanford.edu/kst/what-is-an-ontology.htmlhttp://owl.cs.manchester.ac.uk/tutorials/protegeowltutorial/resources/ProtegeOWLTutorialP4_v1_3.pdf
ReferencesReferences
http://www.seco.tkk.fi/publications/2011/laurenne-et-al-envirofi-2011.pdfhttp://www.hipertext.net/english/pag1031.htm http://www.iti.illinois.edu/sites/www.iti.illinois.edu/files/docs/profile-files/trust-icec06.pdfhttp://www-ksl.stanford.edu/kst/what-is-an-ontology.html