privacy and security in online social media : misinformation on social media

21
Privacy and Security in Online Social Media Course on NPTEL NOC-CS07 Week 3.1 Ponnurangam Kumaraguru (“PK”) Associate Professor ACM Distinguished Speaker fb/ponnurangam.kumaraguru, @ponguru

Upload: precog

Post on 09-Jan-2017

129 views

Category:

Data & Analytics


1 download

TRANSCRIPT

Page 1: Privacy and Security in Online Social Media : Misinformation on Social Media

Privacy and Security in Online Social Media

Course on NPTELNOC-CS07

Week 3.1

Ponnurangam Kumaraguru (“PK”)Associate Professor

ACM Distinguished Speakerfb/ponnurangam.kumaraguru, @ponguru

Page 2: Privacy and Security in Online Social Media : Misinformation on Social Media

Frameworks / Platforms to know

⚫APIs of OSM (e.g. Facebook / Twitter API)

⚫A programming language to write code to extract data (e.g. Python / RoR)

⚫A database to store data (e.g. MySQL / MongoDB)

⚫A visualization tool to query and analyze data (e.g. PhpMyAdmin / RoboMongo)

2

Page 3: Privacy and Security in Online Social Media : Misinformation on Social Media

Tutorials for this week

⚫Facebook API

3

Page 4: Privacy and Security in Online Social Media : Misinformation on Social Media

Temporal Patterns

4

Fake content / rumors becomes viral in first 7-8 hours just after the event.

Page 5: Privacy and Security in Online Social Media : Misinformation on Social Media

Misinformation Tweets

FAKE

RUMORS

5

$

Page 6: Privacy and Security in Online Social Media : Misinformation on Social Media

Fake Image Tweets

6

Page 7: Privacy and Security in Online Social Media : Misinformation on Social Media

Analysis

⚫Who

⚫When

⚫Where

⚫What

⚫Why

⚫How

7

Page 8: Privacy and Security in Online Social Media : Misinformation on Social Media

Classification

8

Tweet Features [F2]

Length of TweetNumber of Words

Contains Question Mark?

Contains Exclamation Mark?

Number of Question Marks

Number of Exclamation Marks

Contains Happy Emoticon

Contains Sad Emoticon

Contains First Order Pronoun

Contains Second Order Pronoun

Contains Third Order Pronoun

Number of uppercase characters

Number of negative sentiment words

Number of positive sentiment words

Number of mentionsNumber of hashtags

Number of URLsRetweet count

User Features [F1]

Number of Friends

Number of Followers

Follower-Friend Ratio

Number of times listed

User has a URL

User is a verified user

Age of user account

Page 9: Privacy and Security in Online Social Media : Misinformation on Social Media

Sample Fake Tweets

9

> 50,000 RTs

> 30,000 RTs

Page 10: Privacy and Security in Online Social Media : Misinformation on Social Media

Data Description

Total tweets 7,888,374Total users 3,677,531Tweets with URLs 3,420,228Tweets with Geo-tag 62,629Retweets 4,464,201Replies 260,627Time of the blast Mon Apr 15 18:50 2013Time of first tweet Mon Apr 15 18:53 2013Time of first image Mon Apr 15 18:54 2013Time of last tweet Thu Apr 25 01:23 2013

10

Page 11: Privacy and Security in Online Social Media : Misinformation on Social Media

Data Description

11

Page 12: Privacy and Security in Online Social Media : Misinformation on Social Media

Geo-Located Tweets

12

Page 13: Privacy and Security in Online Social Media : Misinformation on Social Media

Network Analysis of Fake Accounts

13

Closed community

Page 14: Privacy and Security in Online Social Media : Misinformation on Social Media

Architecture

14

Page 15: Privacy and Security in Online Social Media : Misinformation on Social Media

TweetCred

⚫Available as a Chrome Extension

Page 16: Privacy and Security in Online Social Media : Misinformation on Social Media

Facebook

⚫Features are different

⚫Different network structure - Friendship

Page 17: Privacy and Security in Online Social Media : Misinformation on Social Media

FBI: Methodology

17

Facebook Graph API

Ground truth extraction

Generating feature vectors

Supervised learningRESTful API

Page 18: Privacy and Security in Online Social Media : Misinformation on Social Media

Web of Trust scores

18

Reputation: Unsatisfactory / Poor / Very poor (less than 60)Confidence: High (greater than 10)

ORCategory: Negative

Malicious

http://www.domain.com

Page 20: Privacy and Security in Online Social Media : Misinformation on Social Media

Demo

20

Page 21: Privacy and Security in Online Social Media : Misinformation on Social Media

Thank [email protected]

precog.iiitd.edu.in fb/ponnurangam.kumaraguru