big data: a survey of technical and sociotechnical concepts
DESCRIPTION
Lecture presentation given at Bard College in October 2014. The content covers concepts related to Big Data, Semantic Web, ethics, politics, etc.TRANSCRIPT
BIG DATA:A SURVEY OF TECHNICAL & SOCIOTECHNICAL CONCEPTS
PRESENTED AT BARD COLLEGE !
BY KRISTINE GLORIA !
OCTOBER 1, 2014
(BIG) DATA:increased volume, distribution, and complexity
Office of Science Technology Policy: “Big Data: Seizing Opportunities and Preserving Values”. May 2014. <http://1.usa.gov/1ky0reK>
Inferencing
Internet v. Web Semantic Web{early} Web
AlgorithmsOntologies
Biases & EthicsPoliticsSurveillance
INTERNET V. THE WEB
Photo Credit: “Hierarchical Structure of the Internet” by Lanet-vi program of I. Alvarez-Hamelin et al (2007)
WEB ARCHITECTURE• A standard system for identifying resources
• Standard formats for representing resources
• A standard protocol for exchanging resources
• Relevant core standards:
• URIs (URLs): Universal Resource Identifiers
• HTML: Hypertext Markup Language
• HTTP: Hypertext Transfer Protocol
=
Architecture of the World Wide Web, Volume One http://www.w3.org/TR/webarch/
resource
resource resource
resourceresource
resource
href
hrefhref
href
href
Adapted from W3C, Marja-Riitta Koivunen and Eric Miller
Early Web
Links to other nodes as a "vote" of quality and/
or relevance = PageRank
SEMANTIC WEB
Kristine | Knows | Jane Doesubject predicate object
uri://peoplepeople#KristineGloria http://xmlns.com/foaf/0.1/knows uri://people#JaneDoe
machine readable: resource description framework (RDF)
also known as a “triple”
resource
resource resource
resourceresource
resource
software
softwaredocument
document
document
person
place
href
hrefhref
href
href
generates dependsOn
creator
locatedIn
hrefhref
IsVersionOf
Adapted from W3C, Marja-Riitta Koivunen and Eric Miller
Early Web Semantic Web
<item> blog </item>
EARLY WEB MARKUP
SEMANTIC WEB MARKUP<item rdf:about=“http:// example.org/semantic-web”> Semantic Web </item>
Diagram is maintained by Richard Cyganiak (Insight Centre for Data Analytics at NUI Galway) and Anja Jentzsch (HPI). < http://lod-cloud.net/>
INFERENCE:“act or process of deriving logical conclusions from premises known or assumed to be true”
software
softwaredocument
document
document
person
place
generates dependsOn
creator
locatedIn
hrefhref
IsVersionOf
Ontologies
Inferencing
AlgorithmsOntologies
Biases & EthicsPoliticsSurveillance
Data MiningNLP Machine Learning
WEB SCIENCE METHOD
Issues
Idea
Social Technical
Micro Macro
Values creativity
design
complexity
analyze
Berners-Lee, T. (2007). W3C. http://www.w3.org/2007/Talks/0509-www-keynote-tbl/#(10)
BIASES & ETHICSWhat are the variations in how we collect, evaluate, interpret, and share those data/evidence?
Kate Crawford, Principle Researcher with Microsoft Research