Download - Kalev Leetaru, Eric Shook, and Shaowen Wang
![Page 1: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/1.jpg)
1
Kalev Leetaru, Eric Shook, and Shaowen Wang
CyberInfrastructure and Geospatial Information Laboratory (CIGI)Department of Geography and Geographic Information Science
School of Earth, Society, and EnvironmentNational Center for Supercomputing Applications (NCSA)
University of Illinois at Urbana-Champaign
CyberGIS ‘ 12, Urbana IL, August 8, 2012
A CyberGIS Approach to Digital Humanities and Social Sciences: The World of Textual Geography and a Case
Study of Wikipedia’s History of the World
![Page 2: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/2.jpg)
![Page 3: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/3.jpg)
![Page 4: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/4.jpg)
![Page 5: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/5.jpg)
![Page 6: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/6.jpg)
![Page 7: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/7.jpg)
![Page 8: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/8.jpg)
![Page 9: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/9.jpg)
![Page 10: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/10.jpg)
10
![Page 11: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/11.jpg)
11
![Page 12: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/12.jpg)
![Page 13: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/13.jpg)
![Page 14: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/14.jpg)
14
http://www.sgi.com/go/wikipedia
![Page 15: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/15.jpg)
15
![Page 16: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/16.jpg)
16
![Page 17: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/17.jpg)
17
![Page 18: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/18.jpg)
18
![Page 19: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/19.jpg)
19
![Page 20: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/20.jpg)
Workflow
CyberGIS
SentimentMining
Fulltext Geocoding
![Page 21: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/21.jpg)
Inside the CyberGIS “black box”
Security DomainDecomposition
XSEDE
GISolve Middleware
CI
Data &Viz
Resource Selection
Task Scheduling
Clouds
Workflow Management ServicesOpen Service API
OSG
EmotionalHeatmap
![Page 22: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/22.jpg)
Data Input for a Topic
A set of locations with 3 attributes Latitude, longitude point location1. Number of articles mentioning this location2. Number of articles mentioning both this location and topic3. Average tone of articles mentioning both this location and topic
![Page 23: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/23.jpg)
Data Input for a Topic
A set of locations with 3 attributes Latitude, longitude point location1. Number of articles mentioning this location2. Number of articles mentioning both this location and topic3. Average tone of articles mentioning both this location and topic
?
![Page 24: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/24.jpg)
Spatializing Emotion
3 important elements
1. Importance of location2. Prevalence of topic3. Emotion toward topic
Goal:Capture 3 elements on a single map
![Page 25: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/25.jpg)
1) Importance of Location Every mention of a location
increases its importance
Generate a density map of the number of times a location is mentioned in text using Kernel Density Estimation (KDE) based on k nearest neighbor search
![Page 26: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/26.jpg)
1) Importance of Location
![Page 27: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/27.jpg)
2) Prevalence of Topic
We term topic intensity to capture the prevalence of a topic relative to other topics, and adopt a method commonly used in epidemiological studies to estimate it
Relative risk is a ratio of the KDE of disease infection locations and case control locations
![Page 28: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/28.jpg)
Topic Intensity
Topic Intensity
KDE(articles that mention a topic)___ KDE(articles that do not mention the topic)
Relative Risk
KDE(points with disease)__ KDE(points without disease)
![Page 29: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/29.jpg)
Topic Intensity
![Page 30: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/30.jpg)
3) Emotion Toward a Topic Challenging question:
Is the emotional measure tone, discrete or continuous?– Is tone "countable" like trees or does
it exist as a continuum like air temperature?
Tone is a continuum:– Cannot have "number of tones"
![Page 31: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/31.jpg)
3) Emotion Toward a Topic A different method is used,
because tone is continuous and not discrete
Inverse distance weighted (IDW) interpolation is used to estimate tone across space creating a tone map
Tone map captures positive and negative tone toward a particular topic across space
![Page 32: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/32.jpg)
3) Emotion Toward a Topic
![Page 33: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/33.jpg)
Overview – 3 layers
1) Article density - Proxy: Importance of location
2) Topic intensity - Proxy: Prevalence of topic relative to other topics
3) Tone - Proxy: Emotion toward a topic
![Page 34: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/34.jpg)
Overview – 3 layers
1) Article density - Proxy: Importance of location
2) Topic intensity - Proxy: Prevalence of topic relative to other topics
3) Tone - Proxy: Emotion toward a topic
First two layers representscaling factors for tone
Value range: 0 - 1
Value range: 0 - 100
Value range: -100 - 100
![Page 35: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/35.jpg)
Emotional Heatmap
Article Density Topic Intensity
Emotional HeatmapTone
*
=
*
![Page 36: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/36.jpg)
Emotional Heatmap of Armed Conflict in 2003 (Wikipedia)
![Page 37: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/37.jpg)
Summary
First steps, but started the dialogue
Balance– Managing the complexity of
cyberinfrastructure access– Simplifying the workflow of chaining
of spatial analytics– Making sense of what’s involved
Scientific rigor
![Page 38: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/38.jpg)
Ongoing Work
Translate spatial knowledge to domain knowledge by answering a basic question: why is this here and not there?
Tackle spatial aggregation issues– Represent locations as areas not
points– Areal interpolation
![Page 39: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/39.jpg)
39
Acknowledgments
Guofeng Cao, Anand Padmanabhan National Science Foundation
– BCS-0846655– OCI-1047916– Open Science Grid– XSEDE SES070004N
![Page 40: Kalev Leetaru, Eric Shook, and Shaowen Wang](https://reader035.vdocuments.mx/reader035/viewer/2022062323/568160d8550346895dd00a4c/html5/thumbnails/40.jpg)
40
Thanks!