potential for crowdsourced data for population

21
Potential for Crowdsourced Data for Crowdsourced Data for Population Distribution Models and Databases Presented at UN-SPIDER Expert Meeting on Crowdsourcing for Disaster Management and Emergency Response and Emergency Response Budhendra Bhaduri Budhendra Bhaduri Corporate Research Fellow December 05, 2012 Vienna, Austria www.ornl.gov/gist

Upload: others

Post on 10-Jan-2022

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Potential for Crowdsourced Data for Population

Potential for Crowdsourced Data for Crowdsourced Data for Population Distribution Models and Databases

Presented atUN-SPIDER Expert Meeting on Crowdsourcing for Disaster Management and Emergency Responseand Emergency Response

Budhendra BhaduriBudhendra BhaduriCorporate Research Fellow

December 05, 2012Vienna, Austria

www.ornl.gov/gist

Page 2: Potential for Crowdsourced Data for Population

Crowdsourcing: Points to ponder

Crowdsourced information clearly augment space-based data– Increase density and resolution of data (Gap filling) e.g. NetQuakesIncrease density and resolution of data (Gap filling) e.g. NetQuakes– Enhance currency and quality of observation and model data

(incidence report, damage qualification, and local knowledge)– The media makes great use of it (CNN iReport Weather Channel)– The media makes great use of it (CNN iReport, Weather Channel)

Traditional top-down spatial data quality standard doesn’t work– When’s good is good enough (user defined and fit for purpose)

When does crowdsourcing make the system vulnerable?– Reliability of the crowd and crowd fatigue (are there disaster

magnitude and frequency thresholds similar to relief funds)– Digital divide, victim crowd, and system overuse

Managed by UT-Battellefor the Department of Energy

g , , y– Social, legal, and ethical concerns

Page 3: Potential for Crowdsourced Data for Population

LandScan Population Distribution and Dynamics Model and Database

CensusLandScan Global

Census

Gridded

DayNight

LandScan USA

As the finest population distribution data available for the ld d th US L dS Gl b l d L dS USA

Managed by UT-Battellefor the Department of Energy

world and the US, LandScan Global and LandScan USA are the community standard for estimating population at risk

Page 4: Potential for Crowdsourced Data for Population

Managed by UT-Battellefor the Department of Energy

Page 5: Potential for Crowdsourced Data for Population

Managed by UT-Battellefor the Department of Energynightday

Page 6: Potential for Crowdsourced Data for Population

Disasters make population data obsolete

f Loss and dispersion of population

Capturing population redistribution is critical at many time scalesis critical at many time scales– Earthquake aftershocks (minutes-

days)– Hurricanes (weeks to months)– Sea level rise (years to decades)

Space based observation only Space based observation only interprets land cover– Flood damage is often difficult to

d t t f t tdetect for structures

Is crowdsourcing a strategy?– Active (including self disclosure)

Managed by UT-Battellefor the Department of Energy

– Active (including self disclosure)– Passive (social media)

Page 7: Potential for Crowdsourced Data for Population

Spatial refinement of LandScan Global

Managed by UT-Battellefor the Department of Energy

Page 8: Potential for Crowdsourced Data for Population

Addis Ababa, Ethiopia

2 Xeon Quad core 2.4GHz CPUs + 4 Tesla GPUs +

2 Xeon Quad core 2.4GHz CPUs + 4 Tesla GPUs +CPUs + 4 Tesla GPUs + 48GB

Image analyzed (0.3m) 800 sq. Km

RGB b d

CPUs + 4 Tesla GPUs + 48GB

Image analyzed (0.3m) 800 sq. Km

RGB b d RGB bands

Overall accuracy 93% Settlement class 89% Non-settlement class

RGB bands

Overall accuracy 93% Settlement class 89% Non-settlement class

Managed by UT-Battellefor the Department of Energy

Non settlement class 94%

Total processing time 27 seconds

Non settlement class 94%

Total processing time 27 seconds

Page 9: Potential for Crowdsourced Data for Population

Managed by UT-Battellefor the Department of Energy

Page 10: Potential for Crowdsourced Data for Population

Managed by UT-Battellefor the Department of Energy

Page 11: Potential for Crowdsourced Data for Population

Syria: LandScan 2011

Managed by UT-Battellefor the Department of Energy

Page 12: Potential for Crowdsourced Data for Population

Syria: LandScan Aug.31, 2012

Managed by UT-Battellefor the Department of Energy

Page 13: Potential for Crowdsourced Data for Population

Population Distribution Changes (net)

Syrian Refugees and IDP’s*Green to Blue – LossesGreen to Blue LossesYellow to Red – GainsIDP movement - modeled

Managed by UT-Battellefor the Department of Energy

Page 14: Potential for Crowdsourced Data for Population

Assessing Population Dynamics

Dynamic tracking of people and vehicle fleet movement from streaming multisensor data– Video, cell phones, social media

Sociocultural input– Accounting for refugees through

remote sensing is often challenging

Migration can be captured but the challenge is circular migration (resettlement)migration (resettlement)– Information flow and media

coverage significantly drops with time

Managed by UT-Battellefor the Department of Energy

time – Where do they come back from?

Page 15: Potential for Crowdsourced Data for Population

Managed by UT-Battellefor the Department of Energy

Page 16: Potential for Crowdsourced Data for Population

Real Time Rome: MIT Senseable City Lab

“……Ratti's team obtains its data anonymously from ll h GPS d i b d t i

“……Ratti's team obtains its data anonymously from ll h GPS d i b d t icell phones, GPS devices on buses and taxis,

and other wireless mobile devices, using advanced algorithms developed by Telecom Italia, the principal sponsor of the project. These algorithms are able to discern the difference

cell phones, GPS devices on buses and taxis, and other wireless mobile devices, using advanced algorithms developed by Telecom Italia, the principal sponsor of the project. These algorithms are able to discern the difference gbetween, say, a mobile phone signal from a user who is stuck in traffic and one that is sitting in the pocket of a pedestrian wandering down the street. Data are made anonymous and aggregated from the beginning, so there are no implications for

gbetween, say, a mobile phone signal from a user who is stuck in traffic and one that is sitting in the pocket of a pedestrian wandering down the street. Data are made anonymous and aggregated from the beginning, so there are no implications for

Managed by UT-Battellefor the Department of Energy

the beginning, so there are no implications for individual privacy.”the beginning, so there are no implications for individual privacy.”

http://radar.oreilly.com/2007/07/real-time-rome-using-cellphone.html

Page 17: Potential for Crowdsourced Data for Population

Social networking and self disclosure

"Latitude" is being marketed as a tool that could help parents keep tabs on their children's locations, b t it b d f t fi d l but it can be used for anyone to find anyone else, assuming permission is given.”

“…allow you to share that location with friends and f il b d lik i b bl t family members, and likewise be able to see friends and family members' locations"

"To protect privacy, Google specifically requires people to sign up for the service. People can share their precise location, the city they're in, or nothing at all."

Managed by UT-Battellefor the Department of Energy http://www.cbsnews.com/stories/2009/02/04/earlyshow/leisure/gamesgadgetsgizmos/main4774320.shtml

Page 18: Potential for Crowdsourced Data for Population

Social networking and self disclosure

“Foursquare just snagged its six millionth member…”

“Foursquare, the social network that allows members to communicate with acquaintances by “checking in” to q y glocations they patronize, is breaking with its own traditions by allowing users to “check in” to the Super Bowl even if they’re not attending the game in person ”they re not attending the game in person.

Managed by UT-Battellefor the Department of Energy

http://blogs.wsj.com/digits/2011/02/06/foursquare-changes-rules-for-super-bowl-tie-in/

Page 19: Potential for Crowdsourced Data for Population

Challenges and Opportunities

• Recruiting the crowd could benefit from high-profile volunteer catalysts Th d t b f t t itiSocialSocial • The crowd may not be aware of engagement opportunities

• Success may be locally variable because of cultural differencesSocialSocial

• Expectation of privacy is a variable standard• Legal standards are not clearly defined and understood• Self disclosure could be an effective way to address privacy

LegalLegal• Self disclosure could be an effective way to address privacy

D hi i l d i i i l (i i i l• Does this involve deceptive principles (instrumenting national parks, GPS and battery life)?

• Should we promote the crowd as only volunteers?• Self disclosure may come with expectations of service guarantee

EthicalEthical

Managed by UT-Battellefor the Department of Energy

y p g

Page 20: Potential for Crowdsourced Data for Population

We must have a strategy for data curation

Managed by UT-Battellefor the Department of Energy

Page 21: Potential for Crowdsourced Data for Population

Additional Resources

Role of Volunteer Geographic Information in Advancing Science: Quality and CredibilityScience: Quality and Credibility– GIScience 2010 and 2012 workshops– http://www ornl gov/sci/gist/workshops/2012/index shtml– http://www.ornl.gov/sci/gist/workshops/2012/index.shtml– Springer eBook forthcoming in 2013

Hurricane Sandy image courtesy: Eric Young (Penn State) Hurricane Sandy image courtesy: Eric Young (Penn State)

Contact: [email protected]

Managed by UT-Battellefor the Department of Energy