why is digital curation important for workforce and economic development? alan blatecky
DESCRIPTION
Why is Digital Curation Important for Workforce and Economic Development? Alan Blatecky Office of Cyberinfrastructure Symposium on Digital Curation in the Era of Big Data; Career Opportunities and Education Requirements NRC July 19, 2012. 1. - PowerPoint PPT PresentationTRANSCRIPT
1
Why is Digital Curation Important for Workforce and Economic Development?
Alan BlateckyOffice of Cyberinfrastructure
Symposium on Digital Curation in the Era of Big Data;Career Opportunities and Education Requirements
NRCJuly 19, 2012
1
Framing the Challenge:Science and Society Transformed by
Data Modern science
Data- and compute-intensive
Integrative, multiscale Multi-disciplinary
Collaborations for Complexity Individuals, groups,
teams, communities Sea of Data
Age of Observation Distributed, central
repositories, sensor- driven, diverse, etc 2
Data as a transforming agent
Enormous amounts of data are being generated by modern experiments, sensors, observations and social networks
New analysis tools including automatic extraction of new knowledge continues to accelerate
Infusion of data-intensive computation into science, engineering and education is revolutionizing research
Research in science and education is an essential pathway to prosperity and competitiveness and thrives in an environment of shared data
Scientific Data Challenges
4
Byt
es p
er d
ay
2012 2020
Genomics
LHC
TeraGrid, BlueWaters
SquareKilometer
Array
Genomics
LHC
Climate, Environment
LSST
ExaBytes
PetaBytes
TeraBytes
GigaBytes
Climate, Environment
Volume/Growth
Useful
Lifetime
Distribution
Data Access
Many smaller datasets…
NSF Data strategy
Establish a national data infrastructure to support science, engineering and education
Ensure that this infrastructure stays at the most advanced state of sophistication and is sustainable
Expand the development of the next generation of compute and data intensive workforce
Develop of a suite of policies to support the full data life cycle (data access, curation, object identifiers, etc)
5
Data Web Forum
Linking top-down governance model with bottom-up IETF model to catalyze this community-based activity Top-down focus on policy, permission . . . Bottom-up focus on operations, services . . .
Timeliness an important factor Ability to respond quickly essential
The DWF will facilitate the exchange and interoperabilityof data across disciplines and national boundaries by producing high quality, relevant technical documents thatinfluence the way people store, use, and manage data
Balanced representation of stakeholder communities
Community-based; not a government organization, a regulatory body or a commercial organization
Products are free and open source Meetings are public, progress through
consensus and practice Focus on harmonization across standards,
policies, technologies, tools, and other data infrastructure elements
DWF Principles
Proposed Timeline: 2012
Initial Government Agency funding in process Awards to Non Government Organizations in
Aug/Sep Teams flesh out organization and structure –
Sep/Dec Initial working groups identified and charged –
Sep/Dec groups already sharing data across global boundaries Identify candidates for early deliverables, best practices Secure time-commitments to undertake proposed
activities