research data alliance data sharing dr. kathleen fontaine managing director, rda/us geo data sharing...

25
Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

Upload: martha-cecilia-crawford

Post on 12-Jan-2016

225 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

Research Data AllianceData SharingDr. Kathleen Fontaine

Managing Director, RDA/US

GEO Data Sharing Working Group

May 2015

Geneva, Switzerland

Page 2: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

2

Topics

▪ Why We Are▪ Who We Are▪ Examples of Data Sharing▪ Data Challenge

RDA/US Work Funded by

Page 3: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

3Many Infrastructure Building Blocks Needed to Accelerate Progress

Institutional Data Sharing Practice

Data Access and Distribution Policy

Data Discovery Tools

Common Metadata Standards

Digital Object Identifiers

Data CitationStandards

Data Analytics Algorithms

Data Preservation Practice

Data Scientists and Expert Support

Sustainable Economic Models

Curation Practice and Policy

Auditing, Certification and Reporting Practice

RDA/US Work Funded by

Page 4: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

4

Research Data Alliance

▪ Created to accelerate the development of research data sharing infrastructure worldwide through

▪ building social, organizational and technical infrastructure based on need

▪ Discipline, technology, and culture agnostic

RDA/US Work Funded by

Page 5: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

5

RDA Governance

Features Openness, Transparency, and Consensus

▪ through high-level governance bodies

▪ through community-based groups

▪ through individual and organizational contributions

▪ through emphasis on real outputs that solve real barriers to research data sharing for real users

RDA/US Work Funded by

Page 6: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

6

RDA/US Work Funded by

Page 7: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

7

RDA/US Work Funded by

Page 8: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

8

RDA/US Work Funded by

Page 9: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

9

RDA/US Work Funded by

Page 10: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

10

RDA/US Work Funded by

Page 11: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

11

RDA/US Work Funded by

Page 12: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

12

RDA/US Work Funded by

Page 13: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

13

RDA Organizational Structure

RDA Council Responsible for overarching mission, vision, impact of RDA

Technical Advisory BoardResponsible for Technical roadmap and interactions

Secretary-General and Secretariat

Responsible for administration and operations

Organizational Advisory Board and Organizational

AssemblyResponsible for organizational

and strategic advice

Working GroupsResponsible for impactful, outcome-oriented efforts

Interest GroupsResponsible for defining and refining common issues

RD

A M

embe

rshi

p

RDA/US Work Funded byRDA Colloquium (Research Funders)

Operational and community sponsorship

* *

*

*

* heavy user/community involvement

*

*

Page 14: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

14

Create -> Adopt -> Use

▪ Birds-of-a-Feather – groups meeting at Plenaries to gauge broader interest in a topic

▪ Interest Groups – longer-lived discussion forums that spawn Working Groups as specific pieces of needed infrastructure are identified.

▪ Working Groups – 12-18 month efforts to build, adopt, and use specific pieces of infrastructure

▪ Code, policy, infrastructure, standards, or best practices that are adopted and used by communities to enable data sharing

▪ “Harvestable” efforts for which 12-18 months of work can eliminate a roadblock

▪ Efforts that have substantive applicability to groups within the data community, but may not apply to everyone

▪ Efforts for which working scientists and researchers can start today

RDA/US Work Funded by

Page 15: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

15

1. Brokering Governance

2. Data Citation **

3. Data Description Registry Interoperability **

4. Data Foundation and Terminology †

5. Data Type Registries †

6. Metadata Standards Directory **

7. PID Information Types †

8. Practical Policy †

9. RDA/CODATA Summer Schools in Data Science and Cloud Computing in the Developing World

10.RDA/WDS Publishing Data Bibliometrics

11.RDA/WDS Publishing Data Services

12.RDA/WDS Publishing Data Workflows

13.Repository Audit and Certification DSA–WDS Partnership

14.The BioSharing Registry: connecting data policies, standards & databases in life sciences*

15.Wheat Data Interoperability WG **

RDA Working Groups

† in Output review** in final phases* in review

RDA/US Work Funded by

Page 16: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

16

1. Agricultural Data Interoperability2. Active Data Management Plans *3. Big Data *4. Biodiversity Data Integration5. Brokering6. Community Capability Model7. Data Fabric 8. Data for Development9. Data Foundations and Terminology *10.Data in Context 11.Data Rescue *12.Defining Urban Data Exchange for Science IG13.Development of cloud computing capacity and

education in developing world research14.Digital Practices in History and Ethnography15.Domain Repositories Interest Group16.Education and Training on handling of research

data17.ELIXIR Bridging Force18.Engagement 19.Ethics and Social Aspects of Data *20.Federated Identity Management21.Geospatial *22.Libraries for Research Data

23.Long tail of research data24.Marine Data Harmonization25.Metabolomics26.Metadata27.National Data Services *28.PID 29.Preservation e-Infrastructure 30.Quality of Urban Life31.RDA/CODATA Legal Interoperability 32.RDA/CODATA Materials Data, Infrastructure &

Interoperability 33.RDA/WDS Certification of Digital Repositories 34.RDA/WDS Publishing Data Cost Recovery for

Data Centres35.RDA/WDS Publishing Data 36.Repository Platforms for Research *37.Reproducibility 38.Research data needs of the Photon and Neutron

Science community39.Research Data Provenance40.Service Management 41.Structural Biology 42.Toxicogenomics Interoperability43.Vocabulary Services *

RDA Interest Groups

RDA/US Work Funded by

* in review

Page 17: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

17

Data Type Registries Deliverables: System of data type registries, formal model for describing types, working model of a registry.Initial Adopters and Users: CNRI, International DOI Foundation, Deep Carbon Observatory

Practical Policy Deliverables: Survey of policies in production use, test bed of machine actionable policies, deployment of 5 policy sets, policy starter kitsInitial Adopters and Users: RENCI, DataNet Federation Consortium, CESNET, Odum Institute, EUDAT

Persistent Identifier Information Types Deliverables: Minimal set of PID types, API Initial Adopters and Users: Data Conservancy, DKRZ

Data Foundations and Terminology Deliverables: Common vocabulary for data terms, formal definitions and open registry for data termsInitial Adopters and Users: EUDAT, DKRZ, Deep Carbon Observatory, CLARIN, EPOS

Sharing Data - Through Output Solutions

RDA/US Work Funded by

Page 18: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

18Sharing Data - Through First Generation Adoptions

RDA/US Work Funded by

Materials Genome Initiative, Laura Bartolo, Kent State University

Our RDA Adoption Demonstration Project focuses on the Data Type Registry and PID, with guidance and feedback from the National Institute of Standards and Technology, a lead Federal agency developing the key models, tools, standards, and data for the MII.

Platform for Experimental Collaborative Ethnography, Luis Felipe Rosado Murillo, Rensselaer Polytechnic Institute

In this presentation, we will describe the design and the practical guidelines for the Platform for Experimental, Collaborative Ethnography (PECE), and our efforts to implement comprehensive data management policies based on recommendations from RDA's Working Group on Practical Policies.

Early outcomes: Implementation of RDA DFT Recommendations for DataFed.net, Aaron Addison, Washington University St. Louis

The DataFed.net data catalog lists numerous datasets related to atmospheric and air-quality data over time. The metadata surrounding this data catalog is being examined in the context of the recently released RDA outcome focused on the work the Data Foundation and Terminology (DFT) working group has published. This real-world examination of a RDA outcome is working to adopt a common terminology for both the community of practice as well as machine driven applications.

Deep Carbon Observatory, Stephan Zednik, Rensselaer Polytechnic Institute

The Deep Carbon Observatory (DCO) community is building a cyber-enabled platform for linked science, made available to the community by a multi-institutional data portal. Persistent identifiers and domain specific data types have been identified as key technological issues the portal must address. This presentation focuses on the DCO portal’s planned adoption of RDA DTR and PID methodologies and technologies as a means to address the DCO community's need for persistently identifiable and understandable data type information.

Page 19: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

19Sharing Data - Through Joint Efforts - A Sample

▪ Although not formally organizations partners just yet, several GEO-related data sharing efforts are underway within RDA• Brokering Governance Working Group• RDA/CODATA Legal Interoperability Working Group • RDA/CODATA Materials Data, Infrastructure & Interoperability

Interest Group• RDA/WDS Certification of Digital Repositories Interest Group• RDA/WDS Publishing Data Cost Recovery for Data Centres

Interest Group• RDA/WDS Publishing Data Interest Group• RDA/WDS Publishing Data Services Working Group• And many, many more

RDA/US Work Funded by

Page 20: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

20

Brokering Governance Working Group

The goal is to address the Governance of the brokering framework middleware and interconnect existing international e-infrastructures. The Working Group will address the following: 1. Brokering configuration and strategies;2. Brokering governance and agreements;3. Publications and transparency;4. Community adoption and sustainability. Effective middleware governance has the potential to support longer-term development under a variety of funding models, to simplify and standardize access models, and establish a basis for the continued value of brokered systems. To ensure sustainable, stable development and effectiveness in an operational environment of brokering systems reliant on middleware service architectures, an effective model for the governance and reuse of that middleware must be agreed upon.We propose to consider and recommend a set of best practices for governing and managing brokering middleware. These practices will work to ensure future interoperability, access, and use to brokering middleware independent or in light of various development and funding models to support long-term planning of brokered, integrated systems. These will be of value not only to interoperability architects and to developers (who can plan integrated systems assuming the continued use and support of brokering middleware) but also to system managers and end users. Work packagesBusiness Models(Brokering) Service AgreementsUse-casesUse Cases

ICSU/WDS (leader: Michael Michael Diepenbroek);Danube Data Infrastructure (Max Craglia);GEO-BON (Wim Hugo);SAION (Bente Lilja Bye).

RDA/US Work Funded by

Page 21: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

21

RDA/CODATA Legal Interoperability

RDA/US Work Funded by

The proposed Research Data Alliance – CODATA Working Group on Legal Interoperability of Research Data (RDA-CODATA WG), will be established to address and promote the following objectives:

▪ Define legal interoperability of research data and articulate why it is important for data interoperability and reuse.

▪ Document and analyze up to four case studies in the areas of geoscience, biodiversity research, social sciences, and humanities of legal interoperability solutions in interdisciplinary and international contexts.

▪ Develop and publish core principles and guidelines of best practices through which legal interoperability can be achieved, and link to related information resources online.

▪ Work with key stakeholder groups to get the core principles and guidelines of best practices adopted.

▪ Generally promote better understanding and greater use by the stakeholder groups in the research community of the agreed approaches to legal interoperability of research data, focused on highlighting and enabling better integration and reuse of such data.

Page 22: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

22RDA/WDS Certification of Digital Repositories

RDA/US Work Funded by

In order to guarantee data sharing, the long-term preservation of these data in sustainable digital repositories is a sine qua non. Data that are created and used by science and scholarship need to be managed, curated and archived, making sure that the substantial investments in preparing and presenting the content and tools will not be lost. Researchers need to be sure that the resources the repositories offer remain meaningful and usable over time. Moreover, the repositories themselves need to have sustainable business models.

Preservation and sustainability raise challenges in many areas. The main issues related to long term preservation and sustainability remain basically unresolved, as many organizational, technical, financial and legal aspects remain open. Certification is therefore fundamental in guaranteeing the trustworthiness of digital repositories and thus in sustaining the opportunities for long-term data sharing. The Interest Group will build on previous work in the area of certification. It will deliver the global overview and the necessary recommendations and requirements that allow the effective implementation of certification of digital repositories on a national, European and even global level.

Page 23: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

23

Sharing Data - Through Challenges

RDA/US Work Funded by

▪ The 6th Plenary RDA Meeting to be hosted in Paris from 23-25 September 2015, will feature a special focus on research data for climate change, leveraging on the UN Climate Change Conference (COP21) to be held in Paris in December 2015.

▪ As a part of this special focus Cap Digital & RDA have created a special Challenge designed to connect Climate Change related Data Sets with startups, SMEs and larger organizations with practical application for these data.

▪ In this phase we are interested in receiving applications from holders of datasets on:

▪ Air quality;

▪ Energy;

▪ Urban Activity.

▪ In addition to a description of the datasets, we would also like to receive indications on the possible applications that might constitute a challenge goal. Submissions must be made on-line and are accepted until midnight CET 21st May 2015. For any queries please contact [email protected]

▪ This challenge intends to demonstrate how the work of RDA is a key to solve numerous climate change related issues through enterprise engagement. The course of this challenge will be the following:

▪ Selection of relevant data sets for inclusion in the challenge (21st May)

▪ Open challenge to enterprises to propose practical application pilots (22nd May – 22nd June)

▪ Challenge announcement and publication on a dedicated web platform during Futur en Seine, the international digital festival in Paris in June.

▪ Identification of 3 application pilots to be taken forward; (30th June)

▪ Presentation of results & announcement of the challenge winner during the 6th Plenary Climate Change Data Challenge Day; (24th Sept)

▪ Demo of the winner’s solution and data set during the UN Climate Change Conference (COP21) in December (to be confirmed).

Page 24: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

24

Upcoming RDA Meetings

▪ Plenaries every six months, in September and March• Sept 2015 (P6) - Paris• March 2016 (P7) - Japan• Sept 2016 (P8) - US (DC area), International Data

Week hosted by RDA/US, CODATA US and ICSU; joint with SciDataCon 2016

▪ Check the RDA website for global meetings, and the RDA/Europe and the [soon to be available] RDA/US sites for regional meetings

RDA/US Work Funded by

Page 25: Research Data Alliance Data Sharing Dr. Kathleen Fontaine Managing Director, RDA/US GEO Data Sharing Working Group May 2015 Geneva, Switzerland

25

Thank You and Questions

[email protected]

[email protected]

rd-alliance.org

RDA/US Work Funded by