open research data: the future of science - forum rolex learning center (epfl campus) christopher...

32
Open Research Data: The Future of Science - Forum Rolex Learning Center (EPFL Campus) Christopher Brown 28/10/2014 Open Research Data – A Jisc Perspective

Upload: maud-taylor

Post on 18-Dec-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Open Research Data: The Future of Science - Forum Rolex Learning Center (EPFL Campus)

Christopher Brown

28/10/2014 Open Research Data – A Jisc Perspective

2Open Research Data – A Jisc perspective

MissionTo enable people in higher education, further education and skills in the UK to perform at the forefront of international practice by exploiting fully the possibilities of modern digital empowerment, content and connectivity

VisionTo make the UK the most digitally advanced education and research nation in the world23 Oct 2014 - The Higher Education Policy Institute (HEPI) today publishes a new report entitled Jisc: a hidden advantage for higher education http://jisc.ac.uk/news/new-hepi-report-highlights-the-benefits-of-shared-infrastructure-23-oct-2014

3

What Jisc delivers…Open Research Data – A Jisc perspective

Digital conten

t

Network &IT

services

Advice Research & developme

ntWe provide institutions, their students and researchers secure, cost-effective access to the UK’s richest collection of digital resources.

We provide the HE sector with Janet, the Jisc network, advanced infrastructure services and collaboration platforms.

We listen to our members to ensure we offer them quality support, guidance and tools - estimated to save the sector £122 million in cost efficiencies each year.

Identifying emerging technologies and developing them around our members’ needs. Testing and learning on their behalf to ensure they are ready to take advantage of new technologies as they arrive.

4

Jisc’s Strategic Framework Impact AreasOpen Research Data – A Jisc perspective

5

“Open Research Data”Open Research Data – A Jisc perspective

»EPSRC definition: "Research data is defined as recorded factual material commonly retained by and accepted in the scientific community as necessary to validate research findings; although the majority of such data is created in digital format, all research data is included irrespective of the format in which it is created."

»Creative Commons: “we believe scientific data should be freely available to everyone. We call this idea Open Data.”

Definitions

6

»Loss

»Destruction

»Pride

»Gluttony

»Ineptitude

»Concealment

»Bureaucracy

»Complexity

»Procrastination

»Lack of potential

Open Research Data – A Jisc perspective

» “People will ask questions”› So use a data centre or repository

» “It will be misinterpreted”› Stuff happens. Also, openness encourages

correction» “It’s not interesting”

› Let others be the judge – your noise is my signal» “I might get another paper out of it”

› Up to a point. We might get more research out of it» “I don’t have permission”

› A real problem. But solvable at senior level» “It’s too bad/complicated” –see above» “It’s not a priority”

› Unfortunately, funders are making it so. But if you looked at the evidence, it would be your priority as well

http://datapub.cdlib.org/2013/04/24/closed-data-excuses-excuses/

With thanks to Kevin Ashley (DCC) for responses

What stops research data being open?

8

» Final Report of the High Level Expert Group on Scientific Data - A submission to the European Commission – October 2010

» Calls for a collaborative data infrastructure that will enable researchers and other stakeholders from education, society and business to use, re-use and exploit research data to the maximum benefit of science and society.

» “To collect, curate, preserve and make available ever increasing amounts of scientific data, new types of infrastructures will be needed. The potential benefits are enormous but the same is true for the costs. We therefore need to lay the right foundations and the sooner we start the better.” - Neelie Kroes, VP European Commission

Unlocking the full value of Scientific DataOpen Research Data – A Jisc perspective

“Riding the wave – How Europe can gain from the rising tide of scientific data”

http://cordis.europa.eu/fp7/ict/e-infrastructure/docs/hlg-sdi-report.pdf

9

» The 5 Knowledge Exchange partner organisations:› CSC - IT centre for Science in Finland› Denmark’s Electronic Research Library (DEFF)› German Research Foundation (DFG)› Jisc in the United Kingdom› SURF in the Netherlands

» Builds on the report and presents an overview of the present situation with regard to research data in Denmark,  Germany, the Netherlands and the United Kingdom and offers broad outlines for a possible action programme for the four countries in realising the envisaged collaborative data infrastructure.

» Identified 4 key drivers:› Incentives for researchers to publish and share their

datasets› Training in new skills needed for creating, handling,

manipulating, analysing, and making available large amounts of data for re-use by others

› Infrastructure (to make datasets discoverable and accessible)

› Funding of the infrastructure

Knowledge Exchange responseOpen Research Data – A Jisc perspective

“A Surfboard for Riding the Wave”

http://www.knowledge-exchange.info/surfboard

10

»“The conduct and communication of science needs to adapt to this new era of information technology”

»“As a first step towards this intelligent openness, data that underpin a journal article should be made concurrently available in an accessible database. We are now on the brink of an achievable aim: for all science literature to be online, for all of the data to be online and for the two to be interoperable.”

Science as an Open Enterprise Report, 2012

Open Research Data – A Jisc perspective

The Royal Society, UK

http://royalsociety.org/policy/projects/science-public-enterprise/report/

11

» A shift away from a research culture where data is viewed as a private preserve

» Expanding the criteria used to evaluate research to give credit for useful data communication and novel ways of collaborating

» The development of common standards for communicating data» Mandating intelligent openness for data relevant to published

scientific papers» Strengthening the cohort of data scientists needed to manage and

support the use of digital data (which will also be crucial to the success of private sector data analysis and the government’s Open Data strategy)

» The development and use of new software tools to automate and simplify the creation and exploitation of datasets

Science as an Open Enterprise ReportOpen Research Data – A Jisc perspective

Six key challenges

12

» Undertaken on behalf of the Research Councils UK in partnership with Jisc, the Royal Society and Sciencewise-ERC, this public dialogue explored views on open data, data reuse and data management policies within research

» The public dialogue was designed to:› Provide insight on the business issues that

the dialogue will support, at the research councils and Jisc

› Build on prior work in the area and account for the wider policy framework

› Engage people meaningfully around this complex area, enabling the public to frame issues and test out any principles emerging across a range of research contexts..’

RCUK’s Open Data Dialogue Report, 2012Open Research Data – A Jisc perspective

http://www.rcuk.ac.uk/Publications/policy/OpenData/

13

» Public good: Publicly funded research data are produced in the public interest should be made openly available with as few restrictions as possible

» Planning for preservation: Institutional and project specific data management policies and plans needed to ensure valued data remains usable

» Discovery: Metadata should be available and discoverable; Published results should indicate how to access supporting data

» Confidentiality: Research organisation policies and practices to ensure legal, ethical and commercial constraints assessed; research process should not be damaged by inappropriate release

» First use: Provision for a period of exclusive use, to enable research teams to publish results

» Recognition: Data users should acknowledge data sources and terms & conditions of access

» Public funding: Use of public funds for RDM infrastructure is appropriate and must be efficient and cost-effective

Research Funder PoliciesOpen Research Data – A Jisc perspective

http://www.rcuk.ac.uk/research/datapolicy/

RCUK Common Principles on Data Policy

14Open Research Data – A Jisc perspective

15

» ROs will promote internal awareness of these principles

» Published research papers should indicate how supporting research data can be accessed

» ROs will have policies and processes in place to maintain awareness of and access to their publicly-funded research data holdings

» Non-digital publicly-funded research data will be stored in a manner to facilitate it being shared on request

» ROs will ensure that appropriately structured metadata describing the research data is published and freely accessible

» Where access to the data is restricted the published metadata should also give the reason and summarise the conditions which must be satisfied for access to be granted

» ROs will ensure research data is securely preserved for a minimum of 10 years from last access

» ROs will ensure effective data curation is provided throughout the full data lifecycle

» ROs will ensure adequate resources are provided to support the curation of publicly-funded research data

» Roadmap in place by 1 May 2012

» Compliance by 1 May 2015

Research Funder PoliciesOpen Research Data – A Jisc perspective

http://www.epsrc.ac.uk/index.cfm/about/standards/researchdata/

EPSRC’s Expectations for funded Research Organisations (ROs)

16

» Data, metadata and computer code (used to evidence published paper or equivalent) must be:› discoverable – readily found to exist by online search› accessible – when discovered they can be interrogated› intelligible - they can be understood› assessable – e.g. the provenance and reliability of data› re-useable - they can be re‐used and re‐combined with other data as a private preserve

» Data generated by publicly/charitably funded research not used in evidence should be made open after a pre-specified period

» Re‐use of data must formally acknowledge the originators» Cost of creating open data is an intrinsic cost of research» Openness is limited for commercially sensitivity, privacy of personal information, safety

and security but do not provide justification for a blanket ban» Existing processes, reward structures and behaviour that inhibit or prevent data sharing

should be reformed to encourage/facilitate/reward data sharing and collaboration.

Principles of Open DataOpen Research Data – A Jisc perspective

From a US/UK Open Data Summit – Washington, April 2014

http://www.researchinfonet.org/wp-content/uploads/2014/07/Note-of-Washington-meeting.pdf

17

»Preliminary qualitative analysis of the outcomes of a questionnaire sent to Science Europe Members, examining the funding landscape relating to research data infrastructure and research data management within Europe.

»Analysis will help inform and direct subsequent stages of the research to provide knowledge, eventually leading to a set of recommendations for policy-makers both on the national and European level

»Sent to 57 SE members. »22 responses, mostly from Research Funding Organisations

Funding Research Data InfrastructuresOpen Research Data – A Jisc perspective

Preliminary report of responses to SE members questionnaire – Sept 2014

18

Where are HEIs in the UK?Open Research Data – A Jisc perspective

Poll of 30 institutions

Blue = exploring through to Red = implementing

19

»Research Excellence & Impact - data will be cited; used by others including peers, other disciplines, the public, industry, in learning – ability to meet global challenges; innovate & create new research areas

»Research integrity - replication, verification of research, improvement of methods & results

»Efficiency - save duplication of research effort, data creation & therefore costs; ease of access & re-use

»Managing risks - ability to meet FOI requests; protect reputation.

Why Research Data Management?Open Research Data – A Jisc perspective

Key area for Jisc

20

Promoting and Supporting good RDMOpen Research Data – A Jisc perspective

First MRD Programme, 2009-2011

5 Strands1. Research Data Management Infrastructure (RDMI) Projects2. Research Data Management Planning (RDMP) Projects 3. Support and Tools Projects4. Citing, Linking, Integrating and Publishing Research Data

(CLIP) Projects5. Research Data Management Training Materials Projects

21

Building Institutional CapacityOpen Research Data – A Jisc perspective

Second MRD Programme, 2011-2013

Institutional RDM

Infrastructure

Services17 Projects

RDM Training

5 projects

Data Publicati

on3

projects

RDM Planning

10 projects

Encouraged to reuse outputs from first programme and elsewhere.Mix of pilot projects and embedding projects.Holistic institutional approach to RDM.

Ownership: High level ownership of the problem, senior manager on steering committee.Sustainability: Large institutional contributions.Develop business cases to sustain work.

22

» Digital Curation Centre – advice & guidance; DMP online; Cardio, institutional engagements, training (http://dcc.ac.uk/)

» Training materials available in Jorum (free OER repository) http://www.jorum.ac.uk/

» Sherpa Juliet – Research funders’ OA policies (http://www.sherpa.ac.uk/juliet/)

» Research data registry – discovery of data

» Journal research data policies

» Software for medical research data management and sharing – BRISSKit

» Preservation & storage – Arkivum (http://arkivum.com/) framework agreement

» Cloud services – guidance, shared purchasing & due diligence (https://www.ja.net/products-services/janet-cloud-services)

» Access & identity management

» Standards & protocols – Sword; Cerif; metadata for discovery; citation

» Data deposit & re-use – experiments & prototypes

» Foresight/clarification – incentives; funding; costs; value.

Summarised on the Research Data Management blog - Where are we now? http://bit.ly/1v6nn7v

What is Jisc doing?Open Research Data – A Jisc perspective

23

R@R:Support take up

of citation

R@R: DMP

registry

DMP OnLine

R@R: business case & costs

Standards; policies;

coordination &

cooperation.EASY ACCESS

Skills &

train

ing

Data

sharing and

Discovery

Data

storage &

archiving

Rese

arch

data

man

agem

ent

plan

ning

Data identifiers

Access & security

Researcher/organisational identifiers

Funders policies

Deposit protocols

Advice & guidance/goo

d practice

R@R:UK Research

Data Discover

y (metadat

a)

R@R: metrics & usage data

service

Cardio planning

tool

R@R: RD Experiments

& prototypes

UKDS/Institutional repositories

R@R: shared PreservationRepositories (metadata)

Digital Curation Centre

Open Training

Materials in Jorum

Shared data centre

Jisc Research Data Infrastructure

R@R: comprehensive

tool-kit; case studies

Sherpa Juliet Funder policies

R@R: Journal Policy

registry

R@R: EPSRC suppor

t

R@R: RD Experiments &

Prototypes, &BRISSkit

24

»Helping to build capacity, capability and skills in research data management and curation across the UK’s higher education research community.

Digital Curation CentreOpen Research Data – A Jisc perspective

Established by Jisc

Support offered by the DCC

http://www.dcc.ac.uk/

25

Digital Curation CentreOpen Research Data – A Jisc perspective

Understanding Data Requirements – Data Management Plans

http://www.dcc.ac.uk/

26Open Research Data – A Jisc perspective

27

» Research data roadmap & a toolkit for UK universities› Make it easy as possible for universities to access tools

» Filling the research data infrastructure gaps› Storage, preservation, archiving – spearhead development of service

» Supporting universities in meeting research funder compliance› EPSRC Mandate – May 2015. Building on what we have.

» Making data count› Helping to change culture on sharing data and making it searchable.

Promoting good RDM practice (ref MANTRA and RDMRose)» Shared research data tools & services

› Big challenge – a clear need for shared services

Research @ Risk Vision and ActivitiesOpen Research Data – A Jisc perspective

Realising a robust and sustainable research data management infrastructure and services to enrich UK research

28

» BRISSKIT› Biomedial research data software kit (https://www.brisskit.le.ac.uk/) › Developing into a sustainable service that offers open source software

as an effective and competitive solution for the sharing of sensitive data between university medical groups and other collaborators such as the NHS

» Journal research data policy bank › Feasibility study shows it meets a requirement› Developing a central registry that helps to make clear the policies and

expectations for data as it underpins research articles› Helping to meeting funders requirements, and to significantly reduce the

time for researchers and their supporting staff in making decisions with regards to their research data and its publication

Research Data ProjectsOpen Research Data – A Jisc perspective

Supporting research and science

29

» DCC pilot / technical evaluation (Oct 2013-Mar 2014)› Modelled on Research Data Australia developed by ANDS› In order to be re-used, data must be discoverable› Harvests simple, but textually rich, metadata records for research data assets › Piloted with 9 HEIs and UK Data Archive, Archaeology Data Centre and NERC data

centres» From pilot to production (Nov 2014 – Oct 2016)

› Develop business case and key use cases› Further evaluate potential software solutions› Collaborate closely with the HEIs and Data Centres› Identify and finalise the agreement on the metadata schema that is appropriate

for a successful cross disciplinary service› Identify the architecture that a UK service could operate on› Produce toolkits and advice/guidance on implementation

Research Data DiscoveryOpen Research Data – A Jisc perspective

From pilot to service

30

» Jisc is kick-starting a tranche of research data experiments and prototypes, based around research data creation, deposit and re-use, with the view of finding solutions, proposed by the research community & research support stakeholders, which will enhance their workflows and make research data creation, deposit and re-use easier

» Priority areas:› Research data deposit and sharing protocols & tools › Data creation, deposit and re-use by discipline› Research data systems/interoperability standards› Research data analytics › Shared services for research

Research Data Experiments & PrototypesOpen Research Data – A Jisc perspective

Creating innovative partnerships and solutions

31

»First shared data centre for medical and academic research in the UK

»First large scale example of HPC environments being placed in an outsourced co-location facility

»Launched Sept 2014 – supporting 6 of UK’s most successful scientific and academic organisations

»Consolidates sensitive data in safe, off-site centre

» Increases collaboration and reduces costs

»Connected to the Janet network core, meeting the bandwidth requirements of large data sets

Open Research Data – A Jisc perspective

Jisc Data CentreSupporting the requirements for academic research

http://www.jisc.ac.uk/news/uk-research-secures-new-national-data-centre-04-sep-2014

32Open Research Data – A Jisc perspective

Christopher BrownSenior Co-design Manager

[email protected]

One CastleparkTower HillBristolBS2 0JAT 020 3697 5800

[email protected]

Except where otherwise noted, this work is licensed under CC-BY-NC-ND