introduction to the dryad digital repository

16
Introduction to the Dryad Digital Repository A nonprofit repository for data underlying the international scientific and medical literature. April 2013 DataDryad.org 1

Upload: nicole

Post on 04-Feb-2016

27 views

Category:

Documents


0 download

DESCRIPTION

Introduction to the Dryad Digital Repository. A nonprofit repository for data underlying the international scientific and medical literature. April 2013. 1. The End To make data archiving and reuse standard within scientific communication. The Means - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Introduction to the  Dryad Digital Repository

DataDryad.org

Introduction to the Dryad Digital Repository

A nonprofit repository for data underlying the international scientific and medical literature.

April 2013

1

Page 2: Introduction to the  Dryad Digital Repository

DataDryad.org 2

• The End– To make data archiving and reuse standard within scientific communication.

• The Means– Enable low-burden data archiving at the time of manuscript submission.– Promote researcher benefits from data archiving.– Promote responsible data reuse.– Empower journals, societies & publishers in shared governance.– Ensure sustainability and long-term preservation.

• The Scope– Research data in science and medicine– Primarily data underlying findings in peer-reviewed articles – Also data from some non-peer reviewed publications (e.g. dissertations)– And some non-data content (e.g. software scripts, figures)

Page 3: Introduction to the  Dryad Digital Repository

DataDryad.org 3

The value proposition

• For authors and researchers, Dryad…– increases the impact of, and citations to, published research– preserves and makes available others’ data– frees researchers from the burden of data preservation and access

• For journals, publishers, and societies, Dryad…– frees journals from the burden of maintaining supplemental data– supports all varieties of data archiving policies

• For libraries and institutions, Dryad…– makes data available at no cost, under clear terms of use– helps fulfill their research data management mandates

• For funders, Dryad…– provides a cost-effective mechanism to make research more accessible

Page 4: Introduction to the  Dryad Digital Repository

DataDryad.org

Data archiving has many benefits

Modified from Beagrie et al. (2009) Keeping Research Data Safe 2

DirectVerification of published researchPreserving accessibility to dataAllowing reuse and repurposing of dataDiscoverability of data

Indirect (costs avoided)Redundant data collectionInefficient legacy data curation Burden of sharing-upon-requestOpportunity cost of science not done

Near termProtection against personnel turnoverAvailability for review and validation

Long termSecure long-term stewardshipIncreased impact per publication

PrivateIncreased citationsNew collaborations New research opportunitiesFulfilling funding mandates

PublicMore efficient use of research dollarsPublic trust in scienceEducational opportunitiesImproved methodologiesMore informed policy

4

Page 5: Introduction to the  Dryad Digital Repository

DataDryad.org

Dryad focuses on the long tail of orphan dataVolu

me

Rank frequency of datatype

Specialized repositories(e.g. GenBank, GBIF)

Orphan data

After Heidorn (2008) http://hdl.handle.net/2142/9127

Many datasets belong to the long tail. Though less standardized, they can be rich in information content and have unique value

5

Page 6: Introduction to the  Dryad Digital Repository

Why use Dryad rather than Supplementary Online Materials?

Dryad SOM

Discoverable: indexed and exposed to both web and bibliographic search engines ✔ ✗

Identifiable: DataCite DOIs within articles serve as permanent, resolvable identifiers ✔ ✗*

Permanent: processes in place to promote preservation (incl. format migration) ✔ ✔/✗**

Curated: quality control by both automated processes and human inspection ✔ ✗*

Ease of deposit: streamlined deposit, allowance for large and complex datasets ✔ ✔/✗**

Formatted for reuse: support for non-PDF file formats ✔ ✔/✗**

Updatable: new versions of data files can be added, metadata can be enhanced ✔ ✗

Support for embargoes: can delay release of data in accordance with journal policy ✔ ✗

Free reuse: no paywall, clear terms of reuse (all data released under CC Zero) ✔ ✔/✗**

Economy of scale: cost efficiency from shared infrastructure ✔ ✔/✗**

Alignment to organizational mission: focus on archiving and reuse of scientific data ✔ ✗

* A few publisher SOM sites are exceptions to the general rule** Practices differ among publishers, see Smit (2011), doi:10.1045/january2011-smit

6DataDryad.org

Page 7: Introduction to the  Dryad Digital Repository

DataDryad.org 7

Researchers and journals are using Dryad for archiving

Page 8: Introduction to the  Dryad Digital Repository

DataDryad.org 8

…and using the data for research

Page 9: Introduction to the  Dryad Digital Repository

DataDryad.org 9

Page 10: Introduction to the  Dryad Digital Repository

DataDryad.org 10

Dryad integrates article and data submissionDryad works with the manuscript workflow of journals to:

– Simplify the process of data submission for authors,– Allow authors to deposit, to a single repository, gigabytes of

data files in their original formats,– Ensure permanent bidirectional links between the article and

the data, and increased visibility for both, – Ensure that the data is accessible once the article becomes

available,– Offer the option of making data available for editorial or peer

review, via secure access for editors and reviewers,– Give authors the option to embargo public access to data for a

limited time after publication, if permitted by the journal's data policy.

Options are customized to meet the requirements of each journal.

Page 11: Introduction to the  Dryad Digital Repository

DataDryad.org 11

Over 30 integrated partner journals

The American NaturalistBiology Letters BMJ Open Biological Journal of the

Linnean SocietyEcological MonographseLifeEvolutionary ApplicationsEvolutionFunctional Ecology gms German Medical ScienceHeredityJournal of Animal Ecology Journal of Evolutionary BiologyJournal of Fish and Wildlife

Management

Journal of HeredityJournal of Open Public Health

DataJournal of PaleontologyMethods in Ecology and

EvolutionMolecular Ecology and M.E.

ResourcesPaleobiologyPLoS Biology, PLOS Genetics Systematic Biology ZooKeys & 7 other Pensoft

journals

.. and more being added regularly

Page 12: Introduction to the  Dryad Digital Repository

DataDryad.org 12

Trustworthy repository infrastructure• Making data available is the primary mission of the organization

– No pay-walls or restrictive licenses (all released under CCZero)– The same data may be hosted by other services (non-exclusivity)

• Built on the DSpace repository platform, an open source framework used by hundreds of institutional repositories

• Multiple machine and human interfaces for discovery and access– Dublin Core metadata, harvestable through OAI-PMH– DOIs registered through DataCite– Curators add metadata to enhance keyword searching

• Assurance of data integrity and permanent availability– Service mirroring and backup– File migration and bit-level integrity assurance– Organizational failover through DataONE and CLOCKSS

Page 13: Introduction to the  Dryad Digital Repository

DataDryad.org 13

Dryad as an organization• Governed by an interim Board 2009- 2011.• Now a nonprofit organization incorporated in North

Carolina, USA.• Membership open to all stakeholder organizations,

including scientific societies, publishers, funding agencies, universities & institutes.– Nominal annual fee - no more than $1000 USD

• Governed by an elected 12-member Board of Directors – Nominated and elected by the Membership

• First Annual membership meeting 24 May 2013 in Oxford.

Page 14: Introduction to the  Dryad Digital Repository

DataDryad.org 14

Dryad’s business plan

• Deposit fees are the primary source of revenue, for several reasons:– The time of deposit is when the majority of costs are

incurred– Revenue scales with costs (i.e. volume of deposits)– The costs are distributed both fairly and widely– This enables Dryad to make access to the data free in

perpetuity

• Membership fees will cover costs of annual membership meetings

• Project grants will supplement the operational budget for R&D activities

Page 15: Introduction to the  Dryad Digital Repository

DataDryad.org 15

Payment plans Plan Contract? Paid by Cost1

1. Voucher plan

no Any organization, in advance $65 per data package (members)$70 per data package (non-members)

2. Deferred payment plan

yes, 1 yr. Any organization, in advance $70 per data package (members)$75 per data package (non-members)

3. Subscrip-tion plan

yes, 2 yrs. Journal or journals, fee based on total # of research articles published by the journal/s in the prior year

Unlimited number of submissions for a fixed fee; base fee of $25 per research article for members, $30 for non-members

Individual deposit

no Author, at time of deposit $80/data package, with waivers for submissions from low-income economies

1 Up to a fixed deposit size (currently 10GB). Additional charges for larger deposits.

Page 16: Introduction to the  Dryad Digital Repository

DataDryad.org 16

To learn more

• Repository home: http://datadryad.org• News: http://blog.datadryad.org• Project documentation: http://wiki.datadryad.org• Twitter: @datadryad• Code: http://code.google.com/p/dryad

or contact us: • http://datadryad.org/feedback • Todd Vision, Director, [email protected]• Laura Wendell, Dryad Executive Director, [email protected]