theo andrew, edinburgh university library choosing suitable open-source repository software choosing...

17
Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Sour Repository Software Theo Andrew SPARC IR Workshop 18 th -19 th November

Upload: nicole-pearson

Post on 28-Mar-2015

230 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew

Theo Andrew, Edinburgh University LibraryChoosing Suitable Open-Source Repository Software

Choosing Suitable Open SourceRepository Software

Theo Andrew

SPARC IR Workshop 18th-19th November

Page 2: Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew

Theo Andrew, Edinburgh University LibraryChoosing Suitable Open-Source Repository Software

Talk Outline

• Common Open-Source Packages

• Generic Software Issues

• Specific Repository Software Issues

• Case study: ERA

Page 3: Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew

Theo Andrew, Edinburgh University LibraryChoosing Suitable Open-Source Repository Software

Common Popular Open-Source Packages

• DSpace (http://www.dspace.org/)– MIT, HP, DSpace Federation

• EPrints.org (http://www.eprints.org/)– University of Southampton

• Fedora (http://www.fedora.info/)– University of Virginia, Cornell Univeristy

Page 4: Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew

Theo Andrew, Edinburgh University LibraryChoosing Suitable Open-Source Repository Software

Support & Development

• Support from authors– Documentation essential, mailing lists etc.

• Continued development– Bug fixes, feature requests, minimal local

development

Page 5: Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew

Theo Andrew, Edinburgh University LibraryChoosing Suitable Open-Source Repository Software

System Architecture

• Modular architecture– Easy to upgrade, develop and customise

• Appropriate programming languages• Stable and appropriate database system

– PostgreSQL (open-source), Oracle (proprietary)

• Easy to integrate into current web services– Templates and styles, using language standards (e.g.

HTML/CSS, XML/XSLT)

Page 6: Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew

Theo Andrew, Edinburgh University LibraryChoosing Suitable Open-Source Repository Software

System Security• Authentication methods

– Most importantly: the one you use at your institution, with the option to insert your own

• Authorisation methods– Able to integrate into current institutional information systems such

as staff, student or course lists

• Authenticate-able content– Provenance metadata, paper-trails, data checksums (e.g. MD5)

• Secure supporting systems– Well-known, open security systems and coherent standard

architectures

Page 7: Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew

Theo Andrew, Edinburgh University LibraryChoosing Suitable Open-Source Repository Software

System Administration• Coherent user administration

– Different types of user and user groups

• Granular, distributable administration– Delegate areas of the system to different administrators

• Licensing System– Related to access policies, with separate submitter, institution

and user licences, ideally with a time-dependent facility

• Access policies– Possible requirements: domain restrictions, time-dependent

restrictions, partial restrictions

Page 8: Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew

Theo Andrew, Edinburgh University LibraryChoosing Suitable Open-Source Repository Software

Record Handling (1)

• Metadata Capture– What metadata do you need? Flexible, appropriate schema

(e.g. Qualified DC, ETD-MS (E-Theses), MARC21)

• Customisable Submission System– Collects relevant metadata, and can be modified

conditionally on the fly

• Ingest Methods– Standard submission, batch import, harvesting (e.g. OAI-

PMH (metadata only)), customised insert using native API

Page 9: Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew

Theo Andrew, Edinburgh University LibraryChoosing Suitable Open-Source Repository Software

Record Handling (2)

• Extract Methods– Native viewing system, batch export, metadata cross-walk,

harvest (e.g. OAI-PMH (metadata only)), customised extract using API

• Item Wrappers– Multiple files, multiple metadata records/schemas, internal

structure mapping (e.g. METS, DIDL)

Page 10: Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew

Theo Andrew, Edinburgh University LibraryChoosing Suitable Open-Source Repository Software

Digital Preservation

• Persistent Identifiers– Some available systems: Handle, PURL, URN, DOI, ARK

• Migration– On Ingest (migrate submission to open format), or on

request (preserve migration tool)

• Viewers– Tools to render the format are preserved

• Emulation– The original viewer is emulated in the new system

Page 11: Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew

Theo Andrew, Edinburgh University LibraryChoosing Suitable Open-Source Repository Software

Additional Functionality

• Coherent data structuring– An internal structure that can represent your institution in one or more

overlaying schemas

• Native Browse– Hierarchical browsing, filtering by structure and metadata; aids indexing

by search engines

• Native Search– Constrained search locations, using browse functionality to display results

• Full Text Indexing• Public API (Application Programming Interface)

– Creating Portal-like services within the institution

Page 12: Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew

Theo Andrew, Edinburgh University LibraryChoosing Suitable Open-Source Repository Software

Theses Alive Plugin for Institutional Repositories

Case Study: ERA

Page 13: Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew

Theo Andrew, Edinburgh University LibraryChoosing Suitable Open-Source Repository Software

Development Box Production Box

Hardware: Hardware:- Dell 1.3GHz PIII - Dell Poweredge 1750 - 2x 36Gb HDD - 3x73Gb RAID 5- 2x36Gb Mirrored - 2x73Gb Mirrored

Software: Software:- Red Hat 8 - Red Hat Enterprise 3- Apache 1.3 - Tomcat 4.1.30- Tomcat 4.0.6 - Java 1.4.2- Java 1.4.1 - Postgres 7.4.2- Postgres 7.3.2 - DSpace 1.1.1- DSpace 1.2 - Tapir 0.3- Tapir 0.3

Page 14: Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew

Theo Andrew, Edinburgh University LibraryChoosing Suitable Open-Source Repository Software

Metadata Capture

Submit a Thesis or Dissertation

Main Metadata Page

Abstract & Keywords

Page 15: Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew

Theo Andrew, Edinburgh University LibraryChoosing Suitable Open-Source Repository Software

Multi-Part Licensing

Page 16: Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew

Theo Andrew, Edinburgh University LibraryChoosing Suitable Open-Source Repository Software

• No systems yet deal with all issues

• Some good development work ongoing with the various packages

• Not all issues need to be solved:

– To provide an Institutional Repository

– For your institution

• Choose a package and get going!

Summary

Page 17: Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew

Theo Andrew, Edinburgh University LibraryChoosing Suitable Open-Source Repository Software

www.era.lib.ed.ac.uk

This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License.

Questions?