perseus’ archiving needs

Post on 11-Jan-2016

46 Views

Category:

Documents

5 Downloads

Preview:

Click to see full reader

DESCRIPTION

Perseus’ Archiving Needs. And What They Mean For Librarians. Preserving Perseus. Data and Behaviors. What does Perseus have to lose? Data If lost, we cannot do anything. The primary text is primary. Behavior We lose the ability to make associations. Structure of the Talk. - PowerPoint PPT Presentation

TRANSCRIPT

Perseus’ Archiving Needs

And What They Mean For Librarians

Preserving Perseus

Data and Behaviors

• What does Perseus have to lose?

• Data– If lost, we cannot do anything.– The primary text is primary.

• Behavior– We lose the ability to make associations

Structure of the Talk

• Perseus’ current and future options for archiving/preserving its data and behaviors

• Use this to motivate new skills required by and emerging new roles for librarians

Perseus’ Preservation Options…

• Be Open– Hard to maintain a black box

• Distribute for Redundancy– Library of Alexandria: Don’t put all your

eggs in one basket.

• Use Institutions for Reliability/Quality– Library of Alexandria: Lots of quality

content

Be Open

Be Open: Data

• Data formats – Non-binary for text

• Images are different

– Application-independent– Easily transformable when possible

• XML

• Licensing– Can other people use this data?– Are other people able to create derivative works?

Be Open: Behaviors

• Protocol Specifications– What does Perseus mean? (semantics)– Defining behaviors

• Browsing by logical citation scheme: CTS protocol

• Perseus’ APIs– Open source implementations– Let people download these

implementations

Distribute For Redundancy

Distributing Data

• Leveraging Geographic Distribution– SRB/iRods

• Desktop/Web-based GUI

• The more copies, the safer our data will be– Perseus lets people download raw data

• Creative commons

Distribute Your Behaviors

• Mirror sites– Enables distribution of behaviors

• Distributed computing power– Performance gain

• For Perseus’ mission: the more copies, the better!– Let people download your specs and

implementations.• GPL license

Use Institutions For Reliability & Quality

Give Institutions Your Data

• Quality– Policies for ingest ensure a standard for

the data and metadata

• Leverage Expertise– Their job is to archive and preserve data

Give Institutions Your Behaviors

• Institutional repositories can preserve behaviors– Fedora

• Forces documentation – Specification – Implementation

• If using a different implementation– Is the specification really implementation-

independent?

Skills Perseus Needs from Future Librarians

• Data formats:– XML

• Manipulating the data– XSLT– Basic Scripting: Perl, Python, Groovy

• Licensing agreements– Creative Commons– GPL

• Grid/Distributed Computing• Investigate Institutional Repositories

– Fedora

top related