perseus’ archiving needs

16
Perseus’ Archiving Needs And What They Mean For Librarians

Upload: rhonda

Post on 11-Jan-2016

46 views

Category:

Documents


5 download

DESCRIPTION

Perseus’ Archiving Needs. And What They Mean For Librarians. Preserving Perseus. Data and Behaviors. What does Perseus have to lose? Data If lost, we cannot do anything. The primary text is primary. Behavior We lose the ability to make associations. Structure of the Talk. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Perseus’ Archiving Needs

Perseus’ Archiving Needs

And What They Mean For Librarians

Page 2: Perseus’ Archiving Needs

Preserving Perseus

Page 3: Perseus’ Archiving Needs
Page 4: Perseus’ Archiving Needs

Data and Behaviors

• What does Perseus have to lose?

• Data– If lost, we cannot do anything.– The primary text is primary.

• Behavior– We lose the ability to make associations

Page 5: Perseus’ Archiving Needs

Structure of the Talk

• Perseus’ current and future options for archiving/preserving its data and behaviors

• Use this to motivate new skills required by and emerging new roles for librarians

Page 6: Perseus’ Archiving Needs

Perseus’ Preservation Options…

• Be Open– Hard to maintain a black box

• Distribute for Redundancy– Library of Alexandria: Don’t put all your

eggs in one basket.

• Use Institutions for Reliability/Quality– Library of Alexandria: Lots of quality

content

Page 7: Perseus’ Archiving Needs

Be Open

Page 8: Perseus’ Archiving Needs

Be Open: Data

• Data formats – Non-binary for text

• Images are different

– Application-independent– Easily transformable when possible

• XML

• Licensing– Can other people use this data?– Are other people able to create derivative works?

Page 9: Perseus’ Archiving Needs

Be Open: Behaviors

• Protocol Specifications– What does Perseus mean? (semantics)– Defining behaviors

• Browsing by logical citation scheme: CTS protocol

• Perseus’ APIs– Open source implementations– Let people download these

implementations

Page 10: Perseus’ Archiving Needs

Distribute For Redundancy

Page 11: Perseus’ Archiving Needs

Distributing Data

• Leveraging Geographic Distribution– SRB/iRods

• Desktop/Web-based GUI

• The more copies, the safer our data will be– Perseus lets people download raw data

• Creative commons

Page 12: Perseus’ Archiving Needs

Distribute Your Behaviors

• Mirror sites– Enables distribution of behaviors

• Distributed computing power– Performance gain

• For Perseus’ mission: the more copies, the better!– Let people download your specs and

implementations.• GPL license

Page 13: Perseus’ Archiving Needs

Use Institutions For Reliability & Quality

Page 14: Perseus’ Archiving Needs

Give Institutions Your Data

• Quality– Policies for ingest ensure a standard for

the data and metadata

• Leverage Expertise– Their job is to archive and preserve data

Page 15: Perseus’ Archiving Needs

Give Institutions Your Behaviors

• Institutional repositories can preserve behaviors– Fedora

• Forces documentation – Specification – Implementation

• If using a different implementation– Is the specification really implementation-

independent?

Page 16: Perseus’ Archiving Needs

Skills Perseus Needs from Future Librarians

• Data formats:– XML

• Manipulating the data– XSLT– Basic Scripting: Perl, Python, Groovy

• Licensing agreements– Creative Commons– GPL

• Grid/Distributed Computing• Investigate Institutional Repositories

– Fedora