create your own digital repository

Post on 12-May-2015

1.616 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Create Your Own Create Your Own

Digital Digital RepositoryRepository

Leah PrescottLeah PrescottGetty Research InstituteGetty Research Institute

October 18, 2007October 18, 2007Master of St. Bartholomew,Master of St. Bartholomew, The Meeting of the Three The Meeting of the Three Kings (detail), Kings (detail), ca. 1480 J. Paul Getty Museumca. 1480 J. Paul Getty Museum

DIGITIZING IN A MATERIAL DIGITIZING IN A MATERIAL WORLDWORLD

What is a Digital Repository?

Digital Library, Digital Repository, Institutional Repository, Trusted Digital Repository, Digital Asset Management, Content Management System, Digital Commons...

Are they all the same thing?

What is a Digital Repository?

“A managed collection of digital objects... with consistent data and metadata structures for similar objects, enabling resource discovery by the ‘Communities of Practice’ for whom the objects are of interest”

Geoff Payne

What is a Digital Repository?

OAIS Reference ModelOpen Archival Information System

What is a Digital Repository?

Access Preservation

Institu

tion

al

Rep

osito

ry

Asset M

an

ag

em

en

t S

yste

m

Colle

ctio

ns M

an

ag

em

en

t S

yste

m

Tru

ste

d D

igita

l R

ep

osito

ry

Emblem (“Emblem (“Premitur, non opprimiturPremitur, non opprimitur”) from Claude Paradin, ”) from Claude Paradin, The heroicall devises of M. Claudius Paradin (London: W. The heroicall devises of M. Claudius Paradin (London: W. Kearney, 1591) Kearney, 1591) http://emblem.libraries.psu.edu/parad176.htmhttp://emblem.libraries.psu.edu/parad176.htm

Planning for a Repository

Planning for a Repository

What is the mission?What kinds of content will you

provide?Who are the key users?Who are the key stakeholders?What can you afford?What are your top service priorities?

Planning for a Repository

What Users Want from Digital Image Collections

Planning for a Repository

What Users Want from Digital Image Collections

◊ Users tend to be impatient with long waits

Planning for a Repository

What Users Want from Digital Image Collections

◊ Users tend to be impatient with long waits◊ Users need both image and text versions

Planning for a Repository

What Users Want from Digital Image Collections

◊ Users tend to be impatient with long waits◊ Users need both image and text versions◊ Users are generally content with current

image quality

Planning for a Repository

What Users Want from Digital Image Collections

◊ Users tend to be impatient with long waits◊ Users need both image and text versions◊ Users are generally content with current

image quality◊ Completeness and legibility of pages with

minimal scrolling is the primary user requirement for text- based documents.

Planning for a Repository

What Users Want from Digital Image Collections

◊ Users tend to be impatient with long waits◊ Users need both image and text versions◊ Users are generally content with current

image quality◊ Completeness and legibility of pages with

minimal scrolling is the primary user requirement for text- based documents.

◊ Users want multiple views to support their different research needs and some are interested in tonal and color fidelity

Planning for a Repository

What Users Want from Digital Image Collections

◊ Large format graphical images such as maps are difficult to display and fully comprehend online – need tools that provide zoom, pan, and peripheral- view capabilities

Planning for a Repository

What Users Want from Digital Image Collections

◊ Large format graphical images such as maps are difficult to display and fully comprehend online – need tools that provide zoom, pan, and peripheral- view capabilities

◊ Users prefer a simple interface

Planning for a Repository

What Users Want from Digital Image Collections

◊ Large format graphical images such as maps are difficult to display and fully comprehend online – need tools that provide zoom, pan, and peripheral- view capabilities

◊ Users prefer a simple interface◊ Users want to navigate the structural and

intellectual content of image collections – backward and forward, jump to specific pages, table of contents, index

Planning for a Repository

What Users Want from Digital Image Collections

◊ Users want to manipulate graphic images

Planning for a Repository

What Users Want from Digital Image Collections

◊ Users want to manipulate graphic images Zoom to view detail

Planning for a Repository

What Users Want from Digital Image Collections

◊ Users want to manipulate graphic images Zoom to view detail Examine Two or more images side by

side

Planning for a Repository

What Users Want from Digital Image Collections

◊ Users want to manipulate graphic images Zoom to view detail Examine Two or more images side by

side Save search result sets

Planning for a Repository

What Users Want from Digital Image Collections

◊ Users want to manipulate graphic images Zoom to view detail Examine Two or more images side by

side Save search result sets Sort search result sets

Planning for a Repository

What Users Want from Digital Image Collections

◊ Users want to manipulate graphic images Zoom to view detail Examine Two or more images side by

side Save search result sets Sort search result sets Export images into other software

Planning for a Repository

What Users Want from Digital Image Collections

◊ Users want to manipulate graphic images Zoom to view detail Examine Two or more images side by

side Save search result sets Sort search result sets Export images into other software

Produce high-quality printouts

Planning for a Repository

What Users Want from Digital Image Collections

◊ Users want to manipulate graphic images Zoom to view detail Examine Two or more images side by side Save search result sets Sort search result sets Export images into other software

Produce high-quality printouts Annotate images with comments and

save them to a notebook

Planning for a Repository

What Users Want from Digital Image Collections

◊ Users want to manipulate graphic images Zoom to view detail Examine Two or more images side by side Save search result sets Sort search result sets Export images into other software

Produce high-quality printouts Annotate images with comments and save

them to a notebook Use image-editing tools

Planning for a Repository

What Users Want from Digital Image Collections

◊ Users need a variety of search functions

Planning for a Repository

What Users Want from Digital Image Collections

◊ Users need a variety of search functions◊ Preferences for access points vary

according to the collection and the type of image Some perform searches for known items Other pose queries that contain a

preponderance of subject terms

Planning for a Repository

What Users Want from Digital Image Collections

◊ Users need a variety of search functions◊ Preferences for access points vary

according to the collection and the type of image Some perform searches for known items Other pose queries that contain a

preponderance of subject terms

◊ Search engines, query formulation and database indexing affect search results

Planning for a Repository

System Requirements – Technical

File formats supportedMetadata standardsInteroperability: OAI compliance, Z39.50 etc.Persistent URLSearch/Browse of metadataFull-text searchWorkflow, submission for content approvalUser authenticationCustomization features

Planning for a Repository

System Requirements – Technical

Free vs. commercial software (licence, subscription fees

Open Source vs. Proprietary

Technical Support availablefor fee vs. freeby phoneby emailvia online forms

Planning for a RepositorySystem Requirements - Metadata

““data about data” data about data” — — data data categoriescategories

data describing a discrete data data describing a discrete data object or objectsobject or objects

cataloging or indexing information cataloging or indexing information created to arrange, describe, and created to arrange, describe, and otherwise enhance access to an otherwise enhance access to an information objectinformation object

Planning for a RepositorySystem Requirements – Metadata

What is it?

Why is metadata Why is metadata important?important?

for enhanced for enhanced accessibilityaccessibility

for retention of contextfor retention of context for expanding usefor expanding use for multi-versioningfor multi-versioning for legal issuesfor legal issues for preservation of datafor preservation of data

Types of MetadataTypes of Metadata

AdministrativeAdministrative DescriptiveDescriptive PreservationPreservation TechnicalTechnical StructuralStructural UseUse

Dublin Core-Dublin Core- http://dublincore.org/

METS:METS:A Metadata “Wrapper” A Metadata “Wrapper” for Digital Information for Digital Information

ObjectsObjects

sample METS Objectsample METS Object

MODSMODS(Metadata Object Description (Metadata Object Description

Schema)Schema)An initiative of Network An initiative of Network Development and MARC Standards Development and MARC Standards Office at the Library of Congress Office at the Library of Congress http://www.loc.gov/standards/mods/ Uses XML schema Uses XML schema Originally designed for library Originally designed for library applications, but may be used for applications, but may be used for othersothers A derivative of MARCA derivative of MARC

Title InfoTitle InfoNameNameType of resourceType of resourceGenreGenreOrigin InfoOrigin InfoLanguageLanguagePhysical Physical descriptiondescriptionAbstractAbstractTable of contentsTable of contentsTarget audienceTarget audience

NoteNote

SubjectSubject

ClassificationClassification

Related itemRelated item

IdentifierIdentifier

LocationLocation

Access Access conditionsconditions

ExtensionExtension

Record InfoRecord Info

MODS high-level MODS high-level elementselements

Why do you need Why do you need standards?standards?

DATA STANDARDS: DATA STANDARDS: A BRIEF TYPOLOGYA BRIEF TYPOLOGY

Data structureData structure standards (metadata element standards (metadata element sets):sets): MARC, EAD, Dublin Core, METS, CDWA, VRA MARC, EAD, Dublin Core, METS, CDWA, VRA

CoreCore Data contentData content standards (cataloging rules): standards (cataloging rules):

AACR (AACR (RDA), ISBD, CCO, DA:CSRDA), ISBD, CCO, DA:CS Data valueData value standards (vocabularies): standards (vocabularies):

LCSH, LCNAF, TGM, AAT, TGN, ULANLCSH, LCNAF, TGM, AAT, TGN, ULAN Data formatData format standards (standards expressed in standards (standards expressed in

machine-readable form):machine-readable form): MARC, MARCXML, EAD, CDWA Lite XML, MARC, MARCXML, EAD, CDWA Lite XML,

Dublin Core Simple XML schema, DC Dublin Core Simple XML schema, DC Qualified XML schema, VRA Core XML Qualified XML schema, VRA Core XML schemaschema

Implementing a digital repository

Content SelectionDoes the item or collection have sufficient value to and demand from a current audience to justify digitization?

Does the proposed item or collection have active current users?Is there greater demand than can be served by the original or a traditional type of copy?Does it support high priority activities such as teaching of core courses that have large enrollments?Is it marketable to a group of specialists widely dispersed who all need access?

Implementing a digital repository

Content SelectionDo limitations on handling of fragile or valuable originals create a source of demand for high quality surrogates?

How does it fit with other materials on the same subject?Does it help build a distributed online collection?Do we have the legal right to create a digital version?Do we have the legal right to disseminate it?

Implementing a digital repository

Content SelectionCan the materials be digitized successfully?

Does or can digitization add something beyond simply creating a copy?

Can and should images be manipulated to make them more legible than the original items?OCRing for searchable text

Is the cost appropriate?

Issues with the Issues with the web...web...

Are your collections Are your collections “reachable” by commercial “reachable” by commercial search engines?search engines?

If yes, how will you If yes, how will you “contextualize” individual “contextualize” individual collection objects?collection objects?

If not, what is your strategy If not, what is your strategy to lead web users to your to lead web users to your search page?search page?

The “Visible web” The “Visible web” versus the “Deep versus the “Deep

web”web” The Visible web is what you see in the The Visible web is what you see in the results pages from general web search results pages from general web search engines & subject directories (static engines & subject directories (static web pages)web pages)

The Invisible or Deep web consists of The Invisible or Deep web consists of data from dynamically searchable data from dynamically searchable databases that cannot be indexed by databases that cannot be indexed by search engines, because they aren’t search engines, because they aren’t “stored” anywhere.“stored” anywhere.

PreservationPreservationEnduring CareBitstream CopyingDurable, Persistent MediaMigrationEmulationEncapsulationTechnology PreservationReinterpretation

EvaluationEvaluation Use metadata and usability Use metadata and usability

analysis should be a routine analysis should be a routine part of digital library work.part of digital library work.

Study end-user behavior Study end-user behavior (including your own)(including your own)

Joseph Ducreux, Joseph Ducreux, Yawning (Self-Yawning (Self-Portrait)Portrait), before 1783, before 1783

J. Paul Getty MuseumJ. Paul Getty Museum

Thank you Thank you for your for your

attention!attention!

lprescott@getty.elprescott@getty.edudu

top related