the taro project t exas a rchival r esources o nline

14
The TARO Project T exas A rchival R esources O nline Fred Gilmore Sr Operating Systems Specialist UT Austin General Libraries [email protected] April 30, 2004

Upload: borka

Post on 11-Feb-2016

49 views

Category:

Documents


0 download

DESCRIPTION

The TARO Project T exas A rchival R esources O nline. Fred Gilmore Sr Operating Systems Specialist UT Austin General Libraries [email protected]. What It Is . . . A project to make Texas archive and manuscript collection finding aids available through the Web. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: The TARO Project T exas  A rchival  R esources  O nline

The TARO ProjectTexas Archival Resources Online

Fred GilmoreSr Operating Systems SpecialistUT Austin General [email protected]

April 30, 2004

Page 2: The TARO Project T exas  A rchival  R esources  O nline

What It Is . . .

A project to make Texas archive and manuscript collection finding aids available through the Web.

“finding aid”: descriptive summary and inventory of a material collection housed at a specific archive; not the materials themselves.

Currently: 1500+ searchable, browsable finding aids, 5000+ hits / day

Page 3: The TARO Project T exas  A rchival  R esources  O nline

How it came to be . . .

Two grant funded phases:– Outsourced scanning, OCR, XML tagging of

existing paper finding aids– Training/hardware/software for creation of

new finding aids – Phase I (2000 – 2001) : 14 participating

repositories– Phase II (2002 – 2003) : additional 11

repositories

Page 4: The TARO Project T exas  A rchival  R esources  O nline

Participating Repositories Alexander Architectural Archive

(UT Austin) Center For American History

(UT Austin) Benson Latin America

Collection (UT Austin) Ransom Humanities Research

Center (UT Austin) Texas State Library Texas Tech Southwest

Collection/University Archives University of Houston Special

Collections/University Archives Rice University Texas A&M

Houston Public Library Austin History Center UT San Antonio Texas State University Southern Methodist University UT Medical Branch – Galveston MD Anderson UT El Paso UT Pan American UT Arlington

Page 5: The TARO Project T exas  A rchival  R esources  O nline

How It Came To Be . . .

Why XML?– Compose once, format many– XML and related standards make data

exchange/reuse, description easier through separation.

Page 6: The TARO Project T exas  A rchival  R esources  O nline

Creating content for TARO

Archives staff:– Edit or compose XML tagged electronic

version of finding aid (new finding aids are created using text/XML editor such as Corel XMetaL)

– Submit file to UT Austin server

Page 7: The TARO Project T exas  A rchival  R esources  O nline

.

.<unittitle label="Title:" encodinganalog="245$a">Thomas J. Rollins Papers,<unitdate type="inclusive" encodinganalog="245$f" label="Dates:"

era="ce" calendar="gregorian">1875-1997 and undated</unitdate></unittitle><abstract label="Abstract:" encodinganalog="520$a">The personal papers of Thomas J. Rollins from 1875-1997 and

undated.</abstract><unitid countrycode="us" repositorycode="TxLT-SW"

encodinganalog="099" label="Collection #">S 1261.1</unitid><repository label="Repository:" encodinganalog="852$a"><corpname><subarea>Southwest Collection/Special Collections

Library,</subarea>..

Page 8: The TARO Project T exas  A rchival  R esources  O nline

Creating Content For TARO

UT Austin technical staff:– XML file is moved into production, error

checked, translated into three HTML varieties for viewing.

– HTML content is indexed for searching (keyword and fielded), sorted into repository lists for browsing

Page 9: The TARO Project T exas  A rchival  R esources  O nline

http://www.lib.utexas.edu/taro/ttusw/00054/tsw-00054.html

Page 10: The TARO Project T exas  A rchival  R esources  O nline

http://www.lib.utexas.edu/taro/ttusw/00054/tsw-00054.html

Page 11: The TARO Project T exas  A rchival  R esources  O nline

Advantages

Pages picked up by Google and give content higher visibility.

Multiple views of content including ability to customize view by running the XML document against a personal stylesheet.

Processing fully automated. HTML translated files can be available within hours.

DC metadata and OAI records provide additional access points.

Page 12: The TARO Project T exas  A rchival  R esources  O nline

Challenges

Relationships– Mediating local needs with federated site

requirements.– Encouraging supplemental metadata creation.

Resources– Introducing improvements without dedicated

staff on either end.

Page 13: The TARO Project T exas  A rchival  R esources  O nline

Challenges

Realities of the Web– User education. Practically a meta-site.

Content expectations not met.– Finding aids can be large. Load times a

problem.– XML Unicode requirements make special

characters tricky.

Page 14: The TARO Project T exas  A rchival  R esources  O nline

Future Plans

Searching: search XML directly Content: fund the creation, serving of

pictures, sound, video Participation: more repositories = more

content Access: Open Archives, RDF metadata Flexibility: provide stylesheet for direct XML

browsing, PDF creation for hardcopy