experiences from digitization and digital preservation at the swedish national archives presented at...

41
Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra Sept 19, 2007 by Göran Kristiansson

Upload: diana-jacklin

Post on 30-Mar-2015

219 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

Experiences from digitization and digital preservation at the Swedish

National Archives

Presented at the Digital Futures International Forum in CanberraSept 19, 2007 by Göran Kristiansson

Page 2: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

A Swedish National Archival Database (NAD)

• Background– Database for private papers

– Tool for producing archival guides

– A Government Commission 1988

• Project at the National Archives 1990– Coordination of existing databases

Page 3: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

A Swedish National Archival Database (NAD)

– Strategy • Descriptive standards. ISAD(G) and ISAAR(CPF)

• Data elements.

• Exchange format. MARC-AMC

• Distribution on CD-ROM

– Content from Archives, Libraries and Museums

Page 4: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

A Swedish National Archival Database (NAD)

• Development of an Archival information system (ARKIS) for the National and the Regional Archives.– top level archival descriptions

– "container lists"

– registration of cartographic material

– repository control

Page 5: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

A Swedish National Archival Database (NAD)

• Unemployment projects– Finding aids, census records and Cabinets cause-lists.

• The present edition of NAD– 170 000 fonds– 40 000 inventories– Digital and microfilmed copies of records– Information about administrative boarder changes

• NAV – a tool for updating NAD

Page 6: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

Next generation - ARKIS II

• Merge three systems– ARKIS I– Microfilm Database– Database for magnetic tapes

• Web based system

• Full implementation of multilevel description

Page 7: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra
Page 8: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

• Used a Generic datamodel

• Support for XML and the Encoded Archival Description (EAD) and Encoded Archival Context (EAC)

Demo NAD, bildexempel, ARKIS och XML1, XML2

Next generation - ARKIS II

Page 9: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra
Page 10: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

• Used a Generic datamodel

• Support for XML and the Encoded Archival Description (EAD) and Encoded Archival Context (EAC)

Demo NAD, bildexempel, ARKIS och XML1, XML2

Next generation - ARKIS II

Page 11: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra
Page 12: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra
Page 13: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra
Page 14: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra
Page 15: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra
Page 16: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra
Page 17: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

Linking and Exploring Authority Files (LEAF)

• Develop a model architecture for a distributed search system

• A common European name authority file

• Encoded Archival Context (EAC)

• Context information from Archives, Libraries and Museums

Page 18: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

Large digitization projects

• Reproduction of church records 1895-1991– 57 million images (total 110 million)– 40 Tb/year

• Digitization of microfilm 1600-1905– 15 million images

• National Land Survey of Sweden 1600 -– 70 million images

• Scanning on demand

Page 19: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

Large digitization projects

Archival description

Scanning TIFF-editing

Converting image to DjVu

HSM

Image database

ARKIS

Scanning process

Grayscale, 8 bit-color, 300 dpi and TIFF 6.0

Page 20: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

A Government Commission 2002 ”Arkiv för alla – nu och i framtiden”• Task: Not to solve the problem, but to give advise

to the government on the best way to support archival authorities in their work with long term preservation of digital records.

• The commission was run by the governor of northern Sweden, Kari Marklund, with support of a group of experts

• IT-reference group

Page 21: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

Vision

• Open Archival Information System (OAIS)

• Format for digital deliveries – XML and emulation

– Specification of information package for ingest

– Delivery control

• Storage and administration of digital records in archival institutions

• Access to digital information

Page 22: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

Project on long term preservation of digital Information (LDB)

• Cooperation between the Luleå University of Technology, the National Archives and the municipality of Boden• Putting the vision into practice• Records management/archives management – a key question for a successful e-government

Page 23: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

LDB test case

National Social Insurance Board

• Electronic Record Management System

• Digital Archive• Common development project• Test transfer summer 2006

Page 24: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

Arrival archive

Reference model: Authority

Dis

trib

ute

Dis

trib

ute

ProcessProcess

Arc

hive

Arc

hive

Store document

Rec

eive

Del

iver

Search/FAQSearch/FAQRegisterRegisterOpenOpen

Pub

lish

Pub

lish

Edi

tE

dit

Documents in files

Administrative documentMet

adat

a

For

ms

Tem

plat

esViewViewCreate/modifyCreate/modify

DescribeSearch

DescribeSearch

Citizens

Employers

Local and public applications

Complementary

information

?

Caseworker Archivist

Oth

er s

yste

ms

Oth

er s

yste

ms

National Archives

Citizens

Citizens

Provide helpProvide help

Call center

CreateCreate

ViewView

Page 25: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

LDB test case

Agresso• Accounting System

• specifications from preceding project

• export test spring 2006

Page 26: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

Open Archival Information System (OAIS) – A conceptual model

SIP = Submission Information PackageAIP = Archival Information PackageDIP = Dissemination Information Package

OAIS

Administration

Preservation planning

Digitalarchive

Archivaldescription

Access

retrievalanswer

orderdelivery

Incheckningnegotiation

deliverySIP

Arkiv-beskr.

AIP

Archivaldescr.

AIP

DIPDIP

Information Information packagepackage

Container list(sökbar)

Data object (analog, digital)

Representation information

(structure, semantic)

Preservation Information

(referensinfo, provenance,

context,äkthet)

Package identification

Page 27: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

General architecture of SIP

DigitalData object

DigitalData object

DigitalData object

Archival Description

EADArchival Description

EADAuthority records

EACAuthority records

EAC

Records managementsystems

ERMS

Records managementsystems

ERMS

Package level

Archival structure level

System structure level

Object level

Economical systems

SIE-XMLEconomical systems

SIE-XMLData bases[ ADDML]Data bases[ ADDML]

(Ex: TIFF, XML, PDF/A, ASCII, …)

PhysicalData object

SIP/(AIP)

METSSIP/(AIP)

METS

Other systems[to be defined]

Other systems[to be defined]

Technical metadata

PREMISTechnical metadata

PREMIS

Context metadataEx Manual

Specific metadataEx Style sheet

Specific metadataEx Style sheet

(Ex: XSLT / XSL-FO) (Ex: PDF/A)

OROR OROROROR OROROROR

Page 28: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

EAD

Encoded Archival Description

description of archival collections

based on ISAD(G) International Standard Archival Description (General)

for encoding archival finding aids

multi-level description

used by archival repositories

Page 29: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

EAC

Encoded Archival Context

description of authoritiesorganizations, persons and families

based on ISAAR(CPF)International Standard Archival Authority Record

describing the circumstances under which

records have been created and used

Page 30: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

ERMSElectronic Records Management Systems

metadata standard from UK

based on Functional Requirements for Electronic Records Management Systems

exchange of records metadata and interoperability between ERMS systems

Page 31: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

ERMS levels

ERMS

Class

Folder

Component

Folder

Record Record Record Record

Component Component

Component Component

ComponentComponent

XML-file

XML-files

XML-files

XML-files

Page 32: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

ADDML

Archives Data Description Markup Language

describes transferred database files

technical, structural and general descriptive metadata

developed by National Archives of Norway

Arkadukt (editing) and Arkade (testing and conversion)

Page 33: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

SIE-XML

Standard import/export - XML

describes and contain accounting informationexchange between economic administrative

systemsaggregated and transaction level

XBRL only aggregated level in Sweden today

based on Swedish regulations developed from classic SIE

implemented on 95% of products on market

Page 34: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

PREMIS

PREservation Metadata: Implementation Strategies

expanded conceptual structure for the OAIS model,

a set of metadata elements reflecting requirements OAIS

developed by a working group initiated by OCLC/RLG

Page 35: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

DATA MODEL

PREMIS

Page 36: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

METS

Metadata Encoding and Transmission Standard

expressing the structure or structures of a digital entity linking Descriptive metadata with digital content linking Administrative metadata with digital content linking behavior definitions and program code with digital

content and with associated descriptive and administrative metadata

wrapping digital content, and associated descriptive and administrative metadata as binary data

self-defined profiles developed by Digital Library Federation

Page 37: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

Presentations of records in FKs ERMS

Page 38: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

Presentations of FKs transferred SIP

Page 39: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

From opportunity to reality

LDB

Administration

Preservation planning

Archiving

Archivaldescription

AccessValidation

Riksarkivet Producer Researcher etc

ARKIS

Tape robot

NATIONALFRAMEWORK

NATIONALFRAMEWORK

Till

gänglig

-

görande

XML

DIP

ISAD(G) ISAAR(CPF)

EAD EAC

Archival description

???

XML

SIP

Physical Archives

Page 40: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

PROTAGEPROTAGE

• Preservation Organizations Testing Agent Environment

• EU-funded project

• Build and validate flexible software agents for long-term digital preservation and access

Page 41: Experiences from digitization and digital preservation at the Swedish National Archives Presented at the Digital Futures International Forum in Canberra

The future

National competence center in Boden

CompaniesGovernments, municipalities Citizens, organizations

Society

Luleå University of Technology

Competence development

Long term accessibility

Nationaland RegionalArchives