1 joining it up making our cultural heritage visible online paul miller interoperability focus uk...

34
1 Joining it up making our cultural heritage visible online Paul Miller Interoperability Focus UK Office for Library & Information Networking (UKOLN) [email protected] http:// www.ukoln.ac.uk/ UKOLN is funded by Resource: the Council for Museums, Archives and Libraries, the Joint Information Systems Committee (JISC) of the Further and Higher Education Funding Councils, as well as by project funding from JISC and the EU. UKOLN also receives support from the Universities of Bath and Hull where staff are based.

Upload: mervyn-stafford

Post on 30-Dec-2015

213 views

Category:

Documents


0 download

TRANSCRIPT

1

Joining it upmaking our cultural heritage visible online

Paul Miller

Interoperability FocusUK Office for Library & Information Networking (UKOLN)

[email protected] http://www.ukoln.ac.uk/

UKOLN is funded by Resource: the Council for Museums, Archives and Libraries, the Joint Information Systems Committee (JISC) of the Further and Higher Education Funding Councils, as well as by project funding from JISC and the EU. UKOLN also receives support from the Universities of Bath and Hull where staff are based.

2

“No man is an island”Donne, John, 1572–1631

3

“No citizen is an island”

Community Information Services (CIS)

A Netful of Jewels

The People’s Network

ARCHSearch

e–Government strategy

The National Grid for Learning

SCRAN

Images of England

NOF–Digitise

HEIRNET

100% of Government services available by 200X

ukonline.gov.

NGDF Metadata Project/ UKSGB

4

Standard solutions

The nice thing about standards…

…is that there are so many to choose from!

5

Standard solutions

6

So… why use standards?

• Benefit from the expertise of others• Enforce rigour in internal practices• Facilitate interoperability (and access)

– Archives and monuments are held ‘in trust’– Considered deployment of standard solutions makes

access to those resources feasible for many– An enhanced Excavation Index, linking to the

distributed excavation archives and the site report?.

7

What do standards do?

• Help identify what’s important– CIMI’s “Access Points”– Mandatory fields

• Allow for consistent use of terminology– Name Authority Files– Thesauri– Look–up tables

• Enable internal and external data exchange or access

• Reduce duplication of effort• Minimise (hopefully!) wasted effort• Reflect consensus.

8

What types of standard are there?

• Terminology– ‘Roma’, not ‘Rome’– ‘Roma’ is preferred to ‘Rome’

• Format– ‘Miller, A.P. 1971–’, not ‘Paul Miller’

• ‘Semantics’– A gross simplification, and a very big bucket– ‘Creator’, ‘Subject’, ‘Title’, ‘Description’…

• Syntax– <RDF xmlns = “http://www.w3.org/TR/WD-rdf-syntax#”>

• Transfer– ftp://ftp.niso.org/… .

9

What is ‘Interoperability’?

“to be interoperable, one should actively be engaged in the ongoing

process of ensuring that the systems, procedures and culture of an organisation are managed in such a way as to maximise opportunities for exchange and re-use of information, whether internally or externally.”

See http://www.ariadne.ac.uk/issue24/interoperability/See http://www.ariadne.ac.uk/issue24/interoperability/

10

What is ‘Metadata’?

– meaningless jargon

– ora fashionable, and terribly misused, term for what we’ve always done

– or“a means of turning data into information”

– and“data about data”

– andthe name of a monument (‘South Cadbury’)

– andthe title of a book (‘Principles of Archaeological Excavation’).

11

Metadata Standards

“Paul Miller gave a really interesting talk about Dublin Core at the National Monuments Record in Swindon”

• In Swindon, or in Dublin?

• About monuments and about milling?

• But what was it?

12

Metadata Standards

<speaker>“Paul Miller</speaker> gave a <value judgement>really interesting</value judgement> talk about <subject>Dublin Core</subject> at the <location>National Monuments Record in Swindon”</location>.

13

Challenges

Many flavours of metadatawhich one do I use?

Managing changenew varieties, and evolution of

existing forms

Tension between functionality and simplicity, extensibility and interoperability

Functions, features, and cool stuff Simplicity and interoperability

Opportunities

14

Introducing the Dublin Core

• An attempt to improve resource discovery on the Web

– now adopted more broadly

• Building an interdisciplinary consensus about a core element set for resource discovery

– simple and intuitive– cross–disciplinary — not just libraries!!– international– open and consensual– flexible.

See http://purl.org/dc/See http://purl.org/dc/

15

• 15 elements of descriptive metadata• All elements optional• All elements repeatable• The whole is extensible

– offers a starting point for semantically richer descriptions

• Interdisciplinary– libraries, government, museums,

archives…

• International– available in more than 20 languages, with

more on the way...

Introducing the Dublin Core

16

• Title• Creator• Subject• Description• Publisher• Contributor• Date• Type

• Format• Identifier• Source• Language• Relation• Coverage• Rights

http://purl.org/dc/

Introducing the Dublin Core

17

Introducing XML

• eXtensible Markup Language• World Wide Web Consortium

recommendation• Simplified subset of SGML for use on Web• Addresses HTML’s lack of evolvability• Easily extended• Supported by major vendors• Increasingly used as a transfer syntax, but

capable of far more….

See http://www.w3.org/XML/See http://www.w3.org/XML/

18

Introducing RDF

• Resource Description Framework• W3C Recommendation• Fully compliant application of XML• Improves upon XML, HTML, PICS…• Machine understandable metadata!• Supports structure• Increasing interest

See http://www.ukoln.ac.uk/metadata/resources/dc/datamodel/WD–dc–rdf/

See http://www.ukoln.ac.uk/metadata/resources/dc/datamodel/WD–dc–rdf/

See http://www.w3.org/RDF/See http://www.w3.org/RDF/

19

Introducing Z39.50

• North American Standard (ANSI/NISO Z39.50–1995 [version 3])• International Standard (ISO 23950)

• Originally library–centric

• Permits remote searching of databases• Access via Z client or over web

• Relies upon ‘Profiles’• CIMI profile for cultural heritage• GEO profile for Geospatial data.

See http://www.ariadne.ac.uk/issue21/z3950/See http://www.ariadne.ac.uk/issue21/z3950/

20

Z39.50 Challenges• Profiles for each discipline

• Defeats interoperability?

• Vendor interpretation of the standard

• Bib–1 bloat

• Largely invisible to the user

• Seen as complicated and expensive

• Seen as old–fashioned

• Surely no match for XML/RDF/ whatever.

See http://www.ariadne.ac.uk/issue21/z3950/See http://www.ariadne.ac.uk/issue21/z3950/

21

Some Joined up working:

The Bath Profile• Vendors and systems implement areas of the Z39.50 standard differently

• Regional, National, and disciplinary Profiles have appeared over previous years, many of which have basic functions in common

• Users wish to search across national/regional boundaries, and between vendors.

See http://www.ariadne.ac.uk/issue21/z3950/See http://www.ariadne.ac.uk/issue21/z3950/

22

Learning from the past

• The Bath Profile is heavily influenced by• ATS–1• CENL• DanZIG• MODELS• ONE• Z Texas• vCUC• CIMI/Aquarelle

See http://www.ukoln.ac.uk/interop–focus/bath/See http://www.ukoln.ac.uk/interop–focus/bath/

23

Learning from the past

See http://www.ukoln.ac.uk/interop–focus/bath/See http://www.ukoln.ac.uk/interop–focus/bath/

24

Doing the work

• ZIP–PIZ–L mailing list, hosted by National Library of Canada

• Meeting face–to–face• JISC supported a face–to–face meeting in Bath

(UK) over the summer of 1999

• A draft was widely circulated for comment• Discussion and feedback world–wide

• Profile presented at DC7 in Frankfurt• Open Concertation day in the UK• etc.

See http://www.ukoln.ac.uk/interop–focus/bath/See http://www.ukoln.ac.uk/interop–focus/bath/

25

Makx DekkersPricewaterhouseCoopers/ EC

Janifer GatenbyGEAC

Juha HakalaNational Library of Finland

Poul Henrik JoergensenDanish Library Centre

Carrol LunauNational Library of Canada

Paul MillerUKOLN

Slavko ManojlovichSIRSI/ Memorial University of Newfoundland

Bill MoenUniversity of North Texas

Judith PearceNational Library of Australia

Joe ZeemanCGI.

Doing the work

See http://www.ukoln.ac.uk/interop–focus/bath/See http://www.ukoln.ac.uk/interop–focus/bath/

26

What we proposed

• Minimisation of ‘defaults’• Where possible, every attribute is defined in the Profile

(Use, Relation, Position, Structure, Truncation, Completeness)

• Three Functional Areas• Basic Bibliographic Search & Retrieval• Bibliographic Holdings Search & Retrieval• Cross–Domain Search & Retrieval

• Three Levels of Conformance in each Area.

See http://www.ukoln.ac.uk/interop–focus/bath/See http://www.ukoln.ac.uk/interop–focus/bath/

27

What we proposed

• SUTRS or XML and UNIMARC or MARC21 for Bibliographic Search results

• SUTRS and Dublin Core (in XML) for Cross–Domain results

• Other record syntaxes also permitted, but conformant tools must support at least these.

See http://www.ukoln.ac.uk/interop–focus/bath/See http://www.ukoln.ac.uk/interop–focus/bath/

28

Finishing it off…

• Bath Profile 1.1 accredited as ISO Internationally Registered Profile (IRP)• National Library of Canada as Maintenance Agency

• Direct approaches to international vendors• Already begun in North America. Europe still to do.

• User testing in Europe and North America• Does the Profile do what it’s meant to?

• Revision of CIMI Profile and others to include Bath as a core subset.

See http://www.ukoln.ac.uk/interop–focus/bath/See http://www.ukoln.ac.uk/interop–focus/bath/

29

Finishing it off…

• Inclusion of explicit Bath Profile requirements in RFPs across North America and Europe already.• Bath Editorial Group working on stock text

• Addition of Functional Areas and Levels of Conformance as required• Community Information?

• Next open meeting in Newfoundland, 24–25 September.

See http://www.ukoln.ac.uk/interop–focus/bath/See http://www.ukoln.ac.uk/interop–focus/bath/

30

Relevance to EH…?

• Users want content• Good, quality controlled content, but not only

from EH

• EH sits on a wealth of information• But how much of it is really accessible to the

world?

• Joined–up everything• HEIRNET• Government agendas• ADS…

• Portalitis.

31

Users want content…

• Welcome to the information economy…• But if you’re ‘selling’, then they’re ‘customers’• People only ‘buy’ what they can see

• Educational agendas• Every school online by 2002, but where’s the

stuff?– National Grid for Learning– Learning for Life/ Social Inclusion– e–University– 24 Hour Museum.

See http://www.ukoln.ac.uk/metadata/education/See http://www.ukoln.ac.uk/metadata/education/

“Gimme stuff!”

32

Joined–up everything…

• Modernising Government/ Best Value• Government metadata/ interoperability rules

• Digital Scotland• The shape of things to come?

• DNER• Regional Broadband Consortia• HEIRNET/ ADS Gateway• NOF–DIGI

• 2 EH bids through to second stage?

• SOCITM.

33

Portalitis• Making your stuff portable and visible,

makes it reusable and valuable.• ukonline.gov/• Ask Giraffe• DNER• People’s Network/ NOF–DIGI• A(H)DS• Heritage Gateway?• 24 Hour Museum• National Grid for Learning• Paul’s portal• etc.

34

See www.ukoln.ac.uk/interop-focus/See www.ukoln.ac.uk/interop-focus/

Mail [email protected] [email protected]

Join www.mailbase.ac.uk/lists/interoperability/Join www.mailbase.ac.uk/lists/interoperability/