our standards vs their standards

20
Our Standards vs. Their Standards: Development and Re- Use of Non-Library Standards in the Cultural Heritage Domain Lars G. Svensson | 20 | Our Standards vs. Their Standards | August 19, 2015 1

Upload: lars-g-svensson

Post on 24-Jan-2017

371 views

Category:

Technology


0 download

TRANSCRIPT

Our Standards vs. Their Standards: Development and Re-Use of Non-Library Standards in the Cultural Heritage Domain

Lars G. Svensson

| 20 | Our Standards vs. Their Standards | August 19, 20151

2

Information exchange in libraries has gone a long way during the last 50 years

| 20 | Our Standards vs. Their Standards | August 19, 2015

Phot

o by

Bea

trice

Mur

ch (C

C BY

-SA)

: http

://fli

ckr.c

om/p

hoto

s/bl

mur

ch/4

6562

3933

/

3 | 20 | Our Standards vs. Their Standards | August 19, 2015

Libraries increasingly exchange information with organisations that are not libraries (or at least they should!)

Search engine

Library Re-searcher

Archive,museum

Foto

von

Doc

Sea

rls (C

C BY

): ht

tp://

www.

flick

r.com

/pho

tos/

docs

earls

/550

0714

140/

4 | 20 | Our Standards vs. Their Standards | August 19, 2015

But how do we do that in an interoperable way?

ILS

MARC only beyond this point

5 | 20 | Our Standards vs. Their Standards | August 19, 2015

By using common data standards!

ISOOpen Geospatial (OGC)

W3C

IEEE

IETF

6 | 20 | Our Standards vs. Their Standards | August 19, 2015

What we want to share is knowledge, but the best we can do is to share data

Knowledge

DataData

Information Information

Knowledge

Sender Receiver

7 | 20 | Our Standards vs. Their Standards | August 19, 2015

That data, however, should be as machine-interpretable as possible

Phot

o by

KIU

I Sta

ff (C

C BY

): ht

tp://

www.

flick

r.com

/pho

tos/

kiui

/369

3823

005/

Library data

Otherdata

Somedata

8 | 20 | Our Standards vs. Their Standards | August 19, 2015

1) 123 < 123.456 < 124

2) 123 < 123,456 < 124

Example 1: Floating point numbers

9 | 20 | Our Standards vs. Their Standards | August 19, 2015

1) 123 < 123.456 < 124

2) 123 < 123,456 < 124

float f = Float.valueOf( “123,456” );(Gives you a NumberFormatException)

Example 1: Floating point numbers

10 | 20 | Our Standards vs. Their Standards | August 19, 2015

$c — Confidence valueDescribes the confidence of the agency using the process/activity identified in subfield $a to generate the linked field. The subfield contains a floating point value between 0 and 1. Either a comma or a point may be used as a decimal marker. 0 means no confidence and and [sic!] 1 means full confidence.(http://www.loc.gov/marc/bibliographic/bd883.html)

At some places, MARC 21 isn’t quite clear about this...

11 | 20 | Our Standards vs. Their Standards | August 19, 2015

The most important date format is ISO 8601– 20150819 – 2015-08-19– 2015-08-19T07:30:00+02:00and its derivatives, e. g. EDTF– ~2015 (approximately 2015)– 201u (one of the years 2010-2019, but we don’t know which)

Example 2: (Birth-)Dates in Authority Data

12 | 20 | Our Standards vs. Their Standards | August 19, 2015

MARC 21:– 100 $d 1240 or 41-ca. 1316 (for people)– 046 $f [1240,1241] $g ~1316 $2 EDTF (for machines)UNIMARC– 200 $f 1240 or 41-ca. 1316 (for people)– 640 $f #1240____? $i #1316____? (for machines)

In MARC, dates are expressed differently at different places

13 | 20 | Our Standards vs. Their Standards | August 19, 2015

Example 3: Geographic coordinates

14 | 20 | Our Standards vs. Their Standards | August 19, 2015

Library formats support bounding boxes and polygons

Source: https://en.wikipedia.org/wiki/File:Western_Cape_in_South_Africa.svg

Type Examples

Point POINT (30 10)

LineString LINESTRING (30 10, 10 30, 40 40)

Polygon

POLYGON ((30 10, 40 40, 20 40, 10 20, 30 10))

POLYGON ( (35 10, 45 45, 15 40, 10 20, 35 10), (20 30, 35 35, 30 20, 20 30))

15 | 20 | Our Standards vs. Their Standards | August 19, 2015

A very important non-library format for coordinates is WKT (well-known text)

Exam

ple

take

n fro

m h

ttps:

//en.

wiki

pedi

a.or

g/wi

ki/W

ell-k

nown

_tex

t

16 | 20 | Our Standards vs. Their Standards | August 19, 2015

Record the points in the polygon as long/lat and “list coordinate pairs in clockwise order, starting with the most south-eastern vertex of the polygon. [...] The first and last coordinate pairs are the same. [...] If an area or areas within a given polygon are excluded, list the coordinate pairs for any excluded area in counterclockwise order.“ (RDA §7.4.3.3)

The upcoming content standard RDA has instructions for coordinates too, but different ones…

| 20 | Our Standards vs. Their Standards | August 19, 2015

“Record dates in terms of the calendar preferred by the agency creating the data” and to “record a date associated with a person by giving the year.” An option used by PCC, BL and D-A-CH is to “add the month or month and day in the form [year] [month] [day] or [year] [month]. Record the month in a language and script preferred by the agency creating the data.” (RDA §9.3.1.3)

Dates in RDA are not really aligned with ISO 8601, either

17

| 20 | Our Standards vs. Their Standards | August 19, 2015

So what we really need, is a system that mediates between the cataloguing code and the exchange format(s)

18

13 ነሐሴ 2007

Décade I Doudi,

Fructidor an CCXXIII

2015-08-19

Date

con

vers

ion

thro

ugh

http

://ww

w.fu

naba

.org

/cc

אלול 4 ,רביעי5775

19 | 20 | Our Standards vs. Their Standards | August 19, 2015

And we must ensure that we export data using well-known, widely adopted standards

My dad’s standard is better than your dad’s!

Picture provided by e r j k p r u n c z y k (CC BY-SA): https://www.flickr.com/photos/24842486@N07/6162798898/ Picture by Pascal (CC BY): https://www.flickr.com/photos/pasukaru76/12144956203/

Who cares about your standard, anyway?

20 | 20 | Our Standards vs Their Standards | August 19, 2015

Common standards for future data exchange

http

://ww

w.ge

dank

enko

nsum

.de/

wp-c

onte

nt/u

ploa

ds/2

011/

03/th

e-fu

ture

.jpg