getting triples from records: the role of isbd

25
Getting triples from records: the role of ISBD Gordon Dunsire Presented at Centar zu Stalno Stručno Usavršavanje (CSSU), Zagreb 21 Nov 2011

Upload: maxime

Post on 12-Feb-2016

47 views

Category:

Documents


0 download

DESCRIPTION

Getting triples from records: the role of ISBD. Gordon Dunsire Presented at Centar zu Stalno Stručno Usavršavanje (CSSU), Zagreb 21 Nov 2011 . Linking Open Data cloud (LOD) September 2010. Diagram by Richard Cyganiak and Anja Jentzsch . http://lod-cloud.net/. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Getting triples from records: the role of ISBD

Getting triples from records: the role of ISBD

Gordon DunsirePresented at Centar zu Stalno Stručno

Usavršavanje (CSSU), Zagreb21 Nov 2011

Page 2: Getting triples from records: the role of ISBD

Linking Open Data cloud (LOD) September 2010

Diagram by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

Page 3: Getting triples from records: the role of ISBD

Linking Open Data cloud (LOD) September 2011

Diagram by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

Page 4: Getting triples from records: the role of ISBD

LOD: “Library” corner

Page 5: Getting triples from records: the role of ISBD

Why get involved?

To share our dataWe work for “society”

To share our expertise and experience150 + years

To promote the power of libraries (and archives and museums)

To survive

Page 6: Getting triples from records: the role of ISBD

From record to triples (in 9 stages)Very large numbers of records

Catalogue records, finding aids, etc.300 million; 1 billion?

High quality metadataIn comparison with other communities

Each record may generate many triples50-200 “raw” triples (no inferences) per MARC record?

Very, very large numbers of triplesBillions? Trillions?

Page 7: Getting triples from records: the role of ISBD

1. Take a recordField/attribute ValueRecord ID 54321Title Museum archives: an introductionAuthor Wythe, DeborahDate 2004LCSH Museum archivesMedia/GMD ElectronicContent form Text

Page 8: Getting triples from records: the role of ISBD

2. Disaggregate to single statementsRecord Attribute Value54321 (has) title Museum archives: an

introduction54321 (has) author Wythe, Deborah54321 (has) date 200454321 (has) LCSH Museum archives54321 (has) media type Electronic54321 (has) content form Text

Page 9: Getting triples from records: the role of ISBD

3. Create URI for record

Must be unique, so 54321 no good on its ownhttp URIs are a good thing (W3C)So add record ID to a unique http domain

E.g. http://MyLibraryX.com (unique to the library)+ 54321

http://MyLibraryX.com/54321(or http://MyLibraryX.com#54321)

This is not a URL!

Page 10: Getting triples from records: the role of ISBD

4. Replace record ID with URIURI Attribute Valuemlx:54321 (has) title Museum archives:

an introductionmlx:54321 (has) author Wythe, Deborahmlx:54321 (has) date 2004mlx:54321 (has) LCSH Museum archivesmlx:54321 (has) media type Electronicmlx:54321 (has) content form Text

“mlx” = qname (xmlns) = shorthand for “http://MyLibraryX.com/”

Page 11: Getting triples from records: the role of ISBD

5. Find URIs for attributesAttributes are modelled as RDF properties (predicates) in

“element set” namespacesE.g. Dublin Core terms (dct); ISBD (isbd); FRBR (frbrer); RDA

(rdaxxx); Bibliographic Ontology (bibo); etc.Choose a namespace, find property with same (or closest)

“meaning” (e.g. definition) as attributeNearest property minimises loss of information

Get URI for property If no suitable property, choose another namespace

Properties do not have to come from single namespaceMatch and mix!

Page 12: Getting triples from records: the role of ISBD

5 (cont). Find URI for titlehttp://purl.org/dc/terms/title (dct:title)http://iflastandards.info/ns/isbd/elements/

P1014 (isbd:P1014)hasTitleProper

http://RDVocab.info/Elements/titleProper (rdaGR1:titleProper)

Page 13: Getting triples from records: the role of ISBD

5 (cont). Find URI for authordct:creatorrdarole:author(isbd does not cover “headings”)

Page 14: Getting triples from records: the role of ISBD

5 (cont). Find URI for datedct:dateisbd:P1018

hasDateOfPublicationProductionDistributionrdaGr1:dateOfPublication

Page 15: Getting triples from records: the role of ISBD

5 (cont). Find URI for LCSHLCSH is a subject vocabulary

Controlled termsSo attribute is really “subject”

And the term itself is the valuedct:subject

Page 16: Getting triples from records: the role of ISBD

5 (cont). Find URI for media typeAssuming record uses new ISBD Area 0 ...isbd:P1003

hasMediaType

Page 17: Getting triples from records: the role of ISBD

5 (cont). Find URI for content formAssuming record uses new ISBD Area 0 ...isbd: P1001

hasContentForm

Page 18: Getting triples from records: the role of ISBD

6. Replace attributes with URIsURI URI Valuemlx:54321 isbd:P1014 Museum archives:

an introductionmlx:54321 rdarole:author Wythe, Deborahmlx:54321 isbd:P1018 2004mlx:54321 dct:subject Museum archivesmlx:54321 isbd:P1003 Electronicmlx:54321 isbd:P1001 Text

Page 19: Getting triples from records: the role of ISBD

7. Find URIs for values If object of a triple is a URI, it can link to the subject of

another triple with the same URILinked data!

Values from controlled vocabularies may have URIsPossible vocabularies: author, subject, ISBD Area 0NOT: title, date

For author: Virtual International Authority File (VIAF)For LCSH: Library of Congress Authorities &

VocabulariesFor ISBD Area 0: Open Metadata Registry

Page 20: Getting triples from records: the role of ISBD

7 (cont). Find URI for authorAuthor: Wythe, DeborahVIAF: http://www.viaf.org/

viaf:31899419/#Wythe,+Deborah

Page 21: Getting triples from records: the role of ISBD

7 (cont). Find URI for subject (LCSH)LCSH: Museum archivesLoC: http://id.loc.gov/authorities/

lcsh:/sh85088707#concept

Page 22: Getting triples from records: the role of ISBD

7 (cont). Find URIs for ISBD Area 0

Media type: ElectronicISBD media type

isbdmt:T1002Content form: TextISBD Content form

isbdcf:T1009

Page 23: Getting triples from records: the role of ISBD

8. Replace values with URIssubject predicate objectmlx:54321 isbd:P1014 “Museum archives: an

introduction”mlx:54321 rdarole:author viaf:31899419/#Wythe,

+Deborahmlx:54321 isbd:P1018 “2004”mlx:54321 dct:subject lcsh:/

sh85088707#concept mlx:54321 isbd:P1003 isbdmt:T1002mlx:54321 isbd:P1001 isbdcf:T1009

Page 24: Getting triples from records: the role of ISBD

9. Publish triples (linked data)mlx:54321 | isbd:P1014 | “Museum archives: an

introduction” mlx:54321 | rdarole:author | viaf:31899419/#Wythe,

+Deborahmlx:54321 | isbd:P1018 | “2004”

mlx:54321 | dct:subject | lcsh:/sh85088707#concept

mlx:54321 | isbd:P1003 | isbdmt:T1002

mlx:54321 | isbd:P1001 | isbdcf:T1009

isbdcf:T1009 | skos:prefLabel |”text”@en

isbdcf:T1009 | skos:prefLabel |”tekst”@hr

Page 25: Getting triples from records: the role of ISBD

Thank you

[email protected] Open Data cloud diagram

http://richard.cyganiak.de/2007/10/lod/