kathryn cassidy - dri training series: 4. metadata and xml

29
Kathryn Cassidy Software Engineer, Digital Repository of Ireland (Trinity College Dublin) Metadata & XML

Upload: driireland

Post on 13-Apr-2017

291 views

Category:

Education


0 download

TRANSCRIPT

Page 1: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

Kathryn Cassidy

Software Engineer, Digital Repository of Ireland (Trinity College Dublin)

Metadata & XML

Page 2: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

Digital Repository of IrelandDRI is a trusted digital repository for Humanities and Social Sciences Data in Ireland, launched June 2015

• Provides preservation and access to digital collections

• Born digital and digitised collections including maps, photographs, letters,

audio-visual, sound, books, oral histories, paintings..

Page 3: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML
Page 4: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

https://repository.dri.ie/

Page 5: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

Metadata & XML

• Define Metadata• Identify benefits of using standards-compliant metadata• Familiarise ourselves with XML• Create XML Metadata by Interactive Example

By the end of this morning’s session we will have be able to do the following…

Page 6: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

What is Metadata?

Baby Em at 3 months !First house in Ranelagh

Page 7: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

What is Metadata?

Technical metadata – hardware, software, file formats, resolution, size

Preservation metadata – provenance, authenticity, preservation actions, responsibility (eg. PREMIS)

Structural metadata – physical/logical structure of digital resources (eg. METS)

Descriptive metadata – describes the digital resource; catalogue records/finding aids

Page 8: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

Human readable metadataA handwritten or typewritten listing or finding aid

Can be easily read and understood

Can be accessible in physical or digital medium

Can be free-text searched

Page 9: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

Machine readable metadataIn a format that can be understood by computers

Structured representation of information

Described using particular standards (eg. XML, HTML, RDF)

Allows processing, exchange and analysis

Page 10: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

The importance of Structure & standards

K. Cassidy

Page 11: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

Why use standard metadata?

Using standardised descriptive metadata means adhering to the best practices in your domain.

Standardised metadata allows you to control how records are described within your organisation too.

Enforcing standards allows greater searchability of your records.

Metadata sharing and interoperability is only possible when a standard is used.

Quality metadata enables analysis, manipulation and “value-added services”

Page 12: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

Seeing Standards: A Visualization of the Metadata Universe, Jenn Riley & Devin Becker, http://www.dlib.indiana.edu/~jenlrile/metadatamap/

Page 13: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

Digital Archiving in Ireland:

National Survey of the Humanities and Social Sciences

Page 14: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

DRI metadata guidelines

•Dublin core and Qualified Dublin Core

• MODS

• EAD

• MARC

Page 15: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

Simple Dublin Core Metadata Element Set1. Title

2. Creator3. Subject4. Description5. Publisher6. Contributor7. Date8. Type

9. Format10. Identifier11. Source12. Language13. Relation14. Coverage15. Rights

Page 16: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

DRI metadata guidelines

Page 17: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

1. Title Ulysses

2. Creator James Joyce

3. Subject Stream of consciousness;Modern novel;Turn of century Dublin;Book covers

4. Description Traces the character Leopold Bloom as he walks around Dublin on 16 June, 1904

Or

Scan of first edition, hard cover5. Publisher Shakespeare and Company

6. Contributor

7. Date 1922

Page 18: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

Metadata Quality Control – DRI guidewww.dri.ie/publications

Page 19: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

“Extensible Markup Language (XML) is a markup language that defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. It is defined by the W3C's XML 1.0 Specification and by several other related specifications, all of which are free open standards.”

- https://en.wikipedia.org/wiki/XML

What is XML?“Extensible Markup Language (XML) is a markup language that defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. It is defined by the W3C's XML 1.0 Specification and by several other related specifications, all of which are free open standards.”

- https://en.wikipedia.org/wiki/XML

Page 20: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

Machine readable metadataIn a format that can be understood by computers

Structured representation of information

Described using particular standards (eg. XML, HTML, RDF)

Allows processing, exchange and analysis

Page 21: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

“Extensible Markup Language (XML) is a markup language that defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. It is defined by the W3C's XML 1.0 Specification and by several other related specifications, all of which are free open standards.”

- https://en.wikipedia.org/wiki/XML

What is XML?

Page 22: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML
Page 23: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML
Page 24: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML
Page 25: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

<participants> <person role="teacher"> <id>1</id> <name> <personalname>Kathryn</personalname> <familyname>Cassidy</familyname> </name> <affiliation>DRI</affiliation> <affiliation>TCD</affiliation> </person> <person role="learner"> <id>2</id> <name> <personalname>Clare</personalname> <familyname>Lanigan</familyname> </name> <affiliation>DRI</affiliation> <affiliation>RIA</afiliation> </person> .... </participants>

XML Structure

An XML document consists of a set of elements, which can be nested within eachother

All elements must have an opening and closing tag of the form <tagname>… </tagname>

Elements may also have attributes

Page 26: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

<tramTicket><type>return</type><from>Central 1</from><to>Red 2</to><validUntil>Last Tram</validUntil><date>31 Jul 06</date><for>Adult</for><on>Luas only</on><timeIssued>21:15</timeissued><price>2.90</price><number>6004375019</number>

</tramTicket>

XML does not DO anything

Page 27: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

Useful Links

W3 schools xml tutorial http://www.w3schools.com/xml/default.asp

W3 schools xslt tutorial http://www.w3schools.com/xsl/default.asp

Page 28: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

Examples

Page 29: Kathryn Cassidy - DRI Training Series: 4. Metadata and XML