metadata, vocabularies and licensing managing research data in repositories workshop, 11 nov 2015...

38
Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Upload: amie-wright

Post on 18-Jan-2016

222 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Metadata, vocabularies and licensing

Managing research data in repositories workshop, 11 Nov 2015Kathryn Unsworth

Page 2: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

What I’ll cover today

A little context

What is metadata?● Definitions● Types of metadata

You say, I say, we say – metadata huh?!

Repository managers/staff helping researchers to:● Improve their metadata for discovery/interoperability - a brief

look at controlled vocabularies● Improve their metadata for reuse - a brief look at licensing

Page 3: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Can’t be done without metadata!

Page 4: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Data without metadata is wasted effort!

Copyrighted image – Wasting time - http://julettemillien.com/3-top-ways-we-waste-time-what-to-do-about-it/

Page 5: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

What is metadata?Definitions, types of metadata and examples

Page 6: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Metadata is “structured data about data”. It typically provides detailed information about a specific data object or file.DATUM in Action https://www.northumbria.ac.uk/static/5007/ceispdf/metaguide.pdf

What is metadata?

“Metadata is structured information that describes, explains, locates, or otherwise makes it easier to retrieve, use, or manage an information resource. Metadata is often called data about data or information about information.”--National Information Standards Organisation (NISO)http://www.niso.org/publications/press/UnderstandingMetadata.pdf

Metadata provides information enabling us to make sense of data (e.g. documents, images, datasets), concepts (e.g. classification schemes) and real-world entities (e.g. people, organisations, places).Open Data Support - https://joinup.ec.europa.eu/sites/default/files/d2.1.2_training_module_1.4_introduction_to_metadata_management_v1.00_en_0.pdf

Page 7: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

You say, I say, we say – are we talking

about the same thing?

Page 8: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

• Labels in a spreadsheet; accompanying data dictionary • Digital image attributes coming off photographic

equipment: image size, color depth, image resolution, when the image was created, etc.

• ReadMe file accompanying software code• File/Folder names• Meta tags for web pages• Meteorological measurements: location of readings

(latitude, longitude, and height), instrumentation used to collect data, units, processing actions

• Metadata for research dataset records

For example:

Page 9: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

The metadata “accompanying your data set should be written for a user 20 years into the future. Therefore, you should consider what that investigator needs to know to use your data. Write the documentation for a user who is unfamiliar with your project, sites, methods, or observations.“Oak Ridge National Laboratory, Distributed Active Archive Center (2010)

A message to your researchers:

Cea+. (2012). Metadata is a love not to the future https://flic.kr/p/digHTN CC By 2.0

Page 10: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Data without

metadata is a missed

opportunity!

Quote by Thomas Edison

“Opportunity is missed by most people because it is dressed in overalls and looks like work.”

Assumed copyrighted image from: https://www.pinterest.com/melindagordon22/citations-thomas-edison/

Page 11: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Common types of metadata:Descriptive metadata - describes a dataset for the purposes of discovery and identification

Structural metadata – models content types and attributes (records, elements, attributes) and also indicates how a dataset may form part of a multi-layered and/or complex data object (data collection)

Administrative metadata - provides information to help manage a dataset and ensure its authenticity (versions, ownership (IP), licensing)

Page 12: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Administrative metadata

Page 13: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Structural metadata

{A}

Repository managers have responsibility for some of the structural metadata that shapes their data collection records, i.e. what are the most appropriate elements/fields to include.

Page 14: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Structural metadata

Page 15: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Descriptive metadata

Page 16: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Data without metadata is only half the story!

So why are we telling half the story?

Assumed Copyrighted image from Google: https://www.google.com/search?q=telling+only+half+the+story+quotes&espv=2&biw=1680&bih=921&tbm=isch&tbo=u&source=univ&sa=X&ved=0CFgQ7AlqFQoTCICVz-7zk8kCFeXfpgodVZUGlg&dpr=1#tbm=isch&q=telling+only+half+the+story&imgrc=YnRV0S_VAPsZiM%3A

Page 17: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Aiding data discovery

Raising a researcher’s profile

Page 18: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

What is a controlled vocabulary?

Using thesauri, taxonomies and standardised lists of terms for assigning values to metadata properties.

Page 19: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Vocabularies in everyday life

Page 20: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Aiding data reuse

Vocabularies

Page 21: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Vocabularies & research

Page 22: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Vocabularies & data reuse {A}Data

Data dictionary

Page 23: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Vocabularies & data linking

{A}

Page 24: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

ANDS Vocabulary Service - RVA

Page 25: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

http://www.ands.org.au/ ANDS Vocabulary Service © Australian National Data Service 2015,

is licenced under a Creative Commons Attribution 4.0 International Licence (http://creativecommons.org/licenses/by/4.0/)

vocabs.ands.org.au

RVA Documentation Home:https://documentation.ands.org.au/display/DOC/Research+Vocabularies

Contact us to get started: [email protected]

Page 26: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Data reuse and the role of licensing

Page 27: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Aiding data reuse

Making Ts & Cs around data reuse explicit

Page 28: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Rights and data reuseFront facing in RDA

Registry view in RDA

Recent Release 18 changes to RIF-CS vocabulary - definitionsOpen: Data is publicly accessible onlineConditional: Data is publicly accessible online, subject to certain conditions.  For example: an embargo period; a fee applies.Restricted: Data access is limited.  For example: to a particular group of users; where formal permission is granted; the data may only be accessed at a specific physical location.

Page 29: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

AusGOAL licensing framework

AusGOAL contains eight licensing options:• Six Australian Creative Commons (CC) Version

4.0 licences • Restrictive Licence Template (RLT)• BSD 3-Clause Software Licence

ANDS endorses AusGOAL

Wide support by Federal and State Governments

Page 30: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth
Page 32: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

https://learn.canvas.net/courses/4/pages/compatibility-of-creative-commons-licenses?module_item_id=52575

Creative Commons Licenses compatibility

Page 33: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

The Australian Code for the Responsible Conduct of Research says "Researchers have a responsibility to their colleagues and the wider community to disseminate a full account of their research as broadly as possible". In terms of research data, the best way to achieve this objective is to license the data (using AusGOAL) and to place it in a publicly-accessible repository (along with appropriate metadata etc).

If you don't license the data, no-one else can use it; it's that simple!

A message for researchers

ANDS - Copyright, data and licensinghttp://ands.org.au/guides/copyright-and-data-awareness.html

Page 34: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

“Metadata has value for data users, data developers, and organizations. No dataset should be considered complete without accompanying metadata.

Data without metadata is useless”.

Source: U.S. Geological Survey - Core Science Analytics and Synthesis – Metadata

Page 35: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Dublin Core Element Set - Emphasis on web resources, publicationshttp://dublincore.org/documents/dces/

FGDC Content Standard for Digital Geospatial Metadata (CSDGM) - Emphasis on geospatial datahttp://www.fgdc.gov/metadata/geospatial-metadata-standards

ISO 19115/19139 Geographic information: Metadata - Emphasis on geospatial data and serviceshttp://www.fgdc.gov/metadata/geospatial-metadata-standards#fgdcendorsedisostandards

Metadata Standards Examples

Page 36: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

Metadata Standards Examples continued…

Ecological Metadata Language (EML) - Focus on ecological datahttp://knb.ecoinformatics.org/eml_metadata_guide.html

Darwin Core - Emphasis on museum specimenshttp://rs.tdwg.org/dwc/index.htm

Geography Markup Language (GML) - Emphasis on geographic features (roads, highways, bridges)http://www.opengeospatial.org/standards/gml

Page 37: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

In summary• Well-managed, well-connected, discoverable and

reusable data depend on metadata!

• Minimal metadata is acceptable for discovery, but can be useless for the reuse of data

• When talking with researchers and other data stakeholders, either be explicit about what metadata is in scope or better still refer to it in another way

• Controlled vocabs are highly useful for data discovery, interoperability, and reuse – check out RVA

• Data without an appropriate license is essentially a reuse nightmare

Page 38: Metadata, vocabularies and licensing Managing research data in repositories workshop, 11 Nov 2015 Kathryn Unsworth

This work is licensed under a Creative Commons Attribution 3.0 Australia License

ANDS is supported by the Australian Government through the National Collaborative Research Infrastructure Strategy (NCRIS).