© brigitte jörg june 4th, 2008 in maribor, slovenia 1 tutorial: cerif 2008 release cerif 2008...

45
© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Brigitte Jörg, M.A. (Information Science) Language Technology Lab, Language Technology Lab, German Research Center for German Research Center for Artificial Intelligence (DFKI) Artificial Intelligence (DFKI) Saarbrücken, Germany Saarbrücken, Germany

Upload: chloe-charles

Post on 13-Jan-2016

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

1

Tutorial: CERIF 2008 Release

CERIF 2008 TutorialCERIF 2008 Tutorial

Brigitte Jörg, M.A. (Information Science)Brigitte Jörg, M.A. (Information Science)

Language Technology Lab,Language Technology Lab,

German Research Center for German Research Center for Artificial Intelligence (DFKI)Artificial Intelligence (DFKI)

Saarbrücken, GermanySaarbrücken, Germany

Page 2: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

2

Tutorial: CERIF 2008 Release

Outline

Active People

What is CERIF?

Explanations (Metadata, data-centric, Model)

The CERIF Model (Entities, Relationships, Structure)

The CERIF Semantic Layer in some Detail

The CERIF XML Interchange Format

Related Activities

The CERIF Evolution

The CERIF Aim and Current Activities

Page 3: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

3

Tutorial: CERIF 2008 Release

Active People

Active participation in current release (2008): Brigitte Jörg, (German Res Center for AI) TG Leader Keith G. Jeffery (UK Science and Techn Facilities Council) Geert van Grootel (Flemish Ministry) Anne Asserson (University Bergen) Henrik Rasmussen (atira A/S) Adrian Price (University Copenhagen) Thomas Vestam (atira A/S)

Active participation in past release (2006): Ojars Krast (uniCRIS AG) Edward Grabczewski (UK Science and Techn Facil Council)

Page 4: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

4

Tutorial: CERIF 2008 Release

What is CERIF ?

Common European Research Information Format

(1) data model (data-centric focus)

(2) allows for a (metadata) representation of – research entities – their activities / interconnections (research)– their output (results)

(3) enables quality maintenance, archiving, access and interchange of research information

(4) supports knowledge transfer to researchers / research managers / research strategists / publication editors / media / brokers / the general public

Page 5: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

5

Tutorial: CERIF 2008 Release

Metadata ?

Book: Title: The Hitchhiker‘s Guide to the GalaxyDate of Publication: 1979

Game Cover Image: The Hitchhiker‘s Guide to the Galaxy Source: http://egotron.com/Retrieved: May 30, 2008

Radio Series: Title: The Hitchhiker‘s Guide to the GalaxyDescription: is a science fiction comedy series created by Douglas Adams. Originally a radio comedy broadcast on BBC Radio 4 in 1978, […] Source: WikipediaDate of Query: May 30, 2008

Series of five Books: Title: The Hitchhiker‘s Guide to the Galaxy.Between: 1979 - 1982

TV Series: Title: The Hitchhiker‘s Guide to the GalaxyScreened: 1981

Computer Game: Title: The Hitchhiker‘s Guide to the GalaxyReleased: 1984

Comic Book Adaptions: Title: The Hitchhiker‘s Guide to the GalaxyBetween: 1993 – 1996

Links: http://www.bbc.co.uk/cult/hitchhikers/HTML-Title: Cult – The Hitchhiker‘s Guide to the Galaxyhttp://en.wikipedia.org/wiki/The_Hitchhiker's_Guide_to_the_GalaxyHTML-Title: The Hitchhiker's Guide to the Galaxy

Data about D

ata

Structure: • Type of Resource• Title• Description• Source• Date• Author, Creator, …

Page 6: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

6

Tutorial: CERIF 2008 Release

What is Metadata ?

„Metadata is structured data which describes the characteristics of a resource.”

An Introduction to Metadata, by Chris Taylor, University of Queensland

“Metadata is sometimes defined literally as 'data about data,' but the term is normally understood to mean structured data about resources that can be used to help support a wide range of operations. These might include, for example, resource description and discovery, the management of information resources and their long-term preservation.” Metadata in a Nutshell, by Michael Day, UKOLN

Page 7: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

7

Tutorial: CERIF 2008 Release

What is Metadata for ?

„Metadata is structured data which describes the characteristics of a resource.”

An Introduction to Metadata, by Chris Taylor, University of Queensland

“Metadata is sometimes defined literally as 'data about data,' but the term is normally understood to mean structured data about resources that can be used to help support a wide range of operations. These might include, for example, resource description and discovery, the management of information resources and their long-term preservation.” Metadata in a Nutshell, by Michael Day, UKOLN

Support a Wide Range of Operations

Page 8: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

8

Tutorial: CERIF 2008 Release

What is data-centric ?

CitationTypesType:

Description:

PublicationURI:Type:Title:PartOf:PublDate:

Article Requests 2007Journal X = 4Journal Y = 0Journal Z = 15

Ends in 2010Journals: Y, Z

OrganisationURI:

Name:Abbreviation:Publications:

Academic Staff:

Journal Publications 2007Institute A = 4Institute B = 10Institute C = 9

OrganisationURI:Name:hasAccess:EndOfAccessContactPerson:

Journal SubscriptionsJournal X = 1990 - 2000Journal Y = 2005 - 2010Journal Z = 2001 - 2010

PhD Students 2008Computer Science = 200

Physics = 50Social Sciences = 9

First Author / No of Papers Person H = 10/35Person I = 4/12Person J = 1/10

Citations in 2007Paper M (publish 2007) = 20Paper N (publish 2004) = 100 Paper O (publish 2001) = 0

DataMetadata

Page 9: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

9

Tutorial: CERIF 2008 Release

What is data-centric ?

– Data / Metadata in the center – Data Maintenance, Curation, Preservation

and Quality a major interest – Enabling added-value Services based on

quality data

– Enabling requested views for various stakeholders based on quality data

Page 10: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

10

Tutorial: CERIF 2008 Release

What is a model ?

A model is a simplified view to describe a particular area of interest

It allows for a better communication between interested parties

It supports mutual understanding

It supports (re-)design decisions

It supports documentation

It can be exchanged, re-used, iterated, extended

A Binforms

Page 11: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

11

Tutorial: CERIF 2008 Release

Equipment

ProjectProjectOrganisationOrganisation

Service

FundingProgramme

Patent

Skills

CV

Product

Event

PersonPerson

Classification(Semantics)

Classification(Semantics)

Publication

Common European Research Information Format

Page 12: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

12

Tutorial: CERIF 2008 Release

Common European Research Information Format

• A model to manage Research Information

• Research Entities• Project, Person, Organisation, Publication• Funding Programme, Service, Equipment, • Patent, Product, …

• Activities / Interconnections in the Research Context• Relationships• Semantics / Roles / Types

-> to exchange research information -> to enable interoperability-> to build CRISs

Page 13: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

13

Tutorial: CERIF 2008 Release

CERIF Structure

Core Entities

2nd Level Entities

Link Entities

Language-related Entities

Classification Entities (Semantic Layer)

Page 14: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

14

Tutorial: CERIF 2008 Release

Core Entities

Person OrganisationUnit

Project

ResultPublication

Person OrganisationUnit

Project

ResultPublication

Page 15: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

15

Tutorial: CERIF 2008 Release

Core Entities

Person OrganisationUnit

Project

ResultPublication

Person OrganisationUnit

Project

ResultPublication

PublicationIDURITitleSubtitleAbstractKeywordsBibl. NotePublicationDateTotalPagesStartPageEndPageClassifications

PersonIDURISexFirstNamesOtherNamesFamilyNamesNameVariantsResearchInterestKeywordsClassifications

ProjectIDURIAcronymStartDateEndDateTitleAbstractKeywordsClassifications

OrganisationIDURIAcronymNameHeadCountCurrencyCodeTurnoverResearchActivityKeywordsClassifications

Page 16: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

16

Tutorial: CERIF 2008 Release

2nd Level Entities

ResultPatent

ResultProduct

Service

Equipment

FundingProgramme

Facility

CV

Event

Skills

Person OrganisationUnit

Project

ResultPublication

ResultPatent

ResultProduct

Service

Equipment

FundingProgramme

Facility

CV

Event

Skills

Person OrganisationUnit

Project

ResultPublication

Person OrganisationUnit

Project

ResultPublication

Page 17: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

17

Tutorial: CERIF 2008 Release

2nd Level Entities

ResultPatent

ResultProduct

Service

Equipment

FundingProgramme

Facility

CV

Event

Skills

Person OrganisationUnit

Project

ResultPublication

ResultPatent

ResultProduct

Service

Equipment

FundingProgramme

Facility

CV

Event

Skills

Person OrganisationUnit

Project

ResultPublication

Person OrganisationUnit

Project

ResultPublication

FacilityIDURINameDescriptionKeywordsClassifications

FundingProgrammeIDURINameCurrencyCodeBudgetStartDateEndDateDescriptionKeywordsClassifications

EventIDURINameFeeOrFreeStartDateEndDateCityTownCountryCodeDescriptionKeywordsClassifications

ResultPatentIDURIPatentNumberTitleCountryCodeRegistrationDateApprovalDateDescriptionKeywordsClassifications

ServiceIDURINameDescriptionKeywordsClassifications

Page 18: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

18

Tutorial: CERIF 2008 Release

Link Entities

PersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Project_OrgUnitProject_Person

Person_ResultPublication

OrgUnit_ResultPublication

Project_ResultPublication

PersonPerson OrgUnitOrgUnitOrgUnit

ProjectProject

ResultPublication

Project_OrganisationUnitProject_Person

Person_ResultPublication

OrganisationUnitPerson_OrganisationUnit

ResultPublication

Project_ResultPublication

Page 19: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

19

Tutorial: CERIF 2008 Release

Link Entities

Person_PublicationpersIDpublIDClassificationsStartDate; EndDate

Project_PersonprojIDperslIDClassificationsStartDate; EndDate

Organisation_PublicationorgIDpublIDClassificationsStartDate; EndDate

Project_PublicationpersIDpublIDClassificationsStartDate; EndDate

Project_OrganisationprojIDorgIDClassificationsStartDate; EndDate

Person_OrganisationpersIDorgIDClassificationsStartDate; EndDate

Project_PublicationprojIDpublIDClassificationsStartDate; EndDate

Page 20: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

20

Tutorial: CERIF 2008 Release

Language-related Entities

Page 21: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

21

Tutorial: CERIF 2008 Release

Language-related Entities

PublicationTitle [language]Abstract [languange]Keywords [language]

OrganisationName [language]ResearchActivity [languange]Keywords [language]

ProjectTitle [language]Abstract [languange]Keywords [language]

PersonResearchInterest [language]Keywords [language]

FacilityName [language]Description [languange]Keywords [language]

ServiceName [language]Description [languange]Keywords [language]

PatentName [language]Description [languange]Keywords [language]

ProductName [language]Description [languange]Keywords [language]

Page 22: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

22

Tutorial: CERIF 2008 Release

Classification Entities (Semantic Layer)

Person OrgUnit

Project

ResultPublication

Project_OrgUnitProject_Person

Person_ResultPublicationOrgUnit_ResultPublication

Project_ResultPublication

PersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Person_OrganisationUnitRole=CEO

Project_OrganisationUnitRole=Organiser

Project_PersonRole=Co-ordinator

Person_ResultPublicationRole=Author

OrgUnit_ResultPublicationRole=Publisher

Project_ResultPublicationRole=TechnicalReport

OrganisationUnitPersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Project_OrgUnitProject_Person

Person_ResultPublicationOrgUnit_ResultPublication

Project_ResultPublication

PersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Person_OrganisationUnitRole=CEO

Project_OrganisationUnitRole=Organiser

Project_PersonRole=Co-ordinator

Person_ResultPublicationRole=Author

OrgUnit_ResultPublicationRole=Publisher

Project_ResultPublicationRole=TechnicalReport

OrganisationUnit

Page 23: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

23

Tutorial: CERIF 2008 Release

Classification Entities (Semantic Layer)

Person OrgUnit

Project

ResultPublication

Project_OrgUnitProject_Person

Person_ResultPublicationOrgUnit_ResultPublication

Project_ResultPublication

PersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Person_OrganisationUnitRole=CEO

Project_OrganisationUnitRole=Organiser

Project_PersonRole=Co-ordinator

Person_ResultPublicationRole=Author

OrgUnit_ResultPublicationRole=Publisher

Project_ResultPublicationRole=TechnicalReport

OrganisationUnitPersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Project_OrgUnitProject_Person

Person_ResultPublicationOrgUnit_ResultPublication

Project_ResultPublication

PersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Person_OrganisationUnitRole=CEO

Project_OrganisationUnitRole=Organiser

Project_PersonRole=Co-ordinator

Person_ResultPublicationRole=Author

OrgUnit_ResultPublicationRole=Publisher

Project_ResultPublicationRole=TechnicalReport

OrganisationUnit

ClassificationClassIDClassSchemeIDTerm [language]Description [language]StartDate, EndDateURI

ClassificationSchemeClassSchemeIDDescription [language]URI

Classification_ClassificationClassID1 (Term1)ClassID2 (Term2)ClassSchemeID1 (Schema1)ClassSchemeID2 (Schema2)ClassId (Role)ClassSchemeID (RoleSchema)StartDate, EndDate

ClassScheme_ClassSchemeClassSchemeID1ClassSchemeID2ClassID (Role)ClassSchemeID (RoleSchema)StartDate, EndDate

Page 24: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

24

Tutorial: CERIF 2008 Release

Classification Entities (Semantic Layer)

Publication_ClassificationPublicationType=JournalPublication_Classification

Publication_ClassificationReviewType=peer-reviewedPublication_Classification

PersonPersonPersonPersonPersonClassificationClassification

Publication_ClassificationAccessType=openAccessPublication_Classification

Publication

Publication_ClassificationImpactFactorType=diametricPublication_Classification

Publication_ClassificationCategory=commissionedPublication_Classification

Page 25: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

25

Tutorial: CERIF 2008 Release

Classification Entities (Semantic Layer)

personIDLastnameOtherNamesFirstNameSex

publicationIDTitleAbstractKeywordsDate

BookArticleTechnical ReportThesis……isAuthorisEditorisReviewer…

Relationship

personIDpublicationID

Relationship

personIDpublicationID

PersonPersonPersonPersonPersonClassificationPerson

PersonPersonPersonPersonPersonClassificationPublicationPersonPersonPersonPersonPersonClassificationClassification

Page 26: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

26

Tutorial: CERIF 2008 Release

Semantic LayerSome CERIF Types

Page 27: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

27

Tutorial: CERIF 2008 Release

Semantic LayerSome CERIF Relationship Roles

Page 28: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

28

Tutorial: CERIF 2008 Release

Semantic LayerMany Schemas (publicly) available

For Publication Records:– Dublin Core– Marc Code– Digital Item Declaration Language (DIDL)– Metadata Object Description Schema (MODS)– …

For Audio/Video Files:– Metadata Encoding and Transmission Standard (METS)– …

For Subject Headings:– Ortelius Thesaurus– …

Page 29: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

29

Tutorial: CERIF 2008 Release

Classification Entities (Semantic Layer)

Allows to capture any Schema or Structure• Flat Lists• Taxonomies• Ontologies

Open / Extensible in all directions• New Schemas• New Concepts / Terms• New Relationships

Enables to manage• Roles / Types Semantics• Subject Headings • Archiving (Time component)

Allows for simple Mappings between Schemas Allows for a efficient (independent) Maintenance

Page 30: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

30

Tutorial: CERIF 2008 Release

XML Interchange Format

According to W3C Standards Refers to XML Schemas for Validation XML files corresponding to Entities / Separation of Relationships

<XML><PERSON> <ID>1</ID> <FirstName>Anne</FirstName> <LastName>Asserson</LastName> <URI>http://www.linkedin.com1</URI> <Sex>female</Sex></PERSON><PERSON> <ID>2</ID> <FirstName>Keith</FirstName> <LastName>Jeffery</LastName> <OtherNames>G.</OtherNames> <URI>http://www.linkedin.com2</URI> <Sex>male</Sex></PERSON>---</XML>

<XML><PUBLICATION> <ID>1</ID> <Title language=„EN“>Grey in the R&D Process</Title> <Date>2006</Date> <URI>http://www.epubs.org/ID1</URI></PUBLICATION><PUBLICATION> <ID>2</ID> <Title language=„EN“>What‘s new in Grey Literature …</Title> <Date>2005</Date> <URI>http://www.greynet.org/thegrey journal.html?ID2</URI></PUBLICATION>---</XML>

Page 31: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

31

Tutorial: CERIF 2008 Release

CERIF 2008 Release

Model Introduction and Specification Document

Full Data Model, SQL Database Scripts

XML Data Exchange Specification Document

XML Example Files

XML Schemas for XML Validation

CERIF Types / Roles / Semantics as XML

http://www.eurocris.org/

June / July 2008

Page 32: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

32

Tutorial: CERIF 2008 Release

What is going on ?

Source: http://maps.repository66.org/; Reported on: http://www.sparceurope.org/

Page 33: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

33

Tutorial: CERIF 2008 Release

What is going on ? JISC Report from April 2008“Metadata for digital libraries: state of the art and future directions”

by Richard Gartner http://www.jisc.ac.uk/media/documents/techwatch/tsw_0801pdf.pdf

Many available Schemas (DC, METS, MODS, …)

Each schema was singularly developed and not designed as an overal architecture to cover integrated object entities

JISC recommends therefore to overcome the problem by best practise guidelines and pragmatic application

Issues of duplicate information (overlap in sections of metadata) need rules and are currently being addressed by the library community in good practise guidelines

Page 34: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

34

Tutorial: CERIF 2008 Release

What is going on ? JISC Report from April 2008“Metadata for digital libraries: state of the art and future directions”

by Richard Gartner http://www.jisc.ac.uk/media/documents/techwatch/tsw_0801pdf.pdf

– Descriptive Metadata (intellectual contents)

– Administrative Metadata (technical metadata [file formats], rights management, provenance [info on creation, subsequent treatment, responsibility, …])

– Structural Metadata (internal structure of items: e.g.: page order, …)

• METS • DIDL• …

Page 35: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

35

Tutorial: CERIF 2008 Release

What is going on ? JISC Report from April 2008“Metadata for digital libraries: state of the art and future directions”

by Richard Gartner http://www.jisc.ac.uk/media/documents/techwatch/tsw_0801pdf.pdf

XML is of great importance to embed and make use of namespaces Combining Metadata standards, even a limited such as described

above, will always be messier than utilising a single standard that combines their taxonomic powers and resolves any potential clashes or duplications between them.

Integration by itself would of course be of little consequence if the standards themselves failed to address the metadata needs of the digital library community. In this respect, the provenance of each standard is of some importance. All have been constructed by authoritative standard setters within their communities.

Most of the mentioned standards have proved their ability to meet the requirements of major and highly complex digital collections.

Page 36: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

36

Tutorial: CERIF 2008 Release

What CERIF aims for

Source: http://maps.repository66.org/; Reported on: http://www.sparceurope.org/

Equipment

ProjectProjectOrganisationOrganisation

Service

FundingProgramme

Patent

Skills

CV

Product

Event

PersonPerson

Classification(Semantics)

Classification(Semantics)

Publication

Page 37: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

37

Tutorial: CERIF 2008 Release

What CERIF aims for

Equipment

ProjectProjectOrganisationOrganisation

Service

FundingProgramme

Patent

Skills

CV

Product

Event

PersonPerson

Classification(Semantics)

Classification(Semantics)

Publication

Enabling the ERA eInfrastructure

Standardization / Integration / Interchange

Added-Value Services

Middle (Interoperability)-Layer for EU Research Information

Page 38: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

38

Tutorial: CERIF 2008 Release

What CERIF aims for

Equipment

ProjectProjectOrganisationOrganisation

Service

FundingProgramme

Patent

Skills

CV

Product

Event

PersonPerson

Classification(Semantics)

Classification(Semantics)

Publication

The ultimate answer to Life, the Universe, and Everything.

from “The Hitchhiker’s Guide to the Galaxy” by Douglas Adams

Page 39: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

39

Tutorial: CERIF 2008 Release

Example: GeneratingPublication Reference Records

BibTexEndnote...

@article{615182, author = {Veda C. Storey}, title = {Understanding semantic relationships}, journal = {The VLDB Journal}, volume = {2}, number = {4}, year = {1993}, issn = {1066-8888}, pages = {455--488}, doi = {http://dx.doi.org/10.1007/BF01263048}, publisher = {Springer-Verlag New York, Inc.}, address = {Secaucus, NJ, USA}, }

Best Practice Guide

Page 40: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

40

Tutorial: CERIF 2008 Release

The CERIF Evolution

EU Working Group

on Research DatabasesWorkshop

1987 1991

CERIF 91CERIF 91

PROJECT

Similar IdeasUN/UNESCOOECDCODATA

Acronym: ERGOParticipant: Keith Jeffery, Anne Asser son, many moreOrganisations: Rutherford Appleton, Uni- versity of Bergen, …

Acronym: ERGOParticipant: Keith Jeffery, Anne Asser son, many moreOrganisations: Rutherford Appleton, Uni- versity of Bergen, …

2000

CLASSIFICATIONCLASSIFICATION

RESULTSRESULTS EQUIPMENTEQUIPMENT

PROJECTPROJECT

OrgUnitOrgUnit PERSONPERSON

EXPERTISERoles Roles

CERIF 2000 CERIF 2000 ModelModel

- Networking of DBs- Exchange of Records

- Recommendation to Member States

- Data Model (RDBMS, OO, IR)- Multilinguality- Controlled Vocabulary- Roles / Types- User-driven

- EC Recommendation to Member States

ProjectProject OrganisationOrganisation

Service

Funding Programme

Patent

Skills

CV

Product

Event

PersonPerson

Classification(Semantics)

Classification(Semantics)

PublicationEquipment

2ndLevel

CORE

Language

SemanticsLink

CERIF 2006 / CERIF 2006 / 2008 2008 ModelModel

- Data Model (RDBMS, OO, IR)- Model Normalization - Robust Structure - Extensible Structure - Consistent Structure - Semantic Layer - XML Exchange Specification

- Connectivity to Repositories (Elaboration on Publication)

2006 2008

Page 41: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

41

Tutorial: CERIF 2008 Release

CERIF 91

– published in a first release – recommended to Member States

• to harmonise databases on research projects• ease exchange of comparable information• guidelines for building research databases

– only dealt with research project records– demonstrated in the ERGO pilot project

• access to more than 80.000 project records• from more than 20 national information services

– demonstrated the feasability of exchange – identified the need for more detailed guidelines– confirmed the need to revise CERIF and extend it to other types of

research information, not only projects– revision activities started in 1997 co-ordinated by the EC – led to CERIF 2000

Page 42: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

42

Tutorial: CERIF 2008 Release

CERIF 2000

– a full CRIS data model with flexibility to accomodate many database structures

– a base framework for data exchange– multilingual subject indexing (Ortelius Thesaurus)– recommendations for controlled attribute values– reflection on user groups and requirements– types of research information – metadata environment as a uniform summary view– extensions to

• Organisations• Persons• Results: Products, Patent, Publication• Expertise• Equipment and Facilities

Page 43: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

43

Tutorial: CERIF 2008 Release

Major Current Activities

• UK: Research Councils specified to use CERIF as the format for IT processes and MM information

• UK: STFC (Corporate Data Repository)

• BE: Flanders – CERIF as Standard Interchange Format• DK: Danish Universities PURE -> CERIF

• EUROPEESF: CERIF for IS under discussionCORDIS, EC R&D Service: Asked for CERIF presentation EuroHORCS: Recommendation for CERIF; join as a euroCRIS member, from its taskgroup

Page 44: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

44

Tutorial: CERIF 2008 Release

Major Current Activities • Videolectures.net (Teaching Videos)• ICERIS (IS): Access to Information on Icelandic Research Projects & R&D

Results• AURIS-MM (AT): Provides access to Austrian University Research extended

with multimedia• SICRIS (SI): Access to University Research in Slovenia• HUNCRIS (HU): Access to R&D in Hungary• SRIS (UK): Scottish Research Information Systems, public research in

Scotland • CRIS-MER (EC): Research information on Migration and ethnic Relations

(planned)• STFC (UK): Corporate Data Repository• METIS (NL): currently used by Dutch Universities• Fdok (NO): University of Bergen, results• FRIDA (NO): Joint university CRIS• IWETO (BE): Integrating Flemish Research Information• BioDiversa ERANET (project)• IST World SSA (project)

Page 45: © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia

45

Tutorial: CERIF 2008 Release

CERIF TG Activity

Regular CERIF TG meetings and Discussions Tests and major bugfixes before Releases Strong Relation to ongoing implementation activities

(Geert van Grootel, EWI, Flanders; atira A/S, Aalborg, Denmark)

Exchange with Best Practice (Ales Bosniak, IZUM, Slovenia)

Collaborate with new TG Institutional Repositories (IR-CERIF)(Anna Clements, University of St. Andrews, UK)

Next Steps: Extension of Semantic Layer with Content Check Tools for Managing the Semantics Mappings of major Schemas (Standards) Check OAI Wrapping CERIF Ontology