cerif 2008 tutorial

55
© Brigitte Jörg October 8th, 2008 Moscow, Russia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Brigitte Jörg, M.A. (Information Science) Language Technology Lab, Language Technology Lab, German Research Center for German Research Center for Artificial Intelligence (DFKI) Artificial Intelligence (DFKI) Saarbrücken, Germany Saarbrücken, Germany

Upload: sancha

Post on 23-Jan-2016

43 views

Category:

Documents


0 download

DESCRIPTION

CERIF 2008 Tutorial. Brigitte Jörg, M.A. (Information Science) Language Technology Lab, German Research Center for Artificial Intelligence (DFKI) Saarbrücken, Germany. Outline. Introduction of Speaker What is CERIF? Grounding Explanations Model Metadata Data-centric - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

1

Tutorial: CERIF 2008 Release

CERIF 2008 TutorialCERIF 2008 Tutorial

Brigitte Jörg, M.A. (Information Science)Brigitte Jörg, M.A. (Information Science)

Language Technology Lab,Language Technology Lab,

German Research Center for German Research Center for Artificial Intelligence (DFKI)Artificial Intelligence (DFKI)

Saarbrücken, GermanySaarbrücken, Germany

Page 2: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

2

Tutorial: CERIF 2008 Release

Outline

Introduction of Speaker

What is CERIF?

Grounding Explanations Model Metadata Data-centric Research Information

The Conceptual CERIF Model Entities Relationships Structure

The CERIF Semantic Layer in some Detail

The CERIF Evolution, Aim and Ongoing Activities

The CERIF 2008 Release

Page 3: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

3

Tutorial: CERIF 2008 Release

Introduction of Speaker

Brigitte Jörg M.A. Information ScienceInformation Systems, Business Administration

Project Manager, Researcher DFKI GmbH, Language Technology Lab Saarbrücken

CERIF TG Leader, Board Member euroCRIS

Contact: brigitte.joerg @ dfki.de http://www.dfki.de/~brigitte/

Page 4: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

4

Tutorial: CERIF 2008 Release

Equipment

ProjectProject OrganisationOrganisation

Service

FundingProgramme

Patent

Skills

CV

Product

Event

PersonPerson

Classification(Semantics)

Classification(Semantics)

Publication

What is CERIF ?

Common European Research Information Format

Page 5: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

5

Tutorial: CERIF 2008 Release

What is CERIF ?

Common European Research Information Format

– A Concept about Research Entities and their Relationships Specification (Conceptual Level)

– A Description of Research Entities and their Relationships Model (Logical Level)

– A Formalization of Research Entities and their Relationships

Database Scripts (Physical Level)

SQL Script-----------------------CREATE Table PersonCREATE Table ProjectCREATE Table OrgUnit

Page 6: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

6

Tutorial: CERIF 2008 Release

What is CERIF ?

Common European Research Information Format

(1) data model (data-centric)

(2) allows for a (metadata) representation of – research entities – their activities / interconnections (research)– their output (results)

(3) offers high flexibility with (semantic) relationships

(4) enables quality maintenance, archiving, access and interchange of research information

(5) supports knowledge transfer to decision makers, for research evaluation, research managers, strategists, researchers, editors, the general public

Page 7: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

7

Tutorial: CERIF 2008 Release

What is a model ?

… is a simplified view to describe a particular area of interest

… allows for a better communication between parties (mutual understanding)

… supports (re-)design decisions

… supports workflow identification

… supports documentation

… can be exchanged, re-used, iterated, extended

A Binforms

C Dis part of

X Zdepends on

F Gwaits for

Page 8: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

8

Tutorial: CERIF 2008 Release

What is Metadata ?

„Metadata is structured data which describes the characteristics of a resource.”

An Introduction to Metadata, by Chris Taylor, University of Queensland

“Metadata is sometimes defined literally as 'data about data,' but the term is normally understood to mean structured data about resources that can be used to help support a wide range of operations. These might include, for example, resource description and discovery, the management of information resources and their long-term preservation.” Metadata in a Nutshell, by Michael Day, UKOLN

Support a Wide Range of Operations

Page 9: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

9

Tutorial: CERIF 2008 Release

What is Metadata ?

Book: Title: The Hitchhiker‘s Guide to the GalaxyDate of Publication: 1979

Game Cover Image: The Hitchhiker‘s Guide to the Galaxy Source: http://egotron.com/Retrieved: May 30, 2008

Radio Series: Title: The Hitchhiker‘s Guide to the GalaxyDescription: is a science fiction comedy series created by Douglas Adams. Originally a radio comedy broadcast on BBC Radio 4 in 1978, […] Source: WikipediaDate of Query: May 30, 2008

Series of five Books: Title: The Hitchhiker‘s Guide to the Galaxy.Between: 1979 - 1982

TV Series: Title: The Hitchhiker‘s Guide to the GalaxyScreened: 1981

Computer Game: Title: The Hitchhiker‘s Guide to the GalaxyReleased: 1984

Comic Book Adaptions: Title: The Hitchhiker‘s Guide to the GalaxyBetween: 1993 – 1996

Links: http://www.bbc.co.uk/cult/hitchhikers/HTML-Title: Cult – The Hitchhiker‘s Guide to the Galaxyhttp://en.wikipedia.org/wiki/The_Hitchhiker's_Guide_to_the_GalaxyHTML-Title: The Hitchhiker's Guide to the Galaxy

Data about D

ata

Structure: • Type of Resource• Title• Description• Source• Date• Author, Creator, …

Metadata

Metadata

Metadata

Metadata

Metadata

Metadata

Metadata

Metadata

Page 10: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

10

Tutorial: CERIF 2008 Release

What is Data-centric ?

CitationTypesType:

Description:

PublicationURI:Type:Title:PartOf:PublDate:

Article Requests 2007Journal X = 4Journal Y = 0Journal Z = 15

Ends in 2010Journals: Y, Z

OrganisationURI:

Name:Abbreviation:Publications:

Academic Staff:

Journal Publications 2007Institute A = 4Institute B = 10Institute C = 9

OrganisationURI:Name:hasAccess:EndOfAccessContactPerson:

Journal SubscriptionsJournal X = 1990 - 2000Journal Y = 2005 - 2010Journal Z = 2001 - 2010

PhD Students 2008Computer Science = 200

Physics = 50Social Sciences = 9

First Author / No of Papers Person H = 10/35Person I = 4/12Person J = 1/10

Citations in 2007Paper M (publish 2007) = 20Paper N (publish 2004) = 100 Paper O (publish 2001) = 0

DataMetadata

Page 11: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

11

Tutorial: CERIF 2008 Release

What is Data-centric ?

– Data / Metadata in the center – Data Maintenance, Curation, Preservation

and Quality a major Interest – Enabling added-value Services based on

qualitative Data

– Enabling requested views for various stakeholders based on qualitative Data

Page 12: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

12

Tutorial: CERIF 2008 Release

What is Research Information ?

Data/Metadata or Information about:• Scientists• Project Managers• Ongoing and Completed Projects• Research Departments• Funding Organisations and Programmes• Research Results• Publications• Equipment• …

• their timely Relationships (Semantics)

Page 13: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

13

Tutorial: CERIF 2008 Release

Equipment

ProjectProjectOrganisationOrganisation

Service

FundingProgramme

Patent

Skills

CV

Product

Event

PersonPerson

Classification(Semantics)

Classification(Semantics)

Publication

Common European Research Information Format

Page 14: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

14

Tutorial: CERIF 2008 Release

Common European Research Information Format

• A model to manage Research Information

• Research Entities• Project, Person, Organisation, Publication• Funding Programme, Service, Equipment, • Patent, Product, …

• Activities / Interconnections in their Context• Relationships• Semantics / Roles / Types

-> for Exchange -> for Interoperability-> for Implementation of CRISs

(Current Research Information Systems)

Page 15: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

15

Tutorial: CERIF 2008 Release

CERIF Structure

Core Entities

2nd Level Entities

Language-related Entities

Link Entities

Classification Entities (Semantic Layer)

Page 16: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

16

Tutorial: CERIF 2008 Release

Core Entities

Person OrganisationUnit

Project

ResultPublication

Person OrganisationUnit

Project

ResultPublication

Page 17: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

17

Tutorial: CERIF 2008 Release

Core Entities

Person OrganisationUnit

Project

ResultPublication

Person OrganisationUnit

Project

ResultPublication

PublicationIDURITitleSubtitleAbstractBibl. NotePublicationDateTotalPagesStartPageEndPageKeywords

PersonIDURISexFirstNamesOtherNamesFamilyNamesNameVariantsResearchInterestKeywords

ProjectIDURIAcronymStartDateEndDateTitleAbstractKeywords

OrganisationIDURIAcronymNameHeadCountCurrencyCodeTurnoverResearchActivityKeywords

Page 18: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

18

Tutorial: CERIF 2008 Release

2nd Level Entities

ResultPatent

ResultProduct

Service

Equipment

FundingProgramme

Facility

CV

Event

Skills

Person OrganisationUnit

Project

ResultPublication

ResultPatent

ResultProduct

Service

Equipment

FundingProgramme

Facility

CV

Event

Skills

Person OrganisationUnit

Project

ResultPublication

Person OrganisationUnit

Project

ResultPublication

Page 19: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

19

Tutorial: CERIF 2008 Release

2nd Level Entities

ResultPatent

ResultProduct

Service

Equipment

FundingProgramme

Facility

CV

Event

Skills

Person OrganisationUnit

Project

ResultPublication

ResultPatent

ResultProduct

Service

Equipment

FundingProgramme

Facility

CV

Event

Skills

Person OrganisationUnit

Project

ResultPublication

Person OrganisationUnit

Project

ResultPublication

FacilityIDURINameDescriptionKeywords

FundingProgrammeIDURINameCurrencyCodeBudgetStartDateEndDateDescriptionKeywords

EventIDURINameFeeOrFreeStartDateEndDateCityTownCountryCodeDescriptionKeywords

ResultPatentIDURIPatentNumberTitleCountryCodeRegistrationDateApprovalDateDescriptionKeywords

ServiceIDURINameDescriptionKeywords

Page 20: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

20

Tutorial: CERIF 2008 Release

Language-related Entities

Page 21: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

21

Tutorial: CERIF 2008 Release

Language-related Entities

PublicationTitle [language]Abstract [languange]Keywords [language]

OrganisationName [language]ResearchActivity [languange]Keywords [language]

ProjectTitle [language]Abstract [languange]Keywords [language]

PersonResearchInterest [language]Keywords [language]

FacilityName [language]Description [languange]Keywords [language]

ServiceName [language]Description [languange]Keywords [language]

PatentName [language]Description [languange]Keywords [language]

ProductName [language]Description [languange]Keywords [language]

Page 22: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

22

Tutorial: CERIF 2008 Release

Link Entities

PersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Project_OrgUnitProject_Person

Person_ResultPublication

OrgUnit_ResultPublication

Project_ResultPublication

PersonPerson OrgUnitOrgUnitOrgUnit

ProjectProject

ResultPublication

Project_OrganisationUnitProject_Person

Person_ResultPublication

OrganisationUnitPerson_OrganisationUnit

ResultPublication

Project_ResultPublication

Page 23: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

23

Tutorial: CERIF 2008 Release

Link Entities

Person_PublicationpersIDpublIDClassificationClassificationSchemeStartDate; EndDate

Project_PersonprojIDperslIDClassificationClassificationSchemeStartDate; EndDate

Person_OrganisationpersIDorgIDClassificationClassificationSchemeStartDate; EndDate

Person Publication

Person Organisation

Project Person

Semantics

Semantics

Semantics

Page 24: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

24

Tutorial: CERIF 2008 Release

Link Entities

Person_PublicationpersIDpublIDClassificationClassificationSchemeStartDate; EndDate

Project_PersonprojIDperslIDClassificationClassificationSchemeStartDate; EndDate

Organisation_PublicationorgIDpublIDClassificationClassificationSchemeStartDate; EndDate

Project_PublicationpersIDpublIDClassificationClassificationSchemeStartDate; EndDate

Project_OrganisationprojIDorgIDClassificationClassificationSchemeStartDate; EndDate

Person_OrganisationpersIDorgIDClassificationClassificationSchemeStartDate; EndDate

Project_PublicationprojIDpublIDClassificationClassificationSchemeStartDate; EndDate

Page 25: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

25

Tutorial: CERIF 2008 Release

Classification Entities (Semantic Layer)

Formal Semantics / Values for Link Entities

Person OrgUnit

Project

ResultPublication

Project_OrgUnitProject_Person

Person_ResultPublicationOrgUnit_ResultPublication

Project_ResultPublication

PersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Person_OrganisationUnitRole=CEO

Project_OrganisationUnitRole=Organiser

Project_PersonRole=Co-ordinator

Person_ResultPublicationRole=Author

OrgUnit_ResultPublicationRole=Publisher

Project_ResultPublicationRole=TechnicalReport

OrganisationUnitPersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Project_OrgUnitProject_Person

Person_ResultPublicationOrgUnit_ResultPublication

Project_ResultPublication

PersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Person_OrganisationUnitRole=CEO

Project_OrganisationUnitRole=Organiser

Project_PersonRole=Co-ordinator

Person_ResultPublicationRole=Author

OrgUnit_ResultPublicationRole=Publisher

Project_ResultPublicationRole=TechnicalReport

OrganisationUnit

Page 26: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

26

Tutorial: CERIF 2008 Release

Classification Entities (Semantic Layer: Abstract)

Person OrgUnit

Project

ResultPublication

Project_OrgUnitProject_Person

Person_ResultPublicationOrgUnit_ResultPublication

Project_ResultPublication

PersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Person_OrganisationUnitRole=CEO

Project_OrganisationUnitRole=Organiser

Project_PersonRole=Co-ordinator

Person_ResultPublicationRole=Author

OrgUnit_ResultPublicationRole=Publisher

Project_ResultPublicationRole=TechnicalReport

OrganisationUnitPersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Project_OrgUnitProject_Person

Person_ResultPublicationOrgUnit_ResultPublication

Project_ResultPublication

PersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Person_OrganisationUnitRole=CEO

Project_OrganisationUnitRole=Organiser

Project_PersonRole=Co-ordinator

Person_ResultPublicationRole=Author

OrgUnit_ResultPublicationRole=Publisher

Project_ResultPublicationRole=TechnicalReport

OrganisationUnit

ClassificationClassIDClassSchemeIDTerm [language]Description [language]StartDate, EndDateURI

ClassificationSchemeClassSchemeIDDescription [language]URI

Classification_ClassificationClassID1 (Term1)ClassID2 (Term2)ClassSchemeID1 (Schema1)ClassSchemeID2 (Schema1)ClassId (Role)ClassSchemeID (RoleSchema)StartDate, EndDate

ClassScheme_ClassSchemeClassSchemeID1ClassSchemeID2ClassID (Role)ClassSchemeID (RoleSchema)StartDate, EndDate

Page 27: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

27

Tutorial: CERIF 2008 Release

Classification Entities (Semantic Layer: Example)

Person OrgUnit

Project

ResultPublication

Project_OrgUnitProject_Person

Person_ResultPublicationOrgUnit_ResultPublication

Project_ResultPublication

PersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Person_OrganisationUnitRole=CEO

Project_OrganisationUnitRole=Organiser

Project_PersonRole=Co-ordinator

Person_ResultPublicationRole=Author

OrgUnit_ResultPublicationRole=Publisher

Project_ResultPublicationRole=TechnicalReport

OrganisationUnitPersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Project_OrgUnitProject_Person

Person_ResultPublicationOrgUnit_ResultPublication

Project_ResultPublication

PersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Person_OrganisationUnitRole=CEO

Project_OrganisationUnitRole=Organiser

Project_PersonRole=Co-ordinator

Person_ResultPublicationRole=Author

OrgUnit_ResultPublicationRole=Publisher

Project_ResultPublicationRole=TechnicalReport

OrganisationUnit

ClassificationClassID AE (Answer Extraction)ClassSchemeID LT (Language Technology)Term [EN] Answer ExtractionDescription [EN] AE is the method … StartDate, EndDate 2008-10-08, openURI http://www.lt-world.org/Technologies/IE/AE

Classification_ClassificationClassID1 AE (Answer Extraction)ClassID2 IE (Information Extraction)ClassSchemeID1 LT (Language Technology)ClassSchemeID2 LT (Language Technology)ClassId isAClassSchemeID Taxonomic RelationshipsStartDate, EndDate 2008-10-08, open

ClassScheme_ClassSchemeClassSchemeID1 LT (Language Technology)ClassSchemeID2 ONT (Ontology)ClassID isAClassSchemeID Taxonomic RelationshipsStartDate, EndDate 2008-10-08,open

ClassificationSchemeClassSchemeID LT (Language Technology) Description [EN] The Language Technology Schema is an ontology …URI http://www.lt-world.org/

Subject Headings

Page 28: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

28

Tutorial: CERIF 2008 Release

Classification Entities (Semantic Layer: Example)

Person OrgUnit

Project

ResultPublication

Project_OrgUnitProject_Person

Person_ResultPublicationOrgUnit_ResultPublication

Project_ResultPublication

PersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Person_OrganisationUnitRole=CEO

Project_OrganisationUnitRole=Organiser

Project_PersonRole=Co-ordinator

Person_ResultPublicationRole=Author

OrgUnit_ResultPublicationRole=Publisher

Project_ResultPublicationRole=TechnicalReport

OrganisationUnitPersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Project_OrgUnitProject_Person

Person_ResultPublicationOrgUnit_ResultPublication

Project_ResultPublication

PersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Person_OrganisationUnitRole=CEO

Project_OrganisationUnitRole=Organiser

Project_PersonRole=Co-ordinator

Person_ResultPublicationRole=Author

OrgUnit_ResultPublicationRole=Publisher

Project_ResultPublicationRole=TechnicalReport

OrganisationUnit

ClassificationClassID PM (is manager of)ClassSchemeID PPR (Person-Project-Roles)Term [EN] is manager ofDescription [EN] A project manager is respon- sible for the successful … StartDate, EndDate 2008-10-08, openURI i.e.:PPR=PM

Classification_ClassificationClassID1 PM (is manager of)ClassID2 pMM (project management)ClassSchemeID1 PPR-Roles (Org1-Roles)ClassSchemeID2 pMM-Roles (Org2-Roles)ClassId isSimilarClassSchemeID SimilarityRelationshipsStartDate, EndDate 2008-10-08, open

ClassScheme_ClassSchemeClassSchemeID1 PPR-Roles (Org1-Roles)ClassSchemeID2 pMM-Roles (Org2-Roles)ClassID isMappedToClassSchemeID Project MM Mappings StartDate, EndDate 2008-10-08,open

ClassificationSchemeClassSchemeID PPR-Roles Description [EN] The PPR-Roles Scheme collects the Person-Project Roles in the LT World SystemURI http://www.lt-world.org/internal/PPR-Roles

Role Schemes

Page 29: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

29

Tutorial: CERIF 2008 Release

Classification Entities (Semantic Layer: Example)

Person OrgUnit

Project

ResultPublication

Project_OrgUnitProject_Person

Person_ResultPublicationOrgUnit_ResultPublication

Project_ResultPublication

PersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Person_OrganisationUnitRole=CEO

Project_OrganisationUnitRole=Organiser

Project_PersonRole=Co-ordinator

Person_ResultPublicationRole=Author

OrgUnit_ResultPublicationRole=Publisher

Project_ResultPublicationRole=TechnicalReport

OrganisationUnitPersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Project_OrgUnitProject_Person

Person_ResultPublicationOrgUnit_ResultPublication

Project_ResultPublication

PersonPerson OrgUnitOrgUnit

ProjectProject

ResultPublicationResultPublication

Person_OrganisationUnitRole=CEO

Project_OrganisationUnitRole=Organiser

Project_PersonRole=Co-ordinator

Person_ResultPublicationRole=Author

OrgUnit_ResultPublicationRole=Publisher

Project_ResultPublicationRole=TechnicalReport

OrganisationUnit

ClassificationClassID cfART (Article)ClassSchemeID cfPT (Publication Types)Term [EN] Description [EN] An article is usually published in …StartDate, EndDate 2008-10-08, openURI http://www.eurocris.org/CERIF/cfPT=cfART

Classification_ClassificationClassID1 cfART (Article)ClassID2 btART (Article)ClassSchemeID1 cfPT (Publication Types)ClassSchemeID2 btPT (Publication Types)ClassId isEqualToClassSchemeID EquationRelationshipsStartDate, EndDate 2008-10-08, open

ClassScheme_ClassSchemeClassSchemeID1 cfPT (Publication Types)

ClassSchemeID2 btPT (Publication Types)ClassID isMappingOfClassSchemeID CERIF-BibTex MappingStartDate, EndDate 2008-10-08,open

ClassificationSchemeClassSchemeID cfPTDescription [EN] The CERIF Scheme for thePublication Types has been developped …URI http://www.eurocris.org/CERIF/cfPT

Type Schemes

Page 30: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

30

Tutorial: CERIF 2008 Release

Classification Entities (Semantic Layer: Types)

BookReview

BookChapter

BookChapter Abstract

Inbook

BookChapter Review

Anthology MonographReference

Book

Textbook

Encyclopedia

Otherbook

Journal

JournalArticle

JournalArticle Abstract

JournalArticle Review

ConferenceProceedings Article

Letter toEditor

PhD Thesis

Doctoral Thesis

Poster Presentation

BookManual

ConferenceProceedings Letter

Report

ShortCommunication

Commentary

Annotation

NewsClipping

PublicationTypes

Page 31: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

31

Tutorial: CERIF 2008 Release

Classification Entities (Semantic Layer: Roles)

is author (numbered)

of

is author

of

isreviewer

of

is author (percentage)

of

is editor (numbered)

of

is editor

of

is subject

of

is translator

of

is publisher

of

Person_PublicationScheme

Page 32: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

32

Tutorial: CERIF 2008 Release

Classification Entities (Semantic Layer: Roles)

number ofauthors

number ofincomingcitations

number of requests

number ofexternal institutes

number ofdownloads

number ofaccess

is ofpublication type

ISIImpact Factor

claimsIPRof

Publication_MetricsRoles

receivedBest Paper

Award

number ofself

citations

area/typeof research

number ofcitations

Page 33: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

33

Tutorial: CERIF 2008 Release

Classification Entities (Semantic Layer added Value)

Allows to capture any Schema or Structure• Flat Lists• Taxonomies• Ontologies

Open / Extensible in all directions• New Schemas• New Concepts / Terms• New Relationships

Enables to manage• Roles / Types Semantics• Subject Headings • Archiving (Time component)

Allows for simple Mappings between Schemas Allows for a efficient (independent) Maintenance

Page 34: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

34

Tutorial: CERIF 2008 Release

What for ?

CitationTypesType:

Description:

PublicationURI:Type:Title:PartOf:PublDate:

Article Requests 2007Journal X = 4Journal Y = 0Journal Z = 15

Ends in 2010Journals: Y, Z

OrganisationURI:

Name:Abbreviation:Publications:

Academic Staff:

Journal Publications 2007Institute A = 4Institute B = 10Institute C = 9

OrganisationURI:Name:hasAccess:EndOfAccessContactPerson:

Journal SubscriptionsJournal X = 1990 - 2000Journal Y = 2005 - 2010Journal Z = 2001 - 2010

PhD Students 2008Computer Science = 200

Physics = 50Social Sciences = 9

First Author / No of Papers Person H = 10/35Person I = 4/12Person J = 1/10

Citations in 2007Paper M (publish 2007) = 20Paper N (publish 2004) = 100 Paper O (publish 2001) = 0

DeductionInferencingReasoning

2007 -> 2008Computer Science =-20Physics = -5Social Science = +2

Most RequestedJournal: Z

Page 35: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

35

Tutorial: CERIF 2008 Release

What for ?

http://www.ist-world.org/

Aim: investigate the thematic range of SSA projects in FP6

Thematic Areas (Blue Clouds):SEMANTICHEALTHLEGALCHANGINGROADMAPSOFTWARE

Projects (Red Dots)Linked with Full Record in Repository

Page 36: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

36

Tutorial: CERIF 2008 Release

What for ?

http://www.ist-world.org/

Aim: investigate the thematic range of SSA projects in FP6

Goals

Themes

Page 37: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

37

Tutorial: CERIF 2008 Release

What for ?

http://www.ist-world.org/

Aim: investigate the collaboration of SSA partners in FP6

Number of joint partners

Project

Page 38: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

38

Tutorial: CERIF 2008 Release

What for ?

What questions do we expect to answer with CERIF?

How many articles has author X published in 2007 as a first author?

How often have articles by author X been cited? Did author X publish with institutionally external authors? In how many FP7 projects does organisation Z participate? How many publications have resulted from project Y? How many people have been employed in the course of

FP6 projects from the 1st call in the NMS? How many PhD students have participated in FP6 projects? How many women have been involved in FP6 projects? How often have articles from journal A been requested in 2007? How many articles have been published in the field of B? …

Page 39: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

39

Tutorial: CERIF 2008 Release

The CERIF Evolution

EU Working Group

on Research DatabasesWorkshop

1987 1991

CERIF 91CERIF 91

PROJECT

Similar IdeasUN/UNESCOOECDCODATA

Acronym: ERGOParticipant: Keith Jeffery, Anne Asser son, many moreOrganisations: Rutherford Appleton, Uni- versity of Bergen, …

Acronym: ERGOParticipant: Keith Jeffery, Anne Asser son, many moreOrganisations: Rutherford Appleton, Uni- versity of Bergen, …

2000

CLASSIFICATIONCLASSIFICATION

RESULTSRESULTS EQUIPMENTEQUIPMENT

PROJECTPROJECT

OrgUnitOrgUnit PERSONPERSON

EXPERTISERoles Roles

CERIF 2000 CERIF 2000 ModelModel

- Networking of DBs- Exchange of Records

- Recommendation to Member States

- Data Model (RDBMS, OO, IR)- Multilinguality- Controlled Vocabulary- Roles / Types- User-driven

- EC Recommendation to Member States

ProjectProject OrganisationOrganisation

Service

Funding Programme

Patent

Skills

CV

Product

Event

PersonPerson

Classification(Semantics)

Classification(Semantics)

PublicationEquipment

2ndLevel

CORE

Language

SemanticsLink

CERIF 2006 / CERIF 2006 / 2008 2008 ModelModel

- Data Model (RDBMS, OO, IR)- Model Normalization - Robust Structure - Extensible Structure - Consistent Structure - Semantic Layer - XML Exchange Specification

- Connectivity to Repositories (Elaboration on Publication)

2006 2008

Page 40: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

40

Tutorial: CERIF 2008 Release

CERIF 91

– published in a first release – recommended to Member States

• to harmonise databases on research projects• ease exchange of comparable information• guidelines for building research databases

– only dealt with research project records– demonstrated in the ERGO pilot project

• access to more than 80.000 project records• from more than 20 national information services

– demonstrated the feasability of exchange – identified the need for more detailed guidelines– confirmed the need to revise CERIF and extend it to other types of

research information, not only projects– revision activities started in 1997 co-ordinated by the EC – led to CERIF 2000

Page 41: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

41

Tutorial: CERIF 2008 Release

CERIF 2000

– a full CRIS data model with flexibility to accomodate many database structures

– a base framework for data exchange– multilingual subject indexing (Ortelius Thesaurus)– recommendations for controlled attribute values– reflection on user groups and requirements– types of research information – metadata environment as a uniform summary view– extensions to

• Organisations• Persons• Results: Products, Patent, Publication• Expertise• Equipment and Facilities

Page 42: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

42

Tutorial: CERIF 2008 Release

What is going on ?

JISC Report from April 2008“Metadata for digital libraries: state of the art and future directions”

by Richard Gartner http://www.jisc.ac.uk/media/documents/techwatch/tsw_0801pdf.pdf

Many available Schemas (DC, METS, MODS, …)

Each schema was singularly developed and not designed as an overal architecture to cover integrated object entities

JISC recommends therefore to overcome the problem by best practise guidelines and pragmatic application

Issues of duplicate information (overlap in sections of metadata) need rules and are currently being addressed by the library community in good practise guidelines

Page 43: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

43

Tutorial: CERIF 2008 Release

What is going on ?

JISC Report from April 2008“Metadata for digital libraries: state of the art and future directions”

by Richard Gartner http://www.jisc.ac.uk/media/documents/techwatch/tsw_0801pdf.pdf

– Descriptive Metadata (intellectual contents)

– Administrative Metadata (technical metadata [file formats], rights management, provenance [info on creation, subsequent treatment, responsibility, …])

– Structural Metadata (internal structure of items: e.g.: page order, …)

• METS • DIDL• …

Page 44: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

44

Tutorial: CERIF 2008 Release

What is going on ?

JISC Report from April 2008“Metadata for digital libraries: state of the art and future directions”

by Richard Gartner http://www.jisc.ac.uk/media/documents/techwatch/tsw_0801pdf.pdf

XML is of great importance to embed and make use of namespaces Combining Metadata standards, even a limited such as described

above, will always be messier than utilising a single standard that combines their taxonomic powers and resolves any potential clashes or duplications between them.

Integration by itself would of course be of little consequence if the standards themselves failed to address the metadata needs of the digital library community. In this respect, the provenance of each standard is of some importance. All have been constructed by authoritative standard setters within their communities.

Most of the mentioned standards have proved their ability to meet the requirements of major and highly complex digital collections.

Page 45: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

45

Tutorial: CERIF 2008 Release

What is going on ?

Source: http://maps.repository66.org/; Reported on: http://www.sparceurope.org/

Page 46: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

46

Tutorial: CERIF 2008 Release

What CERIF aims for

Source: http://maps.repository66.org/; Reported on: http://www.sparceurope.org/

Equipment

ProjectProjectOrganisationOrganisation

Service

FundingProgramme

Patent

Skills

CV

Product

Event

PersonPerson

Classification(Semantics)

Classification(Semantics)

Publication

Page 47: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

47

Tutorial: CERIF 2008 Release

What CERIF aims for

Equipment

ProjectProjectOrganisationOrganisation

Service

FundingProgramme

Patent

Skills

CV

Product

Event

PersonPerson

Classification(Semantics)

Classification(Semantics)

Publication

Enabling the ERA eInfrastructure

Standardization / Integration / Interchange

Added-Value Services

Middle (Interoperability)-Layer for EU Research Information

Page 48: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

48

Tutorial: CERIF 2008 Release

Activities

• UK: Research Councils specified to use CERIF as the format for IT processes and MM information

• UK: STFC (Corporate Data Repository)

• BE: Flanders – CERIF as Standard Interchange Format• DK: Danish Universities PURE -> CERIF

• EUROPEESF: CERIF for IS under discussionCORDIS, EC R&D Service: Asked for CERIF presentation EuroHORCS: Recommendation for CERIF; ESF joined as a euroCRIS member

Page 49: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

49

Tutorial: CERIF 2008 Release

Activities

• IST World SSA (project)• Videolectures.net (Teaching Videos)• BioDiversa ERANET (project)• IWETO (BE): Integrating Flemish Research Information• FRIDA (NO): Joint university CRIS• Fdok (NO): University of Bergen, results• METIS (NL): currently used by Dutch Universities• STFC (UK): Corporate Data Repository• HUNCRIS (HU): Access to R&D in Hungary• SICRIS (SI): Access to University Research in Slovenia• SRIS (UK): Scottish Research Information Systems, public research in

Scotland • AURIS-MM (AT): Provides access to Austrian University Research extended

with multimedia• ICERIS (IS): Access to Information on Icelandic Research Projects & R&D

Results• CRIS-MER (EC): Research information on Migration and ethnic Relations

(planned)

Page 50: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

50

Tutorial: CERIF 2008 Release

CERIF TG Activity

Regular CERIF TG meetings and Discussions Tests and major bugfixes before Releases Strong Relation to ongoing implementation activities

(Geert van Grootel, EWI, Flanders; atira A/S, Aalborg, Denmark)

Exchange with TG Best Practice (Ales Bosniak, IZUM, Slovenia)

Collaborate with TG Institutional Repositories (IR-CERIF)(Anna Clements, University of St. Andrews, UK)

Next Steps: Extension of Semantic Layer with Content Check Tools for Managing the Semantics Mappings of major Schemas (Standards) Check OAI Wrapping CERIF Ontology

Page 51: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

51

Tutorial: CERIF 2008 Release

Active People

Active participation in current release (2008): Brigitte Jörg, (German Res Center for AI) TG Leader Keith G. Jeffery (UK Science and Techn Facilities Council) Geert van Grootel (Flemish Ministry) Anne Asserson (University Bergen) Henrik Rasmussen (atira A/S) Adrian Price (University Copenhagen) Thomas Vestam (atira A/S)

Active participation in past release (2006): Ojars Krast (uniCRIS AG) Edward Grabczewski (UK Science and Tech Facil Council)

Page 52: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

52

Tutorial: CERIF 2008 Release

CERIF 2008 Release

Model Introduction and Specification Document

Full Data Model, SQL Database Scripts

XML Data Exchange Specification Document

XML Example Files

XML Schemas for XML Validation

CERIF Types / Roles / Semantics

http://www.eurocris.org/

Page 53: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

53

Tutorial: CERIF 2008 Release

XML Interchange Format

According to W3C Standards Refers to XML Schemas for Validation XML files corresponding to Entities / Separation of Relationships

<XML><PERSON> <ID>1</ID> <FirstName>Anne</FirstName> <LastName>Asserson</LastName> <URI>http://www.linkedin.com1</URI> <Sex>female</Sex></PERSON><PERSON> <ID>2</ID> <FirstName>Keith</FirstName> <LastName>Jeffery</LastName> <OtherNames>G.</OtherNames> <URI>http://www.linkedin.com2</URI> <Sex>male</Sex></PERSON>---</XML>

<XML><PUBLICATION> <ID>1</ID> <Title language=„EN“>Grey in the R&D Process</Title> <Date>2006</Date> <URI>http://www.epubs.org/ID1</URI></PUBLICATION><PUBLICATION> <ID>2</ID> <Title language=„EN“>What‘s new in Grey Literature …</Title> <Date>2005</Date> <URI>http://www.greynet.org/thegrey journal.html?ID2</URI></PUBLICATION>---</XML>

Page 54: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

54

Tutorial: CERIF 2008 Release

CERIF 2006 Implementation Thesaurus / Semantics @ EWI

Page 55: CERIF 2008 Tutorial

© Brigitte Jörg October 8th, 2008 Moscow, Russia

55

Tutorial: CERIF 2008 Release

Example: GeneratingPublication Reference Records

CERIF attributes CERIF entities CERIF types / Comment

cfResultPublicationId cfResultPublication attribute of Core entity

cfResultPublicationDate cfResultPublication attribute of Core entity

cfVolume cfResultPublicaion attribute of Core entity

cfEdition cfResultPublication attribute of Core entity

cfSeries cfResultPublication attribute of Core entity

cfIssue cfResultPublication attribute of Core entity

cfStartPage cfResultPublication attribute of Core entity

cfEndPage cfResultPublication attribute of Core entity

cfTotalPages cfResultPublication attribute of Core entity

cfISBN cfResultPublication attribute of Core entity

cfISSN cfResultPublication attribute of Core entity

cfUniformResourceIdentifier cfResultPublication attribute of Core entity

cfNameAbbreviation cfResultPublicationNameAbbreviation attribute of LanguageRelated entity

cfTitle cfResultPublicationTitle attribute of LanguageRelated entity

cfSubtitle cfResultPublicationSubtitle attribute of LanguageRelated entity

cfAbstract cfResultPublicationAbstract attribute of LanguageRelated entity

cfKeywords cfResultPublicationKeywords attribute of LanguageRelated entity

cfBibliographicNote cfResultPublicationBibliographicNote attribute of LanguageRelated entity cfPersonId

cfPerson_ResultPublicationReference to Person in Link entity with assigned Role [Semantic Layer: i.e. isAuthorOf]

cfOrgUnitIdcfOrgUnit_ResultPublication

Reference to OrgUnit in Link entity with assigned Role [Semantic Layer: i.e. isPublisherOf]

cfCopyright cfPerson_ResultPublication, cfOrgUnit_ResultPublication

attribute of Link entities [Statement about property rights]

cfCityTown cfPostAddress attribute of 2ndLevel entity [For location of conference / address of publisher]

cfCountryCode cPostAddress attribute of 2ndLevel entity [For location of conference / address of publisher]

cfStartDate cfEvent attribute of 2ndLevel entity [For startDate of conference]

cfEndDate cfEvent attribute of 2ndLevel entity [For startDate of conference]

cfSourceId only used in CERIFXML for identification of the dataset - not in the model

CERIF Classification Values CERIF Link Entities (Semantics) Types / Comment

cfIsAuthorOf cfPerson_ResultPublication value of PersonPublicationRoleSchema (Semantic Layer)

cfIsPublisherOf cfOrgUnit_ResultPublication value of OrgUnitPublicationRoleSchema (Semantic Layer)

cfPublicationType cfResultPublication_ResultPublication value of publicationTypeSchema (Semantic Layer)

CERIF ResultPublication Entity for Reference Building

@article{615182, author = {Veda C. Storey}, title = {Understanding semantic relationships}, journal = {The VLDB Journal}, volume = {2}, number = {4}, year = {1993}, issn = {1066-8888}, pages = {455--488}, doi = {http://dx.doi.org/10.1007/BF01263048}, publisher = {Springer-Verlag New York, Inc.}, address = {Secaucus, NJ, USA}, }