© brigitte jörg october 8th, 2008 moscow, russia 1 tutorial: cerif 2008 release cerif 2008...
TRANSCRIPT
© Brigitte Jörg October 8th, 2008 Moscow, Russia
1
Tutorial: CERIF 2008 Release
CERIF 2008 TutorialCERIF 2008 Tutorial
Brigitte Jörg, M.A. (Information Science)Brigitte Jörg, M.A. (Information Science)
Language Technology Lab,Language Technology Lab,
German Research Center for German Research Center for Artificial Intelligence (DFKI)Artificial Intelligence (DFKI)
Saarbrücken, GermanySaarbrücken, Germany
© Brigitte Jörg October 8th, 2008 Moscow, Russia
2
Tutorial: CERIF 2008 Release
Outline
Introduction of Speaker
What is CERIF?
Grounding Explanations Model Metadata Data-centric Research Information
The Conceptual CERIF Model Entities Relationships Structure
The CERIF Semantic Layer in some Detail
The CERIF Evolution, Aim and Ongoing Activities
The CERIF 2008 Release
© Brigitte Jörg October 8th, 2008 Moscow, Russia
3
Tutorial: CERIF 2008 Release
Introduction of Speaker
Brigitte Jörg M.A. Information ScienceInformation Systems, Business Administration
Project Manager, Researcher DFKI GmbH, Language Technology Lab Saarbrücken
CERIF TG Leader, Board Member euroCRIS
Contact: brigitte.joerg @ dfki.de http://www.dfki.de/~brigitte/
© Brigitte Jörg October 8th, 2008 Moscow, Russia
4
Tutorial: CERIF 2008 Release
Equipment
ProjectProject OrganisationOrganisation
Service
FundingProgramme
Patent
Skills
CV
Product
Event
PersonPerson
Classification(Semantics)
Classification(Semantics)
Publication
What is CERIF ?
Common European Research Information Format
© Brigitte Jörg October 8th, 2008 Moscow, Russia
5
Tutorial: CERIF 2008 Release
What is CERIF ?
Common European Research Information Format
– A Concept about Research Entities and their Relationships Specification (Conceptual Level)
– A Description of Research Entities and their Relationships Model (Logical Level)
– A Formalization of Research Entities and their Relationships
Database Scripts (Physical Level)
SQL Script-----------------------CREATE Table PersonCREATE Table ProjectCREATE Table OrgUnit
© Brigitte Jörg October 8th, 2008 Moscow, Russia
6
Tutorial: CERIF 2008 Release
What is CERIF ?
Common European Research Information Format
(1) data model (data-centric)
(2) allows for a (metadata) representation of – research entities – their activities / interconnections (research)– their output (results)
(3) offers high flexibility with (semantic) relationships
(4) enables quality maintenance, archiving, access and interchange of research information
(5) supports knowledge transfer to decision makers, for research evaluation, research managers, strategists, researchers, editors, the general public
© Brigitte Jörg October 8th, 2008 Moscow, Russia
7
Tutorial: CERIF 2008 Release
What is a model ?
… is a simplified view to describe a particular area of interest
… allows for a better communication between parties (mutual understanding)
… supports (re-)design decisions
… supports workflow identification
… supports documentation
… can be exchanged, re-used, iterated, extended
A Binforms
C Dis part of
X Zdepends on
F Gwaits for
© Brigitte Jörg October 8th, 2008 Moscow, Russia
8
Tutorial: CERIF 2008 Release
What is Metadata ?
„Metadata is structured data which describes the characteristics of a resource.”
An Introduction to Metadata, by Chris Taylor, University of Queensland
“Metadata is sometimes defined literally as 'data about data,' but the term is normally understood to mean structured data about resources that can be used to help support a wide range of operations. These might include, for example, resource description and discovery, the management of information resources and their long-term preservation.” Metadata in a Nutshell, by Michael Day, UKOLN
Support a Wide Range of Operations
© Brigitte Jörg October 8th, 2008 Moscow, Russia
9
Tutorial: CERIF 2008 Release
What is Metadata ?
Book: Title: The Hitchhiker‘s Guide to the GalaxyDate of Publication: 1979
Game Cover Image: The Hitchhiker‘s Guide to the Galaxy Source: http://egotron.com/Retrieved: May 30, 2008
Radio Series: Title: The Hitchhiker‘s Guide to the GalaxyDescription: is a science fiction comedy series created by Douglas Adams. Originally a radio comedy broadcast on BBC Radio 4 in 1978, […] Source: WikipediaDate of Query: May 30, 2008
Series of five Books: Title: The Hitchhiker‘s Guide to the Galaxy.Between: 1979 - 1982
TV Series: Title: The Hitchhiker‘s Guide to the GalaxyScreened: 1981
Computer Game: Title: The Hitchhiker‘s Guide to the GalaxyReleased: 1984
Comic Book Adaptions: Title: The Hitchhiker‘s Guide to the GalaxyBetween: 1993 – 1996
Links: http://www.bbc.co.uk/cult/hitchhikers/HTML-Title: Cult – The Hitchhiker‘s Guide to the Galaxyhttp://en.wikipedia.org/wiki/The_Hitchhiker's_Guide_to_the_GalaxyHTML-Title: The Hitchhiker's Guide to the Galaxy
Data about D
ata
Structure: • Type of Resource• Title• Description• Source• Date• Author, Creator, …
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
Metadata
© Brigitte Jörg October 8th, 2008 Moscow, Russia
10
Tutorial: CERIF 2008 Release
What is Data-centric ?
CitationTypesType:
Description:
PublicationURI:Type:Title:PartOf:PublDate:
Article Requests 2007Journal X = 4Journal Y = 0Journal Z = 15
Ends in 2010Journals: Y, Z
OrganisationURI:
Name:Abbreviation:Publications:
Academic Staff:
Journal Publications 2007Institute A = 4Institute B = 10Institute C = 9
OrganisationURI:Name:hasAccess:EndOfAccessContactPerson:
Journal SubscriptionsJournal X = 1990 - 2000Journal Y = 2005 - 2010Journal Z = 2001 - 2010
PhD Students 2008Computer Science = 200
Physics = 50Social Sciences = 9
First Author / No of Papers Person H = 10/35Person I = 4/12Person J = 1/10
Citations in 2007Paper M (publish 2007) = 20Paper N (publish 2004) = 100 Paper O (publish 2001) = 0
DataMetadata
© Brigitte Jörg October 8th, 2008 Moscow, Russia
11
Tutorial: CERIF 2008 Release
What is Data-centric ?
– Data / Metadata in the center – Data Maintenance, Curation, Preservation
and Quality a major Interest – Enabling added-value Services based on
qualitative Data
– Enabling requested views for various stakeholders based on qualitative Data
© Brigitte Jörg October 8th, 2008 Moscow, Russia
12
Tutorial: CERIF 2008 Release
What is Research Information ?
Data/Metadata or Information about:• Scientists• Project Managers• Ongoing and Completed Projects• Research Departments• Funding Organisations and Programmes• Research Results• Publications• Equipment• …
• their timely Relationships (Semantics)
© Brigitte Jörg October 8th, 2008 Moscow, Russia
13
Tutorial: CERIF 2008 Release
Equipment
ProjectProjectOrganisationOrganisation
Service
FundingProgramme
Patent
Skills
CV
Product
Event
PersonPerson
Classification(Semantics)
Classification(Semantics)
Publication
Common European Research Information Format
© Brigitte Jörg October 8th, 2008 Moscow, Russia
14
Tutorial: CERIF 2008 Release
Common European Research Information Format
• A model to manage Research Information
• Research Entities• Project, Person, Organisation, Publication• Funding Programme, Service, Equipment, • Patent, Product, …
• Activities / Interconnections in their Context• Relationships• Semantics / Roles / Types
-> for Exchange -> for Interoperability-> for Implementation of CRISs
(Current Research Information Systems)
© Brigitte Jörg October 8th, 2008 Moscow, Russia
15
Tutorial: CERIF 2008 Release
CERIF Structure
Core Entities
2nd Level Entities
Language-related Entities
Link Entities
Classification Entities (Semantic Layer)
© Brigitte Jörg October 8th, 2008 Moscow, Russia
16
Tutorial: CERIF 2008 Release
Core Entities
Person OrganisationUnit
Project
ResultPublication
Person OrganisationUnit
Project
ResultPublication
© Brigitte Jörg October 8th, 2008 Moscow, Russia
17
Tutorial: CERIF 2008 Release
Core Entities
Person OrganisationUnit
Project
ResultPublication
Person OrganisationUnit
Project
ResultPublication
PublicationIDURITitleSubtitleAbstractBibl. NotePublicationDateTotalPagesStartPageEndPageKeywords
PersonIDURISexFirstNamesOtherNamesFamilyNamesNameVariantsResearchInterestKeywords
ProjectIDURIAcronymStartDateEndDateTitleAbstractKeywords
OrganisationIDURIAcronymNameHeadCountCurrencyCodeTurnoverResearchActivityKeywords
© Brigitte Jörg October 8th, 2008 Moscow, Russia
18
Tutorial: CERIF 2008 Release
2nd Level Entities
ResultPatent
ResultProduct
Service
Equipment
FundingProgramme
Facility
CV
Event
Skills
Person OrganisationUnit
Project
ResultPublication
ResultPatent
ResultProduct
Service
Equipment
FundingProgramme
Facility
CV
Event
Skills
Person OrganisationUnit
Project
ResultPublication
Person OrganisationUnit
Project
ResultPublication
© Brigitte Jörg October 8th, 2008 Moscow, Russia
19
Tutorial: CERIF 2008 Release
2nd Level Entities
ResultPatent
ResultProduct
Service
Equipment
FundingProgramme
Facility
CV
Event
Skills
Person OrganisationUnit
Project
ResultPublication
ResultPatent
ResultProduct
Service
Equipment
FundingProgramme
Facility
CV
Event
Skills
Person OrganisationUnit
Project
ResultPublication
Person OrganisationUnit
Project
ResultPublication
FacilityIDURINameDescriptionKeywords
FundingProgrammeIDURINameCurrencyCodeBudgetStartDateEndDateDescriptionKeywords
EventIDURINameFeeOrFreeStartDateEndDateCityTownCountryCodeDescriptionKeywords
ResultPatentIDURIPatentNumberTitleCountryCodeRegistrationDateApprovalDateDescriptionKeywords
ServiceIDURINameDescriptionKeywords
© Brigitte Jörg October 8th, 2008 Moscow, Russia
20
Tutorial: CERIF 2008 Release
Language-related Entities
© Brigitte Jörg October 8th, 2008 Moscow, Russia
21
Tutorial: CERIF 2008 Release
Language-related Entities
PublicationTitle [language]Abstract [languange]Keywords [language]
OrganisationName [language]ResearchActivity [languange]Keywords [language]
ProjectTitle [language]Abstract [languange]Keywords [language]
PersonResearchInterest [language]Keywords [language]
FacilityName [language]Description [languange]Keywords [language]
ServiceName [language]Description [languange]Keywords [language]
PatentName [language]Description [languange]Keywords [language]
ProductName [language]Description [languange]Keywords [language]
© Brigitte Jörg October 8th, 2008 Moscow, Russia
22
Tutorial: CERIF 2008 Release
Link Entities
PersonPerson OrgUnitOrgUnit
ProjectProject
ResultPublicationResultPublication
Project_OrgUnitProject_Person
Person_ResultPublication
OrgUnit_ResultPublication
Project_ResultPublication
PersonPerson OrgUnitOrgUnitOrgUnit
ProjectProject
ResultPublication
Project_OrganisationUnitProject_Person
Person_ResultPublication
OrganisationUnitPerson_OrganisationUnit
ResultPublication
Project_ResultPublication
© Brigitte Jörg October 8th, 2008 Moscow, Russia
23
Tutorial: CERIF 2008 Release
Link Entities
Person_PublicationpersIDpublIDClassificationClassificationSchemeStartDate; EndDate
Project_PersonprojIDperslIDClassificationClassificationSchemeStartDate; EndDate
Person_OrganisationpersIDorgIDClassificationClassificationSchemeStartDate; EndDate
Person Publication
Person Organisation
Project Person
Semantics
Semantics
Semantics
© Brigitte Jörg October 8th, 2008 Moscow, Russia
24
Tutorial: CERIF 2008 Release
Link Entities
Person_PublicationpersIDpublIDClassificationClassificationSchemeStartDate; EndDate
Project_PersonprojIDperslIDClassificationClassificationSchemeStartDate; EndDate
Organisation_PublicationorgIDpublIDClassificationClassificationSchemeStartDate; EndDate
Project_PublicationpersIDpublIDClassificationClassificationSchemeStartDate; EndDate
Project_OrganisationprojIDorgIDClassificationClassificationSchemeStartDate; EndDate
Person_OrganisationpersIDorgIDClassificationClassificationSchemeStartDate; EndDate
Project_PublicationprojIDpublIDClassificationClassificationSchemeStartDate; EndDate
© Brigitte Jörg October 8th, 2008 Moscow, Russia
25
Tutorial: CERIF 2008 Release
Classification Entities (Semantic Layer)
Formal Semantics / Values for Link Entities
Person OrgUnit
Project
ResultPublication
Project_OrgUnitProject_Person
Person_ResultPublicationOrgUnit_ResultPublication
Project_ResultPublication
PersonPerson OrgUnitOrgUnit
ProjectProject
ResultPublicationResultPublication
Person_OrganisationUnitRole=CEO
Project_OrganisationUnitRole=Organiser
Project_PersonRole=Co-ordinator
Person_ResultPublicationRole=Author
OrgUnit_ResultPublicationRole=Publisher
Project_ResultPublicationRole=TechnicalReport
OrganisationUnitPersonPerson OrgUnitOrgUnit
ProjectProject
ResultPublicationResultPublication
Project_OrgUnitProject_Person
Person_ResultPublicationOrgUnit_ResultPublication
Project_ResultPublication
PersonPerson OrgUnitOrgUnit
ProjectProject
ResultPublicationResultPublication
Person_OrganisationUnitRole=CEO
Project_OrganisationUnitRole=Organiser
Project_PersonRole=Co-ordinator
Person_ResultPublicationRole=Author
OrgUnit_ResultPublicationRole=Publisher
Project_ResultPublicationRole=TechnicalReport
OrganisationUnit
© Brigitte Jörg October 8th, 2008 Moscow, Russia
26
Tutorial: CERIF 2008 Release
Classification Entities (Semantic Layer: Abstract)
Person OrgUnit
Project
ResultPublication
Project_OrgUnitProject_Person
Person_ResultPublicationOrgUnit_ResultPublication
Project_ResultPublication
PersonPerson OrgUnitOrgUnit
ProjectProject
ResultPublicationResultPublication
Person_OrganisationUnitRole=CEO
Project_OrganisationUnitRole=Organiser
Project_PersonRole=Co-ordinator
Person_ResultPublicationRole=Author
OrgUnit_ResultPublicationRole=Publisher
Project_ResultPublicationRole=TechnicalReport
OrganisationUnitPersonPerson OrgUnitOrgUnit
ProjectProject
ResultPublicationResultPublication
Project_OrgUnitProject_Person
Person_ResultPublicationOrgUnit_ResultPublication
Project_ResultPublication
PersonPerson OrgUnitOrgUnit
ProjectProject
ResultPublicationResultPublication
Person_OrganisationUnitRole=CEO
Project_OrganisationUnitRole=Organiser
Project_PersonRole=Co-ordinator
Person_ResultPublicationRole=Author
OrgUnit_ResultPublicationRole=Publisher
Project_ResultPublicationRole=TechnicalReport
OrganisationUnit
ClassificationClassIDClassSchemeIDTerm [language]Description [language]StartDate, EndDateURI
ClassificationSchemeClassSchemeIDDescription [language]URI
Classification_ClassificationClassID1 (Term1)ClassID2 (Term2)ClassSchemeID1 (Schema1)ClassSchemeID2 (Schema1)ClassId (Role)ClassSchemeID (RoleSchema)StartDate, EndDate
ClassScheme_ClassSchemeClassSchemeID1ClassSchemeID2ClassID (Role)ClassSchemeID (RoleSchema)StartDate, EndDate
© Brigitte Jörg October 8th, 2008 Moscow, Russia
27
Tutorial: CERIF 2008 Release
Classification Entities (Semantic Layer: Example)
Person OrgUnit
Project
ResultPublication
Project_OrgUnitProject_Person
Person_ResultPublicationOrgUnit_ResultPublication
Project_ResultPublication
PersonPerson OrgUnitOrgUnit
ProjectProject
ResultPublicationResultPublication
Person_OrganisationUnitRole=CEO
Project_OrganisationUnitRole=Organiser
Project_PersonRole=Co-ordinator
Person_ResultPublicationRole=Author
OrgUnit_ResultPublicationRole=Publisher
Project_ResultPublicationRole=TechnicalReport
OrganisationUnitPersonPerson OrgUnitOrgUnit
ProjectProject
ResultPublicationResultPublication
Project_OrgUnitProject_Person
Person_ResultPublicationOrgUnit_ResultPublication
Project_ResultPublication
PersonPerson OrgUnitOrgUnit
ProjectProject
ResultPublicationResultPublication
Person_OrganisationUnitRole=CEO
Project_OrganisationUnitRole=Organiser
Project_PersonRole=Co-ordinator
Person_ResultPublicationRole=Author
OrgUnit_ResultPublicationRole=Publisher
Project_ResultPublicationRole=TechnicalReport
OrganisationUnit
ClassificationClassID AE (Answer Extraction)ClassSchemeID LT (Language Technology)Term [EN] Answer ExtractionDescription [EN] AE is the method … StartDate, EndDate 2008-10-08, openURI http://www.lt-world.org/Technologies/IE/AE
Classification_ClassificationClassID1 AE (Answer Extraction)ClassID2 IE (Information Extraction)ClassSchemeID1 LT (Language Technology)ClassSchemeID2 LT (Language Technology)ClassId isAClassSchemeID Taxonomic RelationshipsStartDate, EndDate 2008-10-08, open
ClassScheme_ClassSchemeClassSchemeID1 LT (Language Technology)ClassSchemeID2 ONT (Ontology)ClassID isAClassSchemeID Taxonomic RelationshipsStartDate, EndDate 2008-10-08,open
ClassificationSchemeClassSchemeID LT (Language Technology) Description [EN] The Language Technology Schema is an ontology …URI http://www.lt-world.org/
Subject Headings
© Brigitte Jörg October 8th, 2008 Moscow, Russia
28
Tutorial: CERIF 2008 Release
Classification Entities (Semantic Layer: Example)
Person OrgUnit
Project
ResultPublication
Project_OrgUnitProject_Person
Person_ResultPublicationOrgUnit_ResultPublication
Project_ResultPublication
PersonPerson OrgUnitOrgUnit
ProjectProject
ResultPublicationResultPublication
Person_OrganisationUnitRole=CEO
Project_OrganisationUnitRole=Organiser
Project_PersonRole=Co-ordinator
Person_ResultPublicationRole=Author
OrgUnit_ResultPublicationRole=Publisher
Project_ResultPublicationRole=TechnicalReport
OrganisationUnitPersonPerson OrgUnitOrgUnit
ProjectProject
ResultPublicationResultPublication
Project_OrgUnitProject_Person
Person_ResultPublicationOrgUnit_ResultPublication
Project_ResultPublication
PersonPerson OrgUnitOrgUnit
ProjectProject
ResultPublicationResultPublication
Person_OrganisationUnitRole=CEO
Project_OrganisationUnitRole=Organiser
Project_PersonRole=Co-ordinator
Person_ResultPublicationRole=Author
OrgUnit_ResultPublicationRole=Publisher
Project_ResultPublicationRole=TechnicalReport
OrganisationUnit
ClassificationClassID PM (is manager of)ClassSchemeID PPR (Person-Project-Roles)Term [EN] is manager ofDescription [EN] A project manager is respon- sible for the successful … StartDate, EndDate 2008-10-08, openURI i.e.:PPR=PM
Classification_ClassificationClassID1 PM (is manager of)ClassID2 pMM (project management)ClassSchemeID1 PPR-Roles (Org1-Roles)ClassSchemeID2 pMM-Roles (Org2-Roles)ClassId isSimilarClassSchemeID SimilarityRelationshipsStartDate, EndDate 2008-10-08, open
ClassScheme_ClassSchemeClassSchemeID1 PPR-Roles (Org1-Roles)ClassSchemeID2 pMM-Roles (Org2-Roles)ClassID isMappedToClassSchemeID Project MM Mappings StartDate, EndDate 2008-10-08,open
ClassificationSchemeClassSchemeID PPR-Roles Description [EN] The PPR-Roles Scheme collects the Person-Project Roles in the LT World SystemURI http://www.lt-world.org/internal/PPR-Roles
Role Schemes
© Brigitte Jörg October 8th, 2008 Moscow, Russia
29
Tutorial: CERIF 2008 Release
Classification Entities (Semantic Layer: Example)
Person OrgUnit
Project
ResultPublication
Project_OrgUnitProject_Person
Person_ResultPublicationOrgUnit_ResultPublication
Project_ResultPublication
PersonPerson OrgUnitOrgUnit
ProjectProject
ResultPublicationResultPublication
Person_OrganisationUnitRole=CEO
Project_OrganisationUnitRole=Organiser
Project_PersonRole=Co-ordinator
Person_ResultPublicationRole=Author
OrgUnit_ResultPublicationRole=Publisher
Project_ResultPublicationRole=TechnicalReport
OrganisationUnitPersonPerson OrgUnitOrgUnit
ProjectProject
ResultPublicationResultPublication
Project_OrgUnitProject_Person
Person_ResultPublicationOrgUnit_ResultPublication
Project_ResultPublication
PersonPerson OrgUnitOrgUnit
ProjectProject
ResultPublicationResultPublication
Person_OrganisationUnitRole=CEO
Project_OrganisationUnitRole=Organiser
Project_PersonRole=Co-ordinator
Person_ResultPublicationRole=Author
OrgUnit_ResultPublicationRole=Publisher
Project_ResultPublicationRole=TechnicalReport
OrganisationUnit
ClassificationClassID cfART (Article)ClassSchemeID cfPT (Publication Types)Term [EN] Description [EN] An article is usually published in …StartDate, EndDate 2008-10-08, openURI http://www.eurocris.org/CERIF/cfPT=cfART
Classification_ClassificationClassID1 cfART (Article)ClassID2 btART (Article)ClassSchemeID1 cfPT (Publication Types)ClassSchemeID2 btPT (Publication Types)ClassId isEqualToClassSchemeID EquationRelationshipsStartDate, EndDate 2008-10-08, open
ClassScheme_ClassSchemeClassSchemeID1 cfPT (Publication Types)
ClassSchemeID2 btPT (Publication Types)ClassID isMappingOfClassSchemeID CERIF-BibTex MappingStartDate, EndDate 2008-10-08,open
ClassificationSchemeClassSchemeID cfPTDescription [EN] The CERIF Scheme for thePublication Types has been developped …URI http://www.eurocris.org/CERIF/cfPT
Type Schemes
© Brigitte Jörg October 8th, 2008 Moscow, Russia
30
Tutorial: CERIF 2008 Release
Classification Entities (Semantic Layer: Types)
BookReview
BookChapter
BookChapter Abstract
Inbook
BookChapter Review
Anthology MonographReference
Book
Textbook
Encyclopedia
Otherbook
Journal
JournalArticle
JournalArticle Abstract
JournalArticle Review
ConferenceProceedings Article
Letter toEditor
PhD Thesis
Doctoral Thesis
Poster Presentation
BookManual
ConferenceProceedings Letter
Report
ShortCommunication
Commentary
Annotation
NewsClipping
PublicationTypes
© Brigitte Jörg October 8th, 2008 Moscow, Russia
31
Tutorial: CERIF 2008 Release
Classification Entities (Semantic Layer: Roles)
is author (numbered)
of
is author
of
isreviewer
of
is author (percentage)
of
is editor (numbered)
of
is editor
of
is subject
of
is translator
of
is publisher
of
Person_PublicationScheme
© Brigitte Jörg October 8th, 2008 Moscow, Russia
32
Tutorial: CERIF 2008 Release
Classification Entities (Semantic Layer: Roles)
number ofauthors
number ofincomingcitations
number of requests
number ofexternal institutes
number ofdownloads
number ofaccess
is ofpublication type
ISIImpact Factor
claimsIPRof
Publication_MetricsRoles
receivedBest Paper
Award
number ofself
citations
area/typeof research
number ofcitations
© Brigitte Jörg October 8th, 2008 Moscow, Russia
33
Tutorial: CERIF 2008 Release
Classification Entities (Semantic Layer added Value)
Allows to capture any Schema or Structure• Flat Lists• Taxonomies• Ontologies
Open / Extensible in all directions• New Schemas• New Concepts / Terms• New Relationships
Enables to manage• Roles / Types Semantics• Subject Headings • Archiving (Time component)
Allows for simple Mappings between Schemas Allows for a efficient (independent) Maintenance
© Brigitte Jörg October 8th, 2008 Moscow, Russia
34
Tutorial: CERIF 2008 Release
What for ?
CitationTypesType:
Description:
PublicationURI:Type:Title:PartOf:PublDate:
Article Requests 2007Journal X = 4Journal Y = 0Journal Z = 15
Ends in 2010Journals: Y, Z
OrganisationURI:
Name:Abbreviation:Publications:
Academic Staff:
Journal Publications 2007Institute A = 4Institute B = 10Institute C = 9
OrganisationURI:Name:hasAccess:EndOfAccessContactPerson:
Journal SubscriptionsJournal X = 1990 - 2000Journal Y = 2005 - 2010Journal Z = 2001 - 2010
PhD Students 2008Computer Science = 200
Physics = 50Social Sciences = 9
First Author / No of Papers Person H = 10/35Person I = 4/12Person J = 1/10
Citations in 2007Paper M (publish 2007) = 20Paper N (publish 2004) = 100 Paper O (publish 2001) = 0
DeductionInferencingReasoning
2007 -> 2008Computer Science =-20Physics = -5Social Science = +2
Most RequestedJournal: Z
© Brigitte Jörg October 8th, 2008 Moscow, Russia
35
Tutorial: CERIF 2008 Release
What for ?
http://www.ist-world.org/
Aim: investigate the thematic range of SSA projects in FP6
Thematic Areas (Blue Clouds):SEMANTICHEALTHLEGALCHANGINGROADMAPSOFTWARE
Projects (Red Dots)Linked with Full Record in Repository
© Brigitte Jörg October 8th, 2008 Moscow, Russia
36
Tutorial: CERIF 2008 Release
What for ?
http://www.ist-world.org/
Aim: investigate the thematic range of SSA projects in FP6
Goals
Themes
© Brigitte Jörg October 8th, 2008 Moscow, Russia
37
Tutorial: CERIF 2008 Release
What for ?
http://www.ist-world.org/
Aim: investigate the collaboration of SSA partners in FP6
Number of joint partners
Project
© Brigitte Jörg October 8th, 2008 Moscow, Russia
38
Tutorial: CERIF 2008 Release
What for ?
What questions do we expect to answer with CERIF?
How many articles has author X published in 2007 as a first author?
How often have articles by author X been cited? Did author X publish with institutionally external authors? In how many FP7 projects does organisation Z participate? How many publications have resulted from project Y? How many people have been employed in the course of
FP6 projects from the 1st call in the NMS? How many PhD students have participated in FP6 projects? How many women have been involved in FP6 projects? How often have articles from journal A been requested in 2007? How many articles have been published in the field of B? …
© Brigitte Jörg October 8th, 2008 Moscow, Russia
39
Tutorial: CERIF 2008 Release
The CERIF Evolution
EU Working Group
on Research DatabasesWorkshop
1987 1991
CERIF 91CERIF 91
PROJECT
Similar IdeasUN/UNESCOOECDCODATA
Acronym: ERGOParticipant: Keith Jeffery, Anne Asser son, many moreOrganisations: Rutherford Appleton, Uni- versity of Bergen, …
Acronym: ERGOParticipant: Keith Jeffery, Anne Asser son, many moreOrganisations: Rutherford Appleton, Uni- versity of Bergen, …
2000
CLASSIFICATIONCLASSIFICATION
RESULTSRESULTS EQUIPMENTEQUIPMENT
PROJECTPROJECT
OrgUnitOrgUnit PERSONPERSON
EXPERTISERoles Roles
CERIF 2000 CERIF 2000 ModelModel
- Networking of DBs- Exchange of Records
- Recommendation to Member States
- Data Model (RDBMS, OO, IR)- Multilinguality- Controlled Vocabulary- Roles / Types- User-driven
- EC Recommendation to Member States
ProjectProject OrganisationOrganisation
Service
Funding Programme
Patent
Skills
CV
Product
Event
PersonPerson
Classification(Semantics)
Classification(Semantics)
PublicationEquipment
2ndLevel
CORE
Language
SemanticsLink
CERIF 2006 / CERIF 2006 / 2008 2008 ModelModel
- Data Model (RDBMS, OO, IR)- Model Normalization - Robust Structure - Extensible Structure - Consistent Structure - Semantic Layer - XML Exchange Specification
- Connectivity to Repositories (Elaboration on Publication)
2006 2008
© Brigitte Jörg October 8th, 2008 Moscow, Russia
40
Tutorial: CERIF 2008 Release
CERIF 91
– published in a first release – recommended to Member States
• to harmonise databases on research projects• ease exchange of comparable information• guidelines for building research databases
– only dealt with research project records– demonstrated in the ERGO pilot project
• access to more than 80.000 project records• from more than 20 national information services
– demonstrated the feasability of exchange – identified the need for more detailed guidelines– confirmed the need to revise CERIF and extend it to other types of
research information, not only projects– revision activities started in 1997 co-ordinated by the EC – led to CERIF 2000
© Brigitte Jörg October 8th, 2008 Moscow, Russia
41
Tutorial: CERIF 2008 Release
CERIF 2000
– a full CRIS data model with flexibility to accomodate many database structures
– a base framework for data exchange– multilingual subject indexing (Ortelius Thesaurus)– recommendations for controlled attribute values– reflection on user groups and requirements– types of research information – metadata environment as a uniform summary view– extensions to
• Organisations• Persons• Results: Products, Patent, Publication• Expertise• Equipment and Facilities
© Brigitte Jörg October 8th, 2008 Moscow, Russia
42
Tutorial: CERIF 2008 Release
What is going on ?
JISC Report from April 2008“Metadata for digital libraries: state of the art and future directions”
by Richard Gartner http://www.jisc.ac.uk/media/documents/techwatch/tsw_0801pdf.pdf
Many available Schemas (DC, METS, MODS, …)
Each schema was singularly developed and not designed as an overal architecture to cover integrated object entities
JISC recommends therefore to overcome the problem by best practise guidelines and pragmatic application
Issues of duplicate information (overlap in sections of metadata) need rules and are currently being addressed by the library community in good practise guidelines
© Brigitte Jörg October 8th, 2008 Moscow, Russia
43
Tutorial: CERIF 2008 Release
What is going on ?
JISC Report from April 2008“Metadata for digital libraries: state of the art and future directions”
by Richard Gartner http://www.jisc.ac.uk/media/documents/techwatch/tsw_0801pdf.pdf
– Descriptive Metadata (intellectual contents)
– Administrative Metadata (technical metadata [file formats], rights management, provenance [info on creation, subsequent treatment, responsibility, …])
– Structural Metadata (internal structure of items: e.g.: page order, …)
• METS • DIDL• …
© Brigitte Jörg October 8th, 2008 Moscow, Russia
44
Tutorial: CERIF 2008 Release
What is going on ?
JISC Report from April 2008“Metadata for digital libraries: state of the art and future directions”
by Richard Gartner http://www.jisc.ac.uk/media/documents/techwatch/tsw_0801pdf.pdf
XML is of great importance to embed and make use of namespaces Combining Metadata standards, even a limited such as described
above, will always be messier than utilising a single standard that combines their taxonomic powers and resolves any potential clashes or duplications between them.
Integration by itself would of course be of little consequence if the standards themselves failed to address the metadata needs of the digital library community. In this respect, the provenance of each standard is of some importance. All have been constructed by authoritative standard setters within their communities.
Most of the mentioned standards have proved their ability to meet the requirements of major and highly complex digital collections.
© Brigitte Jörg October 8th, 2008 Moscow, Russia
45
Tutorial: CERIF 2008 Release
What is going on ?
Source: http://maps.repository66.org/; Reported on: http://www.sparceurope.org/
© Brigitte Jörg October 8th, 2008 Moscow, Russia
46
Tutorial: CERIF 2008 Release
What CERIF aims for
Source: http://maps.repository66.org/; Reported on: http://www.sparceurope.org/
Equipment
ProjectProjectOrganisationOrganisation
Service
FundingProgramme
Patent
Skills
CV
Product
Event
PersonPerson
Classification(Semantics)
Classification(Semantics)
Publication
© Brigitte Jörg October 8th, 2008 Moscow, Russia
47
Tutorial: CERIF 2008 Release
What CERIF aims for
Equipment
ProjectProjectOrganisationOrganisation
Service
FundingProgramme
Patent
Skills
CV
Product
Event
PersonPerson
Classification(Semantics)
Classification(Semantics)
Publication
Enabling the ERA eInfrastructure
Standardization / Integration / Interchange
Added-Value Services
Middle (Interoperability)-Layer for EU Research Information
© Brigitte Jörg October 8th, 2008 Moscow, Russia
48
Tutorial: CERIF 2008 Release
Activities
• UK: Research Councils specified to use CERIF as the format for IT processes and MM information
• UK: STFC (Corporate Data Repository)
• BE: Flanders – CERIF as Standard Interchange Format• DK: Danish Universities PURE -> CERIF
• EUROPEESF: CERIF for IS under discussionCORDIS, EC R&D Service: Asked for CERIF presentation EuroHORCS: Recommendation for CERIF; ESF joined as a euroCRIS member
© Brigitte Jörg October 8th, 2008 Moscow, Russia
49
Tutorial: CERIF 2008 Release
Activities
• IST World SSA (project)• Videolectures.net (Teaching Videos)• BioDiversa ERANET (project)• IWETO (BE): Integrating Flemish Research Information• FRIDA (NO): Joint university CRIS• Fdok (NO): University of Bergen, results• METIS (NL): currently used by Dutch Universities• STFC (UK): Corporate Data Repository• HUNCRIS (HU): Access to R&D in Hungary• SICRIS (SI): Access to University Research in Slovenia• SRIS (UK): Scottish Research Information Systems, public research in
Scotland • AURIS-MM (AT): Provides access to Austrian University Research extended
with multimedia• ICERIS (IS): Access to Information on Icelandic Research Projects & R&D
Results• CRIS-MER (EC): Research information on Migration and ethnic Relations
(planned)
© Brigitte Jörg October 8th, 2008 Moscow, Russia
50
Tutorial: CERIF 2008 Release
CERIF TG Activity
Regular CERIF TG meetings and Discussions Tests and major bugfixes before Releases Strong Relation to ongoing implementation activities
(Geert van Grootel, EWI, Flanders; atira A/S, Aalborg, Denmark)
Exchange with TG Best Practice (Ales Bosniak, IZUM, Slovenia)
Collaborate with TG Institutional Repositories (IR-CERIF)(Anna Clements, University of St. Andrews, UK)
Next Steps: Extension of Semantic Layer with Content Check Tools for Managing the Semantics Mappings of major Schemas (Standards) Check OAI Wrapping CERIF Ontology
© Brigitte Jörg October 8th, 2008 Moscow, Russia
51
Tutorial: CERIF 2008 Release
Active People
Active participation in current release (2008): Brigitte Jörg, (German Res Center for AI) TG Leader Keith G. Jeffery (UK Science and Techn Facilities Council) Geert van Grootel (Flemish Ministry) Anne Asserson (University Bergen) Henrik Rasmussen (atira A/S) Adrian Price (University Copenhagen) Thomas Vestam (atira A/S)
Active participation in past release (2006): Ojars Krast (uniCRIS AG) Edward Grabczewski (UK Science and Tech Facil Council)
© Brigitte Jörg October 8th, 2008 Moscow, Russia
52
Tutorial: CERIF 2008 Release
CERIF 2008 Release
Model Introduction and Specification Document
Full Data Model, SQL Database Scripts
XML Data Exchange Specification Document
XML Example Files
XML Schemas for XML Validation
CERIF Types / Roles / Semantics
http://www.eurocris.org/
© Brigitte Jörg October 8th, 2008 Moscow, Russia
53
Tutorial: CERIF 2008 Release
XML Interchange Format
According to W3C Standards Refers to XML Schemas for Validation XML files corresponding to Entities / Separation of Relationships
<XML><PERSON> <ID>1</ID> <FirstName>Anne</FirstName> <LastName>Asserson</LastName> <URI>http://www.linkedin.com1</URI> <Sex>female</Sex></PERSON><PERSON> <ID>2</ID> <FirstName>Keith</FirstName> <LastName>Jeffery</LastName> <OtherNames>G.</OtherNames> <URI>http://www.linkedin.com2</URI> <Sex>male</Sex></PERSON>---</XML>
<XML><PUBLICATION> <ID>1</ID> <Title language=„EN“>Grey in the R&D Process</Title> <Date>2006</Date> <URI>http://www.epubs.org/ID1</URI></PUBLICATION><PUBLICATION> <ID>2</ID> <Title language=„EN“>What‘s new in Grey Literature …</Title> <Date>2005</Date> <URI>http://www.greynet.org/thegrey journal.html?ID2</URI></PUBLICATION>---</XML>
© Brigitte Jörg October 8th, 2008 Moscow, Russia
54
Tutorial: CERIF 2008 Release
CERIF 2006 Implementation Thesaurus / Semantics @ EWI
© Brigitte Jörg October 8th, 2008 Moscow, Russia
55
Tutorial: CERIF 2008 Release
Example: GeneratingPublication Reference Records
CERIF attributes CERIF entities CERIF types / Comment
cfResultPublicationId cfResultPublication attribute of Core entity
cfResultPublicationDate cfResultPublication attribute of Core entity
cfVolume cfResultPublicaion attribute of Core entity
cfEdition cfResultPublication attribute of Core entity
cfSeries cfResultPublication attribute of Core entity
cfIssue cfResultPublication attribute of Core entity
cfStartPage cfResultPublication attribute of Core entity
cfEndPage cfResultPublication attribute of Core entity
cfTotalPages cfResultPublication attribute of Core entity
cfISBN cfResultPublication attribute of Core entity
cfISSN cfResultPublication attribute of Core entity
cfUniformResourceIdentifier cfResultPublication attribute of Core entity
cfNameAbbreviation cfResultPublicationNameAbbreviation attribute of LanguageRelated entity
cfTitle cfResultPublicationTitle attribute of LanguageRelated entity
cfSubtitle cfResultPublicationSubtitle attribute of LanguageRelated entity
cfAbstract cfResultPublicationAbstract attribute of LanguageRelated entity
cfKeywords cfResultPublicationKeywords attribute of LanguageRelated entity
cfBibliographicNote cfResultPublicationBibliographicNote attribute of LanguageRelated entity cfPersonId
cfPerson_ResultPublicationReference to Person in Link entity with assigned Role [Semantic Layer: i.e. isAuthorOf]
cfOrgUnitIdcfOrgUnit_ResultPublication
Reference to OrgUnit in Link entity with assigned Role [Semantic Layer: i.e. isPublisherOf]
cfCopyright cfPerson_ResultPublication, cfOrgUnit_ResultPublication
attribute of Link entities [Statement about property rights]
cfCityTown cfPostAddress attribute of 2ndLevel entity [For location of conference / address of publisher]
cfCountryCode cPostAddress attribute of 2ndLevel entity [For location of conference / address of publisher]
cfStartDate cfEvent attribute of 2ndLevel entity [For startDate of conference]
cfEndDate cfEvent attribute of 2ndLevel entity [For startDate of conference]
cfSourceId only used in CERIFXML for identification of the dataset - not in the model
CERIF Classification Values CERIF Link Entities (Semantics) Types / Comment
cfIsAuthorOf cfPerson_ResultPublication value of PersonPublicationRoleSchema (Semantic Layer)
cfIsPublisherOf cfOrgUnit_ResultPublication value of OrgUnitPublicationRoleSchema (Semantic Layer)
cfPublicationType cfResultPublication_ResultPublication value of publicationTypeSchema (Semantic Layer)
CERIF ResultPublication Entity for Reference Building
@article{615182, author = {Veda C. Storey}, title = {Understanding semantic relationships}, journal = {The VLDB Journal}, volume = {2}, number = {4}, year = {1993}, issn = {1066-8888}, pages = {455--488}, doi = {http://dx.doi.org/10.1007/BF01263048}, publisher = {Springer-Verlag New York, Inc.}, address = {Secaucus, NJ, USA}, }