semantic web technologies for digital libraries

27
SEMANTIC WEB TECHNOLOGIES FOR DIGITAL LIBRARIES By Nikesh .N International School of Information Management, Mysore

Upload: nikesh-narayanan

Post on 11-May-2015

6.092 views

Category:

Education


2 download

DESCRIPTION

Semantic Web Technologies For Digital Libraries

TRANSCRIPT

Page 1: Semantic Web Technologies For Digital Libraries

SEMANTIC WEB TECHNOLOGIES FOR DIGITAL LIBRARIES

By Nikesh .N

International School of Information Management, Mysore

Page 2: Semantic Web Technologies For Digital Libraries

PRESENTATION OVERVIEW

Digital Library Library standards & Tools Semantic Digital Library :Expectations Semantic Web Technologies Bibliographic Ontology (MarcOnt) Case Study ( Jerome DL) Conclusions

Page 3: Semantic Web Technologies For Digital Libraries

DIGITAL LIBRARY

Information System which deals with collection, organization, storage and retrieval of digital documents

Page 4: Semantic Web Technologies For Digital Libraries

WHY LIBRARY STANDARDS ?WHAT ARE TOOLS & TECHNIQUES ?

Bibliographic Descriptions Standards Library Classification Scheme Subject Headings Subject Indexing Retrieval Techniques

Page 5: Semantic Web Technologies For Digital Libraries

BIBLIOGRAPHIC DESCRIPTIONS

A bibliographic description format is formal definition of syntax, meaning and rules of describing resources collected by library or other similar entity. Bibliographic description formats are descendants of sets of rules for describing library resources used back in the XIXth century and are still used today.

Eg: ISBD , AACR, Bibtex, MARC 21,Doubline Core

Page 6: Semantic Web Technologies For Digital Libraries

LIBRARY CLASSIFICATION SCHEMES

Library classification systems help categorize library resources according to domain of interest of their content.

Eg: Dewey’s Decimal Classification

Universal Decimal Classification

Colon classification

Page 7: Semantic Web Technologies For Digital Libraries

SUBJECT HEADINGSFOR VOCABULARY CONTROL

Library of Congress subject Headings Sears list of subject headings Mesh ( Medical Subject Headings)

Page 8: Semantic Web Technologies For Digital Libraries

SUBJECT INDEXING

Pre-coordinate Indexing (Coordination by Indexer) PRESIS, POPSI, KWIC, Chain Indexing

Post coordinate Indexing (Coordination by user) Keyword indexing

Page 9: Semantic Web Technologies For Digital Libraries

INFORMATION SEARCH AND RETRIEVAL

Bibliographic field based search Full text search Boolean Search Proximity Search Truncation Search etc.

Page 10: Semantic Web Technologies For Digital Libraries

SoorajCreation-Creator/Role

ISIM LibraryCurrent Location-Repository Name

irises, nature, soilSubject-Matter

2009Creation-Date

IrisesTitle

paintingsObject/Work type

PaintingsClassification • Full-text search– “Paintings” AND

“Sooraj” AND “flowers” no result

• Semantic query– if the knowledge that

“irises” are “flowers” is modeled in an ontology (e.g. subclass-hierarchy)

– we can query for all “Paintings” by “Sooraj” with subject “flowers” and retrieve also the picture with subject “irises”

WHAT LACKS- SEMANTICS ( EXAMPLE)

Page 11: Semantic Web Technologies For Digital Libraries

SEMANTIC DIGITAL LIBRARY- EXPECTATIONS

A digital library system which is capable of Integrating information based on different metadata, e.g.: resources, user profiles, bookmarks, taxonomies

Providing interoperability with other systems (not only digital libraries)

Delivering more robust, user friendly search and browsing interfaces empowered by semantics

Page 12: Semantic Web Technologies For Digital Libraries

SEMANTIC WEB TECHNOLOGIES

Semantic Web is becoming reality by applications that support it and are based on it Web Ontology Languages : OWL, RDFS Ontology editors : Protégé, Onto Edit etc. RDF, RDF Schema RDF Storages: Sesame, Jena, YARS Reasoners: KAON, Racer Annotation tools ( Annotea, Onto Annotizer) Topic Maps Thesauri & Controlled Vocabularies

( eg: Wordnet)

Page 13: Semantic Web Technologies For Digital Libraries

SEMANTIC WEB TECHNOLOGIES FOR DIGITAL LIBRARIES?

Metadata is the key concept Many digital libraries do have metadata in place Task is to make them available in a machine

understandable format How to uplift Legacy Metadata to Semantic Level RDF: Is a framework to model any kind of metadata It delivers certain level of technical

interoperability

Page 14: Semantic Web Technologies For Digital Libraries

BIBLIOGRAPHIC ONTOLOGIES Effort to Capture the Semantics of Metadata

MarcOnt Initiatives MarcOnt Initiative has grown from the experiences of

developing and evaluating the first semantic digital library, JeromeDL

Developed as a part of Master's thesis of Sebastian RyszardKruk at the Gdańsk University of Technology (GUT), Poland

MarcOnt Initiative goals: Create a framework for collaborative ontology improvement Incorporate existing metadata and Library classification

scheme Offer tools for data mediation between different data

formats

Page 15: Semantic Web Technologies For Digital Libraries

MARCONT ONTOLOGY AND PORTAL

MarcOnt Ontology: Central point of MarcOnt Initiative Translation and mediation format Continuos collaborative ontology improvement Knowledge from the domain experts

MarcOnt Portal (source of knowledge): Suggestions Annotations Versioning Ontology editor

Page 16: Semantic Web Technologies For Digital Libraries

CASE STUDY-JEROMEDL

Joint effort of DERI, National University of Ireland,

Galway and Gdansk University of Technology (GUT)

Distributed under BSD Open Source license

Page 17: Semantic Web Technologies For Digital Libraries

JEROMEDL - MOTIVATIONS

Support for different kinds of bibliographic medatata,

like: DublinCore, BibTeX and MARC21 at the same

time. Making use of existing rich sources of bibliographic

descriptions (like MARC21) created by human.

Supporting users and communities: users have control over their profile information;

community-aware profiles are integrated with bibliographic

descriptions

support for community generated knowledge

Page 18: Semantic Web Technologies For Digital Libraries

ONTOLOGIES IN JEROMEDL

Structure (system administrators): JeromeDL structure ontology

Bibliographic and legacy descriptions ( domain experts MarcOnt bibliographic ontology Extensible MarcOnt suggestions

Communities (normal users, expert users with restricted vocabulary FOAF and FOAFRealm identity management ontology

Page 19: Semantic Web Technologies For Digital Libraries

STRUCTURE ONTOLOGY IN JEROMEDL

Page 20: Semantic Web Technologies For Digital Libraries

BIBLIOGRAPHIC (MARCONT) ONTOLOGY IN JEROMEDL

Page 21: Semantic Web Technologies For Digital Libraries

COMMUNITY-AWARE (FOAFREALM) ONTOLOGY

Page 22: Semantic Web Technologies For Digital Libraries

MARCONT MEDIATION SERVICES FOR LEGACY METADATA

MarcOnt OntologyMarcOnt RDF

MARC21 RDF

MARC21 XML

MARC21

Dublin Core RDF

Dublin Core XML

Dublin Core

New format RDF

New format XML

New format

Format translation

RDF Translator

Format co-operation

MarcOnt Mediation Services

Page 23: Semantic Web Technologies For Digital Libraries

SEMANTIC INTEROPERABILITY IN GEROMDL

Providing semantic annotations during uploading process: open module (JOnto) for handling any

taxonomies keywords based on:

WordNet free tagging

defining structure of resources in the JeromeDL ontology

Lifting legacy metadata to MarcOnt ontology Community maintained annotations

social semantic collaborative filtering semantic descriptions based on the FOAF

metadata

Page 24: Semantic Web Technologies For Digital Libraries

FOAF - DESCRIBING SOCIAL

NETWORKS

FOAF - Stands for Friend-of-a-Friend

Defines properties for a person

Does not only have to contain one person per file

Can build a network of people with foaf:knows links

FOAF can be easily extended to meet requirements,

as in the case of FOAFRealm for identity

management…

Page 25: Semantic Web Technologies For Digital Libraries

JEROMEDL – SEMANTIC INFORMATION IN

USE

Searching: Keyword-based search with semantic query expansion Semantic search:

Direct RDF querying Natural language templates

Faceted Navigation:creators, types, keywords, Topics etc.

Sharing: Social Semantic Collaborative Filtering Semantically Interlinked Online Communities

Heterogeneous communication: OAI-PMH

Page 26: Semantic Web Technologies For Digital Libraries

REFERENCES Semantic Web – W3C, http://www.w3.org/2001/sw/ The Semantic Web Community Portal, http://semanticweb.org Dublin Core Metadata Initiative (DCMI) – http://www.dublincore.org/ Jerome Digital Library Homepage – http://www.jeromedl.org MarcOnt Initiative Portal – http://www.marcont.org Marcin Synak: MarcOnt Ontology – Semantic MARC21 Description for

L2L & L2C Communication, Masters Thesis, Faculty of Electronics, telecommunication and Informatics, National University of Ireland

Sebastian R. Kruk and Marcin Synak: Semantic Digital Libraries: BANNF conference proceedings, 2007

Page 27: Semantic Web Technologies For Digital Libraries

THANK YOU

Questions Please ?