donatella castelli istituto di elaborazione della informazione – cnr pisa (italy)

23
Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Upload: kayo

Post on 20-Mar-2016

41 views

Category:

Documents


0 download

DESCRIPTION

Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy). General information. Funding programme E U 5FP - Action III-Multimedia Content and Tools Effort: 205 p/m Duration: 30 months Commencement date : February 2001. General information. Participants - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Donatella CastelliIstituto di Elaborazione della Informazione

– CNRPisa (Italy)

Page 2: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

General information

Funding programme EU 5FP - Action III-Multimedia Content

and ToolsEffort: 205 p/mDuration: 30 monthsCommencement date: February 2001

Page 3: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

General information

Participants

CNR (Italy, scientific coordinator)FORTH (Greece)GMD (Germany)University of Dortmund (Germany)ERCIM (France, administrative coordinator)

Page 4: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

Develop an open service archive environment composed by a set of interoperable services able tosupport scholars

in interacting with multi-disciplinary archives

as members of networked peer communities

Objective

Page 5: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

Design decisions

Common Interface

Archive1

Common Interface

Archive2

ArchiveN

Common Interface

Cyclades environment

Built on top of the metadata harvesting layer established by the OAI protocol

Page 6: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

Design decisions

Recommendation Service

Collaborative Work Service

Personalization Service

Query & Browse Mediator Service

Cyclades Mediator Service

Collection Service

Access Service

Independent services + Cyclades Mediator ServicesFor each service: - a well established access protocol - a precise service description

Page 7: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

Cyclades end-user functionality

Page 8: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

The user perceives an information space structured into a set of collections

Virtual Collections

Collection descriptions

documents in “computer science” domain

documents in “physics” domain stored in ArXiv and PhysNet, published after 1990

…….

Collection names

Computer Science

Recent Physics

…….

Page 9: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

Virtual Collections

A set of specific operations is available for each collection

European Computer Science attribute-based search, cross-language search, …

Computer Science

Collection names Operations

simple search,attribute-based search

simple-searchInput: “keywords in the abstract”Preconditions: keywords in EnglishOutput:set of <document.id,author,title>Effect:”returns the specified output for all the documents that contains the given keywords in their abstract”

Page 10: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

Virtual Collections The set of collections is dynamic

d d / (C1, C2, …,Cn) cond(d)

Composition

Selection based on description

of the collection Restriction on the document

metadata fields

Time 0: one collection for each Cyclades harvested repository

create.collection( )

Page 11: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

Information seeking Search

Collection names

Computer Science

Search fieldsauthor, title

Result

title= …. abstract =

European Computer Science

author,titlecountry,language

author.name =..English.title= …English.abstract = …local.language.abstract=

Page 12: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

Information seekingAuthorAuthorAbstractTitle...

CastelliStracciaStracciaThanos...Straccia.02.03.2000

Title = …. PPublisherublisher = IEI-CNR

Straccia.15.03.2000Straccia.24.04.2000...

INRIAGMDIEI-CNRCCNUCE-CNRNUCE-CNR...

Multilevel Browse

Schema

Metadata records

Attribute values

Page 13: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

The user creates a personal, dynamic hierarchy of folders for the storage of the records that match his information needs

Donatella’s folder Digital

Libraries

Education

Tourism

Metadata

Personalisation

The system learns user’s information needs (user profiling)

by observing the content of the folders, their organisation and the user behaviour

Page 14: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

Personalisation New data is classified into the right topic folder

automatically

The efficiency of the classifier may be improved by exploiting contextual information (e.g. the searching collection)

Donatella’s folder Digital

Libraries

Education

Tourism

Metadata

Page 15: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

By observing the behaviour of the users and their profiles the system identifies “similar users”

Recommender

Donatella:

(content based and collaborative)

Users get recommendations about new documents and new collections that the system judges relevant according to:

the specific user information needs the “similar users” information needs

Page 16: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

Cooperative Work A shared working space can be created by groups

referencing:– user own documents– collections– recommendations, related links, textual annotations,

ratings, …

Shared working space

Very interesting docTo be read

Page 17: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

Will we be able to build a quality service

on the OAI low barrier interoperability framework?

Page 18: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

An example

Collection description is used by:

the end-user to select the collection of interestthe collection administrator to select the

collections required to build a new collectionthe document classifier to increase efficiencythe recommender system to provide

information about new collections

Page 19: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

An example

What should a collection description contain:

content descripton subjectcoveragemetadata formatsmetadata languagecontent formatdigitalized content yes/no…

Page 20: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

An example

Collection content description: we need to rely on the document description

DC fields are optional

Collection subject: we need to know if the term in the subject field is a classification code, a free term, or term of a controlled vocabulary – the controlled vocabulary – the language of the terms

DC is unqualified

Page 21: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

Will be Cyclades a service for a subset of the OAI registered archives?

Page 22: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

Browse - Desire (University of Dortmund)

Personalisation - EUROgatherer (CNR)Cooperative work – CSCW, Cookpit

(GMD)

Background Tecnology

Page 23: Donatella Castelli Istituto di Elaborazione della Informazione – CNR Pisa (Italy)

Berlin, 26 February 2001 OAI Open Day for Europe

Cooperative Work

A group of users that often access similar documents may enter into a long term relationship and eventually evolve into a working group (if only they become aware of each other)