mariella di giacomo frances knudson los alamos national laboratory research library la-ur-04-0170

58
LITA Forum 2004 Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory Research Library LA-UR-04-0170 MyLibrary @LANL, a Personalized and Collaborative Digital Library Portal for Facilitating Scientific Research

Upload: hilda-sawyer

Post on 01-Jan-2016

431 views

Category:

Documents


3 download

DESCRIPTION

MyLibrary @LANL, a Personalized and Collaborative Digital Library Portal for Facilitating Scientific Research. Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory Research Library LA-UR-04-0170. Outline. Web-Based Digital Library Portals - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Mariella Di GiacomoFrances Knudson

Los Alamos National Laboratory Research Library

LA-UR-04-0170

MyLibrary @LANL, a Personalized and Collaborative Digital Library Portal for Facilitating Scientific

Research

Page 2: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Outline

Web-Based Digital Library Portals

Project, Application and Architecture Requirements

Realization/Architecture

Features of the system

Short Term Directions

Page 3: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

MyLibrary @LANL

MyLibrary at Los Alamos National Laboratory (@LANL) is the result of a project sponsored

by LANL Research Library.

Page 4: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Web-Based Digital Library Portals

Digital Resources Information Overload Mutations in the nature of scientific

research and web technologies Personalization and Customization

services offer the potential to improve the user experience when interacting with digital resources

Page 5: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

MyLibrary @LANL: Goals

Web-based Application New Tool for the management of all

information resources Personalized and Cooperative service to

facilitate scientific research and collaboration Possibility to include external Internet

resources

Page 6: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

MyLibrary Application Requirements

Easy-to-use interface

Integration of library resources, services and external links

Personalized and Cooperative application

Page 7: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

MyLibrary Application Solutions

Web-based Interface with central login

Personalized private web environments for digital library users

Direct Collaboration through shared web environments

Indirect Collaboration via recommendation systems

Page 8: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

MyLibrary Architecture Requirements

Storage for all stored links and data

A system that has as little service disruption as possible

A robust, fast, flexible, scalable and secure system

Page 9: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Scalable, robust, fast and flexible.Redundant Arrays of Inexpensive Disks (RAID) have been used to mitigate data failure and provide storage capacity.Redundant components and MySQL have been used to provide system, application and data redundancy Secure environment.A secure Linux system in conjunction with other security modules have given us a trusted environment for the application

MyLibrary Architecture Solution

Page 10: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

MyLibrary Hardware Architecture

The whole hardware architecture consists of: 1 Dell processing node running Linux.

2 Processors.

4 GB of main memory.

35 GB of disk storage

Page 11: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

MyLibrary Software Architecture

Linux Operating System

MySQL ServerPerl

Apache MySQL

Connector

MyLibrary

Page 12: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Clients

MyLibrary

Application

MySQL Server

Storage Storage

RecommenderDB

MyLibraryDB

UsersDB

Page 13: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

MyLibrary Features

Personalized private and shared web environments to digital library users

Active Recommendation for MyLibrary content

Content upload

Web link checking mechanism

Locally stored databases alerts

Access to patron circulation record

Drag-drop interface

system provides :

Page 14: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

MyLibrary Organization

A Topic relates to a Discipline, such as Astronomy, Bioinformatics, Chemistry, Computer Science,

Engineering, Physics, etc.

A Topic can be interpreted as a person’s or group’s role, view or digital library channel

A Topic organizes information in Folders

A Folder collects Links and sub-Folders

organizes information into Topics :

Page 15: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

MyLibrary Organization

User

Topic

Folder

Url

Page 16: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

MyLibrary Organization

Page 17: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Personalized Environments

One or more Topics with Links in the selected subject matter divided into Folders according to media types that could be selected (e.g. Databases, Electronic Journals, General Reference, Web Resources, Alerts or Personal Web Links)

A Personalized Environment presents:

Page 18: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Personalized Environments

Page 19: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Users

User Preferences

Topic

Folder

Topic

Url

Authorization

Discipline

Collaboration

MediaType

Properties Url-ISSN

Recommender System

MyLibrary Framework

Page 20: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

A Framework for Collaboration

Digital Libraries should support work and collaboration both within and between groups.

Collaborations can be synchronous or asynchronous, and they may involve people who are in the same location or in multiple locations

It is possible to distinguish Direct from Indirect Collaborations

Page 21: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Direct and Indirect Collaboration

Direct Collaboration: a group of people agrees to work together, synchronously or asynchronously, in the same location or from multiple locations

Indirect Collaboration: the work and the content stored by a user or a group of users is used to

provide recommendations in the future within the user community

Page 22: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

MyLibray Framework for Collaboration

We have included some capabilities to support collaboration in libraries and knowledge management

Information environment where groups may keep retrieved documents

Capabilities to find other groups or people based on shared interests through a recommendation system

Page 23: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

MyLibrary Direct Collaboration

Direct Collaboration: environments where several users agree to work together as a defined team or group exploring and making use of digital library resources

This may be a laboratory group conducting research, or a team of students learning collaboratively for a class group project

Page 24: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Shared Environments

Information spaces, Shared Topics, in which groups may store retrieved documents

Mechanism to allow different degrees of sharing for the content of group information (read-only,

read/modify and all rights)

Capabilities to find other groups or people based on shared interests as indicated by overlapping content in information spaces

Page 25: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Shared Environments

A shared environment/topic follows the same structure of a private topic

A user sharing a topic with a group of people defines the members and their rights

Participants have read only or read/modify rights

The shared topic’s owner has read/modify/delete rights

Page 26: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Creating a Shared Environment

In order to add people to a shared topic the owner needs to know their email addresses

A person who wants to be added to a shared topic must have a login

A message is sent to a person when added to a shared topic or when his/her rights change

Page 27: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Adding a Shared Topic

Page 28: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

MyLibrary Shared Topics

Page 29: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

MyLibrary Shared Topic Members

Page 30: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

MyLibrary Users and Shared Topics

MyLibrary Shared TopicMyLibrary User A

MyLibrary User B

MyLibrary User C

Page 31: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Cloning Private and Shared Topics

This feature allows users to copy a topic in its entirety. The application prompts you to chose among your existing topics and requires a name to be given to the copy

It is possible to make these copies private or shared

Page 32: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Cloning Topics

Page 33: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Cloning Folders

This feature allows users to copy a folder in its entirety. The user selects a folder from the list of existing folders, renames it, and specifies the destination topic.

It is possible to make these copies private or shared

Page 34: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Cloning Folders

Page 35: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Direct and Indirect Collaboration

Direct Collaboration: a group of people agrees to work together, synchronously or asynchronously, in the

same location or from multiple locations.While the implementation of shared topics met the

primary user requirement for shared documents and links, it did not address the problem of improving the digital library environment via learning and adapting mechanisms

Indirect Collaboration: the work and the content stored by a user or a group of users is used to provide recommendation in the future within the user community

Page 36: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Indirect Collaboration

Indirect Collaboration: the work of one user that may benefit anonymously from other users in the future

Information stored by current users are captured and evaluated in order to guide future users with

recommendations (via recommendation systems) that is a form of anonymous, asynchronous, indirect collaboration within the user community.

Page 37: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Recommendation Systems

Recommendation feature compares the content and activities of individuals and various groups and

suggests new material and new interactions

The objective of recommendation systems is to supply the user with relevant choices for content that are automatically inferred

In the case of MyLibrary @LANL the inference necessary to recommend new materials is done with the information extracted from the collection of links associated with a user

Page 38: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Recommendation System in MyLibrary

The recommendation system has access to the content stored in the MyLibrary @LANL database, both private and shared

It can make comparisons between users in several ways

It can, when requested by users, notify them that others are working with similar materials

Page 39: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Recommendation System in MyLibrary

The data extracted and fed to the recommendation system can be viewed as a set of three-dimensional vector links, where one dimension is a user, the second is the topic, and the third component is the link itself

User

TopicLink

Page 40: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Recommendation System in MyLibrary

International Standard Serial Numbers (ISSNs) have been chosen as a means of selection because our system could generate more metadata for a specific link if an ISSN is associated with it

All the links stored in MyLibrary database are processed and those for which it is possible to extract an ISSN are evaluated

ISSN Link

Page 41: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Users

User Preferences

Topic

Folder

Topic

Url

Authorization

Discipline

Collaboration

MediaType

Properties Url-ISSN

Recommender System

MyLibrary Framework

Page 42: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

MyLibrary and Recommendation Analysis

Four types of analysis or relationships have been extracted trough MyLibrary and the Active Recommendation Project (ARP) System:

ISSN Topic Proximity (ITP)

ISSN Semi-metric Relation

Topic ISSN Proximity (TIP)

User ISSN Proximity (UIP)

Page 43: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

ISSN Topic Proximity (ITP)

The data can be seen as a collection of binary relations between two sets

In this scenario the two sets are the Topic and the ISSN set

The ITP measure is the probability of co-occurrence of pairs of ISSNs in a user’s topic

The probability of co-occurrence of a pair of ISSNs, called also the proximity between two ISSNs, Y and Z, is the probability that both Y and Z co-occur in the same topic

Page 44: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

ITP: Direct Co-Occurrence

Two ISSNs are near if they tend to co-occur in many topics

The co-occurrence probability for each pair of ISSNs is the value used to generate e-journal

recommendation

This type of analysis can be thought as “Users who retrieved your electronic journals also retrieved these”

Page 45: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

T1 T2

User A MyLibrary Collection

User B

User C

ISSN10

ISSN12

ISSN17

…………

T1 T2

ISSN10

ISSN17

ISSN15

ISSN20

T1 T2

…………

ISSN10

ISSN17

ISSN205

…………

ISSN10

ISSN12

ISSN17

………..

ISSN13

ISSN15

ISSN18

ISSN20

………..

ISSN205

ISSN1908

ISSN10029

………..

.……….

……….

TO

PIC

ISSN

ISSN15

ISSN20

ISSN205

ISSN12

ISSN205

ISSN12

ISSN15

ISSN20

…………

…………

Recommendation User C /T 1

Recommendation User B /T 2

Recommendation User A /T 1

Recommender System

Page 46: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

MyLibrary ISSN Topic Proximity

Page 47: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

ITP Semi-metric

This relationship is a measure of potential association between ISSN pairs that do not tend to co-occur, but which are indirectly highly associated via indirect ISSN

This type of analysis can be thought as “We think you may also be interested in these”

Page 48: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

T1 T2

User A MyLibrary Collection

User B

User C

ISSN12

T1 T2

ISSN12

ISSN17

ISSN15

ISSN10

T1 T2

…………

ISSN10

ISSN17

ISSN15

…………

ISSN10

ISSN12

ISSN17

ISSN19

ISSN13

ISSN15

ISSN18

ISSN20

………..

ISSN205

ISSN1908

ISSN10029

………..

.……….

……….

TO

PIC

ISSN

ISSN13

ISSN18

…………

…………

…………

…………

Recommendation User C /T 1

Recommendation User B /T 2

Recommendation User A /T 1

Recommender System

Page 49: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Topic ISSN Proximity (TIP)

This type of proximity analysis is useful to establish two-way closeness between topics or elements of a set

Two Topics are near if ISSNs they contain are near

Page 50: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

T1 T2

User A MyLibrary Collection

User B

User C

ISSN10

ISSN17

ISSN19

T1

ISSN10 ISSN20

ISSN17 ARTICLE1

ISSN15 …….

ISSN19

T1 T2

…………

ISSN10

ISSN17

ISSN19

…………

ISSN10

ISSN12

ISSN17

ISSN19

ISSN13

ISSN15

ISSN18

ISSN20

………..

ISSN205

ISSN1908

ISSN10029

………..

.……….

……….

TO

PIC

ISSN…………

…………

…………

Recommendation User C /T 1

Recommendation User B /T 1

Recommendation User A /T 1

Recommender System…………

Page 51: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

User ISSN Proximity (UIP)

This type of proximity analysis is between Users and ISSNs

The relationship contains absolute occurrence values specific ISSNs in the set of topics of a specific user

The proximity between user A and ISSN Y is the number of times ISSN Y occurs in user A compared to the entire set

Page 52: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

User A MyLibrary Collection

User B

User C

ISSN10

ISSN17

ISSN19

ISSN10 ISSN20

ISSN17 ARTICLE1

ISSN15 ARTICLE2

ISSN19 ……….

ISSN10

ISSN17

ISSN19

…………

ISSN10

ISSN12

ISSN17

ISSN19

ISSN13

ISSN15

ISSN18

ISSN20

………..

ISSN205

ISSN1908

ISSN10029

………..

.……….

……….

US

ER

ISSN…………

…………

…………

Recommendation User C

Recommendation User B

Recommendation User A

Recommender System…………

Page 53: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Other MyLibrary Features

system provides :

Personalized private and shared web environment to digital library users

Active Recommendation for MyLibrary content

Content upload

Web link checking mechanism

Locally stored databases alerts

Access to patron circulation record

Drag-drop interface

Page 54: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Drag-Drop Interface

Using the Flash plug-in, users can have a GUI type interface that allows them to drag and drop between folders and inside them

MyLibrary @LANL system through standard URLs communicates to each Flash plug-in used by

MyLibrary users

Page 55: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Drag-Drop Interface

Page 56: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Short Term Directions

Export/Import User Profile Unified User Database for the Research Library

services

Page 57: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Resources

More information can be found at:

http://lib-www.lanl.gov/lww/add.htm

http://www.c3.lanl.gov/~rocha/talks/mylib/

LANL Research Library Web site:

http://lib-www.lanl.gov

MyLibrary @LANL Demo Access

http://mylibdemo.lanl.gov

Page 58: Mariella Di Giacomo Frances Knudson Los Alamos National Laboratory  Research Library LA-UR-04-0170

LITA Forum 2004

Questions ?

Thanks