towards concise preservation by managed forgetting: research issues and case study

Post on 05-Aug-2015

377 Views

Category:

Presentations & Public Speaking

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

ForgetIT Project, GA 600826

Towards Concise Preservationby Managed Forgetting

iPRES-2013 ConferenceLisbon, Portugal5 September 2013

Nattiya Kanhabua, Claudia Niederée, and Wolf SiberskiL3S Research Center / Leibniz Universität Hannover

Hannover, Germany

2

Partners in the ForgetIT project

An interdisciplinary team of experts in:– Preservation, information management, information extraction– Multimedia analysis, storage computing, cognitive psychology

3

Outline

Motivation & VisionApproaches: First IdeasIntegration FrameworkPilot Applications: Overview

ForgetIT Project, GA 600826

4

Inspiration

           

 

A Computer that forgets ?Intentionally ??

And in context of preservation???

5

Inspiration

However we are facing– dramatic increase in content creation (e.g. digital photography)– information overload and changing professional + private lives– increasing storage costs for long-term storage (>10 years)– increasing use of mobile devices with restricted capacity– inadvertent forgetting in lack of systematic preservation

 

A Computer that forgets ?Intentionally ??

And in context of preservation???

6

Inspiration

However we are facing– dramatic increase in content creation (e.g. digital photography)– information overload and changing professional + private lives– increasing storage costs for long-term storage (>10 years)– increasing use of mobile devices with restricted capacity– inadvertent forgetting in lack of systematic preservation

And: Forgetting plays a crucial role for human remembering and life in general (focus, stress on important information, forgetting of details)

A Computer that forgets ?Intentionally ??

And in context of preservation???

7

Inspiration

However we are facing– dramatic increase in content creation (e.g. digital photography)– information overload and changing professional + private lives– increasing storage costs for long-term storage (>10 years)– increasing use of mobile devices with restricted capacity– inadvertent forgetting in lack of systematic preservation

And: Forgetting plays a crucial role for human remembering and life in general (focus, stress on important information, forgetting of details)

A Computer that forgets ?Intentionally ??

And in context of preservation???

So: “Shouldn’t there be something like forgetting in digital memories as well?

ForgetIT

8

Complementing Human Memory

V. Mayer-Schönberger. Delete - The Virtue of Forgetting in the Digital Age. Morgan Kaufmann Publishers, 2009.

9

Motivation

major progress in preservation technology

maturing Information extractiontechnology

storage as service (e.g. clouds)

Opportunities increasing amount of digital contenthandled over decades

more or less systematic backup strategies used

non-paper practices for long-term perspective required

Needs

10

Motivation

major progress in preservation technology

maturing Information extractiontechnology

storage as service (e.g. clouds)

Opportunities increasing amount of digital contenthandled over decades

more or less systematic backup strategies used

non-paper practices for long-term perspective required

Needs

large gap for adoption high-up front cost no established

practices lack of understanding

of benefit reluctance to invest

Major Obstacles

11

Vision: Building a Bridge

major progress in preservation technology

maturing Information extractiontechnology

storage as service (e.g. clouds)

Opportunities increasing amount of

digital contenthandled over decades

more or less systematic backup strategies used

non-paper practices for long-term perspective required

Needs

large gap for adoption high-up front cost no established

practices lack of understanding

of benefit reluctance to invest

Major Obstacles

12

Vision: Building a Bridge

major progress in preservation technology

maturing Information extractiontechnology

storage as service (e.g. clouds)

Opportunities increasing amount of

digital contenthandled over decades

more or less systematic backup strategies used

non-paper practices for long-term perspective required

Needs

Enabling smooth transition to preservation

large gap for adoption high-up front cost no established

practices lack of understanding

of benefit reluctance to invest

Major Obstacles

13

Vision: Building a Bridge

major progress in preservation technology

maturing Information extractiontechnology

storage as service (e.g. clouds)

Opportunities increasing amount of

digital contenthandled over decades

more or less systematic backup strategies used

non-paper practices for long-term perspective required

Needs

Enabling smooth transition to preservation

Creating immediate benefit + reducing effort

large gap for adoption high-up front cost no established

practices lack of understanding

of benefit reluctance to invest

Major Obstacles

14

Vision: Building a Bridge

major progress in preservation technology

maturing Information extractiontechnology

storage as service (e.g. clouds)

Opportunities increasing amount of

digital contenthandled over decades

more or less systematic backup strategies used

non-paper practices for long-term perspective required

Needs

ForgetIT

Enabling smooth transition to preservation

Creating immediate benefit + reducing effort

Opening alternatives to “keep it all” and “forgetting by accident”

large gap for adoption high-up front cost no established

practices lack of understanding

of benefit reluctance to invest

Major Obstacles

15

Vision: Building a Bridge

major progress in preservation technology

maturing Information extractiontechnology

storage as service (e.g. clouds)

Opportunities increasing amount of

digital contenthandled over decades

more or less systematic backup strategies used

non-paper practices for long-term perspective required

Needs

ForgetIT

Enabling smooth transition to preservation

Creating immediate benefit + reducing effort

Opening alternatives to “keep it all” and “forgetting by accident”

Easing interpretation in the long run

large gap for adoption high-up front cost no established

practices lack of understanding

of benefit reluctance to invest

Major Obstacles

16

Vision: Building a Bridge

major progress in preservation technology

maturing Information extractiontechnology

storage as service (e.g. clouds)

Opportunities increasing amount of

digital contenthandled over decades

more or less systematic backup strategies used

non-paper practices for long-term perspective required

Needs

ForgetIT

Enabling smooth transition to preservation

Creating immediate benefit + reducing effort

Opening alternatives to “keep it all” and “forgetting by accident”

Easing interpretation in the long run

taking inspiration from and complementing human memory

large gap for adoption high-up front cost no established

practices lack of understanding

of benefit reluctance to invest

Major Obstacles

17

Building the Bridge

Managed Forgetting

Synergetic Preservation

Contextualized

Remembering

18

Building the Bridge

Managed Forgetting

Synergetic Preservation

Contextualized

Remembering

• as opposed to the current “forgetting by accident”

• inspired by human forgetting

19

Building the Bridge

Managed Forgetting

Synergetic Preservation

Contextualized

Remembering

• bringing back information into active use in a meaningful way

• as opposed to the current “forgetting by accident”

• inspired by human forgetting

20

Building the Bridge

Managed Forgetting

Synergetic Preservation

Contextualized

Remembering

• bringing back information into active use in a meaningful way

• as opposed to the current “forgetting by accident”

• inspired by human forgetting

• couples information management and preservation management

21

Simple Example: Holidays

+20 Years+5-10 Years+1 Yearsafter trip +1 month

• Trip to Paris with Friends

• Thousands of picures

22

• High awareness of trip details

• Showing of pictures

• Sorting out redundant pictures

• Sub-grouping and sorting

Simple Example: Holidays

+20 Years+5-10 Years+1 Yearsafter trip +1 month

• Trip to Paris with Friends

• Thousands of picures

23

• High awareness of trip details

• Showing of pictures

• Sorting out redundant pictures

• Sub-grouping and sorting

Simple Example: Holidays

+20 Years+5-10 Years+1 Yearsafter trip +1 month

• Trip to Paris with Friends

• Thousands of picures

• Life goes on• Pictures go

out of focus• Creation of a

small diverse subset for showing occasionally

24

• High awareness of trip details

• Showing of pictures

• Sorting out redundant pictures

• Sub-grouping and sorting

Simple Example: Holidays

+20 Years+5-10 Years+1 Yearsafter trip +1 month

• Trip to Paris with Friends

• Thousands of picures

• Life goes on• Pictures go

out of focus• Creation of a

small diverse subset for showing occasionally

• Creation of summary page

• Addition of context info

• Further reduction of redundancy

• Rest of pictures into archiveFebruary 2015ParisTeam: Me, Mary Christine, Tom

25

• High awareness of trip details

• Showing of pictures

• Sorting out redundant pictures

• Sub-grouping and sorting

Simple Example: Holidays

+20 Years+5-10 Years+1 Yearsafter trip +1 month

• Trip to Paris with Friends

• Thousands of picures

• Life goes on• Pictures go

out of focus• Creation of a

small diverse subset for showing occasionally

• Creation of summary page

• Addition of context info

• Further reduction of redundancy

• Rest of pictures into archiveFebruary 2015ParisTeam: Me, Mary Christine, Tom

• Changes in life (e.g. marriage)

• Addition/update of context information

• Dealing with preservation issues

girlfriend

26

• High awareness of trip details

• Showing of pictures

• Sorting out redundant pictures

• Sub-grouping and sorting

Simple Example: Holidays

+20 Years+5-10 Years+1 Yearsafter trip +1 month

• Trip to Paris with Friends

• Thousands of picures

• Life goes on• Pictures go

out of focus• Creation of a

small diverse subset for showing occasionally

• Creation of summary page

• Addition of context info

• Further reduction of redundancy

• Rest of pictures into archiveFebruary 2015ParisTeam: Me, Mary Christine, Tom

• Changes in life (e.g. marriage)

• Addition/update of context information

• Dealing with preservation issues

girlfriendGirlfriendwife

27

• High awareness of trip details

• Showing of pictures

• Sorting out redundant pictures

• Sub-grouping and sorting

Simple Example: Holidays

+20 Years+5-10 Years+1 Yearsafter trip +1 month

• Trip to Paris with Friends

• Thousands of picures

• Life goes on• Pictures go

out of focus• Creation of a

small diverse subset for showing occasionally

• Creation of summary page

• Addition of context info

• Further reduction of redundancy

• Rest of pictures into archiveFebruary 2015ParisTeam: Me, Mary Christine, Tom

• Changes in life (e.g. marriage)

• Addition/update of context information

• Dealing with preservation issues

girlfriendGirlfriendwife

• Revisiting of Photo of trip photos

• Re-integration into overall photo collection (link into context)

Managed Forgetting

   

   

     

     

       

 

28

Automatic Deletion?

Managed Forgetting

inspired by central role of human forgetting

Aim: – help in identifying and focus on relevant information– supporting preservation content selection

will replace inadvertent forgetting

managed forgetting ≠ automatic deletion

instead: range of forgetting options e.g. – resource condensation– change of indexing & ranking– reduction of redundancy

       

 

29

Managed Forgetting

inspired by central role of human forgetting

Aim: – help in identifying and focus on relevant information– supporting preservation content selection

will replace inadvertent forgetting

managed forgetting ≠ automatic deletion

instead: range of forgetting options e.g. – resource condensation– change of indexing & ranking– reduction of redundancy

Based on:     

 

30

Managed Forgetting

inspired by central role of human forgetting

Aim: – help in identifying and focus on relevant information– supporting preservation content selection

will replace inadvertent forgetting

managed forgetting ≠ automatic deletion

instead: range of forgetting options e.g. – resource condensation– change of indexing & ranking– reduction of redundancy

Based on:

careful information value assessment   

 

31

decreasing memory buoyancy

Managed Forgetting

inspired by central role of human forgetting

Aim: – help in identifying and focus on relevant information– supporting preservation content selection

will replace inadvertent forgetting

managed forgetting ≠ automatic deletion

instead: range of forgetting options e.g. – resource condensation– change of indexing & ranking– reduction of redundancy

Based on:

careful information value assessment

forgetting strategies via policies 

 

32

decreasing memory buoyancy

Managed Forgetting

inspired by central role of human forgetting

Aim: – help in identifying and focus on relevant information– supporting preservation content selection

will replace inadvertent forgetting

managed forgetting ≠ automatic deletion

instead: range of forgetting options e.g. – resource condensation– change of indexing & ranking– reduction of redundancy

Based on:

careful information value assessment

forgetting strategies via policies

forgetting options to integrate final manual checking before deletion

 

33

decreasing memory buoyancy

Managed Forgetting

inspired by central role of human forgetting

Aim: – help in identifying and focus on relevant information– supporting preservation content selection

will replace inadvertent forgetting

managed forgetting ≠ automatic deletion

instead: range of forgetting options e.g. – resource condensation– change of indexing & ranking– reduction of redundancy

Based on:

careful information value assessment

forgetting strategies via policies

forgetting options to integrate final manual checking before deletion

combination with multi-tier storage solution possible

34

decreasing memory buoyancy

Use of tiers

35

Contextualized Remembering

Aim: – bringing back information into active use in a meaningful

way even if a lot of time has passed– aiming for semantic level of preservation

Based on:

taking into account relevant parts of context when moving to archiveincreasing contextualization of preserved contentconsidering context evolution over time (evolution-aware contextualization)

Evolution-aware Contextualization & Re-contextualization

36

Context of Interpretation

t

C

Archival InformationSystem

Information System

D

Evolution-aware Contextualization & Re-contextualization

37

Context of Interpretation

t

C C‘

Archival InformationSystem

Information System

Human ForgettingChange in focusStructural changes

D

Evolution-aware Contextualization & Re-contextualization

38

Context of Interpretation

t

C C‘

Archival InformationSystem

Information System

Human ForgettingChange in focusStructural changes

Contextualization

DD

Evolution-aware Contextualization & Re-contextualization

39

Context of Interpretation

t

C C‘

Archival InformationSystem

Pres(D‘)

Pres(C‘)

Information System

Human ForgettingChange in focusStructural changes

Contextualization

D

Context-awarePreservation

DD

Evolution-aware Contextualization & Re-contextualization

40

Context of Interpretation

t

C C‘

Archival InformationSystem

Pres(D‘)

Pres(C‘)

Information System

Human ForgettingChange in focusStructural changes

C‘‘

Semantic evolutionStructural evolutionTerminology evolution

Contextualization

D

Context-awarePreservation

DD

Evolution-aware Contextualization & Re-contextualization

41

Context of Interpretation

t

C C‘

Archival InformationSystem

Pres(D‘)

Pres(C‘)

Information System

Human ForgettingChange in focusStructural changes

C‘‘

Semantic evolutionStructural evolutionTerminology evolution

Contextualization

D

Context-awarePreservation

Semantic Evolution Detection

DD

Evolution-aware Contextualization & Re-contextualization

42

Context of Interpretation

t

C C‘

Archival InformationSystem

Pres(D‘)

Pres(C‘)

Information System

Human ForgettingChange in focusStructural changes

C‘‘

Evolution-awareContextualization

Pres(D‘)

Pres(C‘‘)

Semantic evolutionStructural evolutionTerminology evolution

Contextualization

D

Context-awarePreservation

Semantic Evolution Detection

DD

Evolution-aware Contextualization & Re-contextualization

43

Context of Interpretation

t

C C‘

Archival InformationSystem

Pres(D‘)

Pres(C‘)

Information System

Human ForgettingChange in focusStructural changes

C‘‘

Evolution-awareContextualization

Re-contextualization

Pres(D‘)

Pres(C‘‘)

Semantic evolutionStructural evolutionTerminology evolution

Pres(D‘)

Pres(C‘‘)

D

Contextualization

C‘‘‘

D

Context-awarePreservation

Semantic Evolution Detection

DD

ForgetIT Project, GA600826 - Kickoff Meeting, Hannover, February 2013

44

Synergetic Preservation

smooth and step-wise transition between active information use and preservation enables rich information flow in both directionssupports more informed preservation decisionseases preservation adoption

Data Management

Descr. Info.

Archival Storage

AIPs

Access

Ingest

Administration

Preservation Planning

Preserve-or-Forget Framework

Synergetic Preservation

Extraction & Contextualization

Re-Contextualization

Content Management

Access

Authoring

Administration

Adapter Layer

Managed Forgetting

Information Assessment Condensation

Arc

hiv

al In

form

atio

n S

yste

m

Info

rmat

ion

Man

agem

ent

Sys

tem

Integration Framework

45

Information Management System

• Resources + Meta data:• ResourceID• Content (size, tags, aging, geo)• Context (folder/file usage)• Social features • Resources neighbours (Graph)

Forgettor

Assessorcalculates:+ Memory Buoyancy+ Perservation Value

Analyzer1. Classification of resources

w.r.t. startegies2. Triggers forgetting actions

Strategies

ValuesStatistics

Resources Meta-Info

Resources Values + Decisions

Integration Framework

46

Information Management System

• Resources + Meta data:• ResourceID• Content (size, tags, aging, geo)• Context (folder/file usage)• Social features • Resources neighbours (Graph)

Forgettor

Assessorcalculates:+ Memory Buoyancy+ Perservation Value

Analyzer1. Classification of resources

w.r.t. startegies2. Triggers forgetting actions

Strategies

ValuesStatistics

Resources Meta-Info

Resources Values + Decisions

Input: strategy meta-infomation (content, context,

neigbours )previous values

Integration Framework

47

Information Management System

• Resources + Meta data:• ResourceID• Content (size, tags, aging, geo)• Context (folder/file usage)• Social features • Resources neighbours (Graph)

Forgettor

Assessorcalculates:+ Memory Buoyancy+ Perservation Value

Analyzer1. Classification of resources

w.r.t. startegies2. Triggers forgetting actions

Strategies

ValuesStatistics

Forgetting strategies for

different types of resources

Resources Meta-Info

Resources Values + Decisions

Input: strategy meta-infomation (content, context,

neigbours )previous values

Integration Framework

48

Information Management System

• Resources + Meta data:• ResourceID• Content (size, tags, aging, geo)• Context (folder/file usage)• Social features • Resources neighbours (Graph)

Forgettor

Assessorcalculates:+ Memory Buoyancy+ Perservation Value

Analyzer1. Classification of resources

w.r.t. startegies2. Triggers forgetting actions

Strategies

ValuesStatistics

Forgetting strategies for

different types of resources

Resources Meta-Info

Resources Values + Decisions

Input: strategy meta-infomation (content, context,

neigbours )previous values

Processing Resources based on stategies and

information values

Integration Framework

49

Information Management System

• Resources + Meta data:• ResourceID• Content (size, tags, aging, geo)• Context (folder/file usage)• Social features • Resources neighbours (Graph)

Forgettor

Assessorcalculates:+ Memory Buoyancy+ Perservation Value

Analyzer1. Classification of resources

w.r.t. startegies2. Triggers forgetting actions

Strategies

ValuesStatistics

Forgetting strategies for

different types of resources

Resources Meta-Info

Resources Values + Decisions

Input: strategy meta-infomation (content, context,

neigbours )previous values

Processing Resources based on stategies and

information values

Storing the new values and sending them back to IMS

Integration Framework

50

Information Management System

• Resources + Meta data:• ResourceID• Content (size, tags, aging, geo)• Context (folder/file usage)• Social features • Resources neighbours (Graph)

Forgettor

Assessorcalculates:+ Memory Buoyancy+ Perservation Value

Analyzer1. Classification of resources

w.r.t. startegies2. Triggers forgetting actions

Strategies

ValuesStatistics

Forgetting strategies for

different types of resources

Resources Meta-Info

Resources Values + Decisions

Input: strategy meta-infomation (content, context,

neigbours )previous values

Processing Resources based on stategies and

information values

Storing the new values and sending them back to IMSArchives

Acce

ss

Stor

e

Store &access data

51

Application: Organizational Preservation

Starting point: existing and popular CMS (TYPO3)Sophisticated workflows for content creation and publicationBut: Separation of publication and preservation/archival Access to archived content is difficult and costly obsolete and even outdated information stays online

ForgetIT approach:Preservation as integral part (binary model gradual managed forgetting)

Bolder attitude towards removing content possibleAutomated support of cleaning up processesSupport of many stages of archiving, e.g. offline but still in index, aggregates online/ content in archive, only aggregates kept, etc.

Dissemination/Exploitation: Involvement of TYPO3 community, TYPO3 with preservation extension as open source project to TYPO3 community

52

Application: Personal Preservation

Starting point:tremendous growth of information in personal sphereDiversity and fast evolution of devices, platforms and formatsKeeping info sustainably available: Only ad hoc solutions for mid-term, long-term solutions

ForgetIT approach: Preservation solution for personal information spaceBased on concept of Semantic DesktopConsideration of social web content, multimedia content, other types of personal content, knowledge structuresAdditional short/mid-term benefit: de-cluttering information space by managed forgettingConsideration of multi-level infrastructures (e.g. mobile, PC, cloud)

Dissemination/Exploitation: Personal Preservation as a service (e.g. to customers of a telco company)

53

Variables & Dimensions

Personal Organization

Scenarios • Personal events (years at school, holidays, social events, graduations, marriage, etc)

• Public events

• Work-related events (project starts/closing, business trips, new products, etc.)

Data Type • Local: photos, mobile contacts, sms• Online: user-generated content• Feature:

1. documents2. user behaviors3. social context

• Local: textual documents• Online: web pages• Feature:

1. documents2. user roles3. policies

Interaction(user vs. system)

• search/retrieve, re-find• organize• explore• preserve

Action summarization, aggregation, delete

54

Information Value Assessment

Memory Buoyancy Preservation Value

Short-term relevance/interestsE.g., current meeting documents

Long-term interestsE.g. important life events

Subjective metrics+ usage logs (views, edits, modifies)+ social context, influences

Objective metrics+ diversity, coverage, quality

55

Thank you

http://ForgetIT-Project.eu/

Enter EventForgetIT Project, GA 600826

top related