digitally connecting the scattered heritage: a polish ... · ±allow smaller institutions to secure...

40
Digitally connecting the scattered heritage: a Polish perspective Marcin Werla [email protected] :URFáDZ September 8, 2014

Upload: others

Post on 19-Jun-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Digitally connecting the scattered heritage: a Polish perspective

Marcin [email protected]

September 8, 2014

Page 2: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Development of digital libraries infrastructure in Poland

0

10

20

30

40

50

60

70

80

90

2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013

Increase of  the  number of  digitallibraries between 2002 and  2013

10

1

121

15

2

3

1

4

3

111

1

1

2

1

1

1

1

1

1

1

2

1

1

1

1

1

1

1

Digital  libraries in  the  PIONIER  network

- Several hundredsinstitutions- 1.8M objects

Page 3: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Digital libraries in the PIONIER Network

~70 institutional digita libraries ~40 regional digital libraries

In total around 2 mln of digital objects

Name Size

1 Cyfrowa  Biblioteka  Narodowa  Polona 308  9332 258  2803 e-­‐biblioteka  Uniwersytetu  Warszawskiego 161  9654 46  9585 43  9896 Polska  Biblioteka  Internetowa 32  0717 23  5938 22  4449 Muzeum  Narodowe  w  Warszawie 13  060

10 11  789

Name Size

1 Wielkopolska  Biblioteka  Cyfrowa 222  5212 105  4233 86  9184 Kujawsko-­‐Pomorska  Biblioteka  Cyfrowa 75  6745 Biblioteka  Cyfrowa  -­‐ 52  3766 43  8477 39  9168 Pomorska  Biblioteka  Cyfrowa 38  221

9 Zachodniopomorska  Biblioteka  Cyfrowa  "Pomerania" 29  733

10 Podlaska  Biblioteka  Cyfrowa 28  927Average  size:  15  826  objectsMedian:  1  357  objects

Average  size:  24  020  objectsMedian:  9  399  objects

Page 4: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Digital libraries in the PIONIER Network

Regional digital librariesDevelopment of idea of regional collaboration shaped during the initiation of Wielkopolska Digital Library in 2002Allow smaller institutions to secure collections in digital form and to make them availableon-lineOptimize the use of shared IT infrastructureThey are implemented also in country scale (FIDES, RCIN) as well as in local scale (Tarnowska DL, DL, Make access to digital content easier by providing single point of access

Page 5: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Practice of regional digital libraries

For reader they are simply web portals giving access to collections of cultural heritage from many institutions under a single WWW addressIn practice, realized as consortia, which on the basis of knowledge exchange and collaboration, give their participants:

Access to IT infrastructure necessary to put digital collections on-lineWays to professionally preserve digital copies for long timeKnow-how allowing to prepare high resolution digital materials and metadataWide promotion of resourcesVery good conditions to acquire additional funding in common projects

Page 6: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use
Page 7: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Digital library of Wielkopolska popularity in 2013According to Google Analytics

Page 8: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Practice of regional digital libraries

Structure of collections of regional digital library often reflects complexity of the consortiumRegional collectionsThematic collectionsInstitutional collections

Regional collaboration gives many benefits, but also requires compromisesCommon metadata schemaCommon web interface

Page 9: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Practice of regional digital libraries

Good solution to balance collaboration and promotion of individual institutions are virtual repositories built on top of regional digital libraries

Page 10: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Role of regional digital libraries

Regional digital libraries are more often a basis for new information services related to the heritage of a regionThey are used as repositories of source data, making the information services more rich and trusted

Page 11: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

DInGO software Digitise

Technical ingredient of regional digital librariesdLibra: system for digital libraries (e.g.: http://jbc.bj.uj.edu.pl/) dMuseion: system for digital museums (e.g.: http://cyfrowe.mnw.art.pl/) dLab: system for management of digitisation processesdArceo: system for long-term digital preservation

http://dingo.psnc.pl/

Page 12: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Digitisation process and DInGO software

Planned  objects

Presentation  files

MASTER  files

Digitisation,  standarisation

On-­‐line  access

Preparation  of  digital  object

Selection  of  objects  for  digitisation

Archiving

On-­‐line  publishing

Page 13: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Promotion of regional heritage on (inter)national level

Regional consortia allow small institutions to appear on the InternetRegional digital libraries aggregate local and regional heritage in a digital formNational level access and promotion is organized on the basis of metadata aggregation from distributed sources to one central databaseThis is the responsibility of Digital Libraries Federation of the PIONIER NetworkFederation collaborates with Europena, moving these regional collections even higher, to international level

http://fbc.pionier.net.pl/

Page 14: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Digital Libraries Federation (DLF)http://fbc.pionier.net.pl/

Public  portalSearching,  browsingDigitisation  plans,  persistent identifiers

Data  provider  for  external  servicesEuropeana,  DART-­‐EuropeKaRo

Information  website  for  DL  creatorsNews,  publicationsDigital  libraries  database

Advanced  services  for  DL  administratorsTraffic  monitoringMetadata  analysis  module

Competence  center  for  professionalsE-­‐learning  coursesQ&A platform

Page 15: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Who is providing data to DLF?

Hundreds of institutions from entire Poland

Digital  libraries,  repositories,  digital  museums,  digital  archives

Page 16: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

What kind of objects can you find in DLF?

Based on metadata analysis, done on September 3, 2014

journal46%

article14%

book12%

photo4%

electronic  document

4%

PhD  thesis3%

ephemera3%

other16%

journal80%

book5%

postcard2%

oldprint2%

manuscript1%

ephemera1%

photo1%

archival  document

1%

other6%

80%  objects:  materials  created before  1945 20%  objects:  materials  created  after 1945

Page 17: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Increase of the number of objects in the DLF

2014   -­‐ ~2  million  objects

2007   public  opening  of  DLF,  

~75 thousand  objects

Page 18: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

DLF statistics

Presently: During  2013:  

105 data  sources

325  institutions

~2 million  objects

560  thousands  unique  users

1,1  million  visits

4,5  million  views

Page 19: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Collaboration with EuropeanaEuropeana.eu = European Digital Library, Museum and Archive

2009 2010 2011 2012 2013

Beginning  of  collaboration  in  EuropeanaLocal

Federation  connected  to  Europeana

Europeana  API  pilot  program  participation

Polish  edition  of  Hack4Europe

Two  more  Hack4Europe  contests  as  a  part  of  Europeana  Awareness  project

Collaboration  on  Europeana  1989  Europeana  Cloud project  started

Page 20: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Visibility of Polish collections in Europeana

Data from http://www.europeana.eu/ (September 3, 2014)

1  090  660

1  711  099

1  766  490

2  486  594

2  655  770

2  707  656

2  975  847

3  515  861

3  650  312

3  876  048

3,3%

5,2%

5,4%

7,6%

8,1%

8,3%

9,1%

10,8%

11,2%

11,9%

10.  Irlandia

9.  Polska

8.  Norwegia

7.  Wielka  Brytania

5.  Szwecja

4.  Hiszpania

3.  Holandia

2.  Niemcy

1.  Francja

Top  10  countries  in  Europeana

1  036  395

1  062  881

1  331  865

1  381  668

1  405  903

2  005  866

2  025  754

2  103  884

2  240  932

6  368  924

3,2%

3,3%

4,1%

4,2%

4,3%

6,1%

6,2%

6,4%

6,9%

19,5%

10.  CultureGrid

9.  Arts  Council  Norway

8.  Swedish  Open  Cultural  Heritage

7.  Linked  Heritage

6.  Federacja  Bibliotek  Cyfrowych

5.  CARARE

4.  Athena

3.  OpenUp!

2.  Hispana

1.  The  European  Library

Top  10  data  providers  to  Europeana

Page 21: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Public  collection  days  and  home  digitisation

Community  contributions

Long  term  preservations

Europeana and private collections How to save private collections together with their social context?

europeana1989.eu

fbc.pionier.net.pl/zbiorki

fbc.pionier.net.pl/zbiorki

Page 22: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Example of high value of private collections

Page 23: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Summarizing - Most important success factors

Regional collaborationDevelopment of digital libraries in Poland as they are at the moment was initiated as a series of regional projects, often WITHOUT any dedicated external funding

One host institution which is providing the technical infrastructureA number of partners providing content

First consortium was: Poznan Foundation of Scientific Libraries, PSNC, academic and public institutions from the Wielkopolska region http://www.wbc.poznan.pl/Such approach

Allows to lower the costs for each participating institution (in many aspects)Gives small libraries opportunity to promote their collections on-lineProvides natural platform for collaboration for next projects

Page 24: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Summarizing - Most important success factors

Good technical supportShared technology platform (in case of Poland: dLibra/DInGO)

Common development directionsShared development costsLack of typical risks related to project-based funding

Not maintained in-house solutionsAbandoned commercial softwareRising prices and vendor lock-in

Documentation and technical support available locallyNatural environment for development of good users community

Requires reliable technology partner with proper business model

Page 25: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Summarizing - Lessons learned

Bottom-up approach made all that possible Did I forget to mention any central institutions in my presentation?

Some things were not standardized initially on central levelcreated in many places in parallel

40+ variatons of Dublin CoreOther solutions were blindly copied, while they could be tailored to specific local needs

The curse of DjVu format popularity

Page 26: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Most important challenges

Quality in mass digitization projectsHow to check within a month the quality of what a commercial company was preparing for 6-8 months?How to eliminate cheating companies and not cancel the project?

Long-term digital preservationHow to make sure that results of hundreds of digitisation projects are properly secured for the future?

Page 27: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Most important challenges

Data interoperabilityHow to make sure that newly developed small systems follow best digital libraries practices?How to use data automatically with tools for digital humanities researchers?

Open access to data and proper rights labellingMetadata copyrighted or not?

Europeana requires CC0 statementContent

Is digitisation a creative process?Can commercial reuse of public domain materials be free?

Coordination of Europeana-related effortsAssuring proper representation of Polish heritage

Page 28: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Cloud technologies in the cultural sector

Cloud  servicesRemote  support  and  education

Europeana

Mapping

Aggregation

Enrichment

DLaaS

Small  libraries

Private  archives

Home  museums

Wide  access

Local  memory  institutions

Small institutions: LoCloud http://locloud.eu/

Page 29: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Cloud technologies in the cultural sector

LoCloud Collections Digital Library Service in a cloud

https://locloud.pl/The service is now open and available for testing1.0 version is planned for January 2015Until the end of 2015 the service is free, after that time it must becomeself sustainable

Page 30: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use
Page 31: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Cloud technologies in the cultural sectorEuropean infrastructure: Europeana Cloud

The  EuropeanLibrary

Digital  Libraries  Federation

EU-­‐Screen

The  EuropeanLibrary

Digital  Libraries  Federation

EU-­‐ScreenPortal  Europeana

Europeana  Research

vs

http://pro.europeana.eu/web/europeana-­‐cloud

Page 32: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use
Page 33: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

IMPACT European Center of Competence

IMPACT

 Co

Cin  Digitsation

Tools

Data

Services

Trainings

Fou

nd

ing

me

mb

ers

Shared  infrastructure  for  digital  libraries  competence  centers

Optimization  of  resources  usage  in  digitisation  processes  

Standardization  of  data  and  tools

Prizes,  contests,  events

Best  practicies

http://digitisation.eu/

Page 34: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use
Page 35: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use
Page 36: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Virtual Transcription Laboratory

Virtual Transcription Laboratory(http://wlt.synat.pcss.pl) offers:

A free tool supporting creation of textual versions of historical documentsDedicated OCR service for all VTL usersCrowdsourcing platform allowing to collaborate while creating transcriptions of digitized documents

Page 37: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Examples of projects in VTLhttp://wlt.synat.pcss.pl

Books, old-

Page 38: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

OCR training tool for profiling with historical documents

http://wlt.synat.pcss.pl/cutouts

OCR training tool

Page 39: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Thank you for your attention!Marcin Werla ([email protected])

http://dl.psnc.pl/

Page 40: Digitally connecting the scattered heritage: a Polish ... · ±Allow smaller institutions to secure collections in digital form and to make them available on-line ±Optimize the use

Supercomputing and Networking Center

ul. Noskowskiego 12/14, 61-Office: phone center: (+48 61) 858-20-00, fax: (+48 61) 852-59-54,

e-mail: [email protected], http://www.psnc.pl

affiliated to the Institute of Bioorganic Chemistry of the Polish Academy of Sciences,