experimental workflow development in digitisation
DESCRIPTION
Experimental Workflow Development in Digitisation 2nd Qualitative and Quantitative Methods in Libraries International Conference (QQML2010), 25-28 May 2010, Chania, Greece.TRANSCRIPT
IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
Mustafa Dogan (Göttingen State and University Library)Clemens Neudecker (Koninklijke Bibliotheek)Gerd Zechmeister (Austrian National Library)Sven Schlarb (Austrian National Library)
Experimental workflow developmentin digitisationThe concept of collaborative workflow development in the IMPACT project
27.5.2010 QQML Chania/Greece
IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
2
Agenda Background of IMPACT Digitisation workflows Collaborative workflow development Architectural principles Workflow development platform Key success factors Outlook and future scenarios
27.5.2010 QQML Chania/Greece
IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
3
Background of IMPACT Project partners
– 26 Libraries, Research Institutes and Industry Partners Main objective
– Improve access to historical books and newspapers printed before 1900
Software tools and prototypes– Image Enhancement & Segmentation Toolkit– Improved ABBYY FineReader OCR Engine, IBM Adaptive OCR– Post-processing and -correction modules– Lexical resources for several European languages
Support to the MLA community– Best Practises & Strategic/Operational Guidelines– Online Helpdesk– Tool Showcases & Demonstrators– Centre of Competence
27.5.2010 QQML Chania/Greece
IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
4
Digitisation workflows Digitisation: a sequence of steps, from selection of
analogue source material to presentation of digital objects for end-users
Workflow: software-based execution of a sequence without human interaction
Challenges and barriers– Workflows are tailored to specific needs– Lack of interoperability for applied software and input/outdata
data– Lack of collaboratively used and developed resources and
expertise
27.5.2010 QQML Chania/Greece
IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
5
Collaborative workflow development Workflow Development as a community-driven activity
using an experimental platform Scientific workflows: using web services representing
individual software modules (Shiyong Lu et al. 2009) Providing highly innovative and efficient tools to a wider
community to design workflows Technical staff providing the platform,
conceptual/library staff designing workflows Using Web 2.0 features to share and expand knowledge
and resources
27.5.2010 QQML Chania/Greece
IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
6
Architectural platform principles Modularity Transparency Flexibility Extensibility Open standards based Accessibility Scalability Collaboration
27.5.2010 QQML Chania/Greece
IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
7
Community
ArchivesMuseums Libraries …..
Component A Component B Component C Component D
Workflow Registry
Workflow 1 Workflow 2 Workflow 3
Experimental workflow development platform
Workflow development platform
rate
modify
comment
compare
share
measure
27.5.2010 QQML Chania/Greece
IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
8
Workflow development phases
Cen
tral
Dat
a R
epos
itory
Select
Design
Execute
Evaluate
Workflow Development Workbench
27.5.2010 QQML Chania/Greece
IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
9
Evaluation criteria OCR: correctly recognised characters/words Segmentation: correctly identified text and graphical
regions Workflows: comparing workflows and identifiying most
suitable Statistical and provenance data: e.g. processing time
27.5.2010 QQML Chania/Greece
IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
10
Outlook Keys to success
– Joint effort by library and software development staff– Usability of tools and platform– Incentive to collaborative work– Testing and adaptation of workflows– Permanently tailoring and optimizing workflows
Future work– Demonstration of current (web) services– Experimental platform as sustainable resource for a Centre of
Competence for the MLA community
27.5.2010 QQML Chania/Greece
IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
11
Thank you very much!
Contact:Project Website: http://www.impact-project.euProject Office: [email protected]