report on digital preservation services
DESCRIPTION
Report on digital preservation services. 13th Joint INIS/ETDE Technical Committee Meeting 20-22 October 2011, Vienna, Austria. Germain St-Pierre. Digital Preservation Technician. Digital Preservation Projects at INIS. INIS Microfiche Collection Other Digitization Projects What comes next?. - PowerPoint PPT PresentationTRANSCRIPT
IAEAInternational Atomic Energy Agency
International Atomic Energy Agency
International Nuclear Information System (INIS)
Report on digital preservation services
13th Joint INIS/ETDE Technical Committee Meeting 20-22 October 2011, Vienna, Austria
Germain St-PierreDigital Preservation Technician
IAEA 20-22 October 2011, Vienna13th INIS/ETDE Joint Technical Committee Meeting
Digital Preservation Projects at INIS
• INIS Microfiche Collection
• Other Digitization Projects
• What comes next?
IAEA 20-22 October 2011, Vienna13th INIS/ETDE Joint Technical Committee Meeting
1. Microfiche Digitization Project
Objectives
• Digitize the INIS microfiche collection
• Improve on-line access to full-texts
• Ensure long-term preservation of the digital collection
IAEA 20-22 October 2011, Vienna13th INIS/ETDE Joint Technical Committee Meeting
Key Issues
• Prevent duplication of digitization efforts
• Ensure efficient monitoring and coordination of the project
• Ensure good communication with the contractors
• Ensure efficient matching of full-texts with INIS records
IAEA 20-22 October 2011, Vienna13th INIS/ETDE Joint Technical Committee Meeting
Microfiche Scanning
• Status at the last (12th) INIS/ETDE JTC meeting:
• 8,7 million digitized pages
• Status on 20 October 2011:
• 12,5 million digitized pages
• Increase of 3,8 million pages
IAEA 20-22 October 2011, Vienna13th INIS/ETDE Joint Technical Committee Meeting
Our Goal
• Provide the full-text for each NCL “available from INIS”• > 500 000 INIS NCL records
• Volume 1 to 27 (1970 to 1996)
• In > 304 000 cases:• 1 INIS record = 1 report on MF
• Full-text available right after MF digitization
• In ~ 186 000 cases, post-processing is necessary (document splitting)
IAEA 20-22 October 2011, Vienna13th INIS/ETDE Joint Technical Committee Meeting
Microfiche Digitization Statistics
Year Full-Texts from MF Digitized Pages Size in GB
Before 2004 671 59240 4.1
2004 19982 1329912 36.8
2005 36964 1584308 32.3
2006 23127 1365559 33.3
2007 9308 667419 16.2
2008 25710 1228819 29.7
2009 81220 3936296 76.8
2010 33882 1968969 45.8
2011 22966 451605 14.5Total 253830 12574566 289.5
IAEA 20-22 October 2011, Vienna13th INIS/ETDE Joint Technical Committee Meeting
Cumulative Full-Texts from Microfiche
IAEA 20-22 October 2011, Vienna13th INIS/ETDE Joint Technical Committee Meeting
Digitized Pages from Microfiche
IAEA
Our Tools
• Scanning and Image Enhancement
• PixEdit 7
• Kodak Capture Pro
• OCR
• ABBYY FineReader 10
• NCL Collection Management System
• Developed by SDSG
IAEA 20-22 October 2011, Vienna13th INIS/ETDE Joint Technical Committee Meeting
2. Other Digitization Projects
• Completed projects:• IAEA Bulletin
• IAEA General Conference
• On-going projects:• INDC
• Out of Print IAEA Publications• Technical Reports Series (TRS)
• Proceedings Series
• To support Member States digitization initiatives
IAEA 20-22 October 2011, Vienna13th INIS/ETDE Joint Technical Committee Meeting
3. What comes next? (1)
• Digitization of the remaining NCL on MF (mostly from U.S.A.)
• Progressive Release of “Restricted” full-texts for free access on the Web
• Testing and Implementation of new tools• ABBYY FineReader 11: now supports OCR of
Arabic
• OCR of Scientific Notation: e.g. InftReader
IAEA 20-22 October 2011, Vienna13th INIS/ETDE Joint Technical Committee Meeting
3. What comes next? (2)
• Add value to the existing Digital Collection by reprocessing (OCR) scanned images
• Conversion to PDF/A-2u (with Unicode text) to ensure long-term preservation of our full-texts and multilingual full-text search