department of information studies andy dawson lis1510 library and archives automation issues systems...

38
Andy Dawson DEPARTMENT OF INFORMATION STUDIES LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy Dawson Andy Dawson Department of Information Studies, UCL Department of Information Studies, UCL (University of Malta 2010)

Upload: miles-bennett

Post on 24-Dec-2015

220 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

Andy Dawson

DEPARTMENT OF INFORMATION STUDIES

LIS1510 Library and Archives Automation Issues

Systems management, digitisation and optical systems

Andy DawsonAndy Dawson

Department of Information Studies, UCLDepartment of Information Studies, UCL (University of Malta 2010)

Page 2: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

What we will be covering today

• The need to manage systems

• Role of management

• Ongoing support

• Development & maintenance

• Inhouse vs outhouse skills

• Security and viruses

Page 3: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

What we will be covering today

• Why digitise

• The digitisation process

• Graphics formats

• Indexing and retrieval

• OCR

• Management of digitally-based systems

Page 4: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Why manage?

• Fundamentally just like any other management task - about– planning– monitoring– control

• System criticality• Particular problems of a distributed environment• Importance of good record-keeping• Importance of good communications

Page 5: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Administration

• System management roles– What’s in a name?– System management– Network management– What’s a systems librarian?

• Variation with size• Required with small systems? • How intrusive?

Page 6: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Tasks

• Depend on scope

• System planning

• User control, disk allocations, weeding / maintenance, date-based removal/deletion

• Backups

• Prevention/maintenance– hardware, software, system

Page 7: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Backups

• value of central backups

• strategies for backup

• backup logs

• offsite storage

• methods– tape, removables, Mirrors, RAID

Page 8: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Other management tasks and considerations• Permissable downtime

• System criticality– risk assessment– backup systems? – manual alternatives?

• Day’s data?

• Power cuts?

• Cost/possibility of DATA recovery/loss

Page 9: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

etc etc etc

• (In)Consistency of workstation setup– Customising environments, troublehooting

• Software licences – site, per terminal, concurrent user

• Upgrades

• Dissemination of rules

• And not least…MANAGING THE USERS THEMSELVES!

Page 10: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Monitoring useage

• “Big brother is watching you”• Realtime monitoring• Logging

– counting accesses– reviewing activities– full keystroke/data capture– space required

• Site blocking, privacy, education

Page 11: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

User Support

• Helpdesks• Manpower• Expertise• Problem logs & support systems

– System logs/diaries– The importance of

GOOD RECORDS OF SYSTEM OPERATIONS

Page 12: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

User Support

• Message of the day

• Broadcast messaging– Forced logouts/shutdowns

• External access

• Training– Formal training– Online resources & self-help

Page 13: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Maintenance

• Lasts as long as the system lasts!

• Continuing development of the system– corrective– perfective– adaptive

• Software vs Hardware maintenance…

• …and Liveware maintenance?

Page 14: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Maintenance – Inhouse vs External• Contract vs in-house support

– pros and cons

• Levels of service– response time

– repair parameters

– swap-outs

– cost levels

– machine types

• On-site spares

Page 15: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Security

• Against what?– Unauthorised access– Unauthorised use– Accidental damage– Malicious damage

Page 16: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

External and internal access control

• Login control

• Access control

• Terminal restrictions– User to terminal– Activity to terminal

• Time restrictions

• Lockouts & alarms.

Page 17: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Information structure

• How will the information be accessed?

• Back to basic design:– Workgroups– Systems Analysis– Workflows

• Design of structure - with security in mind.

Page 18: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Access permissions

• The System Supervisor

• Backdoor entry

• User groups/classes

• Equivalences

Page 19: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Rights

• Hierarchies of rights

• Rights profiles – Personal rights– Group rights– Directory rights– File rights

• Flags and attributes

Page 20: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Anti-Virus procedures

• What is a virus?

• Anti-virus programs

• Approaches to protection

• The human problem

• The three-step guide

Page 21: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Coffee break!

Page 22: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Why digitise?

• To improve access to information– Multiple users– Better/different access forms

• To preserve precious/fragile materials– Access copies rather than originals– Allow more/better use than with original

• To increase storage capacity/reduce cost– Archival/single copy storage– BUT only where original is not intrinsically important!

Page 23: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Obtaining digital images

• Conversion of existing images into digital form– scanning, digitising, rekeying/redrawing

• Direct creation of digital images– Digital cameras and video recorders– Illustration or desktop publishing software

• Don’t forget the “original” source…

Page 24: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

The digitisation process

• What is a digital image?– A series of dots (pixels) in rows and columns –

“raster” graphics (bitmap)– Also, sometimes, vector graphics - used by CAD

and some graphic art software

• A digital image is literally a “picture”

• It possesses certain parameters set at its creation (resolution)

Page 25: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Resolution• The “quality” of an image• Size expressed in pixels• 72 dpi is quite normal for screen• Higher resolution monitors show more pixels• Typical PC resolutions

– 600 x 800 – low-end – 1024 x 768 - mainstream– 1600 x 1200 or higher – becoming more common

• Printers usually output 300-1200dpi • Scanning resolutions can go up to 9600dpi

Page 26: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Depth resolution – colour range

• Each pixel in the image needs colour information• Held in a number of bits per pixel of related data

storage• Affects colour or greyscale

– 1-bit - black or white– 8-bit - 256 colours– 16-bit – 64,000 colours– 24-bit -16.7 million colours

Page 27: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Image File Sizes

• Can be very large if 24-bit colour and high resolution

• Compression techniques therefore the norm

• Compression can affect final image quality– “Lossy” and “lossless” compression

• Compression is related to format

Page 28: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Graphics/Image File Formats

• Many different ones, some proprietary– Bmp - plain bitmap, as used by Windows– Gif - graphics image format– Jpeg - Joint Photographic Expert Group– Png - scaleable web-based format – Tiff - usual output from scanners– And many more!

• Gif & Jpeg are the most widely used

Page 29: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Gifs

• Suitable for flat colour images, line artwork, logos • Also used for web backgrounds, buttons, bars etc• Max is 8-bit 256 colours • Good compression for large blocks of the same colour• “Special” gif formats also available:

– Transparent– Interlaced– Animated

Page 30: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Jpegs

• Suitable for photos and anything with colour grading

• Lossy compression but not readily perceived by naked eye

• Fills in with approximated colours

• Can choose level of compression

Page 31: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Indexing and retrieval

• Retrieval requires “searchability”• Not inherent in images, therefore:

– Need to “identify” content– Difficulties of doing this automatically– In any practical system, requires “traditional”

cataloguing and indexing– BUT this is slow and expensive!

• However, scanned text can be processed into machine readable text by OCR

Page 32: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Optical Character Recognition

• How does OCR work?– Character matching– Shape recognition– Word recognition– Grammatical/syntactic analysis

Page 33: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Optical Character Recognition

• Benefits of OCR• Problems with OCR

– Accuracy (real vs theoretical)– Effect of:

• Quality of originals• Consistency of originals• Content• Layout

– Cost of checking/correction vs alternatives

Page 34: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Management of digitally-based systems

• Most issues similar to any other system!

• But some obvious peculiarities:

• Legal/copyright issues

• Choice of format

• Importance of adequate retrieval system

• Dangers of technical obsolescence

Page 35: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Legal issues

• What can be digitised?

• Who owns the rights?

• Digital manipulation of images — are the issues fully resolved?

• Evidential value and originality

Page 36: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Format selection

• What format to use?– Scanners will give an option to save as TIFF, or

may try to optimise format based on ‘evidence’– Cameras will save as JPEG

• How will the image be retrieved?

• How will the image be utilised?

• What resolution to use?

Page 37: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

Retrievability and obsolescence

• Importance of proper indexing for retrieval• Lack of browsability in electronic files• Dangers of obsolescence

– Hardware– Software– Formats

• Importance of backup/rewriting• If you can’t find it or use it, you might as well not

have it!

Page 38: DEPARTMENT OF INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues Systems management, digitisation and optical systems Andy

DEPARTMENT OF INFORMATION STUDIES

Andy Dawson

That’s all folks…

• Any questions?

• Tomorrow: IT Lab

• Introduction to the Internet!