1 about “information” [email protected] vrije universiteit brussel informatie- en...

71
1 About “information” [email protected] Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen België For UNESCO-ODINAFRICA-MIM May 2002

Upload: bernardo-russel

Post on 15-Dec-2015

232 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

1

About “information”

[email protected]

• Vrije Universiteit Brussel

• Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

België

For UNESCO-ODINAFRICA-MIM

May 2002

Page 2: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

2

About “information”

Introductory concepts about information

****

Page 3: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

3

Our world: future trends

Future trends in our world

• Complexity

• Dynamics and evolution Speed and acceleration

• Internationalization Globalization

• Economic products less based on natural resources and more on “knowledge”

Answers / Requirements / Solutions / Reactions

• Knowledge and skills

• Adaptability Flexibility

• Global co-operation Mobility

• Education, research, exploitation of knowledge is important

***-

Page 4: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

!? Question !? Task !? Problem !?

Compare “information” for instance with “bananas”.

Compare “information” for instance with “bananas”.

***- 4

Page 5: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

5

iInformation:

some strange properties (Part 1)

• Information is never consumed and does not deteriorate. However, nevertheless information becomes obsolete; speed of delivery can be crucial. The context is important.

• There is no agreed measure of a unit of information.

• The price of an information item is not well linked to its value in a particular situation. Moreover, one cannot well quantify the benefit/value of information.

***-

Page 6: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

6

iInformation:

some strange properties (Part 2)

• One information item can be available to different persons at the same time. Information can be well reproduced, which makes it cheap for wide consumption. However, copyright can keep the price high.

***-

Page 7: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

7

i***-

Information sources:people and documents

• Information sources come essentially in two formats:

» less formal: people communicating by

—telephone

—electronic mail,…

»more formal: documents such as

—hard copy documents

—electronic, digital documents; computer-based files

• Here we focus mainly on information that is stored in documents.

Page 8: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

8

The flow of documentary information through many channels

Reader/User /

Receiver

Reader/User /

Receiver

Many media / channelsMany media / channels

****

Author /Creator / Sender

Author /Creator / Sender

Page 9: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

9

The flow of documentary information with primary and secondary sources

Reader/User /

Receiver

Reader/User /

Receiver

Secondary sources / systems: mainlyReference works (printed, CD-ROM, online)

Library catalogues, including OPACs...

Secondary sources / systems: mainlyReference works (printed, CD-ROM, online)

Library catalogues, including OPACs...

****

Author /Creator / Sender

Author /Creator / Sender Primary sources / systems: mainly

Journal articles / Books / Electronic mail / Online sources /...

Primary sources / systems: mainlyJournal articles / Books /

Electronic mail / Online sources /...

Page 10: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

!? Question !? Task !? Problem !?

Why is secondary information created?

Why is secondary information created?

**** 10

Page 11: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

11

iThe role of secondary information

sources

• The secondary information flow is generated on the basis of the primary flow, mainly because the great amounts of primary information lower the chance to retrieve and use the appropriate information item.

• Secondary information tries to bring some order in the great chaos.

****

Page 12: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

12

Various categorisations of documentary information sources

Information sources can be categorised in various ways. For instance:

****

•Primary

•Secondary

•Hard copy /not digital

•Digital

•Offline

•Online

•Text

•Image

•Sound

•Software

•Data

•Interactive

•Books

•Serials

Page 13: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

!? Question !? Task !? Problem !?

Explain that the distinction between books and serials

is not sharp.

Explain that the distinction between books and serials

is not sharp.

**-- 13

Page 14: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

14

i***-

Documentary information sources:books versus serials

• Two types of documents are usually distinguished, irrespective of the subject of their contents:

»Books, monographs, in most cases with their International Standard Book Number (ISBN)

»Serials, serial publications, periodicals, journals, newsletters, in most cases with an International Standard Serials Number (ISSN)

• (However, the distinction is not sharp: some books belong to a series and in some cases they carry an ISBN as well as an ISSN.)

Page 15: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

!? Question !? Task !? Problem !?

Which criteria do you know for the evaluation of

the quality of a documentary information source?

Which criteria do you know for the evaluation of

the quality of a documentary information source?

**** 15

Page 16: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

16

iDocumentary information sources: criteria to evaluate their quality (1)

In view of the widely varying degrees of quality of information sources on the one hand, and of the costs associated with using information on the other hand, we should always be critical. Some evaluation criteria:

»Is the information valid, reliable, trustworthy, genuine, authentic? Is the author honest?

»Is the information accurate, correct?

****

Page 17: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

17

iDocumentary information sources: criteria to evaluate their quality (2)

»Has the source an author with a high expertise and a good reputation?

»Is the information source unique? Does it offer a great amount of primary information, which is not obtainable from other sources?

»Is the information complete? Is the work available in its entirety?

»Does the source offer a wide coverage? Is the source comprehensive, substantive?

****

Page 18: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

18

iDocumentary information sources: criteria to evaluate their quality (3)

»Is the source objective, without bias?

»Is the information current, up to date?

»Good clear format and lay-out of the information / User-friendly information system / Easy for users to orientate themselves within the resource and to find their way around it?

»Good user support / Good customer support?

»Appropriate type of distribution medium? (print, e-mail, online,...)

****

Page 19: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

19

iWhy do researchers want to communicate and publish?

**--

• To share their data and ideas with other members of their research community

• To advance their own careers

Page 20: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

20

iWhich publication media are

preferred by authors?

**--

Good publication media

• reach a large fraction of the relevant community

• possess a high prestige within that community

Page 21: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

21

Past

Now

FutureiRetrospective searching versus

current awareness: scheme

****

Retrospective searching

Current awareness

Page 22: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

22

iRetrospective searching versus current awareness: the basics

***-

• Searching for suitable information takes the form of retrospective searching mainly when we enter a new, unknown field or subject domain where we need supporting information.

• Once that we have found enough information, we need to keep aware of new information because we are always challenged

»by the continuous flow of newly generated information and

»by the changing environment in which we work and live.

Page 23: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

23

iWhat is a

current awareness service?

***-

• A service which provides the recipient with information on the latest developments within the subject areas in which he/she has a specific interest or need to know.

• Aims:

»Saving time

»Covering many information sources

»...

Page 24: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

!? Question !? Task !? Problem !?

Give examples of current awareness services.

Give examples of current awareness services.

**-- 24

Page 25: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

!? Question !? Task !? Problem !?

Give evaluation criteria for current awareness services.

Give evaluation criteria for current awareness services.

**-- 25

Page 26: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

26

iCurrent awareness services:

considerations / evaluation criteria

***-

• Same criteria as for information sources in general +Frequency / timeliness / currency of the service

• Mechanism for creating user interest profiles + user interface

• In the case of bibliographic services: associated full document / full text delivery service

Page 27: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

27

iInformation retrieval: evolution of

storage and distribution media

****

• 1450 printing with reusable characters/fonts

• 1975 + online access databasesfrom the 1970s growing Internet

• 1985 + CD-ROM

• 1990 + World-Wide Web

(based on the Internet)

Page 28: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

28

Information retrieval: end user or information intermediaries

End-user

Information intermediary(Broker or library or ...)

Information

****

Page 29: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

29

iEnd user versus information

intermediary

• People can retrieve information themselves, directly as so-called “end-users”.

• However,

»the information landscape is complex,

»it may cost a lot of the time to find the right information,

»it may be costly to search for information

• Therefore it may be wise to obtain the assistance of an expert information intermediary, such a a reference librarian or an information broker.

****

Page 30: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

30

iInformation retrieval:

presentation of the results

The presentation to the user of information retrieved should ideally be

• eye-catching

• easy to read

• laid out in a standard format

• fully referenced

• ...

***-

Page 31: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

31

About “information”

Computer- and network-based information

****

Page 32: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

32

Information: from bits to meaningful information

Digitalcomputer data = bits

or01Program code, meaningful for andto be interpreted / executed bya suitable / compatible computer

Information = “documents”, meaningful for andto be interpreted byhuman beings

****

Page 33: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

33

Information: digitally stored and managed information

Categories of digital, computer readable information / data, forming electronic “documents”,understandable by human beings.

01textnumbersimagesvideosounds

multimedia

+

****

Page 34: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

34

01Digital information

Multimedia / Hypermedia

Information: types of digital information

Linear textHypertext

Static imagesVideo

Sound

Programs for computers

****

Page 35: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

35****

Online / Networked

CD-ROM

Update speed

Volume

Some publication media compared

Printed

Page 36: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

!? Question !? Task !? Problem !?

What can you conclude from the comparison

of the publication media print, CD-ROM and online?

What can you conclude from the comparison

of the publication media print, CD-ROM and online?

**-- 36

Page 37: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

37

Electronic publishing:some meanings of the term

Author / producer

Reader /User

Computerized database

Published database: CD-ROM floppy disk magnetic tape online access ftp archives

Printed documents

Desktop publishingComputerized typesetting

DistributionAccessReadingUsing

TransferReproductionSortingFormatting / Lay-out

DistributionStorageBuyingLendingReading

Creation of input

SelectingDownloadingSortingFormatting

**--

Page 38: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

38

Electronic publishing:evolutionary stages

***-

• To produce print on paper, using computers

• Dual mode: on paper and as database

• Simulation of print on computer display

• Repackaging of data for computer display (e.g. text to hypermedia)

• Creation by author directly for the computer (hypermedia) and no printed version

Page 39: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

39

!? Question !? Task !? Problem !?

Which problems and advantages do you see in electronic publishing?

Which problems and advantages do you see in electronic publishing?

**--

Page 40: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

40

Electronic publishing:

technological problems (Part 1)

**--

• Lack of access to technological infrastructure (computers, peripherals, and networks) by potential readers/users.

• Computer or network downtime can hinder access to information.

• Some information technology expertise and skills are required.

Page 41: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

41

Electronic publishing:

technological problems (Part 2)

**--

• Low quality of the computer interface

most computer displays are less attractive than printed papers and cause eyestrain

• Long times required to download information

in particular when slow or saturated or unreliable networks are used

Page 42: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

42

Electronic publishing:

social problems (Part 1)

**--

• The reward system for the author is traditionally based on printed publications. = Electronic publications are not (yet) accepted as credible by many in academic circles.

chicken-and-egg situation

• Lack of a good, cheap, universal indexing system to index electronic publications.

• Reluctance of many users to pay for electronic information.

Page 43: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

43

Electronic publishing:

social problems (Part 2)

**--

• The ease of copying creates opportunity for plagiarism.

• Mutability of the publication / Difficulty in establishing authenticity and authorship / Version control /...

Technologies such as digital signatures exist, but acceptance and implementation is slow; changes in contents should be avoided to ensure that a particular archived publication can be referred to.

Page 44: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

44

Publications on CD-ROM or online: advantages compared with hard copy

***-

• Can be cheaper to produce, to transport and to store.

• Can offer better search features.

• Can offer various output formats.

• Can offer fast and efficient “copy and paste” by the reader/user of information to other documents.

Taken together, these features allow more efficient access to large, high volume documents or databases.

Page 45: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

45

Publications on CD-ROM or online: advantages compared with hard copy

**--

• Can offer multimedia and hypermedia contents, such as animation, video, static and dynamic virtual reality (instead of only formatted text and numbers plus graphics).

• Can offer “active contents” = accompanying / embedded programs to view / manipulate / manage / order / selectthe data / information / contents.

Page 46: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

46

Publications online:

advantages compared with hard copy

**--

• Allows publishers faster, more up-to-date publication = allows readers access to more current information.

• Allows access from any place connected to the network.

• Allows permanent access, 24 hours/day.

• Avoids loss and theft of information stored in hard copy material.

continued….

Page 47: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

47

Publications online:

advantages compared with hard copy

**--

• Allows access to the same information by many users at the same time.

• Allows faster access.

• Allows the collection of usage statistics.

• In the case of journals: allows payment for articles (items) that you read/use only, instead of payment of a flat subscription fee.

• ...

Page 48: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

48

Publications online in WWW:

advantages compared with hard copy

**--

• Allows documents with hyperlinks to other WWW documents on the network, even on other server computers.

• Allows inclusion of the documents by any WWW-indexing system in a searchable full-text index.

• Allows efficient application of meta-information related to the documents, embedded in the documents or stored on other server computers.

Page 49: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

49

Publications online in WWW:

advantages compared with hard copy

**--

• Allows a good integration with other network services like e-mail and Usenet.

• Allows better communication:

»comments can be added to an electronic publication at any time after it is put into the system and these comments can be read together, by subsequent readers

»allows easy and fast contact with author(s) by e-mail, if needed

Page 50: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

50

Publications online in WWW:

advantages compared with hard copy

**--

• Allows extending “documents” to hybrid online services + documents + databases,

»which can execute programs

—on the server (using CGI, for instance), or

—on the client computer (using viewer programs, or plug-in programs or Java programs or ActiveX programs), and

»which can use information stored in the network, even on other server computers.

Page 51: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

51

Convergence of media to computer-based communication

• Already based on computers and networks:

»CD-ROM / DVD / Hypermedia / Remote login into a computer / File transfer from a computer / Electronic mail / Usenet / the World-Wide Web...

• Evolving towards a computer- and network-based technology:

»Telephone / Radio / Television / Video / Fax / Journals / Books / ...

***-

Page 52: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

52

Scientific publishing in Utopia: an ideal scheme

Many authorsMany authors

Many readers / usersMany readers / users

Many editors / publishersMany editors / publishers

Online remote access multimedia database serverOnline remote access multimedia database server

Many database search clients and user interfaces

Many database search clients and user interfacesone global ,

international computer data communication network

author = reader in science

****

Page 53: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

53

!? Question !? Task !? Problem !?

Indicate the differences between reality

and that simplified, ideal schemeof the information flow.

Indicate the differences between reality

and that simplified, ideal schemeof the information flow.

****

Page 54: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

54

!? Question !? Task !? Problem !?

Which basic problems/difficulties hinder people

to find / access / use information?

Which basic problems/difficulties hinder people

to find / access / use information?

****

Page 55: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

55

iInformation retrieval:

basic difficulties (Part 1)

****

• In many cases it is not completely clear to the user of an information retrieval system which information is in fact needed, required.

• In many cases the need for information cannot be expressed completely in the form of a query.

One of the reasons is that the complete context of the information need should ideally be expressed, including the knowledge and background of the searcher.

Page 56: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

56

iInformation retrieval:

basic difficulties (Part 2)

****

• Computer systems are artificial, but nevertheless most use human language in their interface with the human users, for instance in database search systems. This may cause difficulties related to language and vocabulary in particular. Some examples:

• People use different languages and different terms (vocabularies) to describe a similar concept.

• Concepts, vocabularies and meanings of words and terms may change over time.

• Meanings of words / terms may depend on their context.

Page 57: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

57

iInformation retrieval:

basic difficulties (Part 3)

****

• Many different and imperfect retrieval systems should or must be used.

»To retrieve and access the information that is in principle available, many different retrieval systems must be available and be mastered.

»Furthermore, a perfect information retrieval software does not (yet) exist; scientific and technological evolution is fast in the domain of information retrieval software since about 1970.

Page 58: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

58

iInformation retrieval:

basic difficulties (Part 4)

****

• Information overload

Users are often overwhelmed by the amount of available information and by the large influx of new information.

Page 59: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

59

iInformation retrieval:

basic difficulties (Part 5)

****

• The price (or inaccessibility) of particular information

A lot of information cannot be obtained or at least not free of charge.

Page 60: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

60

Information retrieval: browsing and searching as methods

• To make information available, the producer of an information system can offer to the user basically two different ways for retrieval of the right information from the system:

»by browsing or

»by searching.

***-

Page 61: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

61

• Browsing a logically ordered list of terms

• Logical order /Sorted by subject

• Table of contents

• Classification

• Hypertext-Hypermedia:jump from a page to a linked page

• Searching by submitting a search term to the system

• Alphabetical order / Not sorted by subject

• Alphabetical index

• Thesaurus

• Hypertext-Hypermedia: search built in a page

Information retrieval: browsing versus searching

***-

Page 62: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

62

Information retrieval: browsing systems support

• In browsing systems, the user can follow some of the paths offered by the system.

• The information is ordered, according to subject for instance.

• The user does not have to use his own words to indicate his needs.

***-

Page 63: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

63

Information retrieval: browsing systems

• To support organising and browsing of information items, some type of classification is applied in many cases.

***-

Page 64: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

64

Information retrieval: examples of browsing systems

• Examples of browsing systems are

»a table of contents in the front part of a book,

»a set of books placed on shelves according to some classification system,

»a hypertext hierarchical directory on the WWW, or more generally all hypermedia systems.

***-

Page 65: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

65

Information retrieval: search systems

• In search systems, the user has to express his need for information by formulating a query that is normally using a natural language or a more formal language.

• In this case the information is normally not ordered according to some logic, but in most cases in the form of a well structured compilation of items of a similar form, in the form of the records of a database when a computer system is applied.

***-

Page 66: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

66

Information retrieval: search systems support

• To support searching by avoiding some of the difficulties caused by the use of natural language for retrieval purposes, a list of controlled keywords or a thesaurus or an ontology are applied in many cases.

***-

Page 67: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

67

Information retrieval: examples of search systems

• Examples of search systems are

»the index (the register) in the back part of a book,

»a library or museum catalogue with a search interface,

»a search form on a web page.

***-

Page 68: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

68

!? Question !? Task !? Problem !?

Give some examples of concrete information systems

that use browsing or keyword searching respectively.

Give some examples of concrete information systems

that use browsing or keyword searching respectively.

***-

Page 69: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

69

!? Question !? Task !? Problem !?

List and discuss the advantages and problems

of browsing and keyword searching respectively.

List and discuss the advantages and problems

of browsing and keyword searching respectively.

***-

Page 70: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

70

i Advantages:

»Browsing is relatively easy for the user.

Difficulties for the user:

»Allows the user to explore the information space by roads constructed based on the view of the world of the system designers, and not based on his own view.

Difficulties for the producer:

»It is relatively costly to construct an information system based on browsing.

Information retrieval: pro and contra of browse systems

***-

Page 71: 1 About “information” Paul.Nieuwenhuysen@vub.ac.be Vrije Universiteit Brussel Informatie- en Bibliotheekwetenschap, Universitaire Instelling Antwerpen

71

i Advantages:

»Creation of keyword indexes for fast searching is relatively simple and cheap and can be automated.

Difficulties for the user:

»Searching is hindered by vocabulary / language problems.

»The users cannot always fully articulate their needs.

Information retrieval: pro and contra of search systems

***-