incorporating historical and geographical dimensions into a search interface michael buckland...

35
Incorporating Historical and Geographical Dimensions into a Search Interface Michael Buckland Electronic Cultural Atlas Initiative University of California, Berkeley Association of American geographers San Francisco, CA 17 April 2007

Post on 22-Dec-2015

216 views

Category:

Documents


0 download

TRANSCRIPT

Incorporating Historical and Geographical Dimensions into

a Search Interface

Michael Buckland Electronic Cultural Atlas Initiative

University of California, Berkeley

Association of American geographers

San Francisco, CA 17 April 2007

17 April 2007 Amer Assoc Geogr 2

Acknowledgements

Summarizes work done by and with Kim Carl, Fredric Gey, Ray Larson, Vivien Petras, Jeanette Zerneke and others.

Supported by the [Federal] Institute of Museum and Library Services.

17 April 2007 Amer Assoc Geogr 3

Support the Learner: What, Where, When and WhoSupported by the Institute of Museum and Library Services

Five ideas . . .

1. Understanding requires knowing context.

2. Using internet resources should be as easy as using a library reference collection.

3. Find context of any museum object, document, or performance: What is related to it: what it is, where it came from, when it originated, and who associated with it?

4. WHAT, WHERE, WHEN, and WHO as a useful structure.

5. Make better use of existing descriptive metadata.

17 April 2007 Amer Assoc Geogr 4

Any document, object, performance or query

Any resource:Audio, Images, Texts, Numeric data, Objects, Virtual reality, Webpages

Any catalog: Archives, Libraries, Museums, TV, Publishers

Connect it with its context – and other resources.

Facet Vocabulary Displays

WHAT Thesaurus Cross- e.g. LCSH references

WHERE Gazetteer Map

WHEN Period directory Timeline

WHO Biograph. dict. Interpersonal

e.g. Who’s Who relationships

17 April 2007 Amer Assoc Geogr 5

Linking portal with resources

Local: Relational database - - Generates pages dynamically - - Search term recommender system - - Vocabulary mapping tables - - Library of maps

Remote: - - “Federated” search e.g. Z39.50 - - Structured URLs

17 April 2007 Amer Assoc Geogr 6

WHAT Subject headings Cross-references within and between indexes

LCSH: Kung fu films see Martial Arts filmsPreviously Hand-to-hand fighting, oriental, in motion pictures

Automobile: - PASS MOT VEH, SPARK IGN ENG (U.S. Import/Export statistics) - TL 205 (Library of Congress Classification) - 180/280 (US Patent classification) - 3711 (Standard Industrial Classification)

Computer: HS 847120 Digital auto data proc mach contng in the same housing a CPU and input & output device.”(International Harmonized Commodity Classification System).

NEED TO MAP TO & BETWEEN UNFAMILIAR VOCABULARIES

17 April 2007 Amer Assoc Geogr 7

Guidance from user’s query to remote system’s vocabulary

17 April 2007 Amer Assoc Geogr 8

17 April 2007 Amer Assoc Geogr 9

But language evolves differently in different social groups.

Different words for the same thing

… or the same word for different things . . .

17 April 2007 Amer Assoc Geogr 10

“Cardiac arrest” A single topic, but different specialists don’t want same literature! So how to select differently?

17 April 2007 Amer Assoc Geogr 11

Linking vocabularies WHAT, WHERE, WHEN

Library subject headingsTopic – Geographic subdivision – Chronological subdivision

Place name gazetteer:Place name – Type – Spatial markers (Lat & long) – When

Time Period DirectoryPeriod name – Type – Time markers (Calendar) – Where

17 April 2007 Amer Assoc Geogr 12

Mapping diverse vocabularies“Feature types” to “Subject Headings”

National Geospatial Intelligence Agency Geographic Description Codes: -- 600+ types of physical object, e.g. School, Plateau, Dike

Library of Congress Subject Headings: >100,000 topics and combinations to form complex topics

Most GDC have comparable LCSH, ordinarily in plural. - GDC School = LCSH School buildings. LCSH School means an institution. - Ambiguity of Farm, Plantation, &c. physical / institution. - 38% LCSH same, usually plural; 61% match incl variant spellings & synonyms; 22% boader; 4% narrower; 12% problematic. - GDC weak on historic features, e.g. Ancient site. - Object / topic issues: North Dakota – Antiquities.

17 April 2007 Amer Assoc Geogr 13

Linking vocabularies WHAT, WHERE, WHEN

Library subject headingsTopic – Geographic subdivision – Chronological subdivision

Place name gazetteer:Place name – Type – Spatial markers (Lat & long) – When

Time Period DirectoryPeriod name – Type – Time markers (Calendar) – Where

Now re-align the WHAT, WHERE, and WHEN . . .

17 April 2007 Amer Assoc Geogr 14

Well-developed facet indexes include other facets.What Where When Who

WHAT (LCSH) A A A A

WHERE (Place Gazet.) M M M -

WHEN (Period dir.) M M M -

WHO (Biogr dict.) M M M MM = Mandatory; A = If Applicable

Need vertical interoperability between vocabularies, e.g. for “What” topical mapping from NGA Gazetteer Geographic Description Code “Lthse” (Lighthouse) to LCSH “Lighthouses.” and place name interoperability for “Where.” Horizontal associations occur within records.

17 April 2007 Amer Assoc Geogr 15

Linking portal with resources

Local: Relational database - - Generates pages dynamically - - Search term recommender system - - Vocabulary mapping tables - - Library of maps

Remote: - - “Federated” search e.g. Z39.50 - - Structured URLs

17 April 2007 Amer Assoc Geogr 16

Use external search engine to forward query to remote resource

Interface: Herzl, Theodor, founder of Israel, lived most of his life in Austria, 1860 to 1904

CHESHIRE Z39.50 query to Library of Congress template:

https://sherlock.sims.berkeley.edu/cgi-bin/CheshireZSearch.tcl?search=subject+______+______&target=lc&numwanted=20&format=html&recsyntax=marc

Insert namehttps://sherlock.sims.berkeley.edu/cgi-bin/CheshireZSearch.tcl?search=subject+Herzl+Theodor&target=lc&numwanted=20&format=html&recsyntax=marc

17 April 2007 Amer Assoc Geogr 17

https://sherlock.sims.berkeley.edu/cgi-bin/CheshireZSearch.tcl?search=subject+Herzl+Theodor&target=lc&numwanted=20&format=html&recsyntax=marc

17 April 2007 Amer Assoc Geogr 18

Structured URLs: templates for searching remote sitesWikipedia Template: http://en.wikipedia.org/wiki/_________

http://en.wikipedia.org/wiki/Theodor_Herzl

17 April 2007 Amer Assoc Geogr 19

Structured URLs: templates and cross-vocabulary mappings e.g. Metropolitan Museum of Art Time line Of Art History (TOAH)

11 time periods, 01-10, e.g.04 = 1,000 B.C. – 1 A.D.11 = 1900 A.D. – present

Geographical hierarchy (some variation by time period), e.g.ss = South & southeast Asia

ssa = South Asia (India, Himalayas,…)eu = Europe

euwcm = Austria, Germany, Switzerland

http://www.metmuseum.org/toah/ht/__/___/ht_____.htme.g. http://www.metmuseum.org/toah/ht/04/ssa/ht04ssa.htmhttp://www.metmuseum.org/toah/ht/11/euwcm/ht11euwcm.htm

17 April 2007 Amer Assoc Geogr 20

http://www.metmuseum.org/toah/ht/__/___/ht_____.htm Insert s04 and ssa

http://www.metmuseum.org/toah/ht/04/ssa/ht04ssa.htm

17 April 2007 Amer Assoc Geogr 21

http://www.metmuseum.org/toah/ht/__/___/ht_____.htm Insert 11 and euwcm http://www.metmuseum.org/toah/ht/11/euwcm/ht11euwcm.htm

17 April 2007 Amer Assoc Geogr 22

17 April 2007 Amer Assoc Geogr 23

Prototype “4W” search interface

17 April 2007 Amer Assoc Geogr 24

Entry Vocabulary Index suggests correct LCSH with different spelling

Buttons for searchable resources & local catalogs

Search term recommender service for LC Subject Headings

17 April 2007 Amer Assoc Geogr 25

Potentially related people

Recommender service lists statistically associated Subject Headings

17 April 2007 Amer Assoc Geogr 26

Potentially related period?

17 April 2007 Amer Assoc Geogr 27

Mostly in India 16th-18th century

17 April 2007 Amer Assoc Geogr 28

Find out more about this area.

17 April 2007 Amer Assoc Geogr 29

Different Browsing Options!

17 April 2007 Amer Assoc Geogr 30

Zooming in to South Asia

Restricting time frame

Select

17 April 2007 Amer Assoc Geogr 31

Interface generates menu page General information about the country of India…

17 April 2007 Amer Assoc Geogr 32

General information about the country of India…

WikipediaCIA Factbook

BBC Ethnologue

Berkeley Natural History Museums

17 April 2007 Amer Assoc Geogr 33

Historical events – linked to Library catalog & Wikipedia : none avail. for this time period

17 April 2007 Amer Assoc Geogr 34

ECAI Cultural Atlases: presenting history in its geographical & chronological contexts

17 April 2007 Amer Assoc Geogr 35

The Electronic Cultural Atlas InitiativeAdvancing scholarship through increased

attention to place and time.http://ecai.org

Join us at our next ECAI conferences!Moscow, Russia, May 28-June 1

Berkeley, CA, Oct 17-20.Project website: ecai.org/imls2004The “4W” portal at: ecai.org/imls4WThe “4W California” portal at: ecai.org/imls4W

[email protected]

Understanding means knowing context.