why are taxonomies necessary?

60
© 2007 by ContextualAnalysis, LLC Why Are Taxonomies Necessary? By Fred Leise ContextualAnalysis, LLC

Upload: fred-leise

Post on 12-May-2015

3.183 views

Category:

Technology


0 download

DESCRIPTION

Introduces basic information about what taxonomies (controlled vocabularies) are and why they are important for information finding.

TRANSCRIPT

Page 1: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Why Are Taxonomies Necessary?

By Fred Leise

ContextualAnalysis, LLC

Page 2: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Taxonomies are sets of terms (controlled vocabularies or CVs) used to tag documents or other content objects.

Taxonomies may also be used as browsing hierarchies or for search enhancement.

What Are Taxonomies?

Page 3: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Taxonomy terms are collected into groups called attributes. Each attribute (or facet) describes one property of your content.

What Are Taxonomies?

Page 4: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Example:

Attribute: Office Location

Terms: London

New York City (NYC, Big Apple)Washington, DC

What Are Taxonomies?

Alternate Terms

Page 5: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

In this example, “NYC” and Big Apple” are given as variants for “New York.”

Variant terms are used to expand search queries. If a user enters “New York” the search system expands to search “New York or NYC or Big Apple.

What Are Taxonomies?

Page 6: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Search query expansion ensures that more relevant information is found, even though it might use terms the searcher hasn’t thought of.

What Are Taxonomies?

Page 7: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Other typical attributes include:

Author

Creation Date

Audience

Version Number

Subject

What Are Taxonomies?

Page 8: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

There is an international standard for metadata, the Dublin Core Metadata Element Set, consisting of 15 attributes.

What Are Taxonomies?

Page 9: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Good metadata schemas (collections of attributes) will adhere as closely as possible to the Dublin Core standard.

More information is available at: www.dublincore.org

What Are Taxonomies?

Page 10: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Well designed taxonomies:

1. Enable users to find relevant information quickly and efficiently (improved retrieval)

2. Lead users to additional relevant information, providing upselling and cross-selling opportunities

What Are Taxonomies?

Page 11: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Well designed taxonomies:

3. Assists authors in consistently tagging content

What Are Taxonomies?

Page 12: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Proper use of taxonomies results in:

Less time wasted searching for information

Fewer failed searches

Fewer abandoned interactions

Increased income

Reduced customer assistance costs

What Are Taxonomies?

Page 13: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

English is rich in words that mean the same or nearly the same thing

feline/cat

car/automobile

travel/journey/excursion/trip

jeans/denims/Levi's/501s

Why Are Taxonomies Important?

Page 14: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Result: scattering of information.

No matter what term you use in a free-text search, you get only part of the relevant information.

The rest is not retrieved because it uses different terms to describe the same concept.

Why Are Taxonomies Important?

Page 15: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Consider the example of mobile devices.

There are many ways that users can refer to them:

Personal digital assistants

Handheld computers

Blackberries

PDAs

Why Are Taxonomies Important?

Page 16: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

If users don’t know the term you use to label the information they are looking for, they waste time browsing or give up their search completely.

They are victims of a communication chasm.

Why Are Taxonomies Important?

Page 17: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

You use the term “cat.” I use “feline.” If we each search a recipe database that uses both terms with equal frequency, we will get back only half the appropriate recipes, a recall ratio of 50%

Why Are Taxonomies Important?

Page 18: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Solution: Add a controlled vocabulary to the search system that gives “feline” and “cat” as equivalent terms.

Search queries will be expanded appropriately.

Why Are Taxonomies Important?

Page 19: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

English is rich in words that have more than one disparate meaning

Pitch

To throw a baseball

A tar-like substance

A salesman’s monologue

Why Are Taxonomies Important?

Page 20: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Bank

Where you store money

The side of a river

To carom a cue ball off a pool table rail

To prepare a fire for the night

To maneuver a plane for a turn

Why Are Taxonomies Important?

Page 21: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Result: Lots of false drops (irrelevant information), resulting in poor precision.

Why Are Taxonomies Important?

Page 22: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Solution: use a CV that includes scope notes (definitions) or that uses facets.

Example: Think about searching for the term “Rembrandt.” You might get the following results.

Why Are Taxonomies Important?

Page 23: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Why Are Taxonomies Important?

Rembrandt GoSearch

The painter Rembrandtwas one of the greatestof all the Dutch realists….

If you want to whitenand brighten yourteeth, there is no betterbrand than Rembrandt.

Page 24: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Why Are Taxonomies Important?

You probably are interested in only one of these “Rembrandts.” So half of your search results are irrelevant.

Now consider what happens if you were able to specify the type of object you are looking for, either an artist or a toothpaste brand.

Page 25: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Why Are Taxonomies Important?

The painter Rembrandtwas one of the greatestof all the Dutchrealists….

If you want to whitenand brighten yourteeth, there is no betterbrand than Rembrandt.

Artist

Brand Name

Rembrandt

Rembrandt

Page 26: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Why Are Taxonomies Important?

You get only results relevant to what you are interested in.

Here, having search boxes identified by attribute (faceted searching) lets you hone in quickly on the particular information you want.

Page 27: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Why Are Taxonomies Important?

You could also use one search and let users filter or narrow results after their search.

Page 28: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Page 29: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Roles for Taxonomies

Tagging documents for a content management system

Provides administrative metadata to control authoring and publishing processes

How are Taxonomies Used?

Page 30: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Roles for Taxonomies

Administrative metadata: example

Document # AuthorDepartment Creation datePublication date Expiration date

How are Taxonomies Used?

Page 31: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Roles for Taxonomies

Tagging document contents for a content management system

Provides metadata to support search

Ensures inter-indexer consistency

How are Taxonomies Used?

Page 32: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Roles for Taxonomies

Tagging document contents for a content management system

Controls subject scattering

Increases search results relevance: tags “aboutness” not just mentions of a word

How are Taxonomies Used?

Page 33: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Roles for Taxonomies

Search engine component

Translates user’s terms into those used to tag items (increases precision and recall)

Offers options for expanding or reducing scope of search using broader or narrower terms

How are Taxonomies Used?

Page 34: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Roles for Taxonomies

Search engine component

Differentiates between multiple meanings of terms

How are Taxonomies Used?

Page 35: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Taxonomy Use: Search Results

rei.com

Page 36: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Roles for Taxonomies

Operating as a browsing hierarchy

Organizes content using taxonomy terms as category labels

Represents taxonomy hierarchy by browsing levels

How are Taxonomies Used?

Page 37: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

rei.com

Level 1

Level 4Level 3

Level 2

Page 38: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Synonym Ring

Identifies words with equivalent meanings (in a given context)

rock = stone

CD-ROM = CD = disk

money = dough = bucks = greenbacks = legal tender

Types of Taxonomies

Page 39: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Synonym Ring

When one of the words in a synonym ring is searched for, the search engine expands the search and returns items containing any of the words in the ring.

Types of Taxonomies

Page 40: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Authority File

Has all the features of a synonym ring, plus the identification of preferred terms (approved terms/descriptors/keywords) for tagging content.

Types of Taxonomies

Page 41: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Taxonomy

Also called hierarchy or classification.

All features of authority files, plus the broader term (BT) and narrower term (NT) relationships.

Types of Taxonomies

Page 42: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Taxonomy

All terms must be part of a hierarchical relationship (no orphan terms).

Taxonomies may be presented in hierarchical or alphabetical format.

Types of Taxonomies

Page 43: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

total compensation . compensation . . base salary (salary) . . deferred payments (deferred compensation) . . variable pay . benefits . . 401(k) plan . . health benefits . . . dental plan . . . disability insurance

Types of Taxonomies: Taxonomy Example

Page 44: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Thesaurus

Plural form: thesauri

All the features of taxonomies, plus the associative relationship of related terms (RT)

Types of Taxonomies

Page 45: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Types of Taxonomies:Thesaurus Example, Alphabetical

Building Permits BT Permits

Business Licenses BT Licenses

Business Taxes BT Taxes

Fees RT Taxes

Licenses NT Business Licenses RT Permits

Operating Permits BT Permits

Permits NT Building Permits; Operating Permits RT Licenses

Taxes NT Business Taxes RT Fees

Page 46: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Types of Taxonomies:Thesaurus Example, Hierarchical

Vocabulary Terms Related Terms

Licenses, Permits & Taxes    

. Fees   Taxes

. Licenses   Permits

. . Business Licenses  

. Permits   Licenses

. . Building Permits  

. . Operating Permits  

. Taxes   Fees

. . Business Taxes  

Page 47: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Synonym Ring

+ preferred terms

= Authority File

+ broader/narrower terms

= Taxonomy

+ related terms

= Thesaurus

Types of Taxonomies—Summary

Page 48: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Facets are fundamental categories by which an object or concept may be described

Example: some facets describing a toy ball:

size, weight, shape, color, texture, material

Taxonomies and Facets

Page 49: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Uses of Facets: Browsing Hierarchies

Facets allow users to follow the path best matching the way they think (their mental model).

Taxonomies and Facets

Page 50: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Uses of Facets: Browsing Hierarchies

Example: epicurious.com > recipes > browse

Main ingredient Cuisine Preparation method Season/occasion Course/dish

Taxonomies and Facets

Page 51: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Taxonomies and Facets

epicurious.com

Page 52: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Uses of Facets: Fielded Search

Allows for greater specificity, thus increasing search precision.

But this is usually more complicated for users than simple searching, so it is often introduced as option on results page.

Taxonomies and Facets

Page 53: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

alibris.com Advanced Search

Page 54: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

epicurious.com Advanced Search

Page 55: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Requirements for Browsing/Search Facets

Development of metadata schema

Development of appropriate controlled vocabularies

Proper content tagging

Taxonomies and Facets

Page 56: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Aitchison, Jean. Thesaurus Construction and Use: A Practical Manual. 4th ed. Chicago: Fitzroy Dearborn Publishers

Resources

Page 57: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Resources

International standard for metadata: Dublin Core Metadata Element Set (ISO Standard 15836-2003)

http://www.niso.org/international/SC4/n515.pdf

Page 58: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

National Information Standards Organization. ANSI/NISO Z39.19:1993. Guidelines for the Construction, Format and Management of Monolingual Thesauri. Bethesda, MD: NISO Press, 1994

Rosenfeld, Lou, and Peter Morville. Information Architecture for the World Wide Web: Designing Large-Scale Websites. 3d ed. O’Reilly Publishers, 2006.

Resources

Page 59: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Sinha, Rashmi. Beyond Cardsorting: Free-listing Methods to Explore User Categorizations

Available at: http://www. boxesandarrows.com/archives/ beyond_cardsorting_freelisting_ methods_to_explore_user_categorizations.php

Steckel, Mike, Karl Fast and Fred Leise. Creating a Controlled Vocabulary. 2002

Available at: http://www.boxesandarrows.com/archives/ creating_a_controlled_vocabulary.php

Resources

Page 60: Why Are Taxonomies Necessary?

© 2007 by ContextualAnalysis, LLC

Contact Information

Fred Leise

www.contextualanalysis.com

[email protected]

@ChicagoIndexer