user-driven taxonomies

27
© Copyright 2008 Dow Jones and Company, Inc. User-Driven Taxonomies Christine Connors iKMS, Singapore, 13 March 2008

Upload: christine-connors

Post on 01-Nov-2014

1.929 views

Category:

Technology


1 download

DESCRIPTION

Presentation to the Information & Knowledge Management Society in Singapore, March 2008, on approaches to integrating controlled and uncontrolled vocabularies.

TRANSCRIPT

Page 1: User-Driven Taxonomies

© Copyright 2008 Dow Jones and Company, Inc.

User-Driven Taxonomies

Christine Connors

iKMS, Singapore, 13 March 2008

Page 2: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

The problem with…

Formal taxonomies High cost

Taxonomy creation experts Subject Matter Experts (SMEs) Software & Hardware Purchase & modify Consultants

Scope and timeline Implementation Maintenance Hard to sell an ROI

Page 3: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

The problem with…

Informal taxonomies Consistency, clarity, context Scope and timeline Implementation Maintenance Hard to sell an ROI

Page 4: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

The benefits of a hybrid approach

Expertise in taxonomy design User-centered language Contextual variety User-driven prioritization of knowledge modeling Grow the model faster

Guided by taxonomists to avoid chaos Distributed costs

Does require A champion Change Control Board / Taxonomy Advisory Board

Page 5: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

Literary and user warrant in the enterprise

Object Repositories

Metadata Registries/Repositories

Search & BrowseMechanisms (UI)

Page 6: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

What is a folksonomy?

“People’s classification management”

Wisdom of the crowd

User-generated tags applied to digital objects

Informal, uncontrolled vocabularies

Usually Subject or Task based

Provide little to no context on their own

Page 7: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

Examples

Primary examples are del.icio.us and flickr

Blogs are anothergood place to look

Page 8: User-Driven Taxonomies

© Copyright 2008 Dow Jones and Company, Inc.

Lessons learned: Hybrid methods and Social tagging pilots

Page 9: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

Evolution

In the beginning… Best Bets were created by the search

administrator Search terms parsed out of the query sent

from the browser to the search engine Terms compared to manually created list of

Best Bet sites Matches were programmatically inserted

into the SERP before the #1 hit, with special formatting to highlight their existence

Intranet site owners called the search administrator to beg inclusion

Page 10: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

An early pilot

• Each resource can only be placed in one bucket, need to duplicate entries for full coverage

• Not integrated with any other system - ILMS, DMS, CMS, FS

• Administered by Research Librarians

• Rarely used!

• How do we integrate Enterprise Search, Suggested Sites, Public Bookmarks and Social Tagging?

Page 11: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

Updates to enterprise search

In search, search terms are tagged to bring back certain websitesUsers call, email or submit via web-form sites they would like to see addedTaxonomy team reviewed the submission for appropriateness, accuracy of tags, uniqueness of tags Sites and associated terms are manually entered into a flat fileDuring the regular index refresh cycle the flat file is programmatically converted to XML and ingested into search

Page 12: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

2006 Social bookmarking pilot

We wanted to see *what* would happen if we “opened” up the tagging

Goal was to help our users find commonly requested information and most useful information by Tagging favorite internal websites

Maintain security by NOT posting intranet URLs to public sites like del.icio.us

Linking directly to a resource, be it internal or external Sharing and searching other user’s bookmarks Removing a bottleneck and relieving resource constraints in

a moderated hybrid system Reviewed available systems

Public sites not an option due to security considerations Connotea, Scuttle, del.irio.us, Freetag

Page 13: User-Driven Taxonomies

© Copyright 2008 Dow Jones and Company, Inc.

How can folksonomies improve discovery?

Page 14: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

As Inputs

To taxonomies, thesauri, ontologies? What folksonomy terms are popular? What synonyms can you derive? What relationships can you identify? What entity types are you discovering?

To search Identify Best Bets As inputs to a recommendation engine

To the content management strategy What do they tell you about how your content is

perceived? What do they tell you about how your content is used? Do they tell you when your users go elsewhere for

their content needs?

Page 15: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

User driven

Enables user warrant Useful for understanding users – how do they think

about the objects you are providing to them? Allows the users to find things their own way, rather

than forcing them to do it the site’s way

Improves user experience Combine with search and web logs to

Improve navigation Improve browse mechanisms Improve search Identify content gaps Prioritize content and UI related tasks

Page 16: User-Driven Taxonomies

© Copyright 2008 Dow Jones and Company, Inc.

How can you implement folksonomy tools?

Page 17: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

Sample Pure methods

Install a tagging tool Tools similar to del.icio.us

Connotea Scuttle ConnectBeam Semantic applications such as Annotea (W3C) or

semantic blogging tools Modules for blogs/CMSs, examples:

Taxonomy modules for Drupal Tagging system in Wordpress or Typepad Extensions for MediaWiki

Make sure you review the reports available in the tools you consider Can you get actionable data?

Page 18: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

Sample implementations of social/hybrid methods

Best bets Allow users to submit sites, along with keywords, to

improve search results

File properties / repository check-in form Encourage (or require!) that users

fill out the properties of the files they create, using any terms they deem appropriate

Automate whenever possible

Page 19: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

Commercial Example

Buzzillions.com Combines formal taxonomy with folksonomy terms to

guide users to the products right for them

Page 20: User-Driven Taxonomies

© Copyright 2008 Dow Jones and Company, Inc.

Thank you!

Christine ConnorsGlobal Director, Semantic Technology SolutionsDow Jones & [email protected]

Page 21: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

Announcing Synaptica 7.0!

Synaptica 7.0 provides standardized, Semantic Web-enabled tools to manage your global business vocabulary in order to add structure and value to existing information assets, improve the online user experience and connect professionals in your organization with the information they need, where and when they need it.

Customer Benefits

Easy configuration Scalable for the enterprise with multi-user permissions Customizable and flexible with audience-centric views Supports collaboration and workgroups Standards based, semantic Web enabled Multiple data formats (HTML,XML,etc.) API level access for simple integration

http://solutions.dowjones.com/djcs/index.asp

Page 22: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

Synaptica’s new side by side relationship editor makes the creation and editing of terms a one step process.

Easily find and edit a key term and multiple related terms

Page 23: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

Synaptica drag and drop hierarchical relationship editing provides a simple, convenient way to manage vocabulary hierarchies.

Easily Manage and edit vocabulary hierarchies

Page 24: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

Term Information Summary Window provides quick views of term details

Gain quick views of term information without leaving current interface

Page 25: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

In addition to CSV, HTML and XML formats, reports may be created in Microsoft Word and Excel.

Expanded Reporting Functionality for Easier, More Flexible Information Sharing

Page 26: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

Synaptica User and Administrative Guides are now available online directly from the application to browse and search

Quickly and easily access Help right from the application

Page 27: User-Driven Taxonomies

Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.

Dow Jones Client Solutions Offers Comprehensive, Business Taxonomy Solutions for Fast, Relevant Information Retrieval

Industry-focused

integrated solutions

Build & CustomizeTo Suit Your

Information Needs

Stay Informed with

Taxonomywarehouse.com

Optimize and ManageWith Synaptica

License & IntegrateIndustry-focused

Taxonomies