© 2001-2007 franz j. kurfess knowledge organization 1 cpe/csc 581: knowledge management dr. franz...

72
© 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

Post on 20-Dec-2015

217 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 1

CPE/CSC 581: Knowledge Management

CPE/CSC 581: Knowledge Management

Dr. Franz J. Kurfess

Computer Science Department

Cal Poly

Page 2: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 2

Course OverviewCourse Overview Introduction Knowledge Processing

Knowledge Acquisition, Representation and Manipulation

Knowledge Organization Classification, Categorization Ontologies, Taxonomies, Thesauri

Knowledge Retrieval Information Retrieval Knowledge Navigation

Knowledge Presentation Knowledge Visualization

Knowledge Exchange Knowledge Capture, Transfer,

and Distribution Usage of Knowledge

Access Patterns, User Feedback

Knowledge Management Techniques Topic Maps, Agents

Knowledge Management Tools

Knowledge Management in Organizations

Page 3: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 3

Overview Knowledge Organization

Overview Knowledge Organization

Motivation Objectives Chapter Introduction

Review of relevant concepts Overview new topics Terminology

Identification of Knowledge Object Selection Naming and Description

Categorization Feature-based Categorization Hierarchical Categorization

Knowledge Organization Frameworks Dublin Core Resource Description

Framework Topic Maps

Case Studies Northern Light EPA TRS Getty Vocabularies

Important Concepts and Terms

Chapter Summary

Page 4: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 4

LogisticsLogistics Introductions Course Materials

handouts lecture notes

Web page readings

Term Project description deliverables e-group account roles in teams

Homework Assignments description assignment 1

Page 5: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 5

Bridge-InBridge-InHow do you organize your knowledge?

brain paper computer

Page 6: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 7

MotivationMotivationeffective utilization of knowledge depends critically

on its organization quick access identification of relevant knowledge assessment of available knowledge

source, reliability, applicability

knowledge organization is a difficult task, and requires complementary skills expertise in the domain knowledge organization skills

librarians

Page 7: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 8

ObjectivesObjectivesbe able to identify the main aspects dealing with the

organization of knowledgeunderstand knowledge organization methodsapply the capabilities of computers to support

knowledge organizationpractice knowledge organization on small bodies of

knowledgeevaluate frameworks and systems for knowledge

organization

Page 8: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 10

Identification of Knowledge

Identification of Knowledge

Object SelectionNaming and Description

Page 9: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 11

Object SelectionObject Selectionwhat constitutes a “knowledge object” that is relevant

for a particular task or topic physical object, document, concept

how can this object be made available in the systemexample: library

is it worth while to add an object to the library’s collection if so, how can it be integrated

physical document: book, magazine, report, etc. digital document: file, data base, Web page, etc.

Page 10: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 12

Naming and DescriptionNaming and Descriptionnames serve two important roles

identification ideally, a unique descriptor that allows the unambiguous selection

of the object often an ambiguous descriptor that requires context information

location especially in digital systems, names are used as “address” for an

object

names, descriptions and relationships to related objects are specified in listings dictionary, glossary, thesaurus, ontology, index

Page 11: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 13

Naming and Description Devices

Naming and Description Devices

type dictionary glossary thesaurus ontology index

issues arrangement of terms

alphabetical, hierarchical

purpose explanation, unique identifier, clarification of relationships to other terms,

access to further information

Page 12: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 14

DictionaryDictionarylist of words together with a short explanation of their

meanings, or their translations into another languagehelpful for the identification of knowledge objects, and

their distinction from related oneseach entry in a dictionary may be considered an atomic

knowledge object, with the word as name and “entry point” may provide cross-references to related knowledge objects

straightforward implementation in digital systems, and easy to integrate into knowledge management systems

Page 13: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 15

GlossaryGlossarylist of words, expressions, or technical terms with an

explanation of their meanings usually restricted to a particular book, document, activity,

or topic

provides a clarification of the intended meaning for knowledge objects

otherwise similar to dictionary

Page 14: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 16

ThesaurusThesauruscollection of synonyms (word sets with identical or

similar meanings) frequently includes words that are related in some other

way, e.g. antonyms (opposite meanings), homonyms (same pronounciation or spelling)

identifies and clarifies relationships between words not so much an explanation of their meanings

may be used to expand search queries in order to find relevant documents that may not contain a particular word

Page 15: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 17

Thesaurus TypesThesaurus Typesknowledge-basedlinguisticstatistical

[Liddy 2000]

Page 16: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 18

Knowledge-based ThesaurusKnowledge-based Thesaurus manually constructed for a specific domain intended for human indexers and searchers contains

synonyms (“use for” UF) more general (“broader term” BT) more specific (“narrower” NT) otherwise associated words (“related term” RT)

example: “data base management systems” UF data bases BT file organization, management information systems NT relational databases RT data base theory, decision support systems

[Liddy 2000]

Page 17: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 19

Linguistic ThesaurusLinguistic Thesauruscontains explicit concept hierarchies of several

increasingly specified levelswords in a group are assumed to be (near-)

synonymous selection of the right sense for terms can be difficult

examples: Roget’s, WordNetoften used for query expansion

synonyms (similar terms) hyponyms (more specific terms; subclass) hypernyms (more general terms; super-class)

[Liddy 2000]

Page 18: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 20

Example 1: Linguistic Thesaurus

Example 1: Linguistic Thesaurus

AbstractRelations

Space Physics Matter SensationIntellectVilitionAffections

The World

Sensationin General

Touch Taste Smell Sight Hearing

OdorFragranceStench Odorless

.1 .9.8.2 .3 .4 .5 .7.6

Incense; joss stick;pastille; frankincense or olibanum; agallock or aloeswood; calambac

[Liddy 2000]

Page 19: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 21

[Liddy 2000]

Example 2: Linguistic Thesaurus

Example 2: Linguistic Thesaurus

Page 20: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 22

Query Expansion in Search Engines

Query Expansion in Search Engines

look up each word in Word Net if the word is found, the set of synonyms from all Synsets are

added to the query representation weigh each added word as 0.8 rather than 1.0 results better than plain SMART

variable performance over queries major cause of error: the use of ambiguous words’ Synsets

general thesauri such as Roget’s or WordNet have not been shown conclusively to improve results may sacrifice precision to recall not domain specific not sense disambiguated

[Liddy 2000, Voorhees 1993]

Page 21: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 23

Statistical ThesaurusStatistical Thesaurus automatic thesaurus construction

classes of terms produced are not necessarily synonymous, nor broader, nor narrower

rather, words that tend to co-occur with head term effectiveness varies considerably depending on technique used

[Liddy 2000]

Page 22: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 24

Automatic Thesaurus Construction (Salton)Automatic Thesaurus

Construction (Salton) document collection based

based on index term similarities compute vector similarities for each pair of documents if sufficiently similar, create a thesaurus entry for each term which

includes terms from similar document

[Liddy 2000]

Page 23: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 25

Sample Automatic Thesaurus EntriesSample Automatic Thesaurus Entries

408 dislocation 411 coercive

junction demagnetize

minority-carrier flux-leakage

point contact hysteresis

recombine induct

transition insensitive

409 blast-cooled magnetoresistance

heat-flow square-loop

heat-transfer threshold

410 anneal 412 longitudinal

strain transverse

[Liddy 2000]

Page 24: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 26

Dynamic Automatic Thesaurus Construction

Dynamic Automatic Thesaurus Construction

thesaurus short-cut run at query time take all terms in the query into consideration at once look at frequent words and phrases in the top retrieved documents and

add these to the query

= automatic relevance feedback

[Liddy 2000]

Page 25: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 27

Expansion by Association Thesaurus

Expansion by Association Thesaurus

Query: Impact of the 1986 Immigration LawPhrases retrieved by association in corpus

- illegal immigration - statutes- amnesty program - applicability- immigration reform law- seeking amnesty- editorial page article - legal status- naturalization service - immigration act- civil fines - undocumented workers- new immigration law - guest worker- legal immigration - sweeping immigration law- employer sanctions - undocumented aliens

[Liddy 2000]

Page 26: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 28

IndexIndexlisting of words that appear in a (set of) documents,

together with pointers to the locations where they appear

provides a reference to further information concerning a particular word or concept

constitutes the basis for computer-based search engines

Page 27: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 29

IndexingIndexingthe process of creating an index from a set of

documents one of the core issues in Information Retrieval

manual indexing controlled vocabularies, humans go through the documents

semi-automatic humans are in control, machines are used for some tasks

automatic statistical indexing natural-language based indexing

Page 28: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 30[Liddy 2000]

NLP-based IndexingNLP-based Indexingthe computational process of identifying, selecting,

and extracting useful information from massive volumes of textual data for potential review by indexers stand-alone representation of content using Natural Language Processing

Page 29: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 31[Liddy 2000]

Natural Language Processing

Natural Language Processing

a range of computational techniques for analyzing and representing naturally occurring texts at one or more levels of linguistic analysis for the purpose of achieving human-like language

processing for a range of tasks or applications

Page 30: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 32

Levels of Language Understanding

[Liddy 2000]

Morphological

Lexical

PragmaticDiscourse

Semantic Syntactic

Page 31: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 33[Liddy 2000]

What can NLP Indexing do?What can NLP Indexing do? phrase recognition disambiguation concept expansion

Page 32: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 34

Knowledge Organization Example

Knowledge Organization Example

Semantic Annotation

XML textMulti -Media

Deep Web

Web Services

Semantic Content

Semantic

Indexing /Integration

Indexing

Semantic Retrieval

Reason

Learn

annotation

Efficient

indexing engine

Learn

Reasoning rule

Semantic

Retrieval

Ontology base

Ontology base

Learn Mapping/

Link

Semantic Crawler

texttextWWW

??

Active LearningMachine Learning : BC ,

NN, GA, ARMulti -view Learning Multi -view detection

Application

Make it efficient in

domain

Active

learning driven OM /

OL

Ontology based

Summary

http://keg.cs.tsinghua.edu.cn/persons/tj/Reports/Pswmp-Jie-Tang.ppt

Page 33: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 35

^

Communication Principle

Communication Principle

ReferentFormStands for

refers toevokes

Concept

“Jaguar“

[Odwen, Richards, 1923]

[Hotho, Sure, 2003]

Page 34: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 36

Views on OntologiesViews on OntologiesFront-End

Back-End

TopicMaps

Extended ER-Models

Thesauri

Predicate Logic

Semantic Networks

Taxonomies

Ontologies

Navigation

Queries

Sharing of Knowledge

Information Retrieval

Query Expansion

MediationReasoning

Consistency CheckingEAI

[Hotho, Sure, 2003]

Page 35: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 37

Object

Person Topic Document

ResearcherStudent Semantics

OntologyDoctoral Student

Taxonomy := Segementation, classification and ordering of elements into a classification system according to their relationships between each other

PhD Student F-Logic

Menu

[Hotho, Sure, 2003]

TaxonomyTaxonomy

Page 36: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 38

Object

Person Topic Document

ResearcherStudent Semantics

PhD StudentDoktoral Student

• Terminology for specific domain• Graph with primitives, 2 fixed relationships (similar, synonym) • originate from bibliography

similarsynonym

OntologyF-Logic

Menu

ThesaurusThesaurus

[Hotho, Sure, 2003]

Page 37: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 39

Object

Person Topic Document

ResearcherStudent Semantics

PhD StudentDoktoral Student

knows described_in

writes

AffiliationTel

• Topics (nodes), relationships and occurences (to documents)• ISO-Standard• typically for navigation- and visualisation

OntologyF-Logic

similarsynonym

Menu

Topic MapTopic Map

[Hotho, Sure, 2003]

Page 38: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 40

OntologyF-Logic

similar

PhD StudentDoktoral Student

Object

Person Topic Document

Tel

Semantics

knows described_in

writes

Affiliationdescribed_in is_about

knowsP writes D is_about T P T

DT T D

Rules

subTopicOf

• Representation Language: Predicate Logic (F-Logic)• Standards: RDF(S); coming up standard: OWL

ResearcherStudent

instance_of

is_a

is_a

is_a

Affiliation

Affiliation

York Sure

AIFB+49 721 608 6592

OntologyOntology

[Hotho, Sure, 2003]

Page 39: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 41

PhD StudentPhD Student AssProfAssProf

AcademicStaffAcademicStaff

rdfs:subClassOfrdfs:subClassOf

cooperate_withcooperate_with

rdfs:rangerdfs:domainOntology

<swrc:AssProf rdf:ID="sst"> <swrc:name>Steffen Staab </swrc:name>...</swrc:AssProf>

http://www.aifb.uni-karlsruhe.de/WBS/sst

Anno- tation

<swrc:PhD_Student rdf:ID="sha"> <swrc:name>Siegfried Handschuh</swrc:name>

...</swrc:PhD_Student>

Web Page

http://www.aifb.uni-karlsruhe.de/WBS/shaURL

<swrc:cooperate_with rdf:resource = "http://www.aifb.uni-karlsruhe.de/WBS/sst#sst"/>

instance ofinstance of

Cooperate_with

Ontology & MetadataOntology & Metadata

Links have explicit meanings!

Page 40: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 42

Example: OntoWeb.orgExample: OntoWeb.org

Portal Generation

Navigation

Query/Serach

Content

Integration Collect metadata from participating partners

Annotation [Hotho, Sure, 2003]

Page 41: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 43

OntologyOntologyexamines the relationships between words, and the

corresponding concepts and objects in practice, it often combines aspects of thesaurus and

dictionary frequently uses a graph-based visual representation to

indicated relationships between words

used to identify and specify a vocabulary for a particular subject or task

Page 42: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 44

The Notion of OntologyThe Notion of Ontologyontology

explicit specification of a shared conceptualization that holds in a particular context

captures a viewpoint on a domain: taxonomies of species physical, functional, & behavioral system descriptions task perspective: instruction, planning

[Schreiber 2000]

Page 43: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 45

Ontology Should Allow for “Representational

Promiscuity”

Ontology Should Allow for “Representational

Promiscuity”ontology

parameterconstraint -expression

knowledge base A

cab.weight + safety.weight = car.weight:

cab.weight < 500:

knowledge base B

parameter(cab.weight)parameter(safety.weight)parameter(car.weight)constraint-expression(

cab.weight + safety.weight = car.weight)constraint-expression(

cab.weight < 500)

rewritten as

viewpointmapping rules

[Schreiber 2000]

Page 44: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 46[Schreiber 2000]

Ontology TypesOntology Types domain-oriented

domain-specific medicine => cardiology => rhythm disorders traffic light control system

domain generalizations components, organs, documents

task-oriented task-specific

configuration design, instruction, planning task generalizations

problems solving, e.g. upml

generic ontologies “top-level categories” units and dimensions

Page 45: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 47

Using OntologiesUsing Ontologiesontologies needed for an application are typically a

mix of several ontology types technical manuals

device terminology: traffic light system document structure and syntax instructional categories

e-commerceraises need for

modularization integration

import/export mapping

[Schreiber 2000]

Page 46: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 48

Domain Standards and Vocabularies As

Ontologies

Domain Standards and Vocabularies As

Ontologies example: Art and Architecture Thesaurus (AAT) contains ontological information

AAT: structure of the hierarchy structure needs to be “extracted”

not explicit can be made available as an ontology

with help of some mapping formalism lists of domain terms are sometimes also called “ontologies”

implies a weaker notion of ontology scope typically much broader than a specific application domain example: domain glossaries, wordnet contain some meta information: hyponyms, synonyms, text

[Schreiber 2000]

Page 47: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 49

Ontology DevelopmentOntology Development

Scott Patterson, CS8350

Kietz, Maedche, Voltz; A Method for Semi-Automatic Ontology acquisition from a Corporate Intranet

Maedche & Stabb; Ontology Learning for the Semantic Web

DomainOntology

Extract

Import/Reuse

Prune

Refine

Select Sources

Concept Learning

Relation learning

Evaluation

Page 48: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 50

Ontology SpecificationOntology Specificationmany different languages

KIF Ontolingua Express LOOM UML XML to the rescue: Web Ontology Language (OWL)

common basis class (concept) subclass with inheritance relation (slot)

[Schreiber 2000]

Page 49: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 51

Art & Architecture Thesaurus

Art & Architecture Thesaurus

used forindexing stolen art objects in Europeanpolice

databases

[Schreiber 2000]

Page 50: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 52

AAT OntologyAAT Ontology

descriptionuniverse

descriptiondimension

descriptor

value set

value

descriptorvalue

object

object type object class

classconstraint

has feature

descriptorvalue set

in dimension

instance of

class of

hasdescriptor

1+

1+

1+

1+

1+

1+

[Schreiber 2000]

Page 51: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 53

Document Fragment Ontologies: Instructional

Document Fragment Ontologies: Instructional

[Schreiber 2000]

Page 52: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 54

Domain Ontology of a Traffic Light Control

System

Domain Ontology of a Traffic Light Control

System

[Schreiber 2000]

Page 53: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 55

Two Ontologies of Document FragmentsTwo Ontologies of Document Fragments

[Schreiber 2000]

Page 54: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 56

Ontology for E-commerceOntology for E-commerce

[Schreiber 2000]

Page 55: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 57

Top-level Categories:Many Different ProposalsTop-level Categories:

Many Different Proposals

Chandrasekaran et al. (1999)

[Schreiber 2000]

Page 56: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 58

A Few Observations about Ontologies

A Few Observations about Ontologies Simple ontologies can be built by non-experts

Consider Verity’s Topic Editor, Collaborative Topic Builder, GFP interface, Chimaera, etc.

Ontologies can be semi-automatically generated from crawls of site such as yahoo!, amazon, excite, etc. Semi-structured sites can provide starting points

Ontologies are exploding (business pull instead of technology push) most e-commerce sites are using them - Google, MySimon, Affinia, Amazon, Yahoo!

Shopping, etc. Controlled vocabularies (for the web) abound - SIC codes, UMLS, UN/SPSC, Open

Directory, Rosetta Net, … DTDs, schemata are making more ontology information available Business ontologies are including roles Businesses have ontology directors “Real” ontologies are becoming more central to applications

[McGuiness 2000]

Page 57: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 59

OntoSeek Example 1OntoSeek Example 1

<URL1>Mountain BikePurposeCross-country Ride

ModelModel:Team FRSPartPart

AluminiumModel:7000 SeriesModel:Rocker-SuspensionSuspensionTubing

MaterialModel ModelTriangleShape

<URL2>Component

LanguageFunction

ObjectFunctionFunctionFunctionEditCompileBrowseDebugProgram

Language:JavaObjectObjectObject

LanguageLanguage:JavaModel:JavaEditor 1.02

Model

[Guarino et al. 2000]

Page 58: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 60

OntoSeek Screen ShotOntoSeek Screen Shot

[Guarino et al. 2000]

Page 59: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 61

OntoSeek DisambiguationOntoSeek Disambiguation

CarColorRed

English Word"4-wheeled; usually propelled by an internal combustion engine""a visual attribute that results from the light they emit or transmit or reflect""a color"

Interpretation[car, auto, automobile, machine, motocar][color, colour, coloring, colouring]

[red, redness]

Corresponding Synset

RadioPart "something less than the whole of a human artifact""an electronic device that detects and demodulates and amplifies transmitted signals"

[part, portion][radio receiver, receiving set, radio set, radio, tuner, wireless]

<URL>[car, auto, automobile, machine, motocar]:c#3272[color, colour, coloring, coluring][red, redness]Disambiguation

<URL>Car:c#3272ColorRedRadio

Part[radio receiver, receiving set, radio set, radio, tuner, wireless][part, portion]

[Guarino et al. 2000]

Page 60: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 62

OntoBroker ArchitectureOntoBroker Architecture

[Studer. 2000]

Page 61: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 63

OntoPadOntoPad

[Studer. 2000]

Page 62: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 64

Query InterfaceQuery Interface

[Studer. 2000]

Page 63: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 65

Hyperbolic View InterfaceHyperbolic View Interface

[Studer. 2000]

Page 64: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 66

CategorizationCategorizationFeature-based CategorizationHierarchical Categorization

Page 65: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 67

Hierarchical CategorizationHierarchical

Categorizationa set of objects is divided into smaller and smaller

subset, forming a hierarchical structure (tree) with the elementary objects as leaf nodes typically one feature is used to distinguish one category

from another often constitutes a relatively stable “backbone” of a

knowledge organization scheme re-organization requires a major effort

Page 66: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 68

Feature-based CategorizationFeature-based Categorization

objects or documents are assigned to categories according to commonalties in specific features

can be used to dynamically group objects into categories that are of interest for a particular task or purpose re-organization is easy with computer support

Page 67: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 69

Knowledge Organization Frameworks

Knowledge Organization Frameworks

Dublin CoreResource Description FrameworkTopic Maps

Page 68: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 70

Case StudiesCase StudiesNorthern LightEPA TRSGetty VocabulariesRDFSemantic Web

Page 69: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 71

Post-TestPost-Test

Page 70: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 73

Important Concepts and Terms

Important Concepts and Terms natural language processing

neural network predicate logic propositional logic rational agent rationality Turing test

agent automated reasoning belief network cognitive science computer science hidden Markov model intelligence knowledge representation linguistics Lisp logic machine learning microworlds

Page 71: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 74

Summary Knowledge Organization

Summary Knowledge Organization

Page 72: © 2001-2007 Franz J. Kurfess Knowledge Organization 1 CPE/CSC 581: Knowledge Management Dr. Franz J. Kurfess Computer Science Department Cal Poly

© 2001-2007 Franz J. Kurfess Knowledge Organization 75