terminology services diane vizine-goetz senior research scientist oclc research

Download Terminology Services Diane Vizine-Goetz Senior Research Scientist OCLC Research

Post on 10-Feb-2016

33 views

Category:

Documents

0 download

Embed Size (px)

DESCRIPTION

Terminology Services Diane Vizine-Goetz Senior Research Scientist OCLC Research. Presentation History. A version of this presentation was given at: New Dimensions in Knowledge Organization Systems: A Joint NKOS/CENDI Workshop The World Bank Washington, DC 11 September 2008 - PowerPoint PPT Presentation

TRANSCRIPT

  • Terminology Services

    Diane Vizine-GoetzSenior Research ScientistOCLC Research

  • Presentation HistoryA version of this presentation was given at:New Dimensions in Knowledge Organization Systems: A Joint NKOS/CENDI Workshop The World Bank Washington, DC 11 September 2008OCLC / ISKO-NA Preconference Universit de Montral Montral, Canada 5 August 2008

  • Moving Vocabularies to the Network LevelRequirements:Expressive data structuresVocabularies encoded for the WebAccess mechanisms for search and retrievalURI accessible contentUse of open protocols and standards

  • Success will be measured by the appearance of applications that use or combine vocabulary data to create new derivative works or tools.

  • OCLC Terminology Services Prototype

    Employs library and Web standards to make the terms and relationships in controlled vocabularies available as Web resources.

  • Top-level site intended for machines

  • Human interface for everyone else

  • ContentApplicationsQuery ExpansionSearching Heterogeneous CollectionsMetadata Creationfastgsafdlcshmeshlctgm & gmgpcVocabulariesWeb Services

  • ContentApplicationsQuery ExpansionSearching Heterogeneous CollectionsMetadata Creationfastgsafdlcshmeshlctgm & gmgpcVocabulariesWeb Services

  • Types of Controlled Vocabularies (Hodge 2000) * indicates availability in Terminology Services PrototypeTerm ListsAuthority Files*GlossariesDictionariesGazetteersClassifications and CategoriesSubject Headings*Classification Schemes*TaxonomiesCategorization SchemesRelationship ListsThesauri*Semantic NetworksOntologies

  • Vocabularies (August 2008)Faceted Application of Subject Terminology (fast)Form and Genre Terms for Fiction and Drama (gsafd)Library of Congress Subject Headings (lcsh)Medical Subject Headings (mesh)Thesaurus for Graphic Materials: TGM I, Subject Terms (lctgm)Thesaurus for Graphic Materials: TGM II, Genre and Physical Characteristics (gmgpc)

  • Data Structures for Controlled VocabulariesMARC 21 Format for Authority DataA format for the use and exchange of information about the authorized forms of names and subjects used as access points in MARC bibliographic records. Simple Knowledge Organization System (SKOS)SKOS Core is a model and an RDF vocabulary for expressing the basic structure and content of concept schemes such as thesauri, classification schemes, subject heading lists, taxonomies, 'folksonomies', [etc.]ZthesA model for representing thesauri* and a specification for expressing them in XML. Zthes also provides specifications for searching Zthes compliant data using SRU/SRW or Z39.50.

  • ContentApplicationsQuery ExpansionSearching Heterogeneous CollectionsMetadata Creationfastgsafdlcshmeshlctgm & gmgpcVocabulariesWeb Services

  • Encoding MechanismsXML (Extensible Markup Language) A data-interchange format for custom markup languages. RDF (Resource Description Framework) A data-interchange format for the representation of graph models. JSON (Javascript Object Notation) A data-interchange format based on a subset of the JavaScript Programming Language defined by the ECMA-262 3rd Edition standard.

  • Access MechanismsREST (Representational State Transfer) A software architecture style used for building distributed systems that retrieve Web resources.

    SRU & SRW (Search/Retrieve via URL) A standard search protocol that utilizes the Contextual Query Language (CQL) syntax to retrieve Web resources.

  • ContentApplicationsQuery ExpansionSearching Heterogeneous CollectionsMetadata Creationfastgsafdlcshmeshlctgm & gmgpcVocabulariesWeb Services

  • http://tspilot.oclc.org/lctgm/?query=oclcts.expandedHeading+exact+%22temples%22&version=1.1&operation=searchRetrieve039__$a (DLC)lctgm-010644039__$a (DLC)lctgm-10644040__$a DLC$b eng$c OCoLC$d OCoLC$d OCoLC-O$f lctgm$9 lctgm150__$a Temples$9 temples550__$w g$a Religious facilities$0 (DLC)lctgm008761$9 religious facilities550__$w h$a Buddhist temples$0 (DLC)lctgm001379$9 buddhist temples550__$w h$a Confucian temples$0 (DLC)lctgm002437$9 confucian temples550__$w h$a Greek temples$0 (DLC)lctgm004717$9 greek temples550__$w h$a Hindu temples$0 (DLC)lctgm004994$9 hindu temples550__$w h$a Roman temples$0 (DLC)lctgm008977$9 roman temples550__$w h$a Taoist temples$0 (DLC)lctgm010519$9 taoist temples550__$a Churches$0 (DLC)lctgm002048$9 churches550__$a Pagodas$0 (DLC)lctgm007367$9 pagodas550__$a Pronaoi$0 (DLC)lctgm008289$9 pronaoi550__$a Torii$0 (DLC)lctgm010868$9 torii341User Enters Search2Query Sent to TS PrototypeMARC XML ReturnedClient application extracts terms for query expansion

  • 040__$a DLC$b eng$c OCoLC$d OCoLC$d OCoLC-O$f lctgm$9 lctgm150__$a Temples$9 temples550__$w g$a Religious facilities$0 (DLC)lctgm008761550__$w h$a Buddhist temples$0 (DLC)lctgm001379550__$w h$a Confucian temples$0 (DLC)lctgm002437550__$w h$a Greek temples$0 (DLC)lctgm004717550__$w h$a Hindu temples$0 (DLC)lctgm004994550__$w h$a Roman temples$0 (DLC)lctgm008977550__$w h$a Taoist temples$0 (DLC)lctgm010519550__$a Churches$0 (DLC)lctgm002048550__$a Pagodas$0 (DLC)lctgm007367550__$a Pronaoi$0 (DLC)lctgm008289550__$a Torii$0 (DLC)lctgm010868Narrower terms

  • URI accessible content{URL for the service}/{vocabulary}/{identifier}.{format}http://tspilot.oclc.org/lcsh/sh95000541.htmlhttp://tspilot.oclc.org/lcsh/sh95000541.jsonhttp://tspilot.oclc.org/lcsh/sh95000541.marcxmlhttp://tspilot.oclc.org/lcsh/sh95000541.skoshttp://tspilot.oclc.org/lcsh/sh95000541.zthes

  • The template shows how to link to vocabulary data in the prototype{URL for the service}/{vocabulary}/{identifier}.{format}Base URL for the service http://tspilot.oclc.orgVocabulary - the code for the controlled vocabulary in the MARC code list for termshttp://www.loc.gov/marc/relators/relasour.html#rela6xxhttp://www.loc.gov/marc/relators/relasour.html#rela655Identifier a control number associated with a concept or term (e.g., an LCCN - Library of Congress Control Number)Format the representation of the vocabulary data

  • In the QueueMore vocabulariesAccess to complete term hierarchiesMADS profileAdditional mappings

  • Learn moreTerminology Services Prototypehttp://tspilot.oclc.org/resources/http://tspilot.oclc.org (machine interface)Project pagehttp://www.oclc.org/research/projects/termservices/

    http://tspilot.oclc.org

    Provides information about the services and vocabularies accessible through the site.http://tspilot.oclc.org/resources/Provides access to documentation and standards for the project.

    Vocabularies are searchable through a basic SRU interface. The following examples are for LCSH and TGM I: http://tspilot.oclc.org/lcsh/?operation=explain&version=1.1 http://tspilot.oclc.org/lctgm/?operation=explain&version=1.1

    The Terminology Services Prototype has three main components:Content (controlled vocabularies)Web servicesApplications (created by software developers internal & external to OCLC) Hodge, Gail. 2000. Systems of Knowledge Organization for Digital Libraries: Beyond Traditional Authority Files. Available at: http://www.clir.org/pubs/abstract/pub91abst.html

    Metadata for each vocabulary is retrievable from the meta database. Copy the following link into a browser address bar to see the metadata record for GSAFD:

    http://tspilot.oclc.org/meta/?query=dc.subject+exact+%22fiction%22&version=1.1&operation=searchRetrieveIndiana University Digital Library Program has built an application that uses the Terminology Services prototype to provide query expansion.http://fedora-dev.dlib.indiana.edu:8080/search/index.jspFrom IU DLP interface, a user searches for templesSRU query sent to OCLC Terminology Service prototypeOCLC Terminology Service prototype returns MARC XML authority record for templesClient application extracts broader, narrower, and related terms and searches Indiana Universitys digital library collection using original term + narrower terms

Recommended

View more >