linked data: oclc offerings and experiments linked data: oclc offerings and experiments oclc...

Download LINKED DATA: OCLC OFFERINGS AND EXPERIMENTS LINKED DATA: OCLC OFFERINGS AND EXPERIMENTS OCLC Informationstag

Post on 01-May-2020

1 views

Category:

Documents

0 download

Embed Size (px)

TRANSCRIPT

  • LINKED DATA: OCLC

    OFFERINGS AND

    EXPERIMENTS

    OCLC Informationstag 2012 – Frankfurt

    Titia van der Werf Senior Program Officer OCLC Research

    O C L C R e se a rch

    1

  • CONTENT

    �Digital library innovation in Europe

    �Role of OCLC Research

    �OCLC Research Linked Data Activity

    �Discussion

    2

    O C L C R e se a rch

  • DIGITAL LIBRARY INNOVATION IN

    EUROPE

    Innovation at the European level: EC ICT-FP

    research projects for digital libraries

    � High investment in innovation

    � Building an EU-wide network of experts

    � Gap between academic research and practical

    implementation of research outcomes

    � High project management overhead

    � Driven by EU-policy and politics

    � Mutual competition and temporary alliances – no long-

    term/self-sustainable cooperation in innovation. 3

    O C L C R e se a rch

  • DIGITAL LIBRARY INNOVATION IN

    EUROPE

    Innovation at the national level:

    � Funds managed by national agencies like JISC, DFG,

    SURF, etc.

    � Funding of institutions with national tasks and

    responsibilities

    � Driven by national policy and politics

    � Mutual competition and temporary alliances – no long-

    term/self-sustainable cooperation in innovation.

    4

    O C L C R e se a rch

  • DIGITAL LIBRARY INNOVATION IN

    EUROPE

    Innovation at the institutional level:

    � Complex ecosystem to manage

    5

    O C L C R e se a rch

  • ECOSYSTEM OF A SINGLE LIBRARY

    TODAY

    6

    O C L C R e se a rch

    Users

    Print

    Vendors

    Library OPAC

    ILS

    Circulation

    Cataloging

    Self

    Service

    Acquisitions

    Cataloging

    Utility

    National/

    Global

    System

    Consortial

    System

    Electronic

    Vendor

    A to Z

    List

    Resolver

    ERM

    Institutional

    Repository

    Meta-

    search

  • DIGITAL LIBRARY INNOVATION IN

    EUROPE

    Innovation at the institutional level:

    � Complex ecosystem to manage (previous slide)

    � Wide range of staff expertise/skills required to keep

    this ecosystem up and running – local capacities are

    overstretched

    � Fragmention of innovation effort

    � Budget constraints aggravated by economic crisis

    � Need to evolve towards a model of shared innovation

    � Need to redefine boundaries of local/shared/external

    ecosystem

    7

    O C L C R e se a rch

  • BOUNDARIES: THINGS ONLY WE CAN

    DO, THINGS WE CAN DO TOGETHER,

    THINGS WE SHOULDN’T HAVE TO DO

    ANYMORE

    8

    O C L C R e se a rch

    Engagement Innovation

    Infrastructure

    Develop new services

    Take-up new technologies

    Speed/flexibility important

    Attract and build

    relationships with

    end-user,

    “service-oriented”,

    customization, etc.

    Back office capacities that

    support day-to-day operations

    “Routinized” workflows

    Economies of scale important

  • SHARED OCLC PLATFORM

    9

    O C L C R e se a rch

    Platform

    Management

    Platform

    Management

    Infrastructure

    Data

    Web Services

    App GalleryApp Gallery

    OCLC-built

    Applications

    OCLC-built

    Applications Library-built

    Applications

    Library-built

    Applications Partner-built

    Applications

    Partner-built

    Applications Flexible, open

    platform for the

    community to

    share

    applications and

    innovation

  • OCLC Research Website

  • MEET THE OCLC RESEARCH TEAM

    11

    O C L C R e se a rch

    and many more…

    Dublin – San Mateo - Leiden

  • OBJECTIVE AND ROLES OF OCLC

    RESEARCH

    � Objective

    To expand knowledge that advances the OCLC’s public

    purposes of furthering access to the world’s information

    and reducing library costs.

    � Roles

    1. Act as a community resource for shared R&D

    2. Advance ideas for service improvement, new

    services and technology adoption within OCLC

    product groups

    3. Engage the library and archive community through

    research focused working groups and collaborations.

    12

    O C L C R e se a rch

  • OCLC – 3 CONSTITUENCIES

    13

    O C L C R e se a rch

    OCLC

    Services

    OCLC Research

    Library Partnership

    OCLC Membership

  • COMPOSITION OF THE

    PARTNERSHIP

    14

    O C L C R e se a rch

  • THE OCLC RESEARCH PROCES

    15

    O C L C R e se a rch

    BUILD

    COMMUNITY

    CREATE

    CONSENSUS

    IDENTIFY BEST

    PRACTICE

    PERFORM RESEARCH &

    BUSINESS

    INTELLIGENCE

    PRODUCE

    OUTCOMES

    TRANSFER

    TECHNOLOGY

    DEVELOP &

    DEPLOY

    BUILD

    PROTOTYPES

    CONVENE

    EXPERTS

    DEVELOP

    ARCHITECTURE & STANDARDS

    Shared uncertainties Community solutions

  • OCLC RESEARCH STRANDS

    16

    O C L C R e se a rch

  • OCLC RESEARCH STRANDS

    17

    O C L C R e se a rch

  • FROM BIBL. RECORDS TO LINKED

    DATA O C L C R e se a rch

    18

  • LINKED DATA

    � Global nodes for referencing:

    � information about entities (persons, organisations,

    books, events, geographical locations, plants and

    crops, etc.)

    � identified on the web with an HTTP URI;

    � The same entity can be described by different

    URIs:

    � http://viaf.org/viaf/102333412 identifies the same

    person as http://dbpedia.org/resource/Jane_Austen,

    without claiming that the information about this

    person is the same. 19

    O C L C R e se a rch

  • LINKED DATA

    � Linked data are published as a dataset using the

    web standards for browsing (human readable) and for

    re-use across applications (machine readable: RDF,

    SPARQL).

    � Linked data usually include links to other URIs

    that are relevant (eg. descriptions of the same entity

    in other languages).

    20

    O C L C R e se a rch

  • LINKED DATA

    � Library Linked Data Incubator Group http://www.w3.org/2005/Incubator/lld/XGR-lld-usecase/

    � Use cases for semantic web technology

    implementation in libraries.

    � To demonstrate the benefit of linked data for library

    resources and the value of sharing these descriptions

    among libraries and beyond. 21

    Make your stuff available on the web.

    Make it available as structured data…

    …in a non-proprietary format.

    Use HTTP URIs to identify things.

    Link your data to other people’s data. S o u rc

    e :

    W 3 C

  • LINKED DATA INITIATIVES

    � Dbpedia: http://en.wikipedia.org/wiki/Dbpedia

    A community effort to extract structured information

    from Wikipedia and to make this information

    available as linked data. Dbpedia dataset describes

    more than 3.64 million things.

    � Linking Open Government Data http://logd.tw.rpi.edu/home

    Instance Hub Project: demos of linked data across

    diverse categories derived from US Government

    data (US agencies, crops, toxic chemicals, etc.).

    � BBC Sport: http://www.bbc.co.uk/blogs/bbcinternet/2012/04/sports_dyn

    amic_semantic.html

    Linked data of sports (players, teams, matches,

    leagues and divisions, events and competitions)

    22

    O C L C R e se a rch

  • OCLC LINKED DATA ACTIVITY

    � Which entities in bibliographic descriptions are

    useful to publish as linked data?

    Authorities

    VIAF: combines multiple name authority records into a

    single name cluster and identifies it with one URI.

    VIAF initiative started in 2003 with LoC, DNB, BnF,

    OCLC Research.

    2012: 9M name clusters; cooperation with 22 agencies

    from 19 countries; transitioned to production. 23

    O C L C R e se a rch

  • O C L C R e se a rch

    24

  • OCLC LINKED DATA ACTIVITY

    � Which entities in bibliographic descriptions are

    useful to publish as linked data?

    Subject headings

    FAST: simplified and faceted version of the LCSH.

    FAST as Linked Data is available at

    http://id.worldcat.org/fast/), and is made available

    under the Open Data Commons Attribution Licence.

    It is also available for download under this license at

    http://www.oclc.org/research/activities/fast/download.

    htm. 25

    O C L C R e se a rch

  • MOVING BEYOND MARC: THE QUALITY CHALLENGE

    � OCLC Data Architecture Group

    Bibliographic records contain an enormous