the gnd initiative 2017-2021: developing a backbone for the web of cultural and scientific data....

Post on 22-Jan-2018

1.140 Views

Category:

Education

1 Downloads

Preview:

Click to see full reader

TRANSCRIPT

The GND initiative 2017-2021:

Developing a backbone for the web of cultural and scientific data

Sarah Hartmann, Jürgen Kett

1

2

The Integrated Authority File= Gemeinsame Normdatei (GND)

Corporate Bodies 11%

Conferences 6%

Geographic Names 2%

Persons 30%

Names of Persons 47%

Subject Headings 2%

Works 2%

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

3

Interfaces and formats

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

PICAMARC 21 XML

RDFXML

Turtle JSON

DNB OPAC ● ●

Data Service FTP ● ● ●

OAI-PMH ● ●

SRU ● ●

Linked Data Service ● ● ● ●

Entity Facts ●

Cataloguing Client ●

API SRU Record Update ●

API Webcat (Persons) ●

active

passive

GND

4

GND characteristics and sustainbility

Alfred Stieglitz

Georgia OꞌKeeffe

Sky aboveClouds IV

Art Institute of Chicago

Women painters

Maler

Sun Prairie, Wis.

1918

Künstlerin

MalerinGeorgia OꞌKeeffe, Hands

Ghost Ranch, Abiquiu, NM

1887

1986

Santa Fe, NM1965

• each record describes one entity (exception: names)

• Unique, persistent Identifier ( basis for URI)

• Entities have attributes and relationships to other entities

• Relations are designated by codes

• Modular data structure

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

5 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

The GND initiative 2017-2021

Organisation Guidelines Work program

6

Organisation

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

7

GND cooperative – organizational structure

Policy + Reconcilement

Office + Infrastructure

Coordination + Curation

Creating the Data

STA

GND Committee

Agency Agency Agency...

... ... ...

Participants

GND Central Office

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

8 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

Guidelines

9 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

Guidelines

trustedquality

stable

transparent permanentlyaccessible

open and freeobliging rules

runcooperatively

neutral domaintranscending

unambiguous

10 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

Work program

11 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

▪ Action field 1 – organisation and communication

▪ Action field 2 – data management, maintenance, standardisation

▪ Action field 3 – import and datamining

▪ Action field 4 – visualisation and end user applications

▪ Action field 5 – data supply and cataloging processes

▪ Action field 6 – collaboration with other communities

Work program

12

Opening up to museums and archives

Future

2016201420122010 2018

Agency

... ...

Museum

new

Archive

newnew

STA

GND Committee

GND Central Office

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

Action field 1 - organisation and communication

13 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

Develop a domain transcending authoritydata system

Action field 2 - data management, maintenance, standardisation

GND-CORE

community andapplication specificextensions(GND-PLUS)

common minimumstandard

– modularize the data structure

– easy to use APIs and

interfaces

– optimize data management

- tracking of changes and

provenance

- suggestions for modification,

add comments

14 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

Analyze, linking and integrating data

Action field 3 - import und datamining

– „GND assistants“

– extend semantic links

(internal and external)

– monitoring data quality

- clearing up inconsistency,

errors, doublets

– clustering (e. g. works)

15 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

– function as a „signpost“

– modules for visualization and

navigation

– (test) implementation of

semantic search system

Action field 4 - visualization and end user applications

Improve access to GND network

16

– modernize the infrastucture of

data supply

– integration of GND in current

systems

– interlocking of the cataloging

and indexing workflows

Action field 5 - data supply and cataloging processes

Cooperative data supply

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

17 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

– deepen cooperations

(Wikipedia/Wikidata, research,

publishers …)

– expand linking to other data

sources or identifier systems

– provide applications to

participate easily

Source of image:Quelle: https://commons.wikimedia.org/wiki/File:Illustration_of_overlapping_communities.jpg Action field 6 - collaboration

Expand user groups and applications

18

2017Establishment of GND cooperative

Project ARACHNE: infrastructure for linking data Project GND4C:

GND for cultural data

Start project DDUP: quality control

GND ORCIDProject ORCID DE:

In the pipeline: projectGND for publishers

Next steps

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

19 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

Thanks!

Questions?

s.hartmann@dnb.de

j.kett@dnb.de

http://www.dnb.de/EN/gnd

http://d-nb.info/standards/elementset/gnd#

20 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

Back-up

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

– Integration of new partners

- other requirements of cultural and scientific data providers

- discussions about

- rules, different kinds of „views“, quality levels

– responsibilities

- who is allowed to create and update/correct which records or

values

- need for provenance data

Challenges Develop a backbone for the web of cultural and scientific data -

21

22

GND characteristics and sustainbility

Alfred Stieglitz

Georgia OꞌKeeffe

Sky aboveClouds IV

Art Institute of Chicago

Women painters

Maler

Sun Prairie, Wis.

1918

Künstlerin

MalerinGeorgia OꞌKeeffe, Hands

Ghost Ranch, Abiquiu, NM

1887

1986

Santa Fe, NM1965

• each record describes one entity (exception: names)

• Unique, persistent Identifier ( basis for URI)

• Entities have attributes and relationships to other entities

• Relations are designated by codes

• Modular data structure

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

23

Vernetzung innerhalb der GND

http://d-nb.info/gnd/4005728-8

Berlin

http://d-nb.info/gnd/118554700

Humboldt, Alexander von

http://d-nb.info/gnd/4020214-8

Geograph

http://d-nb.info/gnd/4041423-1

Naturwissenschaftler

http://d-nb.info/gnd/118554727

Humboldt, Wilhelm von

http://d-nb.info/gnd/119247267

Humboldt, Elisabeth von

http://d-nb.info/gnd/7569879-1

Ideen zu einer Geographie

der Pflanzen

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

24

Vernetzung zu externen Quellen

http://d-nb.info/gnd/118554700

Humboldt, Alexander von

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

25

Personen in der GND

http://d-nb.info/gnd/100799892

Müller, Heinrich

http://d-nb.info/gnd/

Müller, Heinrich

1886-

– zwei Satzarten

- Personennamen (Tn) und Personen (Tp)

– Personennamen

- nicht-individualisierte Datensätze

- verwendet für beliebig viele Personen mit

diesem (bevorzugten) Namen

– Personen

- individualisierte Datensätze

- verwendet für genau eine Person

- monatlicher Zuwachs ca. 20.000 Datensätze

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

26

Voraussetzungen zur aktiven Mitarbeit

– Unterzeichnung der Kooperationsvereinbarung

- (inkl. Leitlinien)

– ISIL (International Standard Identifier for Libraries and Related

Organizations (ISIL) oder MARC Organization Code zur

eindeutigen Identifikation

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

Actions

– Create new records

– Add additional information in existing records

- z. B variant names, other attributes used for identifying

the entity

– Correction of existing values

– Merging records / heading replaced by another

(Umlenkung)

– in case of insufficient permission

- „adding“ hints or requests to correct a record / values

27

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

User groups in GND

28

– Different level mark provenance of the records

– Level see MARC 079 $c

– Level

- 1 curation / quality team of library network

- 2 local curation / quality team

- 3 trained users

- 4 untrained users

- 5 other, non-librarian users

- 6 legacy data, not edited by curation team

- 7 automatically generated records

– Special responsibilities

- For musical works (level 1)

- For transcription or names in non-latin script

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

Actions according to level

29

– Create new record- any user (with assigned level)

– add additional information- any user

– corrections of existing records (elements, values)- same or lower level (certaine elements)

– Replace / delete / split- any user

- but if lower level than 3: just a request for replace / delete /

split

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

30 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

31 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

32 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

33 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

34 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

35 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

36 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

37 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

38 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017 39

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017 40

41 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

42 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

43 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

44 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

45 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

46 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

47 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

48 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

49 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

50 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

51 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

52 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

53 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017 54

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017 55

56 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

57 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

58 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

| 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017 59

60 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

61 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

62 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

63 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

64 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

65 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

66 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

67 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

68 | 19 | GND initiative 2017-2021 | LIBER 2017 | 7th July 2017

top related