anila angjeli 1 aparsen - interoperability of pi workshop, ipres, lisbon, 5 september 2013 viaf and...

12
Anila Angjeli 1 PARSEN - Interoperability of PI workshop, iPRES, Lisbon, 5 September 2013 VIA F and mber of the Board of directors of ISNI-IA

Upload: joseph-sparks

Post on 25-Dec-2015

215 views

Category:

Documents


2 download

TRANSCRIPT

Anila Angjeli

1APARSEN - Interoperability of PI workshop, iPRES, Lisbon, 5 September 2013

VIAFand

Member of the Board of directors of ISNI-IA

VIAF

• Merge of 32+ national level authority files• 34 + million authority records• 103 + million bibliographic records• 23 + million merged clusters

Persons, organizations, meetings, geographic names, works and expressions

though young (less than 10 years) …

from VIAF to ISNI

Libraries

Text Rights

Music RightsTrade Sources

Encyclopaedias

Researchers & Professional

cross-domain bridging-domains

Figures+ % confidence

- % confidence

Assigned ISNIs to VIAF July 2013

2 + independent sources 2,496,141

3+ VIAF sources 656,976

Unique name 2,643,958

Total 5,797,075Provisional: Unassigned

9,563,590

Provisional: Possible580,738

Assigned

6.87 million

Number of data contributors

VIAF 39

Others 25

Total(in permanent growth)

64

Cross links among sources through local IDs

Over 7,6 million

Governance & control infrastructureISNI-IA

ISO Registration Authority(the governing body)

Quality Teamlibraries

data

ISNI-AAAssignment Agency

(manages the central database)

Registration Agency

Registration Agency

Registration Agency

Registration AgencyRegistration

Agency

Member

Member

Member

enduser

enduser

database

• Samples data regularly – c. 2% VIAF clusters have mixed identities– Duplicate clusters are higher, nearer 5%

• Makes corrections at cluster level

– Merges, splits, error notifications– Access to cataloguing client / macros

• Makes system recommendations• Gives approval for single source assignment• Responds to End User input

ISNI Quality Team

Domain Cross-domain identities Authority files +

Main purpose

Certified ID (unique, persistent, international, cross-domain) 27729

Clustering, federating authority files for reuse

(initially not an ID system)

ID degree of persistency

Permanent As persistent as possible

Referent

PublicIdentities Persons Organizations Fictional

Authority Persons Organizations Meetings, Works/Expr, Geog

Data privacy Includes private data (not disclosed to public)

All public data

Assignment principles

Matching authoritative data sources No sparse records No undifferentiated identities

Matching authority files

Management Centrally managed Quality Team (BnF+BL)

Maintenance of source authority files in the contributing databases

Links

Among source data, Titles of works, Related identities, Wikipedia, Other encycl sources VIAF

Among source files, Wikipedia, ISNI

Linked data Soon Yes

differences – commonalities - complementarity

Interest for interoperability

VIAF – ISNI

Areas of interest• Same referent (semantic)• Communities of users sharing interests• Same user operating in multiple communities

8

VIAF-ISNI inter

Monthly updates

ISNIsReprocessing

after notification

Error notifications

Quality Team Quality

control

matching

Assignment

Error detection

Reminder: VIAF seed database for ISNI

VIAF-ISNI Task ForcePolicy on pseudonyms Study notification work flows Participate in cluster sampling in VIAF and ISNI Help define new anomaly detectors, etc

-relationship-operability

11

in BnF Catalog a library use case

Workflows with publishers(Legal deposit)

Bibliographic and

authority products

Out of commerce works

Research and discovery

Other uses…