ringgold webinar series: 2. core strength - standard identifiers as the foundation of healthy data...

55
Core Strength: Standard Identifiers as the Foundation of Healthy Data and the Basis for Linking Your Supply Chain Ringgold Webinar Series: Session 2 29 January 2014

Upload: ringgold-inc

Post on 21-May-2015

829 views

Category:

Marketing


3 download

DESCRIPTION

The second session took place on Wednesday January 29 and discussed Ringgold IDs - what they are and what other identifiers can do for your business. We addressed: - The current landscape of standard identifiers applicable to scholarly publishing including Ringgold IDs, ISNI, and ORCID. What are they, and why are they important? - How & why to incorporate them into your internal data silos and into your supply chain activities - Ringgold Identifiers and the Identify database: Service overview & typical use cases

TRANSCRIPT

  • 1. Ringgold Webinar Series: Session 2 29 January 2014

2. Todays Agenda 1. Unique identifiers at the core of good data health2. Identifiers in scholarly publishing 3. Embedding identifiers into your records 4. Related Ringgold services 3. At the core of good data health 4. Regardless of the state of your datas health, it can be improved by the addition of unique identifiers 5. What are standard identifiers? Numeric or alpha-numeric persistent designations associated with a single entity Entities can be an institution, person, or piece of content 6. and what do they do, exactly? 1. Disambiguate, aka enforceuniqueness 2. Enable linking, aka data integration In other words, they provide a simple basis for data governance 7. Enforcing Uniqueness Means: Disambiguating things that have the same name, but are actually different: UCL: University College London (UK) Universit Catholique de Louvain (Belgium) Universidad Cristiana Latinoamericana (Ecuador) University College Lilleblt (Denmark) Centro Universitario Celso Lisboa (Brazil) Union County Library (USA) NPL: National Physical Laboratory (UK) National Physical Laboratory (India) York University University of York (UK) York University (Canada) Northeastern University: Northeastern University (Boston, USA) Northeastern University (Shenyang, China) 8. .. And consolidating the things that have different names but are actually the same University of Oxford Univ. Oxford Oxford University Library, Oxford Univ. Radcliffe Science Library Bodleian Library Bodleian, Oxford Oxford, University of University of Northampton Northampton Business School School of Education School of Health School of Science and Technology Division of Computing Division of Engineering Environmental & Geographical Sciences Institute for Creative Leather Technologies School of Social Sciences School of The Arts 9. Why is disambiguation important? Uniquely identify institutions within records Eradicate duplication of data Ensure correct delivery, entitlements and access rights Better understand your customer base and relationships withinstitutions Improve trust in data Map institutions into their hierarchy 10. Data integration, or linking Identifiers are a single data element that provides an unambiguous hook into a record 11. What can you do with linked data? Using InstitutionalIdentifiers to link internal systems:CRMElectronic document storageFinancial System Break down silos Keep data up-to-date andsystems synchronised Enable staff to use data more effectively Simplify data transmission Improve overall data qualityAuthenticationInstitutional IdentifiersMembership systemUsage statisticsAuthor Database Fulfilment system 12. Linking author and institution IDs When authors and their affiliations are linked correctly,publishers gain: Market intelligence about authors and institutions Author and subscriber information mapped together Knowledge of where research funding is concentrated Reduction in time taken calculating open access charges (APCs) Institutions gain information about their overall researchoutput Funders gain information about where authors reside and publish 13. The supply chain using identifiersConsortium 14. People, places, publications.. 15. What do we need to identify? People Institutions Authors Subscribers / customers Members Funders Editors & other contributors Publishers / licensors Customers / subscribers Aggregators Content Books & ebooks Journals Articles Sales & subscription agents 16. Personal Identifiers International Standard NameIdentifier (ISNI) www.isni.org Open Researcher and Contributor ID (ORCID) www.orcid.org Scopus Author ID www.elsevier.com/onlinetools/scopus ResearcherID http://wokinfo.com/researcherid/And many other proprietary system IDs: Mendeley, Microsoft Academic, Google Scholar, etc. 17. ISNI ISNI NumberISNI NumberParty ID 1Party ID 2Proprietary Information and/or MetadataProprietary Information and/or Metadata ISO Standard 27729 ISNI is designed to be abridge identifier Covers any type of entity 18. ISNI Personal Record 19. ISNI Institutional Record 20. Institutional Identifiers JISC and CASRAI (Consortia AdvancingStandards in Research Administration Information) report on Organisation IDs: http://repository.jisc.ac.uk/5381/1/CC549D 001-1.0_org_ID_landscape_study.pdf Examined the landscape of organizational identifiers in the UK and identified 23 different IDs Lots of detail on use cases for publishing, funders, and institutions 21. CASRAI report findings Disambiguating organizational information from multiplesources typically described as a nightmare Benefits from effective unique identifiers are truly realized when data is shared Key aspects of identifiers that support the widest range of uses: Governance Trust Transparency Temporal Appropriate metadata 22. Publishers FundersCompaniesHEIs CuratedRegulated HistoricGlobal Global Global Global Global Global Global Global GlobalMainly used for linkingIdentifier Name Dun & Bradstreet FundRef ISNI ORCID Ringgold's Identify MACE & UK Federation VIAF Research AnalyticsCoverageGlobal Identifiers 23. Ringgold ID: Covers institutions in the scholarly supply chain 24. FundRef 25. Content-related Identifiers ISSN, eISSN ISBN DOI LCCN 26. Where & When to Include IDs Adding them to existing records Embedding IDs as new records are created make them arequired data field Priority record sets? Existing workflows? Which IDs do you need? Create dedicated fields for selected IDs 27. In-House Options Use internal resources & personnel to join existing records toIDs or an authority file Build customized solutions mapping systems together ; i.e. data loaders and transformation tools Improve data capture to require an ID upon record creation Manual vs. programmatic ORCID tools: http://support.orcid.org/ 28. Outsourcing Considerations Mapping data elements in yourrecords to standard identifiers vs. data normalization services Normalizing against a standard taxonomy Computer mapping vs manual process 29. How to build a linked supply chain Urge your vendors and partners toadopt identifiers Request dedicated data fields in any systems implementations Embed IDs in data exchange processes with your vendors and partners (e.g. subscription agents) Encourage authors and contributors to register with ORCID 30. Identify Auditing Validate 31. Use Cases Identify can act as an authority file of institutions in anynumber of systems: editorial, MSS submissions, CRMs, financial, fulfillment, etc. Understand & analyze your customer base Analyze the wider market for opportunities Disambiguate institutions & find duplicate accounts Reveal institutional relationships with hierarchies Enhance customer records with Identify metadata Support pricing decisions & policies 32. The world of institutions from a publishers point of view 33. Identify Database: Catalogs & classifies institutions in the scholarly publishing supply chain.. 34. organizes them into hierarchies (aka family trees) 35. and spans all industries, market segments, and regions. Academia Medical Not-for-profit Public libraries Corporate GovernmentPublishers Funding bodies IntermediariesMore than 370,000 institutions and growing 36. Delivery & Access Access is enterprise wide: All divisions may utilize completearray of Identify features and data Weekly data feed: Direct feed of complete Identify databasefor incorporation into your own data warehouse or systems Identify Online: Ringgolds own web interface; may be accessed via UN/PW and IP addresses API: Webservice permits calls to Identify and returns selected data elements 37. Licensing terms Annual subscription: provides ongoing access to the Identifydatabase. Upon cancellation Ringgold Numbers and Ringgold Names may be retained; Ringgold will require deletion of all other Ringgold data from the customers systems. Perpetual-use licence: provides ownership of all of the data provided by Ringgold in the Identify database at time of purchase and archival rights to the data supplied. The annual maintenance fee covers the supply of a continuing data feed and ownership of the data held within. Upon cancellation, Ringgold will cease to provide the data feed. 38. Mapping your accounts to Identify 39. Audit Service Turn your customer records from this....into this. 40. Auditing is Manual process, ideal for high-value records such asinstitutional subscribers Conducted by our team of 40 researchers, speaking more than 30 languages and expert in their assigned regions Delivers the following for each unique institution: Unique Ringgold Identifier Institutional hierarchy Additional metadata 41. Audit Process Receive files from clientNormalise data (de-duplication and automatching)Researcher checks and matches to Ringgold IDs, hierarchy etc.Researcher creates new IDs for unidentified organizationsData split into countriesData assigned to appropriate country expertData uploaded to Identify systemClient sent encrypted file via FTP with IDs and metadata 42. Deliverables & Fees Audit Files for Systems: Intended for sequential upload into multiple data systems Audit Files for Humans: Excel files for direct analysis by any member of staff Identify Online incorporation: With Identify subscription, you can see your accounts in a custom, secure view of Identify Online. View your accounts vs the wider market for prospecting, penetration analysis, etc. Per-record fees apply 43. Audit Data Consortia Member Parent RINConsortia Member Parent Ringgold ID Inst NameRinggold Inst NameCustomer IDCustomer NameProductPriceFormat27003 Universite de Caen BasseNormandie27003 Universite de Caen BasseNormandie1008564 Bibliotheque Univ. de CaenAdvances in Warp $ Speed Engine Efficiency27003 Universite de Caen BasseNormandie56820 Universite de Caen Faculte de Medecine1151389 UFR De Medecine De CaenEnterprise-Wide Alls Package$10,100 Print + Online27015 Universite Joseph Fourier27015 Universite Joseph Fourier58596 U Joe Fourier BibliothequeEnterprise-Wide Alls Package$10,100 Print + Online27015 Universite Joseph Fourier72758 Universite Joseph Fourier Faculte de Medecine de GrenobleJournal of Interspecies Bioengineering$4,500 Print + Online27092 Universite Francois-Rabelais de Tours27092 Universite Francois-Rabelais de ToursEnterprise-Wide Alls Package$10,100 Print + Online27092 Universite Francois-Rabelais de Tours56555 Universite de Tours Faculte de Medecine de ToursAnnals of Mind Meld Research$1,600 Print + Online27092 Universite Francois-Rabelais de Tours56555 Universite de Tours Faculte de Medecine de Tours484855 Medical School, Univ of Tours Journal of Interspecies Bioengineering$3,995 Online128791 Aix-Marseille Universite128791 Aix-Marseille Universite128791 Aix-Marseille Universite128791 Aix-Marseille Universite1216879 BU Gren1 Med Lot 2332568 Tour Univ Library 1303611 Service De Documentation (I894)1037952 Bibl. Univ Med Odontolgie 889965 Med Biblio - AixLeaderSHIP Quarterly$Journal of Interspecies Bioengineering$1,230 Print + Online970 Print + Online 3,995 Online 44. Beta Affiliation Matching Service Matches institutional affiliations in personal records toIdentify Combines machine matching with manual processes; ideal for datasets such as members, authors, reviewers, etc. Fees are levied on a per-record basis 45. Instant creation of new Ringgold Identifiers 46. Validate Validate enables Ringgolds Identify customers to obtainRinggold IDs for institutions which are not currently held in the Identify database with immediate effect. Users search for an institution, if the institution does not appear to be in Identify, the institution can be added and the Ringgold number obtained immediately. Ringgolds staff and researchers manually check all entries made in the Validate system. 47. How Validate works User searches for an institution in IdentifyCannot find institutionAdds institution in Validate with required location informationObtains new and unique Ringgold ID instantlyResearcher checks entry for duplication or mistakesResearcher adds metadata for new recordsReport sent back to publisher next dayDuplicate Ringgold IDs deleted 48. Enabling effective supply chain linking 49. ProtoView A service that creates and disseminates book and e-bookmetadata on behalf of scholarly publishers Developed from a successful model as the next generation of services to meet the needs of an evolving market Guided by industry best practices and standardsBuilt on the Book News, Inc. foundation and its 35 years of experience in providing promotional services for publishers 50. ProtoView in the supply chain 51. Upcoming Webinars Session 3: Lean and Mean: Publication Metadata to Enhance Discovery, Purchase and Use of Your Content Wednesday, February 12. 60 minutes. Session 4: 30-Minute Workout: Quick Tips for Better Customer Data Health Wednesday February 26. 30 minutes. Visit www.ringgold.com to see full descriptions & to register. 52. Jay Henry Chief Marketing Officerwww.ringgold.comChristine Orr Sales Director