Cross-linking and Referencing Data and Publications in CLADDIER

Download Cross-linking and Referencing Data and Publications in CLADDIER

Post on 04-Jan-2016

46 views

Category:

Documents

1 download

Embed Size (px)

DESCRIPTION

Cross-linking and Referencing Data and Publications in CLADDIER. Brian Matthews, E-Science Centre, STFC Rutherford Appleton Laboratory. Bryan Lawrence (PI, BADC) Sam Pepler (Project Manager, BADC) Sue Latham (BADC) Pauline Simpson (NOCS) Jessie Hey (Southampton) Brian Matthews (STFC) - PowerPoint PPT Presentation

TRANSCRIPT

<ul><li><p>Cross-linking and Referencing Data and Publications in CLADDIER</p><p>Brian Matthews, E-Science Centre,STFC Rutherford Appleton Laboratory</p></li><li><p>About CLADDIERBryan Lawrence (PI, BADC)Sam Pepler (Project Manager, BADC) Sue Latham (BADC)Pauline Simpson (NOCS)Jessie Hey (Southampton)Brian Matthews (STFC)Catherine Jones (STFC)Alistair Miles (STFC)Katie Portwin (STFC)Shoaib Sufi (STFC)Kevin ONeil (STFC)Katherine Bouton (Reading, NCAS)Citation, Location and Deposition in Discipline and Institutional RepositoriesFunded via a JISC grant, through the Digital Repositories programme - July 2005-Oct 2007</p></li><li><p>Citation and linking in repositories</p><p>In order to achieve this scenario we need to provide a set of key mechanisms Publishing of Data Conventions for the citation of dataCan then treat data citation in similar way to publications Browsing and searchingacross different repositoriesacross data and publication Cross-citation of data and publicationforward and backward citationneed to maintain currency of citation links A simple mechanism to push citation information between repositories</p><p>A practical look at citation of data and how repositories could communicate citation information.</p></li><li><p>Data PublicationIn this context publication is defined as the process through which data is fixed and made retrievable over the long term, and may imply that there has been some quality control process. </p><p>Defining data : fixing and encapsulating a meaningful data setQuality Control : Publishers, Data Centres</p><p>Natural Environment Research Council, Mesosphere-Stratosphere-Troposphere Radar Facility [Thomas, L.; Vaughan, G.] . Mesosphere-Stratosphere-Troposphere Radar Facility at Aberystwyth, [Internet]. Version 2, Cartesian products. British Atmospheric Data Centre (BADC), 1990- [cited 2006 Apr 25]. Available from http://badc.nerc.ac.uk/data/mst.</p></li><li><p>Browsing and SearchingBrowsing and searching across different repositories across data and publication</p><p>CLADDIER has provided a harvesting and search tool to support cross-repository searching</p></li><li><p>Discovery ServiceThe Discovery Service gives a broad-brush search</p><p> Give you both publications and data sets indexed by keyword</p><p>Google across repositories.</p><p>Uses OAI-PMH a conventional approach</p><p> Simple but it works! Simple key-word searching Three participating repositories in the pilot: BADC, STFC ePubs, SOTON ePrints</p></li><li><p>Adding Cross-CitationsTraditional CitationCross CitationCannot tell whether the data and publication are actually related.what data and publications inspire a piece of work (generating a new data set)what publications arise from a data setWe need to exploit the concept of cross-citation to see whether items are actually related.</p></li><li><p>Maintaining LinksIdeally the archives holding the datasets and publications would be notified that a paper citing them had been submitted.</p><p>Metadata associated with those records would be updated to reflect the citations. The metadata in the publication repository should also link to the metadata in the data archives and vice versa.It would be great if this notification could be done automatically.Tedious to enter citationsforward citations (cited-by) are hard to track</p><p>We adapted a protocol from the world of BloggingTrackbackDesigned to allow cross-referencing of blog articles Extended to allow richer metadata</p></li><li><p>Trackback Protocol</p></li><li><p>Sender PublicationThis publication has a citation to a technical report</p></li><li><p>Adds CitationSends trackback call to this URI</p></li><li><p>Embedded MetadataTrackback URIFormats accepted</p></li><li><p>After Trackback cited-by link addedReceiver PublicationAdded this cited by link</p></li><li><p>Notes on TrackbackA simple existing protocolP2P loosely federates repositoriesExtended to carry metadata of the citationTo add cited-by linksCan also indicate which metadata is expectedSimple Dublin Core ePrints Application ProfileCan also use the metadata of the receiver Improves the citation metadataImplemented in ePubsAlso partially in BADCReceiver only send email to admin.Some problems or extensions are under considerationLink to metadata not full textSpamming anyone could send trackbacksWhitelistsAdministrator interventionMultiple entriesSame citation multiple timesSame citation in different repositoriesRetraction of citationA delete protocol</p></li><li><p>ConclusionsCLADDIER supports the scientific process with federated repositories</p><p>This requires the cross-linking network of information objects.Which needs to be stored, maintained and searchedNow doing some user testing </p><p>Tools and ideas relatively straightforwardLots of gluing of existing componentsKeep it simple so it will get used</p><p>http://claddier.badc.ac.uk/ </p></li></ul>