linking data with sameas: challenges and solutions - workshop
DESCRIPTION
Feedback from 'Linking Data with sameAs: Challenges and Solutions' 3 hour workshop given at ELAG 2014 in Bath, UK. http://elag2014.org/programme/elag-2014-workshops/stevenson/TRANSCRIPT
![Page 1: Linking Data with sameAs: Challenges and Solutions - Workshop](https://reader035.vdocuments.mx/reader035/viewer/2022062405/55563155d8b42a5b528b4fde/html5/thumbnails/1.jpg)
ELAG 2014 Workshop. Bath, UK. 11–12th June 2014
Adrian Stevenson and Jane StevensonMimas, University of Manchester, UK@adrianstevenson @janestevenson
Linking Data with sameAs: Challenges and Solutions
![Page 2: Linking Data with sameAs: Challenges and Solutions - Workshop](https://reader035.vdocuments.mx/reader035/viewer/2022062405/55563155d8b42a5b528b4fde/html5/thumbnails/2.jpg)
Linking Lives
• An interface to biographical data, using– the Archives Hub– VIAF– DBPedia– the British National Biography (BNB)– Copac
• http://archiveshub.ac.uk/linkinglives/
![Page 3: Linking Data with sameAs: Challenges and Solutions - Workshop](https://reader035.vdocuments.mx/reader035/viewer/2022062405/55563155d8b42a5b528b4fde/html5/thumbnails/3.jpg)
3
owl:sameAs
<Archives Hub Person> owl:sameAs <VIAF Person>
<http://data.archiveshub.ac.uk/id/person/nra/webbmarthabeatrice1858-1943socialreformer>
owl:sameAs
<http://viaf.org/viaf/86607236> .
![Page 4: Linking Data with sameAs: Challenges and Solutions - Workshop](https://reader035.vdocuments.mx/reader035/viewer/2022062405/55563155d8b42a5b528b4fde/html5/thumbnails/4.jpg)
4
http://data.archiveshub.ac.uk/id/person/nra/webbmarthabeatrice1858-1943socialreformerfoaf:familyName + foaf:givenName + hub:dates
“Webb, Martha Beatrice, 1858-1943”
http://viaf.org/viaf/86607236/foaf:name
“Webb, Martha Beatrice, 1858-1943”
![Page 5: Linking Data with sameAs: Challenges and Solutions - Workshop](https://reader035.vdocuments.mx/reader035/viewer/2022062405/55563155d8b42a5b528b4fde/html5/thumbnails/5.jpg)
5
Matching
• LOD Refine• http://code.zemanta.com/sparkica/download.html
• SILK Framework• http://wifo5-03.informatik.uni-mannheim.de/bizer/
silk/#workbench
![Page 6: Linking Data with sameAs: Challenges and Solutions - Workshop](https://reader035.vdocuments.mx/reader035/viewer/2022062405/55563155d8b42a5b528b4fde/html5/thumbnails/6.jpg)
6
LOD Refine
![Page 7: Linking Data with sameAs: Challenges and Solutions - Workshop](https://reader035.vdocuments.mx/reader035/viewer/2022062405/55563155d8b42a5b528b4fde/html5/thumbnails/7.jpg)
7
SILK
![Page 8: Linking Data with sameAs: Challenges and Solutions - Workshop](https://reader035.vdocuments.mx/reader035/viewer/2022062405/55563155d8b42a5b528b4fde/html5/thumbnails/8.jpg)
Comments on the workshop
• ‘great lead-through on LOD refine’• LOD Refine and Silk seem to be workable tools
for creating sameAs triples that can help matching
• ‘purpose and possibilities of Silk perhaps a little rushed for me’
• ‘made me realize how disconnected my concept of Silk restrictions and Sparql was. This is now fixed. Ta!’
![Page 9: Linking Data with sameAs: Challenges and Solutions - Workshop](https://reader035.vdocuments.mx/reader035/viewer/2022062405/55563155d8b42a5b528b4fde/html5/thumbnails/9.jpg)
Comments on Linking Lives
• ‘Great to see the British National Biography (BNB) being used’
• Linking Lives project shows the need for more open data!’
• ‘We need robust Sparql endpoints!’
![Page 10: Linking Data with sameAs: Challenges and Solutions - Workshop](https://reader035.vdocuments.mx/reader035/viewer/2022062405/55563155d8b42a5b528b4fde/html5/thumbnails/10.jpg)
Comments…
• ‘Funny how hard it is to find useful stuff to link to, and how the user is to make sense of it’.
• ‘I feel reconciled!’• ‘Linking = hard work’
![Page 11: Linking Data with sameAs: Challenges and Solutions - Workshop](https://reader035.vdocuments.mx/reader035/viewer/2022062405/55563155d8b42a5b528b4fde/html5/thumbnails/11.jpg)
Challenges
Identifying entities: • One of the main problems we came up with in
our linked data pilot connecting library catalogue data and theatre performance data was the lack of identifiers for people and works
• String matching on personal names and work titles in legacy heterogenous systems is extremely important
![Page 12: Linking Data with sameAs: Challenges and Solutions - Workshop](https://reader035.vdocuments.mx/reader035/viewer/2022062405/55563155d8b42a5b528b4fde/html5/thumbnails/12.jpg)
Challenges
• Question is how to match work titles in multiple languages.