SlideShare a Scribd company logo
ELAG 2014 Workshop. Bath, UK. 11–12th June 2014
Adrian Stevenson and Jane Stevenson
Mimas, University of Manchester, UK
@adrianstevenson @janestevenson
Linking Data with sameAs:
Challenges and Solutions
Linking Lives
• An interface to biographical data, using
– the Archives Hub
– VIAF
– DBPedia
– the British National Biography (BNB)
– Copac
• http://archiveshub.ac.uk/linkinglives/
owl:sameAs
<Archives Hub Person> owl:sameAs <VIAF Person>
<http://data.archiveshub.ac.uk/id/person/nra/webbma
rthabeatrice1858-1943socialreformer>
owl:sameAs
<http://viaf.org/viaf/86607236> .
3
http://data.archiveshub.ac.uk/id/person/nra/web
bmarthabeatrice1858-1943socialreformer
foaf:familyName + foaf:givenName + hub:dates
“Webb, Martha Beatrice, 1858-1943”
http://viaf.org/viaf/86607236/
foaf:name
“Webb, Martha Beatrice, 1858-1943”
4
Matching
• LOD Refine
• http://code.zemanta.com/sparkica/download.html
• SILK Framework
• http://wifo5-03.informatik.uni-
mannheim.de/bizer/silk/#workbench
5
LOD Refine
6
SILK
7
Comments on the workshop
• ‘great lead-through on LOD refine’
• LOD Refine and Silk seem to be workable tools
for creating sameAs triples that can help
matching
• ‘purpose and possibilities of Silk perhaps a
little rushed for me’
• ‘made me realize how disconnected my
concept of Silk restrictions and Sparql was.
This is now fixed. Ta!’
Comments on Linking Lives
• ‘Great to see the British National Biography
(BNB) being used’
• Linking Lives project shows the need for more
open data!’
• ‘We need robust Sparql endpoints!’
Comments…
• ‘Funny how hard it is to find useful stuff to link
to, and how the user is to make sense of it’.
• ‘I feel reconciled!’
• ‘Linking = hard work’
Challenges
Identifying entities:
• One of the main problems we came up with in
our linked data pilot connecting library
catalogue data and theatre performance data
was the lack of identifiers for people and
works
• String matching on personal names and work
titles in legacy heterogenous systems is
extremely important
Challenges
• Question is how to match work titles in
multiple languages.

More Related Content

What's hot (18)

PPTX
2011 11 grdi-presentation
Johannes Keizer
 
PPTX
Biodiversity—A Healthy Ecosystem Thrives on Fresh Ideas (Part 1 of 3), Phil J...
Allen Press
 
PPTX
Robust Links - a proposed solution to reference rot in scholarly communication
Martin Klein
 
PPTX
ProQuest: The Road to Open Access - An Aggregator Journey (LundOnline 2014)
ProQuest
 
PPT
Cultural Heritage Information Dashboards
Richard Urban
 
PDF
The Past's Present Future: Emerging Trends in Online Cultural Heritage
Richard Urban
 
PPT
IMLS DCC Progress Update to the Chief Officers of State Library Agencies (COSLA)
Richard Urban
 
PDF
Oadoi and libraries
Heather Piwowar
 
PPTX
We Are Generation Open
Jonathan Tennant
 
KEY
Creating Visualizations with Linked Open Data
Alvaro Graves
 
PPT
The Great Twentieth-Century Hole Or, what the Digital Humanities Miss
TU Delft, Netherlands
 
PDF
Digitised historic newspapers in Europe
TU Delft, Netherlands
 
PPT
Europeana in a Research Context
TU Delft, Netherlands
 
PPT
Europeana Newspapers -
TU Delft, Netherlands
 
PPTX
Intellectual Freedom Through Subject Headings: Can FAST Help?
Emily Nimsakont
 
PPTX
Rich Data? Poor Data? Depends on...
Lars G. Svensson
 
PPTX
eluxemburgensia: the portal for Luxembourg's historic newspapers
Europeana Newspapers
 
PPTX
Exploring Aggregation of Personal, Private, and Institutional Web Archives
Mat Kelly
 
2011 11 grdi-presentation
Johannes Keizer
 
Biodiversity—A Healthy Ecosystem Thrives on Fresh Ideas (Part 1 of 3), Phil J...
Allen Press
 
Robust Links - a proposed solution to reference rot in scholarly communication
Martin Klein
 
ProQuest: The Road to Open Access - An Aggregator Journey (LundOnline 2014)
ProQuest
 
Cultural Heritage Information Dashboards
Richard Urban
 
The Past's Present Future: Emerging Trends in Online Cultural Heritage
Richard Urban
 
IMLS DCC Progress Update to the Chief Officers of State Library Agencies (COSLA)
Richard Urban
 
Oadoi and libraries
Heather Piwowar
 
We Are Generation Open
Jonathan Tennant
 
Creating Visualizations with Linked Open Data
Alvaro Graves
 
The Great Twentieth-Century Hole Or, what the Digital Humanities Miss
TU Delft, Netherlands
 
Digitised historic newspapers in Europe
TU Delft, Netherlands
 
Europeana in a Research Context
TU Delft, Netherlands
 
Europeana Newspapers -
TU Delft, Netherlands
 
Intellectual Freedom Through Subject Headings: Can FAST Help?
Emily Nimsakont
 
Rich Data? Poor Data? Depends on...
Lars G. Svensson
 
eluxemburgensia: the portal for Luxembourg's historic newspapers
Europeana Newspapers
 
Exploring Aggregation of Personal, Private, and Institutional Web Archives
Mat Kelly
 

Similar to Linking Data with sameAs: Challenges and Solutions - Workshop (6)

PPTX
Linked dataworkshopintro14aug2014
Jane Stevenson
 
PPTX
Very Gentle Linked Data Workshop
Adrian Stevenson
 
PPTX
“Il n’y a pas de hors-texte” - Challenges for Archival Linked Data
Adrian Stevenson
 
PDF
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...
Biblioteca Nacional de España
 
PPTX
Lessons from ‘Linking Lives’ and ‘WW1 Discovery’ Projects
Adrian Stevenson
 
PPT
Locah Project Show and Tell
Adrian Stevenson
 
Linked dataworkshopintro14aug2014
Jane Stevenson
 
Very Gentle Linked Data Workshop
Adrian Stevenson
 
“Il n’y a pas de hors-texte” - Challenges for Archival Linked Data
Adrian Stevenson
 
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...
Biblioteca Nacional de España
 
Lessons from ‘Linking Lives’ and ‘WW1 Discovery’ Projects
Adrian Stevenson
 
Locah Project Show and Tell
Adrian Stevenson
 
Ad

More from Adrian Stevenson (20)

PPTX
Exploring British Design
Adrian Stevenson
 
PPTX
SEO Matters
Adrian Stevenson
 
PPTX
Wrapping and Unwrapping History: What’s Gained and What’s Lost
Adrian Stevenson
 
PPTX
Digital Humanities and the First World War
Adrian Stevenson
 
PPTX
Introduction to APIs and Linked Data
Adrian Stevenson
 
PPTX
GLAM Rocks! London Semantic Web Meetup
Adrian Stevenson
 
PPTX
Linked Data - the Future for Open Repositories. Kultivate Workshop
Adrian Stevenson
 
PPTX
High and Lows of Library Linked Data
Adrian Stevenson
 
PPTX
2 minutes on LOCAH Linking Lives at Europeana Tech 2011
Adrian Stevenson
 
PPTX
Linked Open Data: Opportunities & Barriers for Archives
Adrian Stevenson
 
PPTX
Report on the International Linked Open Data for Libraries, Archives and Muse...
Adrian Stevenson
 
PPT
Aggregation Using Linked Data – LOCAH Project Experiences
Adrian Stevenson
 
PPT
Linked Data - the Future for Open Repositories?
Adrian Stevenson
 
PPT
LOCAH Project and Considerations of Linked Data Approaches
Adrian Stevenson
 
PPT
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
Adrian Stevenson
 
PPT
RDFa From Theory to Practice
Adrian Stevenson
 
PPT
Linked Data and the Semantic Web - Mimas Seminar
Adrian Stevenson
 
PPT
Semantic Technologies: Which Way Now? – UKOLN Response
Adrian Stevenson
 
PPT
SWORD 3 Kick-off Meeting
Adrian Stevenson
 
PPT
Linked Data and the Semantic Web: What Are They and Should I Care?
Adrian Stevenson
 
Exploring British Design
Adrian Stevenson
 
SEO Matters
Adrian Stevenson
 
Wrapping and Unwrapping History: What’s Gained and What’s Lost
Adrian Stevenson
 
Digital Humanities and the First World War
Adrian Stevenson
 
Introduction to APIs and Linked Data
Adrian Stevenson
 
GLAM Rocks! London Semantic Web Meetup
Adrian Stevenson
 
Linked Data - the Future for Open Repositories. Kultivate Workshop
Adrian Stevenson
 
High and Lows of Library Linked Data
Adrian Stevenson
 
2 minutes on LOCAH Linking Lives at Europeana Tech 2011
Adrian Stevenson
 
Linked Open Data: Opportunities & Barriers for Archives
Adrian Stevenson
 
Report on the International Linked Open Data for Libraries, Archives and Muse...
Adrian Stevenson
 
Aggregation Using Linked Data – LOCAH Project Experiences
Adrian Stevenson
 
Linked Data - the Future for Open Repositories?
Adrian Stevenson
 
LOCAH Project and Considerations of Linked Data Approaches
Adrian Stevenson
 
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
Adrian Stevenson
 
RDFa From Theory to Practice
Adrian Stevenson
 
Linked Data and the Semantic Web - Mimas Seminar
Adrian Stevenson
 
Semantic Technologies: Which Way Now? – UKOLN Response
Adrian Stevenson
 
SWORD 3 Kick-off Meeting
Adrian Stevenson
 
Linked Data and the Semantic Web: What Are They and Should I Care?
Adrian Stevenson
 
Ad

Recently uploaded (20)

PPTX
Nutri-QUIZ-Bee-Elementary.pptx...................
ferdinandsanbuenaven
 
PPTX
CLEFT LIP AND PALATE: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
PPTX
Gall bladder, Small intestine and Large intestine.pptx
rekhapositivity
 
PPTX
Views on Education of Indian Thinkers J.Krishnamurthy..pptx
ShrutiMahanta1
 
PDF
Federal dollars withheld by district, charter, grant recipient
Mebane Rash
 
PPTX
Optimizing Cancer Screening With MCED Technologies: From Science to Practical...
i3 Health
 
PPTX
ANORECTAL MALFORMATIONS: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
PPTX
Accounting Skills Paper-I, Preparation of Vouchers
Dr. Sushil Bansode
 
PDF
07.15.2025 - Managing Your Members Using a Membership Portal.pdf
TechSoup
 
PPTX
Presentation: Climate Citizenship Digital Education
Karl Donert
 
PPTX
SCHOOL-BASED SEXUAL HARASSMENT PREVENTION AND RESPONSE WORKSHOP
komlalokoe
 
PDF
IMP NAAC-Reforms-Stakeholder-Consultation-Presentation-on-Draft-Metrics-Unive...
BHARTIWADEKAR
 
PPTX
Explorando Recursos do Summer '25: Dicas Essenciais - 02
Mauricio Alexandre Silva
 
PPTX
nutriquiz grade 4.pptx...............................................
ferdinandsanbuenaven
 
PPTX
Nutrition Month 2025 TARP.pptx presentation
FairyLouHernandezMej
 
PPTX
How to Configure Access Rights of Manufacturing Orders in Odoo 18 Manufacturing
Celine George
 
PDF
IMP NAAC REFORMS 2024 - 10 Attributes.pdf
BHARTIWADEKAR
 
PPTX
Maternal and Child Tracking system & RCH portal
Ms Usha Vadhel
 
PPTX
Optimizing Cancer Screening With MCED Technologies: From Science to Practical...
i3 Health
 
PDF
BÀI TẬP BỔ TRỢ THEO LESSON TIẾNG ANH - I-LEARN SMART WORLD 7 - CẢ NĂM - CÓ ĐÁ...
Nguyen Thanh Tu Collection
 
Nutri-QUIZ-Bee-Elementary.pptx...................
ferdinandsanbuenaven
 
CLEFT LIP AND PALATE: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
Gall bladder, Small intestine and Large intestine.pptx
rekhapositivity
 
Views on Education of Indian Thinkers J.Krishnamurthy..pptx
ShrutiMahanta1
 
Federal dollars withheld by district, charter, grant recipient
Mebane Rash
 
Optimizing Cancer Screening With MCED Technologies: From Science to Practical...
i3 Health
 
ANORECTAL MALFORMATIONS: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
Accounting Skills Paper-I, Preparation of Vouchers
Dr. Sushil Bansode
 
07.15.2025 - Managing Your Members Using a Membership Portal.pdf
TechSoup
 
Presentation: Climate Citizenship Digital Education
Karl Donert
 
SCHOOL-BASED SEXUAL HARASSMENT PREVENTION AND RESPONSE WORKSHOP
komlalokoe
 
IMP NAAC-Reforms-Stakeholder-Consultation-Presentation-on-Draft-Metrics-Unive...
BHARTIWADEKAR
 
Explorando Recursos do Summer '25: Dicas Essenciais - 02
Mauricio Alexandre Silva
 
nutriquiz grade 4.pptx...............................................
ferdinandsanbuenaven
 
Nutrition Month 2025 TARP.pptx presentation
FairyLouHernandezMej
 
How to Configure Access Rights of Manufacturing Orders in Odoo 18 Manufacturing
Celine George
 
IMP NAAC REFORMS 2024 - 10 Attributes.pdf
BHARTIWADEKAR
 
Maternal and Child Tracking system & RCH portal
Ms Usha Vadhel
 
Optimizing Cancer Screening With MCED Technologies: From Science to Practical...
i3 Health
 
BÀI TẬP BỔ TRỢ THEO LESSON TIẾNG ANH - I-LEARN SMART WORLD 7 - CẢ NĂM - CÓ ĐÁ...
Nguyen Thanh Tu Collection
 

Linking Data with sameAs: Challenges and Solutions - Workshop

  • 1. ELAG 2014 Workshop. Bath, UK. 11–12th June 2014 Adrian Stevenson and Jane Stevenson Mimas, University of Manchester, UK @adrianstevenson @janestevenson Linking Data with sameAs: Challenges and Solutions
  • 2. Linking Lives • An interface to biographical data, using – the Archives Hub – VIAF – DBPedia – the British National Biography (BNB) – Copac • http://archiveshub.ac.uk/linkinglives/
  • 3. owl:sameAs <Archives Hub Person> owl:sameAs <VIAF Person> <http://data.archiveshub.ac.uk/id/person/nra/webbma rthabeatrice1858-1943socialreformer> owl:sameAs <http://viaf.org/viaf/86607236> . 3
  • 4. http://data.archiveshub.ac.uk/id/person/nra/web bmarthabeatrice1858-1943socialreformer foaf:familyName + foaf:givenName + hub:dates “Webb, Martha Beatrice, 1858-1943” http://viaf.org/viaf/86607236/ foaf:name “Webb, Martha Beatrice, 1858-1943” 4
  • 5. Matching • LOD Refine • http://code.zemanta.com/sparkica/download.html • SILK Framework • http://wifo5-03.informatik.uni- mannheim.de/bizer/silk/#workbench 5
  • 8. Comments on the workshop • ‘great lead-through on LOD refine’ • LOD Refine and Silk seem to be workable tools for creating sameAs triples that can help matching • ‘purpose and possibilities of Silk perhaps a little rushed for me’ • ‘made me realize how disconnected my concept of Silk restrictions and Sparql was. This is now fixed. Ta!’
  • 9. Comments on Linking Lives • ‘Great to see the British National Biography (BNB) being used’ • Linking Lives project shows the need for more open data!’ • ‘We need robust Sparql endpoints!’
  • 10. Comments… • ‘Funny how hard it is to find useful stuff to link to, and how the user is to make sense of it’. • ‘I feel reconciled!’ • ‘Linking = hard work’
  • 11. Challenges Identifying entities: • One of the main problems we came up with in our linked data pilot connecting library catalogue data and theatre performance data was the lack of identifiers for people and works • String matching on personal names and work titles in legacy heterogenous systems is extremely important
  • 12. Challenges • Question is how to match work titles in multiple languages.

Editor's Notes

  • #2: Mention this is a very gentle and won’t go into much detail given only 4 hours Aim is to get people actually creating some linked data Will inevitably have to gloss over a number of issues. Please leave if think it might be too simple