SlideShare a Scribd company logo
Linked Data
           from
      Digital Object
    Management System
  @ the National Library of Latvia

Uldis Bojārs - SWIB12 – 28-Nov-2012
Uldis Bojārs
• uldis.bojars@gmail.com
• @CaptSolo

• National Library of Latvia
• Semantic Web expert

• PhD in Computer Science, DERI Galway
  (National University of Ireland, Galway)
Foto: Ligita Ieviņa / Latvijas Nacionālā bibliotēka
DOM2
Digital Object Management System (DOM2)

work – in – progress
• feedback, suggestions = very welcome :)
• Core functionality – digital object
  management and preservation
  – production system (not a pilot)


• Development: custom, outsourced

• Linked Data functions (added on)
Context
• Core functionality
   – Must be reliable and with good performance

• Linked Data functions (added on)
   – Aim: bootstrap linked data at NLL
   – Linked data interface (URIs, HTTP conneg, RDF data)
   – SPARQL endpoint

• Developers
   – Lack of developers who have experience building
     production-level systems based on RDF stores
https://twitter.com/nichtich/status/273460676222152704
Architecture
• Core system (MSSQL, C#, .Net)
   – Ingest, object management, …
   – (DB allows to add links to other objects, web pages)
   – (new Digital Object metadata fields can be added)

• RDF / Linked Data adaptor module
   – URIs, HTTP content negotiation
   – HTML, RDF, XML
   – (for new Digital Object fields can specify how to export in RDF)

• Separate RDF / SPARQL server
   – SPARQL endpoint
   – (no impact on core system)
Synchronisation
• Named graphs

• Push-sync
  – core system knows when something is updated
    and sends changes to the RDF store
  – updates at object level (named graphs)
     • SPARQL CLEAR, INSERT
Data
• Digital object packages (XML)
   – from various sources
   – mapped to RDF: (mix of various vocabs)

• Authority records
   – from ALEPH: ~170 k records
   – may use DOM2 to expose authority data as RDF
   – in RDF: SKOS
       • via https://github.com/kefo/marcauth-2-madsrdf

• Classifiers
   – digital object types, access rights, languages, …
   – in RDF: SKOS
<http://example.org/data/obj/11>
  dc:creator
    <http://example.org/data/auth/104168> ;
  dc:rights <http://example.org/data/clas/copyright#Public> ;
  dc:title "Garās magones" ;
  dc:type
    <http://example.org/data/subtype#Postcard>,
    <http://example.org/data/type#Image> ;
  dct:accessRights <http://example.org/data/clas/accessright#AllowPublic> ;
  dct:captured "2012-07-04"^^xsd:dateTime ;
  dct:modified "2012-07-04"^^xsd:dateTime ;
  a ore:Aggregation .

<http://example.org/data/auth/104168>
  rdfs:label "Губайдуллин, Г. С., (Газиз Салихович)" ;
  rdfs:seeAlso "http://example.org/data/auth/104168.rdf" .
<file id="91"
     mimeType="image/jpeg" name="junijs15-16_040.jpg" size="2112976" … >
    <fileMetadata>
     <field name="Type">JPEG image</field>
     <field name="Name">91.jpg</field>
     <field name="Size">2.01 MB</field>
     <field name="Title">OLYMPUS DIGITAL CAMERA</field>
     <field name="Subject">OLYMPUS DIGITAL CAMERA</field>
     <field name="Content created">30.10.2012 11:37:52</field>
     <field name="Date last saved">30.10.2012 11:37:52</field>
     <field name="Program name">Version 1.1</field>
     <field name="Width">2736 pixels</field>
     <field name="Height">3648 pixels</field>
     <field name="Horizontal resolution">96 dpi</field>
     <field name="Vertical resolution">96 dpi</field>
     …

         What about modeling file metadata (for various content types)?
         Source XML data not very useful.
Issues / Questions
• Technical issues
   – how to reliably work with RDF stores
• Modeling
   – Digital object metadata
       • using a mix of vocabs. can BIBFRAME help?
   – File metadata (for various file types)
       • https://answers.semanticweb.com/questions/19810/file-metadata-
         ontology
   – Classifiers
       • Existing vocabs that can be reused? (for digital object types, …)
• Best practices
   – Have you done something similar?
   – What choices did you make?
Looking for:
Suggestions and feedback:
  – modeling, technical decisions, …
  – … anything else that comes to mind …


Collaboration ideas, projects:
  – to do useful things with this information
     • (re digital objects, authority data, …)
  – further research and development

                           uldis.bojars@gmail.com / @CaptSolo

More Related Content

What's hot (20)

PDF
Getting Started with Knowledge Graphs
Peter Haase
 
PDF
Metadata Management in Islandora
David Wilcox
 
PPTX
DSpace-CRIS: new features and contribution to the DSpace mainstream
Andrea Bollini
 
PPTX
DSpace standard Data model and DSpace-CRIS
Andrea Bollini
 
PPTX
NoSQL: what does it mean, how did we get here, and why should I care? - Hugo ...
South London Geek Nights
 
PPTX
Repository technologies
Andrea Bollini
 
PPTX
NATE-Central-Log
Stefan Coetzee
 
PPTX
DSpace-CRIS technical level introduction
4Science
 
PPTX
Or2019 DSpace 7 Enhanced submission &amp; workflow
4Science
 
PDF
guacamole: an Object Document Mapper for ArangoDB
Max Neunhöffer
 
PDF
EUDAT data architecture and interoperability aspects – Daan Broeder
OpenAIRE
 
PPTX
Hybrid Enterprise Knowledge Graphs
Peter Haase
 
PDF
Scaling ELK Stack - DevOpsDays Singapore
Angad Singh
 
PPTX
Data Day Seattle 2015: Sarah Guido
Bitly
 
PDF
Putting Historical Data in Context: how to use DSpace-GLAM
4Science
 
PDF
Practical Use of a NoSQL
IBM Cloud Data Services
 
Getting Started with Knowledge Graphs
Peter Haase
 
Metadata Management in Islandora
David Wilcox
 
DSpace-CRIS: new features and contribution to the DSpace mainstream
Andrea Bollini
 
DSpace standard Data model and DSpace-CRIS
Andrea Bollini
 
NoSQL: what does it mean, how did we get here, and why should I care? - Hugo ...
South London Geek Nights
 
Repository technologies
Andrea Bollini
 
NATE-Central-Log
Stefan Coetzee
 
DSpace-CRIS technical level introduction
4Science
 
Or2019 DSpace 7 Enhanced submission &amp; workflow
4Science
 
guacamole: an Object Document Mapper for ArangoDB
Max Neunhöffer
 
EUDAT data architecture and interoperability aspects – Daan Broeder
OpenAIRE
 
Hybrid Enterprise Knowledge Graphs
Peter Haase
 
Scaling ELK Stack - DevOpsDays Singapore
Angad Singh
 
Data Day Seattle 2015: Sarah Guido
Bitly
 
Putting Historical Data in Context: how to use DSpace-GLAM
4Science
 
Practical Use of a NoSQL
IBM Cloud Data Services
 

Similar to Linked Data from a Digital Object Management System (20)

PPT
Linked data and voyager
Edmund Chamberlain
 
PPT
Lifting the Lid on Linked Data
Jane Stevenson
 
PPTX
Why I don't use Semantic Web technologies anymore, event if they still influe...
Gautier Poupeau
 
PDF
Metadata
Dorothea Salo
 
PPTX
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Cory Lampert
 
PPTX
How Much to Semanticize? Looking at the future of Library Data and the Semant...
Jenn Riley
 
PDF
Adventures in Linked Data Land (presentation by Richard Light)
jottevanger
 
PPTX
Intro to the semantic web (for libraries)
robin fay
 
PPTX
New Directions in Information Organization: A Linked Data Model with BIBFRAME
SharonYang
 
PPTX
TPDL2013 tutorial linked data for digital libraries 2013-10-22
jodischneider
 
PPT
RDF and Open Linked Data, a first approach
horvadam
 
PDF
ESWC SS 2013 - Thursday Keynote Vassilis Christophides: Preserving linked data
eswcsummerschool
 
PDF
The state of the art in Linked Data
Joshua Shinavier
 
PPTX
VRA_2015_CatalogingRoundup_Seneff
Heather Seneff
 
PPTX
Linked data at the Science Museum
Tristan Roddis
 
PPTX
Adlug annual meeting 2013
@CULT Srl
 
PDF
Linked data experience at Macmillan: Building discovery services for scientif...
Michele Pasin
 
PPTX
Linked Data MLA 2015
Cason Snow
 
PPTX
Linked data MLA 2015
Cason Snow
 
PDF
What is New in W3C land?
Ivan Herman
 
Linked data and voyager
Edmund Chamberlain
 
Lifting the Lid on Linked Data
Jane Stevenson
 
Why I don't use Semantic Web technologies anymore, event if they still influe...
Gautier Poupeau
 
Metadata
Dorothea Salo
 
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Cory Lampert
 
How Much to Semanticize? Looking at the future of Library Data and the Semant...
Jenn Riley
 
Adventures in Linked Data Land (presentation by Richard Light)
jottevanger
 
Intro to the semantic web (for libraries)
robin fay
 
New Directions in Information Organization: A Linked Data Model with BIBFRAME
SharonYang
 
TPDL2013 tutorial linked data for digital libraries 2013-10-22
jodischneider
 
RDF and Open Linked Data, a first approach
horvadam
 
ESWC SS 2013 - Thursday Keynote Vassilis Christophides: Preserving linked data
eswcsummerschool
 
The state of the art in Linked Data
Joshua Shinavier
 
VRA_2015_CatalogingRoundup_Seneff
Heather Seneff
 
Linked data at the Science Museum
Tristan Roddis
 
Adlug annual meeting 2013
@CULT Srl
 
Linked data experience at Macmillan: Building discovery services for scientif...
Michele Pasin
 
Linked Data MLA 2015
Cason Snow
 
Linked data MLA 2015
Cason Snow
 
What is New in W3C land?
Ivan Herman
 
Ad

More from Uldis Bojars (18)

PPTX
Linked Digital Collection "Rainis and Aspazija"
Uldis Bojars
 
PDF
Case study: Towards a linked digital collection of Latvian Cultural Heritage
Uldis Bojars
 
PPTX
OWLGrEd Ontology Visualizer
Uldis Bojars
 
PDF
Library Linked Data in Latvia - #LIBER2014 poster
Uldis Bojars
 
PPTX
Semantiskais tīmeklis un Atvērtie dati
Uldis Bojars
 
PPTX
Linked Open Data / Atvērtie saistītie dati
Uldis Bojars
 
PDF
Web Science - 1. lekcija
Uldis Bojars
 
PPTX
Exploring the Networks in Open Public Data
Uldis Bojars
 
PPT
Envisioning Social Applications of Library Linked Data
Uldis Bojars
 
PPTX
Web Science 01.12.2011 - Linked Data
Uldis Bojars
 
PPTX
Web Science 29.09.2011
Uldis Bojars
 
PPTX
Web Science 15.09.2011
Uldis Bojars
 
PPTX
Web Science seminārs - intro
Uldis Bojars
 
PPT
Weaving SIOC into the Web of Linked Data
Uldis Bojars
 
PPT
Data Portability with SIOC and FOAF
Uldis Bojars
 
PDF
FOAF for Social Network Portability
Uldis Bojars
 
PPT
SIOC: Semantic Web for Social Media Sites
Uldis Bojars
 
PPT
XUL - Mozilla Application Framework
Uldis Bojars
 
Linked Digital Collection "Rainis and Aspazija"
Uldis Bojars
 
Case study: Towards a linked digital collection of Latvian Cultural Heritage
Uldis Bojars
 
OWLGrEd Ontology Visualizer
Uldis Bojars
 
Library Linked Data in Latvia - #LIBER2014 poster
Uldis Bojars
 
Semantiskais tīmeklis un Atvērtie dati
Uldis Bojars
 
Linked Open Data / Atvērtie saistītie dati
Uldis Bojars
 
Web Science - 1. lekcija
Uldis Bojars
 
Exploring the Networks in Open Public Data
Uldis Bojars
 
Envisioning Social Applications of Library Linked Data
Uldis Bojars
 
Web Science 01.12.2011 - Linked Data
Uldis Bojars
 
Web Science 29.09.2011
Uldis Bojars
 
Web Science 15.09.2011
Uldis Bojars
 
Web Science seminārs - intro
Uldis Bojars
 
Weaving SIOC into the Web of Linked Data
Uldis Bojars
 
Data Portability with SIOC and FOAF
Uldis Bojars
 
FOAF for Social Network Portability
Uldis Bojars
 
SIOC: Semantic Web for Social Media Sites
Uldis Bojars
 
XUL - Mozilla Application Framework
Uldis Bojars
 
Ad

Linked Data from a Digital Object Management System

  • 1. Linked Data from Digital Object Management System @ the National Library of Latvia Uldis Bojārs - SWIB12 – 28-Nov-2012
  • 2. Uldis Bojārs • [email protected] @CaptSolo • National Library of Latvia • Semantic Web expert • PhD in Computer Science, DERI Galway (National University of Ireland, Galway)
  • 3. Foto: Ligita Ieviņa / Latvijas Nacionālā bibliotēka
  • 4. DOM2 Digital Object Management System (DOM2) work – in – progress • feedback, suggestions = very welcome :)
  • 5. • Core functionality – digital object management and preservation – production system (not a pilot) • Development: custom, outsourced • Linked Data functions (added on)
  • 6. Context • Core functionality – Must be reliable and with good performance • Linked Data functions (added on) – Aim: bootstrap linked data at NLL – Linked data interface (URIs, HTTP conneg, RDF data) – SPARQL endpoint • Developers – Lack of developers who have experience building production-level systems based on RDF stores
  • 8. Architecture • Core system (MSSQL, C#, .Net) – Ingest, object management, … – (DB allows to add links to other objects, web pages) – (new Digital Object metadata fields can be added) • RDF / Linked Data adaptor module – URIs, HTTP content negotiation – HTML, RDF, XML – (for new Digital Object fields can specify how to export in RDF) • Separate RDF / SPARQL server – SPARQL endpoint – (no impact on core system)
  • 9. Synchronisation • Named graphs • Push-sync – core system knows when something is updated and sends changes to the RDF store – updates at object level (named graphs) • SPARQL CLEAR, INSERT
  • 10. Data • Digital object packages (XML) – from various sources – mapped to RDF: (mix of various vocabs) • Authority records – from ALEPH: ~170 k records – may use DOM2 to expose authority data as RDF – in RDF: SKOS • via https://github.com/kefo/marcauth-2-madsrdf • Classifiers – digital object types, access rights, languages, … – in RDF: SKOS
  • 11. <http://example.org/data/obj/11> dc:creator <http://example.org/data/auth/104168> ; dc:rights <http://example.org/data/clas/copyright#Public> ; dc:title "Garās magones" ; dc:type <http://example.org/data/subtype#Postcard>, <http://example.org/data/type#Image> ; dct:accessRights <http://example.org/data/clas/accessright#AllowPublic> ; dct:captured "2012-07-04"^^xsd:dateTime ; dct:modified "2012-07-04"^^xsd:dateTime ; a ore:Aggregation . <http://example.org/data/auth/104168> rdfs:label "Губайдуллин, Г. С., (Газиз Салихович)" ; rdfs:seeAlso "http://example.org/data/auth/104168.rdf" .
  • 12. <file id="91" mimeType="image/jpeg" name="junijs15-16_040.jpg" size="2112976" … > <fileMetadata> <field name="Type">JPEG image</field> <field name="Name">91.jpg</field> <field name="Size">2.01 MB</field> <field name="Title">OLYMPUS DIGITAL CAMERA</field> <field name="Subject">OLYMPUS DIGITAL CAMERA</field> <field name="Content created">30.10.2012 11:37:52</field> <field name="Date last saved">30.10.2012 11:37:52</field> <field name="Program name">Version 1.1</field> <field name="Width">2736 pixels</field> <field name="Height">3648 pixels</field> <field name="Horizontal resolution">96 dpi</field> <field name="Vertical resolution">96 dpi</field> … What about modeling file metadata (for various content types)? Source XML data not very useful.
  • 13. Issues / Questions • Technical issues – how to reliably work with RDF stores • Modeling – Digital object metadata • using a mix of vocabs. can BIBFRAME help? – File metadata (for various file types) • https://answers.semanticweb.com/questions/19810/file-metadata- ontology – Classifiers • Existing vocabs that can be reused? (for digital object types, …) • Best practices – Have you done something similar? – What choices did you make?
  • 14. Looking for: Suggestions and feedback: – modeling, technical decisions, … – … anything else that comes to mind … Collaboration ideas, projects: – to do useful things with this information • (re digital objects, authority data, …) – further research and development [email protected] / @CaptSolo

Editor's Notes

  • #6: data about Digital Objects, Authorities, Classifiers