SlideShare a Scribd company logo
dans.knaw.nl
DANS is een instituut van KNAW en NWO
Data standardization process
for arts and humanities
Vyacheslav Tykhonov
Senior Information Scientist
(DANS-KNAW, Netherlands)
Developing the SSHOC Reference Ontology workshop
ICS-FORTH , Heraklion, Crete
21-22 May, 2019
DANS-KNAW core services
Outline
• Standardization process during data deposit and archiving
(metadata level created by users)
• Research data management and harmonization of deposited
datasets (file level)
• Standardization and enrichment of harvested content (metadata
level provided by different data providers)
• Tracking provenance information for data and tools, moving to FAIR
Big problem: researchers and librarians are not talking to each other
and there is no common Reference model!
Metadata schemas
• EASY TDR has own metadata schema developed for Dutch
scientific landscape but allows Dublin Core export from OAI-
PMH endpoint
• NARCIS is an aggregator that harvesting metadata from
various repositories, no standardization pipeline
• Metadata from Dataverse can be exported as:
Controlled vocabulary and thesaurus
• Linked data is one step forward (or actually backward in the right
direction) on solving some of standardization problems.
• By having shared controlled vocabularies (CV) created and
maintained by experts on various domains, the digital items can
be annotated with them and easily retrieved by other experts
from the same domain without being librarian. It’s clear
indication which vocabulary is good enough and shared by a
critical mass.
• A thesaurus is a semantic network of unique concepts, including
relationships between synonyms, broader and narrower
(parent/child) contexts, and other related concepts. Thesaurus is
hierarchy for controlled vocabularies.
SSHOC data repository
DANS-KNAW is leading the development of SSHOC DataverseEU project.
We’re developing multilingual web interface and localizing metadata fields and developed data
standardization technique based on APIs for CESSDA CVs, Topic Classification and CESSDA CV Manager
services.
SSHOC/CESSDA DataverseEU:
• Hungary (TARKI)
• Sweden (SND)
• Slovenia (ADP)
• Germany (GESIS)
• France (SciencesPro)
• Austria (AUSSDA)
• United Kingdom (UKDA)
• Italy (CNR, UniData)
• Belgium (SODA)
• Latvia (LSZDA)
• Poland (PSNC)
• Norway (DataverseNO)
• Netherlands (DANS-KNAW)
SKOS RDF Vocabularies (CESSDA)
We’re importing thesaurus delivered as SKOS RDF, for example:
Rest API endpoint delivers back JSON suitable for web applications.
Metadata standardization during deposit process
Standardized metadata in Dataverse
Standardized metadata in RDF
All relations exported and available in the Knowledge Graph
and ready for the further querying and exploration:
Research data management
Data standardization process plays a key role in the data
management plan of any organization but current situation in
research data management is very complex:
• too much data chaos in datasets
• no data transparency
• sometimes no standards available
• no provenance information attached to data
• homonyms, synonyms, generalizations, specializations,
spelling variations and mistakes, language versions are all
complicating the keyword-based search and retrieval of
information
Data standardization pipeline based on
chatbot
Mapping produced by AI as result
mappings:
Image-image:
predicateobjects:
- [a, 'http://xmlns.com/foaf/0.1/Image']
- [a, 'http://schema.org/ImageObject']
- [a, 'http://schema.org/CreativeWork']
- [a, 'http://xmlns.com/foaf/0.1/Document']
- ['http://www.w3.org/2000/01/rdf-schema#label', $(image)]
- ['http://schema.org/image', $(image)]
source: dataset-source
subject: https://data.opendatasoft.com/ld/resources/roman-emperors@public/Image/$(image)
Person-name:
predicateobjects:
- [a, 'http://schema.org/Person']
- [a, 'http://www.w3.org/2000/10/swap/pim/contact#Person']
- [a, 'http://xmlns.com/foaf/0.1/Person']
- [a, 'http://purl.org/dc/terms/Agent']
- [a, 'http://purl.org/goodrelations/v1#BusinessEntity']
- [a, 'http://rhizomik.net/ontologies/copyrightonto.owl#LegalPerson']
- [a, 'http://schema.org/Thing']
- [a, 'http://www.w3.org/2003/01/geo/wgs84_pos#SpatialThing']
- [a, 'http://xmlns.com/foaf/0.1/Agent']
- ['http://www.w3.org/2000/01/rdf-schema#label', $(name)]
source: dataset-source
subject: https://data.opendatasoft.com/ld/resources/roman-emperors@public/Person/$(name)
Place-birth_cty:
predicateobjects:
- [a, 'http://schema.org/Place']
- [a, 'http://purl.org/goodrelations/v1#Location']
- [a, 'http://rdfs.co/juso/SpatialThing']
- [a, 'http://schema.org/Thing']
- ['http://www.w3.org/2000/01/rdf-schema#label', $(birth_cty)]
- ['http://dbpedia.org/ontology/era', $(era)]
- ['http://www.ontologydesignpatterns.org/ont/dul/DUL.owl#isDescribedBy', $(era)]
- ['http://dbpedia.org/ontology/birthDate', $(birth), 'http://www.w3.org/2001/XMLSchema#datetime']
source: dataset-source
subject: https://data.opendatasoft.com/ld/resources/roman-emperors@public/Place/$(birth_cty)
Place-birth_prv:
predicateobjects:
- [a, 'http://schema.org/Place']
- [a, 'http://purl.org/goodrelations/v1#Location']
- [a, 'http://rdfs.co/juso/SpatialThing']
- [a, 'http://schema.org/Thing']
- ['http://www.w3.org/2000/01/rdf-schema#label', $(birth_prv)]
- ['http://dbpedia.org/ontology/deathDate', $(death), 'http://www.w3.org/2001/XMLSchema#datetime']
source: dataset-source
subject: https://data.opendatasoft.com/ld/resources/roman-emperors@public/Place/$(birth_prv)
NARCIS metadata (example)
No authority linking or controlled vocabularies support, but…
Tracking Provenance information
Prov-O example from PARTHENOS project
Time Machine association
• large scale project with 300+
partners
• development and support of
sustainable networked services
• trends watching and tracking of
software maturity level
• reliable governance model
• the foundation for the further
innovation!
Conclusion
• development of large-scale networked services out of research
pipelines
• every service should be mature enough, maintainable and follow
continuous integration pipeline
• tracking provenance information for every tool and dataset is the
highest priority
• creation and governance of standardization pipelines based on
services providing access to domain specific controlled vocabularies
and ontologies
• providing access to data, metadata and provenance (processes) in the
Knowledge Graph
• further integration of services maintained by different partners and
deployed in the Cloud
Questions?
Feel free to ask questions!
Vyacheslav (Slava) Tykhonov
e-mail: vyacheslav.tykhonov@dans.knaw.nl
website: http://dans.knaw.nl (DANS-KNAW)

More Related Content

PPTX
DSpace for Cultural Heritage: adding support for images visualization,audio/v...
Andrea Bollini
 
PDF
General concepts: DDI
Arhiv družboslovnih podatkov
 
PPTX
DSpace-CRIS Workshop OR2015: Slides
Andrea Bollini
 
PPTX
Scaling up Linked Data
EUCLID project
 
PPT
Scripting User Contributed Interlinking
whalb
 
PDF
NESSTAR: Preparing, viewing, analyzing, downloading
Arhiv družboslovnih podatkov
 
PDF
Hdfs Dhruba
Jeff Hammerbacher
 
DSpace for Cultural Heritage: adding support for images visualization,audio/v...
Andrea Bollini
 
General concepts: DDI
Arhiv družboslovnih podatkov
 
DSpace-CRIS Workshop OR2015: Slides
Andrea Bollini
 
Scaling up Linked Data
EUCLID project
 
Scripting User Contributed Interlinking
whalb
 
NESSTAR: Preparing, viewing, analyzing, downloading
Arhiv družboslovnih podatkov
 
Hdfs Dhruba
Jeff Hammerbacher
 

What's hot (20)

PDF
No sql
Karamjit Kaur
 
PPTX
Data(base) taxonomy
Dejan Radic
 
PDF
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
4Science
 
ODP
Database
Hossein Mobasher
 
PDF
Dkan
Fawad Abbasi
 
PDF
DSpace-CRIS: An Open Source Solution for Research - @THETA15
Michele Mennielli
 
PDF
DSpace-CRIS_An open source solution for Research_EDU15
Michele Mennielli
 
PDF
Ado Fundamentals
asim78
 
PPTX
I say NoSQL you say what
Pratik Khasnabis
 
PPTX
HDL - Towards A Harmonized Dataset Model for Open Data Portals
Ahmad Assaf
 
PPTX
d:swarm - A Library Data Management Platform Based on a Linked Open Data Appr...
Jens Mittelbach
 
PDF
An Introduction to Linked Data and Microdata
DLFCLIR
 
PPTX
Data Management and Integration with d:swarm (Lightning talk, ELAG 2014)
Jan Polowinski
 
PPTX
Arches Getty Brownbag Talk
benosteen
 
PPTX
Expressing Concept Schemes & Competency Frameworks in CTDL
Credential Engine
 
PDF
Web Spa
Constantin Stan
 
PDF
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...
ariadnenetwork
 
PPTX
Cassandra Learning
Ehsan Javanmard
 
PPTX
LODLAM Landscape
Shana McDanold
 
Data(base) taxonomy
Dejan Radic
 
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
4Science
 
DSpace-CRIS: An Open Source Solution for Research - @THETA15
Michele Mennielli
 
DSpace-CRIS_An open source solution for Research_EDU15
Michele Mennielli
 
Ado Fundamentals
asim78
 
I say NoSQL you say what
Pratik Khasnabis
 
HDL - Towards A Harmonized Dataset Model for Open Data Portals
Ahmad Assaf
 
d:swarm - A Library Data Management Platform Based on a Linked Open Data Appr...
Jens Mittelbach
 
An Introduction to Linked Data and Microdata
DLFCLIR
 
Data Management and Integration with d:swarm (Lightning talk, ELAG 2014)
Jan Polowinski
 
Arches Getty Brownbag Talk
benosteen
 
Expressing Concept Schemes & Competency Frameworks in CTDL
Credential Engine
 
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...
ariadnenetwork
 
Cassandra Learning
Ehsan Javanmard
 
LODLAM Landscape
Shana McDanold
 
Ad

Similar to Data standardization process for social sciences and humanities (20)

PPTX
Data standardization process for social sciences and humanities
vty
 
PPTX
Building an electronic repository and archives on Dataverse in the European O...
vty
 
PPT
Semantic Web and Linked Data for cultural heritage materials - Approaches in ...
Antoine Isaac
 
PPTX
Ontologies, controlled vocabularies and Dataverse
vty
 
PPTX
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
vty
 
PPT
Progress in semantic mapping - NKOS
Antoine Isaac
 
PPTX
Force11 JDDCP workshop presentation, @ Force2015, Oxford
Mark Wilkinson
 
PDF
Alexandria winer20100623
Dov Winer
 
PPTX
Building COVID-19 Museum as Open Science Project
vty
 
PPTX
CLARIN CMDI support in Dataverse
vty
 
PPTX
External controlled vocabularies support in Dataverse
vty
 
PPTX
Flexible metadata schemes for research data repositories - Clarin Conference...
Vyacheslav Tykhonov
 
PPTX
Flexible metadata schemes for research data repositories - CLARIN Conference'21
vty
 
PPTX
DataverseNL as structured data hub
vty
 
PPTX
Semantics and the Humanities: some lessons from my journey 2000-2012
Guus Schreiber
 
PDF
Describe and Publish data sets on the web: vocabularies, catalogues, data por...
Franck Michel
 
PDF
Eun lre brussels_winer20100616
Dov Winer
 
PDF
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...
Andrea Scharnhorst
 
PPTX
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...
vty
 
PPTX
Running Dataverse repository in the European Open Science Cloud (EOSC)
vty
 
Data standardization process for social sciences and humanities
vty
 
Building an electronic repository and archives on Dataverse in the European O...
vty
 
Semantic Web and Linked Data for cultural heritage materials - Approaches in ...
Antoine Isaac
 
Ontologies, controlled vocabularies and Dataverse
vty
 
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
vty
 
Progress in semantic mapping - NKOS
Antoine Isaac
 
Force11 JDDCP workshop presentation, @ Force2015, Oxford
Mark Wilkinson
 
Alexandria winer20100623
Dov Winer
 
Building COVID-19 Museum as Open Science Project
vty
 
CLARIN CMDI support in Dataverse
vty
 
External controlled vocabularies support in Dataverse
vty
 
Flexible metadata schemes for research data repositories - Clarin Conference...
Vyacheslav Tykhonov
 
Flexible metadata schemes for research data repositories - CLARIN Conference'21
vty
 
DataverseNL as structured data hub
vty
 
Semantics and the Humanities: some lessons from my journey 2000-2012
Guus Schreiber
 
Describe and Publish data sets on the web: vocabularies, catalogues, data por...
Franck Michel
 
Eun lre brussels_winer20100616
Dov Winer
 
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...
Andrea Scharnhorst
 
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...
vty
 
Running Dataverse repository in the European Open Science Cloud (EOSC)
vty
 
Ad

More from vty (20)

PPTX
Decentralised identifiers and knowledge graphs
vty
 
PPTX
Decentralisation and knowledge graphs
vty
 
PPTX
Decentralised identifiers for CLARIAH infrastructure
vty
 
PPTX
Dataverse repository for research data in the COVID-19 Museum
vty
 
PPTX
Metaverse for Dataverse
vty
 
PPTX
External CV support in Dataverse 5.7
vty
 
PPTX
Building COVID-19 Knowledge Graph at CoronaWhy
vty
 
PPTX
CLARIN CMDI use case and flexible metadata schemes
vty
 
PPTX
Controlled vocabularies and ontologies in Dataverse data repository
vty
 
PPTX
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
vty
 
PPTX
Fighting COVID-19 with Artificial Intelligence
vty
 
PPTX
Setting up Dataverse repository for research data
vty
 
PPTX
5 years of Dataverse evolution
vty
 
PPTX
Integration of WORSICA’s thematic service in EOSC, Service QA and Dataverse
vty
 
PPTX
The world of Docker and Kubernetes
vty
 
PPTX
Technical integration of data repositories status and challenges
vty
 
PPTX
SSHOC Dataverse in the European Open Science Cloud
vty
 
PPTX
Dataverse SSHOC enrichment of DDI support at EDDI'19 2
vty
 
PPTX
Dataverse in the European Open Science Cloud
vty
 
PPTX
Development in Dataverse SSHOC project
vty
 
Decentralised identifiers and knowledge graphs
vty
 
Decentralisation and knowledge graphs
vty
 
Decentralised identifiers for CLARIAH infrastructure
vty
 
Dataverse repository for research data in the COVID-19 Museum
vty
 
Metaverse for Dataverse
vty
 
External CV support in Dataverse 5.7
vty
 
Building COVID-19 Knowledge Graph at CoronaWhy
vty
 
CLARIN CMDI use case and flexible metadata schemes
vty
 
Controlled vocabularies and ontologies in Dataverse data repository
vty
 
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
vty
 
Fighting COVID-19 with Artificial Intelligence
vty
 
Setting up Dataverse repository for research data
vty
 
5 years of Dataverse evolution
vty
 
Integration of WORSICA’s thematic service in EOSC, Service QA and Dataverse
vty
 
The world of Docker and Kubernetes
vty
 
Technical integration of data repositories status and challenges
vty
 
SSHOC Dataverse in the European Open Science Cloud
vty
 
Dataverse SSHOC enrichment of DDI support at EDDI'19 2
vty
 
Dataverse in the European Open Science Cloud
vty
 
Development in Dataverse SSHOC project
vty
 

Recently uploaded (20)

DOCX
Echoes_of_Andromeda_Partial (1).docx9989
yakshitkrishnia5a3
 
PDF
Integrating Executable Requirements in Prototyping
ESUG
 
PDF
Little Red Dots As Late-stage Quasi-stars
Sérgio Sacani
 
PPTX
Introduction to biochemistry.ppt-pdf_shotrs!
Vishnukanchi darade
 
PDF
Vera C. Rubin Observatory of interstellar Comet 3I ATLAS - July 21, 2025.pdf
SOCIEDAD JULIO GARAVITO
 
PPT
1a. Basic Principles of Medical Microbiology Part 2 [Autosaved].ppt
separatedwalk
 
PPTX
METABOLIC_SYNDROME Dr Shadab- kgmu lucknow pptx
ShadabAlam169087
 
PDF
N-enhancement in GN-z11: First evidence for supermassive stars nucleosynthesi...
Sérgio Sacani
 
PPTX
General Characters and classification up to Order Level of Sub Class Pterygot...
Dr Showkat Ahmad Wani
 
PPT
Grade_9_Science_Atomic_S_t_r_u_cture.ppt
QuintReynoldDoble
 
PDF
Sujay Rao Mandavilli Multi-barreled appraoch to educational reform FINAL FINA...
Sujay Rao Mandavilli
 
PPTX
Seminar on ethics in biomedical research
poojabisht244
 
PPTX
Feeding stratagey for climate change dairy animals.
Dr.Zulfy haq
 
PDF
Gamifying Agent-Based Models in Cormas: Towards the Playable Architecture for...
ESUG
 
PPTX
Embark on a journey of cell division and it's stages
sakyierhianmontero
 
PPTX
Pharmacognosy: ppt :pdf :pharmacognosy :
Vishnukanchi darade
 
PPTX
Modifications in RuBisCO system to enhance photosynthesis .pptx
raghumolbiotech
 
PPTX
2019 Upper Respiratory Tract Infections.pptx
jackophyta10
 
PDF
urticaria-1775-rahulkalal-250606145215-0ff37bc9.pdf
GajananPatil761074
 
PPTX
General Characters and Classification of Su class Apterygota.pptx
Dr Showkat Ahmad Wani
 
Echoes_of_Andromeda_Partial (1).docx9989
yakshitkrishnia5a3
 
Integrating Executable Requirements in Prototyping
ESUG
 
Little Red Dots As Late-stage Quasi-stars
Sérgio Sacani
 
Introduction to biochemistry.ppt-pdf_shotrs!
Vishnukanchi darade
 
Vera C. Rubin Observatory of interstellar Comet 3I ATLAS - July 21, 2025.pdf
SOCIEDAD JULIO GARAVITO
 
1a. Basic Principles of Medical Microbiology Part 2 [Autosaved].ppt
separatedwalk
 
METABOLIC_SYNDROME Dr Shadab- kgmu lucknow pptx
ShadabAlam169087
 
N-enhancement in GN-z11: First evidence for supermassive stars nucleosynthesi...
Sérgio Sacani
 
General Characters and classification up to Order Level of Sub Class Pterygot...
Dr Showkat Ahmad Wani
 
Grade_9_Science_Atomic_S_t_r_u_cture.ppt
QuintReynoldDoble
 
Sujay Rao Mandavilli Multi-barreled appraoch to educational reform FINAL FINA...
Sujay Rao Mandavilli
 
Seminar on ethics in biomedical research
poojabisht244
 
Feeding stratagey for climate change dairy animals.
Dr.Zulfy haq
 
Gamifying Agent-Based Models in Cormas: Towards the Playable Architecture for...
ESUG
 
Embark on a journey of cell division and it's stages
sakyierhianmontero
 
Pharmacognosy: ppt :pdf :pharmacognosy :
Vishnukanchi darade
 
Modifications in RuBisCO system to enhance photosynthesis .pptx
raghumolbiotech
 
2019 Upper Respiratory Tract Infections.pptx
jackophyta10
 
urticaria-1775-rahulkalal-250606145215-0ff37bc9.pdf
GajananPatil761074
 
General Characters and Classification of Su class Apterygota.pptx
Dr Showkat Ahmad Wani
 

Data standardization process for social sciences and humanities

  • 1. dans.knaw.nl DANS is een instituut van KNAW en NWO Data standardization process for arts and humanities Vyacheslav Tykhonov Senior Information Scientist (DANS-KNAW, Netherlands) Developing the SSHOC Reference Ontology workshop ICS-FORTH , Heraklion, Crete 21-22 May, 2019
  • 3. Outline • Standardization process during data deposit and archiving (metadata level created by users) • Research data management and harmonization of deposited datasets (file level) • Standardization and enrichment of harvested content (metadata level provided by different data providers) • Tracking provenance information for data and tools, moving to FAIR Big problem: researchers and librarians are not talking to each other and there is no common Reference model!
  • 4. Metadata schemas • EASY TDR has own metadata schema developed for Dutch scientific landscape but allows Dublin Core export from OAI- PMH endpoint • NARCIS is an aggregator that harvesting metadata from various repositories, no standardization pipeline • Metadata from Dataverse can be exported as:
  • 5. Controlled vocabulary and thesaurus • Linked data is one step forward (or actually backward in the right direction) on solving some of standardization problems. • By having shared controlled vocabularies (CV) created and maintained by experts on various domains, the digital items can be annotated with them and easily retrieved by other experts from the same domain without being librarian. It’s clear indication which vocabulary is good enough and shared by a critical mass. • A thesaurus is a semantic network of unique concepts, including relationships between synonyms, broader and narrower (parent/child) contexts, and other related concepts. Thesaurus is hierarchy for controlled vocabularies.
  • 6. SSHOC data repository DANS-KNAW is leading the development of SSHOC DataverseEU project. We’re developing multilingual web interface and localizing metadata fields and developed data standardization technique based on APIs for CESSDA CVs, Topic Classification and CESSDA CV Manager services. SSHOC/CESSDA DataverseEU: • Hungary (TARKI) • Sweden (SND) • Slovenia (ADP) • Germany (GESIS) • France (SciencesPro) • Austria (AUSSDA) • United Kingdom (UKDA) • Italy (CNR, UniData) • Belgium (SODA) • Latvia (LSZDA) • Poland (PSNC) • Norway (DataverseNO) • Netherlands (DANS-KNAW)
  • 7. SKOS RDF Vocabularies (CESSDA) We’re importing thesaurus delivered as SKOS RDF, for example: Rest API endpoint delivers back JSON suitable for web applications.
  • 10. Standardized metadata in RDF All relations exported and available in the Knowledge Graph and ready for the further querying and exploration:
  • 11. Research data management Data standardization process plays a key role in the data management plan of any organization but current situation in research data management is very complex: • too much data chaos in datasets • no data transparency • sometimes no standards available • no provenance information attached to data • homonyms, synonyms, generalizations, specializations, spelling variations and mistakes, language versions are all complicating the keyword-based search and retrieval of information
  • 12. Data standardization pipeline based on chatbot
  • 13. Mapping produced by AI as result mappings: Image-image: predicateobjects: - [a, 'http://xmlns.com/foaf/0.1/Image'] - [a, 'http://schema.org/ImageObject'] - [a, 'http://schema.org/CreativeWork'] - [a, 'http://xmlns.com/foaf/0.1/Document'] - ['http://www.w3.org/2000/01/rdf-schema#label', $(image)] - ['http://schema.org/image', $(image)] source: dataset-source subject: https://data.opendatasoft.com/ld/resources/roman-emperors@public/Image/$(image) Person-name: predicateobjects: - [a, 'http://schema.org/Person'] - [a, 'http://www.w3.org/2000/10/swap/pim/contact#Person'] - [a, 'http://xmlns.com/foaf/0.1/Person'] - [a, 'http://purl.org/dc/terms/Agent'] - [a, 'http://purl.org/goodrelations/v1#BusinessEntity'] - [a, 'http://rhizomik.net/ontologies/copyrightonto.owl#LegalPerson'] - [a, 'http://schema.org/Thing'] - [a, 'http://www.w3.org/2003/01/geo/wgs84_pos#SpatialThing'] - [a, 'http://xmlns.com/foaf/0.1/Agent'] - ['http://www.w3.org/2000/01/rdf-schema#label', $(name)] source: dataset-source subject: https://data.opendatasoft.com/ld/resources/roman-emperors@public/Person/$(name) Place-birth_cty: predicateobjects: - [a, 'http://schema.org/Place'] - [a, 'http://purl.org/goodrelations/v1#Location'] - [a, 'http://rdfs.co/juso/SpatialThing'] - [a, 'http://schema.org/Thing'] - ['http://www.w3.org/2000/01/rdf-schema#label', $(birth_cty)] - ['http://dbpedia.org/ontology/era', $(era)] - ['http://www.ontologydesignpatterns.org/ont/dul/DUL.owl#isDescribedBy', $(era)] - ['http://dbpedia.org/ontology/birthDate', $(birth), 'http://www.w3.org/2001/XMLSchema#datetime'] source: dataset-source subject: https://data.opendatasoft.com/ld/resources/roman-emperors@public/Place/$(birth_cty) Place-birth_prv: predicateobjects: - [a, 'http://schema.org/Place'] - [a, 'http://purl.org/goodrelations/v1#Location'] - [a, 'http://rdfs.co/juso/SpatialThing'] - [a, 'http://schema.org/Thing'] - ['http://www.w3.org/2000/01/rdf-schema#label', $(birth_prv)] - ['http://dbpedia.org/ontology/deathDate', $(death), 'http://www.w3.org/2001/XMLSchema#datetime'] source: dataset-source subject: https://data.opendatasoft.com/ld/resources/roman-emperors@public/Place/$(birth_prv)
  • 14. NARCIS metadata (example) No authority linking or controlled vocabularies support, but…
  • 16. Prov-O example from PARTHENOS project
  • 17. Time Machine association • large scale project with 300+ partners • development and support of sustainable networked services • trends watching and tracking of software maturity level • reliable governance model • the foundation for the further innovation!
  • 18. Conclusion • development of large-scale networked services out of research pipelines • every service should be mature enough, maintainable and follow continuous integration pipeline • tracking provenance information for every tool and dataset is the highest priority • creation and governance of standardization pipelines based on services providing access to domain specific controlled vocabularies and ontologies • providing access to data, metadata and provenance (processes) in the Knowledge Graph • further integration of services maintained by different partners and deployed in the Cloud
  • 19. Questions? Feel free to ask questions! Vyacheslav (Slava) Tykhonov e-mail: [email protected] website: http://dans.knaw.nl (DANS-KNAW)