SlideShare a Scribd company logo
Wikidata as a hub for the linked data cloudWikidata as a hub for the linked data cloud
TUTORIAL AT DCMI CONFERENCE, SEOUL, 2019-09-25TUTORIAL AT DCMI CONFERENCE, SEOUL, 2019-09-25
Tom Baker, Joachim Neubert, Andra Waagmeester
Slides (partitially) at https://jneubert.github.io/wd-dcmi2019/#/
OverviewOverview
Part 1: Using and querying Wikidata
Part 2: Wikidata as a linking hub
Part 3: Applications based on Wikidata
Part 4: Wikidata usage scenarios
Scenarios
Intro and details
Hands-on: Mix-n-match
Quality control tools and procedures
Wikidata community
Wikidata as linking hubWikidata as linking hub
The idea of linking hubsThe idea of linking hubs
Connect concepts via identifiers/URLs
Existing hubs: , , ...
Image by Jakob Voss
VIAF sameAs.org
Different linking propertiesDifferent linking properties
1. (datatype URL)
generic link to URL in the meaning of skos:exactMatch
2. : more than 4000 specialized properties (datatype external identifier)
exact match
Pxxxx
Examples for external identifiersExamples for external identifiers
GND / VIAF identifiers
geogaphical entities
proteins
Swedish cultural heritage objects
African plants
baseball players
TED conference speakers
Wikidata as a hub for the linked data cloud
Property definitionsProperty definitions
subject item for the property
examples
constraints on values, cardinality, etc.
: creates a clickable link for the ID
start at the property page, e.g., for the ISSN:
formatter URL
https://www.wikidata.org/wiki/Property:P236
Property DocumentationProperty Documentation
Wikidata as a hub for the linked data cloud
Beyond sameness - mapping relationsBeyond sameness - mapping relations
Wikidata external ids imply "sameness" of linked concepts
even with geographic names, other mapping relations are required in some
cases.
examples:
close matches, e.g., "Yugoslavia" (1918-1992) (Wikidata) ≅ "Yugoslavia (until
1990)" (STW)
related matches, e.g. a company and its founder
Mapping relation type (P4390)Mapping relation type (P4390)
introduced after a community discussion in October 2017
to be used as qualifier for external id entries
fixed value set - SKOS mapping relations (exact, close, broad, narrow, related
match)
EXAMPLE AT ITEMEXAMPLE AT ITEM ASSESSMENT CENTERASSESSMENT CENTER
How does that relate to the Linked Data model?How does that relate to the Linked Data model?
Internal data model and storage (Wikibase) is transformed to RDF for:
RDF dumps
Query Service
RDF linking from WikidataRDF linking from Wikidata
: linked data URI
e.g., , (vs. formatter URL
)
linked external RDF resources
plus ~950.000 relations to individual URIs
formatter URI for RDF resource
http://sws.geonames.org/$1/
https://www.geonames.org/$1
List of 130+ relationships to external RDF datasets
26+ million
exact match
Links in the RDF dumpsLinks in the RDF dumps
Output has full URLs to external resources, however with Wikidata-specific
properties:
This creates a hurdle for generic Linked Data browsers and tools - not even
is translated to skos:exactMatch
wd:Q123 wdt:P234 "External-ID" ;
wdtn:P234 <http://example.com/reference/External-ID>
exact
match
Federated SPARQL queriesFederated SPARQL queries
Example use case: GND authority has information about the
professions/occupations of people which is not known in Wikidata.
So get that information dynamically from a GND SPARQL endpoint.
Here, we are interested in economists, in particular.
From Wikidata to a remote endpointFrom Wikidata to a remote endpoint
From a remote endpoint to WikidataFrom a remote endpoint to Wikidata
<== not working currently
query to WDQS
query to GND endpoint
Several points for attentionSeveral points for attention
Direction and sequence of statements often matters for performance
To reach out from Wikidata, endpoints have to be ( )
In the other direction, access is normally not restricted
Some federated queries get extremely slow, when large sets of bindings exist before the remote
service is invoked
be sure to exclude variables bound to blank nodes ('unknown value' in Wikidata)
approved full list
Further reading on Wikidata/RDFFurther reading on Wikidata/RDF
( )
Critical comments/suggestions:
RDF dump format (documentation)
Waagmeester: Integrating Wikidata and other linked data sources -
Federated SPARQL queries more examples
Malyshev et al.: Semantic Technology Usage in Wikipedia’s
Knowledge Graph
Freire/Isaac: Technical usability of
Wikidatas linked data
Application process for a new propertyApplication process for a new property
Double-check, that the property does not already exist
Prepare a property proposal in the according section, e.g., Wikidata:Property
proposal/Authority control
Wikidata as a hub for the linked data cloud
Hints for getting it approved smoothlyHints for getting it approved smoothly
Clearly lay out the motivation and planned use for the property
Provide working examples (with the formatter URI you are suggesting)
Be responsive to comments
Wikidata as a universal linking hubWikidata as a universal linking hub
easy extensibility with new properties for external identifiers
immense fund of existing items, with the full set of SKOS mapping relations for
more or less exact mappings to these
immediate extensibility with new items
Linking content via Mix-n-MatchLinking content via Mix-n-Match
Mix-n-match is a widely used tool (by Magnus Manske) to link external databases,
catalogs, etc. to existing Wikidata items (or to create new ones).
Example list:Example list:
Newspapers and journalsNewspapers and journals
from the 20th Century Press Archivefrom the 20th Century Press Archive
Wikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloud
Please navigate to our example catalog
Mix-n-match manual
TasksTasks
Login through Widar
In "Automatically matched" list:
Connect matching items
Remove non-matching entries
In "Unmatched" list:
Search for existing items
Create missing items ( )suggested properties
Supplementary materialSupplementary material
Item creation "on the go"Item creation "on the go"
With Mix-n-match "New item": rudimentary, no references
Custom list of prepared QuickStatements insert blocks ( from STW
Thesaurus for Economics - please don't mess with it, this is work in progress)
Workflow-wise, use same sequence for M-n-m input and prepared insert blocks
example
Recommendations for item creationRecommendations for item creation
Pay attention to (much more relaxed then
Wikipedias)
Explain your plan and ask for feedback in the
to make mass edits ( )
Source every statement ( )
Create input in
Check with a few statements, verify result
Run as batch, document input and batch URL
Wikidata's notability criteria
Wikidata project chat
Apply for a bot account example
hints
QuickStatements text format
Matching from WD to the external database entriesMatching from WD to the external database entries
Wikidata as a hub for the linked data cloud
Normally requires an endpoint for the external source, where you can search for
the labels, aliases or other data of Wikidata items
Insert statement for external id into Wikidata can be prepared for cut&paste or
even semi-automatic execution in QuickStatements
Some hints and linked code here
Import catalog data to Mix-n-MatchImport catalog data to Mix-n-Match
Prepare dataPrepare data
... as tab-separated table (one line per record) with three columns
1. identitfier
2. name
3. description
Input file for the example used earlier
Pay attention toPay attention to
description column: include everything useful for intelectual identification
order: the sequence may help structuring your workflow (e.g., most used entries
first)
Load data via web interfaceLoad data via web interface
... at https://tools.wmflabs.org/mix-n-match/import.php
Wikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloud
Sync existing ids from WikidataSync existing ids from Wikidata
Wikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloud
Quality control tools and proceduresQuality control tools and procedures
Perception: Anybody can edit anything - so Wikidata is no reliable source of
knowledge
Seen as a threat for information systems based on Wikidata
particularly by some large Wikipedias (e.g., the English one)
Basic policy to address this: Statements should be referenced
QA support for editorsQA support for editors
Contraint definition for properties
raise warnings during data input, when, e.g.
a format definition (ISBN, DOI etc.) is violated
a supposedly unique identifier is added to more than one item
generated lists of constraint violations (e.g. )
Constraints can be very helpful, but do not cover complex cases
ZDB ID format
More QA support for editorsMore QA support for editors
Additional reports can be created via SPARQL queries
Shape Expressions (ShEx) allow to define complex constraints and conformance
checks
ShEx Primer
How to get started with Shape Expressions in Wikidata?
Revision control and patrolingRevision control and patroling
Versioned edits and version control
Manual and tool supported vandalism prevention
Watchlists
Automated flagging of suspect edits (e.g., "new editor deleteing statements")
Technically very easy to revert edits
Semi-protection or protection of oftenly-vandalized items
Patroling
Wikidata as a hub for the linked data cloud
Automated tools for vandalism detectionAutomated tools for vandalism detection
Fighting to keep up with rate of human edits in Wikidata (multiple per second)
... requires reducing the manual workload, e.g. via
Objective Revision Evaluation Service ( )
and other rule-based and machine-learning tools
Wikidata Abuse Filter
ORES
Ongoing researchOngoing research
Heindorf et al.: Vandalism Detection in Wikidata
Sarabadani et al.: Building automated vandalism detection tools for
Wikidata
The Wikidata communityThe Wikidata community
Everybody can participate
No central "committee" or decision structure
Desisions are made via discussion and community consensus
Main entry point for all kind of discussions
Resolved discussions archived after 7 days -
Beginner's questions welcome (but please try to find the answer online before,
particularly in the , which has a search link to the help pages)
Compared to Wikipedia, the overall atmosphere is constructive (though
exceptions exist sometimes in some sub-communities)
English is the lingua franca, but a few questions show up in other languages,
and receive comments, too
Project chatProject chat
searchable
FAQ
User page and user talk pageUser page and user talk page
Introduce yourself - especially if you work in a professional context with
Wikidata ( )
Activate notifications to your email address
Be responsive to comments on your talk page
You can address other users on their talk page, too
example
Talk pages of propertiesTalk pages of properties
Questions on the use of a certain property
Suggestions for changes or enhancements of the property definition ( )
Consider adding properties you are interested in to your watchlist
example
WikiProjectsWikiProjects
Wikidata as a hub for the linked data cloud
Often a great source to find documentation about the community consensus in
certain fields
Many WikiProjects pages contain data structuring recommendations - see, e.g.,
for
Current WikiProjects on Wikidata
periodicals
Thank you - questions welcome!Thank you - questions welcome!
Joachim Neubert
j.neubert@zbw.eu
Jneubert on WD
Ad

More Related Content

What's hot (20)

AAT LOD Microthesauri
AAT LOD MicrothesauriAAT LOD Microthesauri
AAT LOD Microthesauri
Marcia Zeng
 
Vital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent AppsVital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent Apps
Vital.AI
 
Putting Historical Data in Context: how to use DSpace-GLAM
Putting Historical Data in Context: how to use DSpace-GLAMPutting Historical Data in Context: how to use DSpace-GLAM
Putting Historical Data in Context: how to use DSpace-GLAM
4Science
 
A Linked Data Prototype for the Union Catalog of Digital Archives Taiwan
A Linked Data Prototype for the Union Catalog of Digital Archives TaiwanA Linked Data Prototype for the Union Catalog of Digital Archives Taiwan
A Linked Data Prototype for the Union Catalog of Digital Archives Taiwan
andrea huang
 
Semantics for Big Data Integration and Analysis
Semantics for Big Data Integration and AnalysisSemantics for Big Data Integration and Analysis
Semantics for Big Data Integration and Analysis
Craig Knoblock
 
20160818 Semantics and Linkage of Archived Catalogs
20160818 Semantics and Linkage of Archived Catalogs20160818 Semantics and Linkage of Archived Catalogs
20160818 Semantics and Linkage of Archived Catalogs
andrea huang
 
Leverage DSpace for an enterprise, mission critical platform
Leverage DSpace for an enterprise, mission critical platformLeverage DSpace for an enterprise, mission critical platform
Leverage DSpace for an enterprise, mission critical platform
Andrea Bollini
 
CLARIN CMDI use case and flexible metadata schemes
CLARIN CMDI use case and flexible metadata schemes CLARIN CMDI use case and flexible metadata schemes
CLARIN CMDI use case and flexible metadata schemes
vty
 
WWW2014 Overview of W3C Linked Data Platform 20140410
WWW2014 Overview of W3C Linked Data Platform 20140410WWW2014 Overview of W3C Linked Data Platform 20140410
WWW2014 Overview of W3C Linked Data Platform 20140410
Arnaud Le Hors
 
Open Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchange
lagoze
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?
Ivan Herman
 
Linked Data to Improve the OER Experience
Linked Data to Improve the OER ExperienceLinked Data to Improve the OER Experience
Linked Data to Improve the OER Experience
The Open Education Consortium
 
DSpace-CRIS & OpenAIRE
DSpace-CRIS & OpenAIREDSpace-CRIS & OpenAIRE
DSpace-CRIS & OpenAIRE
4Science
 
How to describe a dataset. Interoperability issues
How to describe a dataset. Interoperability issuesHow to describe a dataset. Interoperability issues
How to describe a dataset. Interoperability issues
Valeria Pesce
 
Knowledge discoverylaurahollink
Knowledge discoverylaurahollinkKnowledge discoverylaurahollink
Knowledge discoverylaurahollink
SSSW
 
Data Enthusiasts London: Scalable and Interoperable data services. Applied to...
Data Enthusiasts London: Scalable and Interoperable data services. Applied to...Data Enthusiasts London: Scalable and Interoperable data services. Applied to...
Data Enthusiasts London: Scalable and Interoperable data services. Applied to...
Andy Petrella
 
Creating Linked Data from Relational Databases
Creating Linked Data from Relational DatabasesCreating Linked Data from Relational Databases
Creating Linked Data from Relational Databases
Nikolaos Konstantinou
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
Stuart Chalk
 
Structured Dynamics' Semantic Technologies Product Stack
Structured Dynamics' Semantic Technologies Product StackStructured Dynamics' Semantic Technologies Product Stack
Structured Dynamics' Semantic Technologies Product Stack
Mike Bergman
 
Storing and Querying Semantic Data in the Cloud
Storing and Querying Semantic Data in the CloudStoring and Querying Semantic Data in the Cloud
Storing and Querying Semantic Data in the Cloud
Steffen Staab
 
AAT LOD Microthesauri
AAT LOD MicrothesauriAAT LOD Microthesauri
AAT LOD Microthesauri
Marcia Zeng
 
Vital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent AppsVital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent Apps
Vital.AI
 
Putting Historical Data in Context: how to use DSpace-GLAM
Putting Historical Data in Context: how to use DSpace-GLAMPutting Historical Data in Context: how to use DSpace-GLAM
Putting Historical Data in Context: how to use DSpace-GLAM
4Science
 
A Linked Data Prototype for the Union Catalog of Digital Archives Taiwan
A Linked Data Prototype for the Union Catalog of Digital Archives TaiwanA Linked Data Prototype for the Union Catalog of Digital Archives Taiwan
A Linked Data Prototype for the Union Catalog of Digital Archives Taiwan
andrea huang
 
Semantics for Big Data Integration and Analysis
Semantics for Big Data Integration and AnalysisSemantics for Big Data Integration and Analysis
Semantics for Big Data Integration and Analysis
Craig Knoblock
 
20160818 Semantics and Linkage of Archived Catalogs
20160818 Semantics and Linkage of Archived Catalogs20160818 Semantics and Linkage of Archived Catalogs
20160818 Semantics and Linkage of Archived Catalogs
andrea huang
 
Leverage DSpace for an enterprise, mission critical platform
Leverage DSpace for an enterprise, mission critical platformLeverage DSpace for an enterprise, mission critical platform
Leverage DSpace for an enterprise, mission critical platform
Andrea Bollini
 
CLARIN CMDI use case and flexible metadata schemes
CLARIN CMDI use case and flexible metadata schemes CLARIN CMDI use case and flexible metadata schemes
CLARIN CMDI use case and flexible metadata schemes
vty
 
WWW2014 Overview of W3C Linked Data Platform 20140410
WWW2014 Overview of W3C Linked Data Platform 20140410WWW2014 Overview of W3C Linked Data Platform 20140410
WWW2014 Overview of W3C Linked Data Platform 20140410
Arnaud Le Hors
 
Open Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchange
lagoze
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?
Ivan Herman
 
DSpace-CRIS & OpenAIRE
DSpace-CRIS & OpenAIREDSpace-CRIS & OpenAIRE
DSpace-CRIS & OpenAIRE
4Science
 
How to describe a dataset. Interoperability issues
How to describe a dataset. Interoperability issuesHow to describe a dataset. Interoperability issues
How to describe a dataset. Interoperability issues
Valeria Pesce
 
Knowledge discoverylaurahollink
Knowledge discoverylaurahollinkKnowledge discoverylaurahollink
Knowledge discoverylaurahollink
SSSW
 
Data Enthusiasts London: Scalable and Interoperable data services. Applied to...
Data Enthusiasts London: Scalable and Interoperable data services. Applied to...Data Enthusiasts London: Scalable and Interoperable data services. Applied to...
Data Enthusiasts London: Scalable and Interoperable data services. Applied to...
Andy Petrella
 
Creating Linked Data from Relational Databases
Creating Linked Data from Relational DatabasesCreating Linked Data from Relational Databases
Creating Linked Data from Relational Databases
Nikolaos Konstantinou
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
Stuart Chalk
 
Structured Dynamics' Semantic Technologies Product Stack
Structured Dynamics' Semantic Technologies Product StackStructured Dynamics' Semantic Technologies Product Stack
Structured Dynamics' Semantic Technologies Product Stack
Mike Bergman
 
Storing and Querying Semantic Data in the Cloud
Storing and Querying Semantic Data in the CloudStoring and Querying Semantic Data in the Cloud
Storing and Querying Semantic Data in the Cloud
Steffen Staab
 

Similar to Wikidata as a hub for the linked data cloud (20)

Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Joachim Neubert
 
Semantic web technologies applied to bioinformatics and laboratory data manag...
Semantic web technologies applied to bioinformatics and laboratory data manag...Semantic web technologies applied to bioinformatics and laboratory data manag...
Semantic web technologies applied to bioinformatics and laboratory data manag...
Toni Hermoso Pulido
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data Tutorial
Sören Auer
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
Carole Goble
 
Arc Ready Cloud Computing
Arc Ready Cloud ComputingArc Ready Cloud Computing
Arc Ready Cloud Computing
Philip Wheat
 
Semantic MediaWiki as Knowledge Graph Interface
Semantic MediaWiki as Knowledge Graph InterfaceSemantic MediaWiki as Knowledge Graph Interface
Semantic MediaWiki as Knowledge Graph Interface
Bernhard Krabina
 
PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010
Andreas Blumauer
 
Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"
Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"
Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"
National Information Standards Organization (NISO)
 
ArcReady - Architecting For The Cloud
ArcReady - Architecting For The CloudArcReady - Architecting For The Cloud
ArcReady - Architecting For The Cloud
Microsoft ArcReady
 
Lessons learned from Semantic Wiki
Lessons learned from Semantic WikiLessons learned from Semantic Wiki
Lessons learned from Semantic Wiki
Jie Bao
 
Semantic MediaWiki - a Linked Open Data Platform
Semantic MediaWiki - a Linked Open Data PlatformSemantic MediaWiki - a Linked Open Data Platform
Semantic MediaWiki - a Linked Open Data Platform
Bernhard Krabina
 
Spotlight
SpotlightSpotlight
Spotlight
Stefano Lariccia
 
Linked Data
Linked DataLinked Data
Linked Data
Danny Ayers
 
Gbrds Tech Issues Op
Gbrds Tech Issues OpGbrds Tech Issues Op
Gbrds Tech Issues Op
Vishwas Chavan
 
Web 3.0 & io t (en)
Web 3.0 & io t (en)Web 3.0 & io t (en)
Web 3.0 & io t (en)
Rikard Strid
 
Web 3.0 & IoT (English)
Web 3.0 & IoT (English)Web 3.0 & IoT (English)
Web 3.0 & IoT (English)
Peter Waher
 
Architecture as Linked Data
Architecture as Linked DataArchitecture as Linked Data
Architecture as Linked Data
Danny Greefhorst
 
Swap For Dummies Rsp 2007 11 29
Swap For Dummies Rsp 2007 11 29Swap For Dummies Rsp 2007 11 29
Swap For Dummies Rsp 2007 11 29
Julie Allinson
 
Enterprise guide to building a Data Mesh
Enterprise guide to building a Data MeshEnterprise guide to building a Data Mesh
Enterprise guide to building a Data Mesh
Sion Smith
 
Big Data Analytics from Azure Cloud to Power BI Mobile
Big Data Analytics from Azure Cloud to Power BI MobileBig Data Analytics from Azure Cloud to Power BI Mobile
Big Data Analytics from Azure Cloud to Power BI Mobile
Roy Kim
 
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Joachim Neubert
 
Semantic web technologies applied to bioinformatics and laboratory data manag...
Semantic web technologies applied to bioinformatics and laboratory data manag...Semantic web technologies applied to bioinformatics and laboratory data manag...
Semantic web technologies applied to bioinformatics and laboratory data manag...
Toni Hermoso Pulido
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data Tutorial
Sören Auer
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
Carole Goble
 
Arc Ready Cloud Computing
Arc Ready Cloud ComputingArc Ready Cloud Computing
Arc Ready Cloud Computing
Philip Wheat
 
Semantic MediaWiki as Knowledge Graph Interface
Semantic MediaWiki as Knowledge Graph InterfaceSemantic MediaWiki as Knowledge Graph Interface
Semantic MediaWiki as Knowledge Graph Interface
Bernhard Krabina
 
PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010
Andreas Blumauer
 
ArcReady - Architecting For The Cloud
ArcReady - Architecting For The CloudArcReady - Architecting For The Cloud
ArcReady - Architecting For The Cloud
Microsoft ArcReady
 
Lessons learned from Semantic Wiki
Lessons learned from Semantic WikiLessons learned from Semantic Wiki
Lessons learned from Semantic Wiki
Jie Bao
 
Semantic MediaWiki - a Linked Open Data Platform
Semantic MediaWiki - a Linked Open Data PlatformSemantic MediaWiki - a Linked Open Data Platform
Semantic MediaWiki - a Linked Open Data Platform
Bernhard Krabina
 
Web 3.0 & io t (en)
Web 3.0 & io t (en)Web 3.0 & io t (en)
Web 3.0 & io t (en)
Rikard Strid
 
Web 3.0 & IoT (English)
Web 3.0 & IoT (English)Web 3.0 & IoT (English)
Web 3.0 & IoT (English)
Peter Waher
 
Architecture as Linked Data
Architecture as Linked DataArchitecture as Linked Data
Architecture as Linked Data
Danny Greefhorst
 
Swap For Dummies Rsp 2007 11 29
Swap For Dummies Rsp 2007 11 29Swap For Dummies Rsp 2007 11 29
Swap For Dummies Rsp 2007 11 29
Julie Allinson
 
Enterprise guide to building a Data Mesh
Enterprise guide to building a Data MeshEnterprise guide to building a Data Mesh
Enterprise guide to building a Data Mesh
Sion Smith
 
Big Data Analytics from Azure Cloud to Power BI Mobile
Big Data Analytics from Azure Cloud to Power BI MobileBig Data Analytics from Azure Cloud to Power BI Mobile
Big Data Analytics from Azure Cloud to Power BI Mobile
Roy Kim
 
Ad

More from Joachim Neubert (20)

Linking the 20th century paper history to the sum of all knowledge
Linking the 20th century paper history to the sum of all knowledgeLinking the 20th century paper history to the sum of all knowledge
Linking the 20th century paper history to the sum of all knowledge
Joachim Neubert
 
Exploring and mapping the category system of the world‘s largest public press...
Exploring and mapping the category system of the world‘s largest public press...Exploring and mapping the category system of the world‘s largest public press...
Exploring and mapping the category system of the world‘s largest public press...
Joachim Neubert
 
Donating data to Wikidata: First experiences from the „20th Century Press Arc...
Donating data to Wikidata: First experiences from the „20th Century Press Arc...Donating data to Wikidata: First experiences from the „20th Century Press Arc...
Donating data to Wikidata: First experiences from the „20th Century Press Arc...
Joachim Neubert
 
Wikidata (für Archive)
Wikidata (für Archive)Wikidata (für Archive)
Wikidata (für Archive)
Joachim Neubert
 
Wikidata as opportunity for special collections: the 20th Century Press Archi...
Wikidata as opportunity for special collections: the 20th Century Press Archi...Wikidata as opportunity for special collections: the 20th Century Press Archi...
Wikidata as opportunity for special collections: the 20th Century Press Archi...
Joachim Neubert
 
20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata
Joachim Neubert
 
Chancen und Herausforderungen einer komplementären Nutzung von GND und Wikidata
Chancen und Herausforderungen einer komplementären Nutzung von GND und WikidataChancen und Herausforderungen einer komplementären Nutzung von GND und Wikidata
Chancen und Herausforderungen einer komplementären Nutzung von GND und Wikidata
Joachim Neubert
 
Pressemappe 20. Jahrhundert: Personen- und Firmendossiers
Pressemappe 20. Jahrhundert: Personen- und FirmendossiersPressemappe 20. Jahrhundert: Personen- und Firmendossiers
Pressemappe 20. Jahrhundert: Personen- und Firmendossiers
Joachim Neubert
 
20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata
Joachim Neubert
 
Making Wikidata fit as a Linking Hub for Knowledge Organization Systems
Making Wikidata fit as a Linking Hub for Knowledge Organization SystemsMaking Wikidata fit as a Linking Hub for Knowledge Organization Systems
Making Wikidata fit as a Linking Hub for Knowledge Organization Systems
Joachim Neubert
 
Linking authorities through Wikidata
Linking authorities through WikidataLinking authorities through Wikidata
Linking authorities through Wikidata
Joachim Neubert
 
Wikidata as a linking hub for knowledge organization systems? Integrating an ...
Wikidata as a linking hub for knowledge organization systems? Integrating an ...Wikidata as a linking hub for knowledge organization systems? Integrating an ...
Wikidata as a linking hub for knowledge organization systems? Integrating an ...
Joachim Neubert
 
Wikidata as authority linking hub
Wikidata as authority linking hubWikidata as authority linking hub
Wikidata as authority linking hub
Joachim Neubert
 
EconBiz Research Dataset (SWIB16 Lightning Talk)
EconBiz Research Dataset (SWIB16 Lightning Talk)EconBiz Research Dataset (SWIB16 Lightning Talk)
EconBiz Research Dataset (SWIB16 Lightning Talk)
Joachim Neubert
 
Using Wikidata as an Authority for the SowiDataNet Research Data Repository
Using Wikidata as an Authority for the SowiDataNet Research Data RepositoryUsing Wikidata as an Authority for the SowiDataNet Research Data Repository
Using Wikidata as an Authority for the SowiDataNet Research Data Repository
Joachim Neubert
 
Change Tracking in Knowledge Organization Systems with skos-history
Change Tracking in Knowledge Organization Systems with skos-historyChange Tracking in Knowledge Organization Systems with skos-history
Change Tracking in Knowledge Organization Systems with skos-history
Joachim Neubert
 
Anforderungen an Thesauri im Semantic Web
Anforderungen an Thesauri im Semantic WebAnforderungen an Thesauri im Semantic Web
Anforderungen an Thesauri im Semantic Web
Joachim Neubert
 
Leveraging SKOS to trace the overhaul of the STW Thesaurus for Economics
Leveraging SKOS to trace the overhaul of the STW Thesaurus for EconomicsLeveraging SKOS to trace the overhaul of the STW Thesaurus for Economics
Leveraging SKOS to trace the overhaul of the STW Thesaurus for Economics
Joachim Neubert
 
skos-history: Tracking the evolution of Knowledge Organization Systems
skos-history: Tracking the evolution of Knowledge Organization Systemsskos-history: Tracking the evolution of Knowledge Organization Systems
skos-history: Tracking the evolution of Knowledge Organization Systems
Joachim Neubert
 
KOS evolution in Linked Data
KOS evolution in Linked DataKOS evolution in Linked Data
KOS evolution in Linked Data
Joachim Neubert
 
Linking the 20th century paper history to the sum of all knowledge
Linking the 20th century paper history to the sum of all knowledgeLinking the 20th century paper history to the sum of all knowledge
Linking the 20th century paper history to the sum of all knowledge
Joachim Neubert
 
Exploring and mapping the category system of the world‘s largest public press...
Exploring and mapping the category system of the world‘s largest public press...Exploring and mapping the category system of the world‘s largest public press...
Exploring and mapping the category system of the world‘s largest public press...
Joachim Neubert
 
Donating data to Wikidata: First experiences from the „20th Century Press Arc...
Donating data to Wikidata: First experiences from the „20th Century Press Arc...Donating data to Wikidata: First experiences from the „20th Century Press Arc...
Donating data to Wikidata: First experiences from the „20th Century Press Arc...
Joachim Neubert
 
Wikidata as opportunity for special collections: the 20th Century Press Archi...
Wikidata as opportunity for special collections: the 20th Century Press Archi...Wikidata as opportunity for special collections: the 20th Century Press Archi...
Wikidata as opportunity for special collections: the 20th Century Press Archi...
Joachim Neubert
 
20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata
Joachim Neubert
 
Chancen und Herausforderungen einer komplementären Nutzung von GND und Wikidata
Chancen und Herausforderungen einer komplementären Nutzung von GND und WikidataChancen und Herausforderungen einer komplementären Nutzung von GND und Wikidata
Chancen und Herausforderungen einer komplementären Nutzung von GND und Wikidata
Joachim Neubert
 
Pressemappe 20. Jahrhundert: Personen- und Firmendossiers
Pressemappe 20. Jahrhundert: Personen- und FirmendossiersPressemappe 20. Jahrhundert: Personen- und Firmendossiers
Pressemappe 20. Jahrhundert: Personen- und Firmendossiers
Joachim Neubert
 
20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata
Joachim Neubert
 
Making Wikidata fit as a Linking Hub for Knowledge Organization Systems
Making Wikidata fit as a Linking Hub for Knowledge Organization SystemsMaking Wikidata fit as a Linking Hub for Knowledge Organization Systems
Making Wikidata fit as a Linking Hub for Knowledge Organization Systems
Joachim Neubert
 
Linking authorities through Wikidata
Linking authorities through WikidataLinking authorities through Wikidata
Linking authorities through Wikidata
Joachim Neubert
 
Wikidata as a linking hub for knowledge organization systems? Integrating an ...
Wikidata as a linking hub for knowledge organization systems? Integrating an ...Wikidata as a linking hub for knowledge organization systems? Integrating an ...
Wikidata as a linking hub for knowledge organization systems? Integrating an ...
Joachim Neubert
 
Wikidata as authority linking hub
Wikidata as authority linking hubWikidata as authority linking hub
Wikidata as authority linking hub
Joachim Neubert
 
EconBiz Research Dataset (SWIB16 Lightning Talk)
EconBiz Research Dataset (SWIB16 Lightning Talk)EconBiz Research Dataset (SWIB16 Lightning Talk)
EconBiz Research Dataset (SWIB16 Lightning Talk)
Joachim Neubert
 
Using Wikidata as an Authority for the SowiDataNet Research Data Repository
Using Wikidata as an Authority for the SowiDataNet Research Data RepositoryUsing Wikidata as an Authority for the SowiDataNet Research Data Repository
Using Wikidata as an Authority for the SowiDataNet Research Data Repository
Joachim Neubert
 
Change Tracking in Knowledge Organization Systems with skos-history
Change Tracking in Knowledge Organization Systems with skos-historyChange Tracking in Knowledge Organization Systems with skos-history
Change Tracking in Knowledge Organization Systems with skos-history
Joachim Neubert
 
Anforderungen an Thesauri im Semantic Web
Anforderungen an Thesauri im Semantic WebAnforderungen an Thesauri im Semantic Web
Anforderungen an Thesauri im Semantic Web
Joachim Neubert
 
Leveraging SKOS to trace the overhaul of the STW Thesaurus for Economics
Leveraging SKOS to trace the overhaul of the STW Thesaurus for EconomicsLeveraging SKOS to trace the overhaul of the STW Thesaurus for Economics
Leveraging SKOS to trace the overhaul of the STW Thesaurus for Economics
Joachim Neubert
 
skos-history: Tracking the evolution of Knowledge Organization Systems
skos-history: Tracking the evolution of Knowledge Organization Systemsskos-history: Tracking the evolution of Knowledge Organization Systems
skos-history: Tracking the evolution of Knowledge Organization Systems
Joachim Neubert
 
KOS evolution in Linked Data
KOS evolution in Linked DataKOS evolution in Linked Data
KOS evolution in Linked Data
Joachim Neubert
 
Ad

Recently uploaded (20)

00804.pptvjlvpuvupvpyvupv6v07upv60v7vupv0v6p
00804.pptvjlvpuvupvpyvupv6v07upv60v7vupv0v6p00804.pptvjlvpuvupvpyvupv6v07upv60v7vupv0v6p
00804.pptvjlvpuvupvpyvupv6v07upv60v7vupv0v6p
hkrishna9495
 
Computer Hardware using mostly in daily.
Computer Hardware using mostly in daily.Computer Hardware using mostly in daily.
Computer Hardware using mostly in daily.
manshajunaid205
 
Microsoft Office 365 Crack Latest Version 2025?
Microsoft Office 365 Crack Latest Version 2025?Microsoft Office 365 Crack Latest Version 2025?
Microsoft Office 365 Crack Latest Version 2025?
yousfhashmi786
 
Fonepaw Data Recovery Crack 2025 with key free Download
Fonepaw Data Recovery Crack 2025 with key free DownloadFonepaw Data Recovery Crack 2025 with key free Download
Fonepaw Data Recovery Crack 2025 with key free Download
mampisoren09
 
TabulationwjsnnanamamamNananaka of Data.pptx
TabulationwjsnnanamamamNananaka of Data.pptxTabulationwjsnnanamamamNananaka of Data.pptx
TabulationwjsnnanamamamNananaka of Data.pptx
sriyansh4443
 
hufman coding for compression algorithm.ppt
hufman coding for compression algorithm.ppthufman coding for compression algorithm.ppt
hufman coding for compression algorithm.ppt
alvishi254
 
C.Mehrac_Englisjdjdjdjdjdjdjsjsjsjsjdh.pptx
C.Mehrac_Englisjdjdjdjdjdjdjsjsjsjsjdh.pptxC.Mehrac_Englisjdjdjdjdjdjdjsjsjsjsjdh.pptx
C.Mehrac_Englisjdjdjdjdjdjdjsjsjsjsjdh.pptx
quliyevlogman4
 
the-importance-of-learning-french-140722055824-phpapp01.ppt
the-importance-of-learning-french-140722055824-phpapp01.pptthe-importance-of-learning-french-140722055824-phpapp01.ppt
the-importance-of-learning-french-140722055824-phpapp01.ppt
LipicaJasujaWadhwa
 
pharmacology.pdffffffffffffffffffffffffff
pharmacology.pdffffffffffffffffffffffffffpharmacology.pdffffffffffffffffffffffffff
pharmacology.pdffffffffffffffffffffffffff
anooshaazam312
 
ADRENAL CORTbsbsbshEX 3 & 4 - 22MBBS.pptx
ADRENAL CORTbsbsbshEX 3 & 4 - 22MBBS.pptxADRENAL CORTbsbsbshEX 3 & 4 - 22MBBS.pptx
ADRENAL CORTbsbsbshEX 3 & 4 - 22MBBS.pptx
c2422429
 
Spectrophotometer instrument basics -.pptx
Spectrophotometer instrument basics -.pptxSpectrophotometer instrument basics -.pptx
Spectrophotometer instrument basics -.pptx
muthulakshmitc
 
ppt untuk seminar proposal dan persiapannya
ppt untuk seminar proposal dan persiapannyappt untuk seminar proposal dan persiapannya
ppt untuk seminar proposal dan persiapannya
Rasidinmamakidin
 
AnyDesk 5.2.1 Crack License Key Full Version 2019 {Latest}
AnyDesk 5.2.1 Crack License Key Full Version 2019 {Latest}AnyDesk 5.2.1 Crack License Key Full Version 2019 {Latest}
AnyDesk 5.2.1 Crack License Key Full Version 2019 {Latest}
yousfhashmi786
 
Download Capcut Pro 5.7.1.2152 Crack Latest Version | PPT
Download Capcut Pro 5.7.1.2152 Crack Latest Version | PPTDownload Capcut Pro 5.7.1.2152 Crack Latest Version | PPT
Download Capcut Pro 5.7.1.2152 Crack Latest Version | PPT
yousfhashmi786
 
Mutation-Breeding (1).pdf.pptxcoucy9f9yf69f9
Mutation-Breeding (1).pdf.pptxcoucy9f9yf69f9Mutation-Breeding (1).pdf.pptxcoucy9f9yf69f9
Mutation-Breeding (1).pdf.pptxcoucy9f9yf69f9
hkrishna9495
 
materi semi konduktor untuk belajar elektronika daya
materi semi konduktor untuk belajar elektronika dayamateri semi konduktor untuk belajar elektronika daya
materi semi konduktor untuk belajar elektronika daya
RaidahAuliaMursyid
 
Bone Tubeoudgkaitaurauraurarhafhau4rculosis.pptx
Bone   Tubeoudgkaitaurauraurarhafhau4rculosis.pptxBone   Tubeoudgkaitaurauraurarhafhau4rculosis.pptx
Bone Tubeoudgkaitaurauraurarhafhau4rculosis.pptx
gaisga4er
 
Mischief and cheating - yashita (2405230008).pptx
Mischief and cheating - yashita (2405230008).pptxMischief and cheating - yashita (2405230008).pptx
Mischief and cheating - yashita (2405230008).pptx
YashitaAhuja
 
Introduction-to-Robotics-PowerPoint-Presentation-on-Robotics (1).ppt
Introduction-to-Robotics-PowerPoint-Presentation-on-Robotics (1).pptIntroduction-to-Robotics-PowerPoint-Presentation-on-Robotics (1).ppt
Introduction-to-Robotics-PowerPoint-Presentation-on-Robotics (1).ppt
AshaManjhi
 
linearly separable and therefore a set of weights exist that are consistent ...
linearly separable and therefore a set of weights exist that are consistent  ...linearly separable and therefore a set of weights exist that are consistent  ...
linearly separable and therefore a set of weights exist that are consistent ...
mebibek_hu
 
00804.pptvjlvpuvupvpyvupv6v07upv60v7vupv0v6p
00804.pptvjlvpuvupvpyvupv6v07upv60v7vupv0v6p00804.pptvjlvpuvupvpyvupv6v07upv60v7vupv0v6p
00804.pptvjlvpuvupvpyvupv6v07upv60v7vupv0v6p
hkrishna9495
 
Computer Hardware using mostly in daily.
Computer Hardware using mostly in daily.Computer Hardware using mostly in daily.
Computer Hardware using mostly in daily.
manshajunaid205
 
Microsoft Office 365 Crack Latest Version 2025?
Microsoft Office 365 Crack Latest Version 2025?Microsoft Office 365 Crack Latest Version 2025?
Microsoft Office 365 Crack Latest Version 2025?
yousfhashmi786
 
Fonepaw Data Recovery Crack 2025 with key free Download
Fonepaw Data Recovery Crack 2025 with key free DownloadFonepaw Data Recovery Crack 2025 with key free Download
Fonepaw Data Recovery Crack 2025 with key free Download
mampisoren09
 
TabulationwjsnnanamamamNananaka of Data.pptx
TabulationwjsnnanamamamNananaka of Data.pptxTabulationwjsnnanamamamNananaka of Data.pptx
TabulationwjsnnanamamamNananaka of Data.pptx
sriyansh4443
 
hufman coding for compression algorithm.ppt
hufman coding for compression algorithm.ppthufman coding for compression algorithm.ppt
hufman coding for compression algorithm.ppt
alvishi254
 
C.Mehrac_Englisjdjdjdjdjdjdjsjsjsjsjdh.pptx
C.Mehrac_Englisjdjdjdjdjdjdjsjsjsjsjdh.pptxC.Mehrac_Englisjdjdjdjdjdjdjsjsjsjsjdh.pptx
C.Mehrac_Englisjdjdjdjdjdjdjsjsjsjsjdh.pptx
quliyevlogman4
 
the-importance-of-learning-french-140722055824-phpapp01.ppt
the-importance-of-learning-french-140722055824-phpapp01.pptthe-importance-of-learning-french-140722055824-phpapp01.ppt
the-importance-of-learning-french-140722055824-phpapp01.ppt
LipicaJasujaWadhwa
 
pharmacology.pdffffffffffffffffffffffffff
pharmacology.pdffffffffffffffffffffffffffpharmacology.pdffffffffffffffffffffffffff
pharmacology.pdffffffffffffffffffffffffff
anooshaazam312
 
ADRENAL CORTbsbsbshEX 3 & 4 - 22MBBS.pptx
ADRENAL CORTbsbsbshEX 3 & 4 - 22MBBS.pptxADRENAL CORTbsbsbshEX 3 & 4 - 22MBBS.pptx
ADRENAL CORTbsbsbshEX 3 & 4 - 22MBBS.pptx
c2422429
 
Spectrophotometer instrument basics -.pptx
Spectrophotometer instrument basics -.pptxSpectrophotometer instrument basics -.pptx
Spectrophotometer instrument basics -.pptx
muthulakshmitc
 
ppt untuk seminar proposal dan persiapannya
ppt untuk seminar proposal dan persiapannyappt untuk seminar proposal dan persiapannya
ppt untuk seminar proposal dan persiapannya
Rasidinmamakidin
 
AnyDesk 5.2.1 Crack License Key Full Version 2019 {Latest}
AnyDesk 5.2.1 Crack License Key Full Version 2019 {Latest}AnyDesk 5.2.1 Crack License Key Full Version 2019 {Latest}
AnyDesk 5.2.1 Crack License Key Full Version 2019 {Latest}
yousfhashmi786
 
Download Capcut Pro 5.7.1.2152 Crack Latest Version | PPT
Download Capcut Pro 5.7.1.2152 Crack Latest Version | PPTDownload Capcut Pro 5.7.1.2152 Crack Latest Version | PPT
Download Capcut Pro 5.7.1.2152 Crack Latest Version | PPT
yousfhashmi786
 
Mutation-Breeding (1).pdf.pptxcoucy9f9yf69f9
Mutation-Breeding (1).pdf.pptxcoucy9f9yf69f9Mutation-Breeding (1).pdf.pptxcoucy9f9yf69f9
Mutation-Breeding (1).pdf.pptxcoucy9f9yf69f9
hkrishna9495
 
materi semi konduktor untuk belajar elektronika daya
materi semi konduktor untuk belajar elektronika dayamateri semi konduktor untuk belajar elektronika daya
materi semi konduktor untuk belajar elektronika daya
RaidahAuliaMursyid
 
Bone Tubeoudgkaitaurauraurarhafhau4rculosis.pptx
Bone   Tubeoudgkaitaurauraurarhafhau4rculosis.pptxBone   Tubeoudgkaitaurauraurarhafhau4rculosis.pptx
Bone Tubeoudgkaitaurauraurarhafhau4rculosis.pptx
gaisga4er
 
Mischief and cheating - yashita (2405230008).pptx
Mischief and cheating - yashita (2405230008).pptxMischief and cheating - yashita (2405230008).pptx
Mischief and cheating - yashita (2405230008).pptx
YashitaAhuja
 
Introduction-to-Robotics-PowerPoint-Presentation-on-Robotics (1).ppt
Introduction-to-Robotics-PowerPoint-Presentation-on-Robotics (1).pptIntroduction-to-Robotics-PowerPoint-Presentation-on-Robotics (1).ppt
Introduction-to-Robotics-PowerPoint-Presentation-on-Robotics (1).ppt
AshaManjhi
 
linearly separable and therefore a set of weights exist that are consistent ...
linearly separable and therefore a set of weights exist that are consistent  ...linearly separable and therefore a set of weights exist that are consistent  ...
linearly separable and therefore a set of weights exist that are consistent ...
mebibek_hu
 

Wikidata as a hub for the linked data cloud

  • 1. Wikidata as a hub for the linked data cloudWikidata as a hub for the linked data cloud TUTORIAL AT DCMI CONFERENCE, SEOUL, 2019-09-25TUTORIAL AT DCMI CONFERENCE, SEOUL, 2019-09-25 Tom Baker, Joachim Neubert, Andra Waagmeester Slides (partitially) at https://jneubert.github.io/wd-dcmi2019/#/
  • 2. OverviewOverview Part 1: Using and querying Wikidata Part 2: Wikidata as a linking hub Part 3: Applications based on Wikidata Part 4: Wikidata usage scenarios Scenarios Intro and details Hands-on: Mix-n-match Quality control tools and procedures Wikidata community
  • 3. Wikidata as linking hubWikidata as linking hub
  • 4. The idea of linking hubsThe idea of linking hubs Connect concepts via identifiers/URLs Existing hubs: , , ... Image by Jakob Voss VIAF sameAs.org
  • 5. Different linking propertiesDifferent linking properties 1. (datatype URL) generic link to URL in the meaning of skos:exactMatch 2. : more than 4000 specialized properties (datatype external identifier) exact match Pxxxx
  • 6. Examples for external identifiersExamples for external identifiers GND / VIAF identifiers geogaphical entities proteins Swedish cultural heritage objects African plants baseball players TED conference speakers
  • 8. Property definitionsProperty definitions subject item for the property examples constraints on values, cardinality, etc. : creates a clickable link for the ID start at the property page, e.g., for the ISSN: formatter URL https://www.wikidata.org/wiki/Property:P236
  • 11. Beyond sameness - mapping relationsBeyond sameness - mapping relations Wikidata external ids imply "sameness" of linked concepts even with geographic names, other mapping relations are required in some cases. examples: close matches, e.g., "Yugoslavia" (1918-1992) (Wikidata) ≅ "Yugoslavia (until 1990)" (STW) related matches, e.g. a company and its founder
  • 12. Mapping relation type (P4390)Mapping relation type (P4390) introduced after a community discussion in October 2017 to be used as qualifier for external id entries fixed value set - SKOS mapping relations (exact, close, broad, narrow, related match)
  • 13. EXAMPLE AT ITEMEXAMPLE AT ITEM ASSESSMENT CENTERASSESSMENT CENTER
  • 14. How does that relate to the Linked Data model?How does that relate to the Linked Data model? Internal data model and storage (Wikibase) is transformed to RDF for: RDF dumps Query Service
  • 15. RDF linking from WikidataRDF linking from Wikidata : linked data URI e.g., , (vs. formatter URL ) linked external RDF resources plus ~950.000 relations to individual URIs formatter URI for RDF resource http://sws.geonames.org/$1/ https://www.geonames.org/$1 List of 130+ relationships to external RDF datasets 26+ million exact match
  • 16. Links in the RDF dumpsLinks in the RDF dumps Output has full URLs to external resources, however with Wikidata-specific properties: This creates a hurdle for generic Linked Data browsers and tools - not even is translated to skos:exactMatch wd:Q123 wdt:P234 "External-ID" ; wdtn:P234 <http://example.com/reference/External-ID> exact match
  • 17. Federated SPARQL queriesFederated SPARQL queries Example use case: GND authority has information about the professions/occupations of people which is not known in Wikidata. So get that information dynamically from a GND SPARQL endpoint. Here, we are interested in economists, in particular.
  • 18. From Wikidata to a remote endpointFrom Wikidata to a remote endpoint From a remote endpoint to WikidataFrom a remote endpoint to Wikidata <== not working currently query to WDQS query to GND endpoint
  • 19. Several points for attentionSeveral points for attention Direction and sequence of statements often matters for performance To reach out from Wikidata, endpoints have to be ( ) In the other direction, access is normally not restricted Some federated queries get extremely slow, when large sets of bindings exist before the remote service is invoked be sure to exclude variables bound to blank nodes ('unknown value' in Wikidata) approved full list
  • 20. Further reading on Wikidata/RDFFurther reading on Wikidata/RDF ( ) Critical comments/suggestions: RDF dump format (documentation) Waagmeester: Integrating Wikidata and other linked data sources - Federated SPARQL queries more examples Malyshev et al.: Semantic Technology Usage in Wikipedia’s Knowledge Graph Freire/Isaac: Technical usability of Wikidatas linked data
  • 21. Application process for a new propertyApplication process for a new property Double-check, that the property does not already exist Prepare a property proposal in the according section, e.g., Wikidata:Property proposal/Authority control
  • 23. Hints for getting it approved smoothlyHints for getting it approved smoothly Clearly lay out the motivation and planned use for the property Provide working examples (with the formatter URI you are suggesting) Be responsive to comments
  • 24. Wikidata as a universal linking hubWikidata as a universal linking hub easy extensibility with new properties for external identifiers immense fund of existing items, with the full set of SKOS mapping relations for more or less exact mappings to these immediate extensibility with new items
  • 25. Linking content via Mix-n-MatchLinking content via Mix-n-Match
  • 26. Mix-n-match is a widely used tool (by Magnus Manske) to link external databases, catalogs, etc. to existing Wikidata items (or to create new ones).
  • 27. Example list:Example list: Newspapers and journalsNewspapers and journals from the 20th Century Press Archivefrom the 20th Century Press Archive
  • 32. Please navigate to our example catalog Mix-n-match manual
  • 33. TasksTasks Login through Widar In "Automatically matched" list: Connect matching items Remove non-matching entries In "Unmatched" list: Search for existing items Create missing items ( )suggested properties
  • 35. Item creation "on the go"Item creation "on the go" With Mix-n-match "New item": rudimentary, no references Custom list of prepared QuickStatements insert blocks ( from STW Thesaurus for Economics - please don't mess with it, this is work in progress) Workflow-wise, use same sequence for M-n-m input and prepared insert blocks example
  • 36. Recommendations for item creationRecommendations for item creation Pay attention to (much more relaxed then Wikipedias) Explain your plan and ask for feedback in the to make mass edits ( ) Source every statement ( ) Create input in Check with a few statements, verify result Run as batch, document input and batch URL Wikidata's notability criteria Wikidata project chat Apply for a bot account example hints QuickStatements text format
  • 37. Matching from WD to the external database entriesMatching from WD to the external database entries
  • 39. Normally requires an endpoint for the external source, where you can search for the labels, aliases or other data of Wikidata items Insert statement for external id into Wikidata can be prepared for cut&paste or even semi-automatic execution in QuickStatements Some hints and linked code here
  • 40. Import catalog data to Mix-n-MatchImport catalog data to Mix-n-Match
  • 41. Prepare dataPrepare data ... as tab-separated table (one line per record) with three columns 1. identitfier 2. name 3. description Input file for the example used earlier
  • 42. Pay attention toPay attention to description column: include everything useful for intelectual identification order: the sequence may help structuring your workflow (e.g., most used entries first)
  • 43. Load data via web interfaceLoad data via web interface ... at https://tools.wmflabs.org/mix-n-match/import.php
  • 48. Sync existing ids from WikidataSync existing ids from Wikidata
  • 52. Quality control tools and proceduresQuality control tools and procedures Perception: Anybody can edit anything - so Wikidata is no reliable source of knowledge Seen as a threat for information systems based on Wikidata particularly by some large Wikipedias (e.g., the English one) Basic policy to address this: Statements should be referenced
  • 53. QA support for editorsQA support for editors Contraint definition for properties raise warnings during data input, when, e.g. a format definition (ISBN, DOI etc.) is violated a supposedly unique identifier is added to more than one item generated lists of constraint violations (e.g. ) Constraints can be very helpful, but do not cover complex cases ZDB ID format
  • 54. More QA support for editorsMore QA support for editors Additional reports can be created via SPARQL queries Shape Expressions (ShEx) allow to define complex constraints and conformance checks ShEx Primer How to get started with Shape Expressions in Wikidata?
  • 55. Revision control and patrolingRevision control and patroling Versioned edits and version control Manual and tool supported vandalism prevention Watchlists Automated flagging of suspect edits (e.g., "new editor deleteing statements") Technically very easy to revert edits Semi-protection or protection of oftenly-vandalized items Patroling
  • 57. Automated tools for vandalism detectionAutomated tools for vandalism detection Fighting to keep up with rate of human edits in Wikidata (multiple per second) ... requires reducing the manual workload, e.g. via Objective Revision Evaluation Service ( ) and other rule-based and machine-learning tools Wikidata Abuse Filter ORES
  • 58. Ongoing researchOngoing research Heindorf et al.: Vandalism Detection in Wikidata Sarabadani et al.: Building automated vandalism detection tools for Wikidata
  • 59. The Wikidata communityThe Wikidata community Everybody can participate No central "committee" or decision structure Desisions are made via discussion and community consensus
  • 60. Main entry point for all kind of discussions Resolved discussions archived after 7 days - Beginner's questions welcome (but please try to find the answer online before, particularly in the , which has a search link to the help pages) Compared to Wikipedia, the overall atmosphere is constructive (though exceptions exist sometimes in some sub-communities) English is the lingua franca, but a few questions show up in other languages, and receive comments, too Project chatProject chat searchable FAQ
  • 61. User page and user talk pageUser page and user talk page Introduce yourself - especially if you work in a professional context with Wikidata ( ) Activate notifications to your email address Be responsive to comments on your talk page You can address other users on their talk page, too example
  • 62. Talk pages of propertiesTalk pages of properties Questions on the use of a certain property Suggestions for changes or enhancements of the property definition ( ) Consider adding properties you are interested in to your watchlist example
  • 65. Often a great source to find documentation about the community consensus in certain fields Many WikiProjects pages contain data structuring recommendations - see, e.g., for Current WikiProjects on Wikidata periodicals
  • 66. Thank you - questions welcome!Thank you - questions welcome! Joachim Neubert [email protected] Jneubert on WD