SlideShare a Scribd company logo
LSQ: The Linked SPARQL Queries Dataset
Muhammad Saleem1 Intizar Ali 2 Aidan Hogan 3
Qaiser Mehmood2 Axel Ngonga 1
1AKSW, University of Leipzig, Germany
2INSIGHT, NUIG, Ireland
3DCC, Universidad de Chile
International Semantic Web Conference, Bethlehem, USA, 2015
Saleem et al. (AKSW) LSQ ISWC 2015 1 / 11
LSQ: The Linked SPARQL Queries Dataset
Linked Dataset of SPARQL queries extracted from endpoint logs
DBpedia (232 million triples)
30/04/2010–20/07/2010
Linked Geo Data (1 billion triples)
24/11/2010–06/07/2011
Semantic Web Dog Food (300 thousand triples)
16/05/2014–12/11/2014
British Museum (1.4 million triples)
08/11/2014–01/12/2014
Saleem et al. (AKSW) LSQ ISWC 2015 2 / 11
LSQ Data Model
Saleem et al. (AKSW) LSQ ISWC 2015 3 / 11
LSQ Statistics
Saleem et al. (AKSW) LSQ ISWC 2015 4 / 11
LSQ Statistics
Saleem et al. (AKSW) LSQ ISWC 2015 5 / 11
LSQ Statistics
Saleem et al. (AKSW) LSQ ISWC 2015 6 / 11
LSQ Statistics
Saleem et al. (AKSW) LSQ ISWC 2015 7 / 11
LSQ Statistics
Saleem et al. (AKSW) LSQ ISWC 2015 8 / 11
LSQ Statistics
90% of the agents issues fewer than 3% queries
Saleem et al. (AKSW) LSQ ISWC 2015 9 / 11
Use-Cases
Custom Benchmarks
SPARQL Adoption
Caching
Usability
Meta-Querying
Saleem et al. (AKSW) LSQ ISWC 2015 10 / 11
Conclusion and Future Work
First Linked Dataset of real-world SPARQL queries
5.7 million query executions, 73 million triples
90% of the agents issues fewer than 3% queries
LSQ is available from (http://aksw.github.io/LSQ/)
Add more logs, e.g., Bioportal, Strabon
Update current logs (esp. DBpedia)
Link to the benchmark generation framework FEASIBLE
(http://feasible.aksw.org/)
Saleem et al. (AKSW) LSQ ISWC 2015 11 / 11

More Related Content

Viewers also liked (20)

PPTX
Linked Cancer Genome Atlas Database
Muhammad Saleem
 
PPTX
DAW: Duplicate-AWare Federated Query Processing over the Web of Data
Muhammad Saleem
 
PPTX
SAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes
Muhammad Saleem
 
PPTX
Data collection
Paul Gichure
 
PPT
Malmo 11.11.2008
Jonas Ranstam PhD
 
PDF
PresentationFinal
Lin Han
 
PDF
Visual Exploration of Clinical and Genomic Data for Patient Stratification
Nils Gehlenborg
 
PPTX
Cancer Research Data Ecosystem - Dr. Warren Kibbe
imgcommcall
 
PDF
how to sell
dkhsurvey
 
PPTX
Clinical research training - Dr Blanaid Mee - Dec 7th 2016
ipposi
 
PPTX
City of hope research informatics common data elements
Abdul-Malik Shakir
 
PPT
Patient profiling disaggregating the data
nhsnwHELP
 
PPTX
Patient-Generated Data for Cancer Treatment and Management
Tommy Snitz
 
PDF
FluxGraph: a time-machine for your graphs
datablend
 
PDF
iHT² Health IT Summit New York - Cancer Care Ontario Presentation "Transformi...
Health IT Conference – iHT2
 
PDF
Impact of Multidisciplinary Discussion on Treatment Outcome For Gynecologic C...
Emad Shash
 
PPTX
Efficient source selection for sparql endpoint federation
Muhammad Saleem
 
PDF
Elective Care Conference: the role of the MDT coordinator role
NHS Improvement
 
PPTX
2015 Micromedex使用者大會 如何在臨床工作中找到實證解答
建豪 陳
 
PPTX
National Cancer Data Ecosystem and Data Sharing
Warren Kibbe
 
Linked Cancer Genome Atlas Database
Muhammad Saleem
 
DAW: Duplicate-AWare Federated Query Processing over the Web of Data
Muhammad Saleem
 
SAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes
Muhammad Saleem
 
Data collection
Paul Gichure
 
Malmo 11.11.2008
Jonas Ranstam PhD
 
PresentationFinal
Lin Han
 
Visual Exploration of Clinical and Genomic Data for Patient Stratification
Nils Gehlenborg
 
Cancer Research Data Ecosystem - Dr. Warren Kibbe
imgcommcall
 
how to sell
dkhsurvey
 
Clinical research training - Dr Blanaid Mee - Dec 7th 2016
ipposi
 
City of hope research informatics common data elements
Abdul-Malik Shakir
 
Patient profiling disaggregating the data
nhsnwHELP
 
Patient-Generated Data for Cancer Treatment and Management
Tommy Snitz
 
FluxGraph: a time-machine for your graphs
datablend
 
iHT² Health IT Summit New York - Cancer Care Ontario Presentation "Transformi...
Health IT Conference – iHT2
 
Impact of Multidisciplinary Discussion on Treatment Outcome For Gynecologic C...
Emad Shash
 
Efficient source selection for sparql endpoint federation
Muhammad Saleem
 
Elective Care Conference: the role of the MDT coordinator role
NHS Improvement
 
2015 Micromedex使用者大會 如何在臨床工作中找到實證解答
建豪 陳
 
National Cancer Data Ecosystem and Data Sharing
Warren Kibbe
 

More from Muhammad Saleem (15)

PPTX
QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...
Muhammad Saleem
 
PPTX
How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...
Muhammad Saleem
 
PPTX
LargeRDFBench
Muhammad Saleem
 
PPTX
Extended LargeRDFBench
Muhammad Saleem
 
PPTX
CostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
Muhammad Saleem
 
PPTX
SQCFramework: SPARQL Query containment Benchmark Generation Framework
Muhammad Saleem
 
PPTX
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...
Muhammad Saleem
 
PPTX
Federated Query Formulation and Processing Through BioFed
Muhammad Saleem
 
PPTX
Fine-grained Evaluation of SPARQL Endpoint Federation Systems
Muhammad Saleem
 
PPTX
SPARQL Querying Benchmarks ISWC2016
Muhammad Saleem
 
PPTX
FEASIBLE-Benchmark-Framework-ISWC2015
Muhammad Saleem
 
PPTX
Federated SPARQL Query Processing ISWC2015 Tutorial
Muhammad Saleem
 
PPTX
Federated SPARQL query processing over the Web of Data
Muhammad Saleem
 
PPTX
HiBISCuS: Hypergraph-Based Source Selection for SPARQL Endpoint Federation
Muhammad Saleem
 
PPTX
Fostering Serendipity through Big Linked Data
Muhammad Saleem
 
QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...
Muhammad Saleem
 
How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...
Muhammad Saleem
 
LargeRDFBench
Muhammad Saleem
 
Extended LargeRDFBench
Muhammad Saleem
 
CostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
Muhammad Saleem
 
SQCFramework: SPARQL Query containment Benchmark Generation Framework
Muhammad Saleem
 
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...
Muhammad Saleem
 
Federated Query Formulation and Processing Through BioFed
Muhammad Saleem
 
Fine-grained Evaluation of SPARQL Endpoint Federation Systems
Muhammad Saleem
 
SPARQL Querying Benchmarks ISWC2016
Muhammad Saleem
 
FEASIBLE-Benchmark-Framework-ISWC2015
Muhammad Saleem
 
Federated SPARQL Query Processing ISWC2015 Tutorial
Muhammad Saleem
 
Federated SPARQL query processing over the Web of Data
Muhammad Saleem
 
HiBISCuS: Hypergraph-Based Source Selection for SPARQL Endpoint Federation
Muhammad Saleem
 
Fostering Serendipity through Big Linked Data
Muhammad Saleem
 
Ad

Recently uploaded (20)

PPTX
GEN Biology 2 LESSON plant and animal 1.pptx
ElsieColico1
 
PPTX
Comparative Testing of 2D Stroke Gesture Recognizers in Multiple Contexts of Use
Jean Vanderdonckt
 
DOCX
Transportation in plants and animals.docx
bhatbashir421
 
PPTX
Cancer
Vartika
 
PDF
Human-to-Robot Handovers track - RGMC - ICRA 2025
Alessio Xompero
 
PDF
The Gender Binary & LGBTI People: Religious Myth and Medical Malpractice
Veronica Drantz, PhD
 
PDF
Global Health Initiatives: Lessons from Successful Programs (www.kiu.ac.ug)
publication11
 
PPTX
Earthquake1214435435665467576786587867876867888.pptx
JohnMarkBarrientos1
 
PPTX
Human-AI Interaction in Space: Insights from a Mars Analog Mission with the H...
Jean Vanderdonckt
 
PDF
HOW TO DEAL WITH THREATS FROM THE FORCES OF NATURE FROM OUTER SPACE.pdf
Faga1939
 
PPTX
Organisms of oncogenic Potential.pptx
mrkoustavjana2003
 
PPSX
Overview of Stem Cells and Immune Modulation.ppsx
AhmedAtwa29
 
PPTX
MEDICINAL CHEMISTRY PROSPECTIVES IN DESIGN OF EGFR INHIBITORS.pptx
40RevathiP
 
PDF
POLISH JOURNAL OF SCIENCE №87 (2025)
POLISH JOURNAL OF SCIENCE
 
PPTX
1699424534480_FOREST_SOCIETY_and_COLONIALISM (15).pptx
kavishtiwari2009
 
PPTX
Single-Cell Multi-Omics in Neurodegeneration p1.pptx
KanakChaudhary10
 
PPT
states_of_matter.ppt presentation for grade 9
ROLANARIBATO3
 
PDF
EV REGENERATIVE ACCELERATION INNOVATION SUMMARY PITCH June 13, 2025.pdf
Thane Heins NOBEL PRIZE WINNING ENERGY RESEARCHER
 
PPTX
Instrumentation of IR and Raman Spectrophotometers.pptx
sngth2h2acc
 
PDF
Electromagnetism 3.pdf - AN OVERVIEW ON ELECTROMAGNETISM
kaustavsahoo94
 
GEN Biology 2 LESSON plant and animal 1.pptx
ElsieColico1
 
Comparative Testing of 2D Stroke Gesture Recognizers in Multiple Contexts of Use
Jean Vanderdonckt
 
Transportation in plants and animals.docx
bhatbashir421
 
Cancer
Vartika
 
Human-to-Robot Handovers track - RGMC - ICRA 2025
Alessio Xompero
 
The Gender Binary & LGBTI People: Religious Myth and Medical Malpractice
Veronica Drantz, PhD
 
Global Health Initiatives: Lessons from Successful Programs (www.kiu.ac.ug)
publication11
 
Earthquake1214435435665467576786587867876867888.pptx
JohnMarkBarrientos1
 
Human-AI Interaction in Space: Insights from a Mars Analog Mission with the H...
Jean Vanderdonckt
 
HOW TO DEAL WITH THREATS FROM THE FORCES OF NATURE FROM OUTER SPACE.pdf
Faga1939
 
Organisms of oncogenic Potential.pptx
mrkoustavjana2003
 
Overview of Stem Cells and Immune Modulation.ppsx
AhmedAtwa29
 
MEDICINAL CHEMISTRY PROSPECTIVES IN DESIGN OF EGFR INHIBITORS.pptx
40RevathiP
 
POLISH JOURNAL OF SCIENCE №87 (2025)
POLISH JOURNAL OF SCIENCE
 
1699424534480_FOREST_SOCIETY_and_COLONIALISM (15).pptx
kavishtiwari2009
 
Single-Cell Multi-Omics in Neurodegeneration p1.pptx
KanakChaudhary10
 
states_of_matter.ppt presentation for grade 9
ROLANARIBATO3
 
EV REGENERATIVE ACCELERATION INNOVATION SUMMARY PITCH June 13, 2025.pdf
Thane Heins NOBEL PRIZE WINNING ENERGY RESEARCHER
 
Instrumentation of IR and Raman Spectrophotometers.pptx
sngth2h2acc
 
Electromagnetism 3.pdf - AN OVERVIEW ON ELECTROMAGNETISM
kaustavsahoo94
 
Ad

LSQ: The Linked SPARQL Queries Dataset

  • 1. LSQ: The Linked SPARQL Queries Dataset Muhammad Saleem1 Intizar Ali 2 Aidan Hogan 3 Qaiser Mehmood2 Axel Ngonga 1 1AKSW, University of Leipzig, Germany 2INSIGHT, NUIG, Ireland 3DCC, Universidad de Chile International Semantic Web Conference, Bethlehem, USA, 2015 Saleem et al. (AKSW) LSQ ISWC 2015 1 / 11
  • 2. LSQ: The Linked SPARQL Queries Dataset Linked Dataset of SPARQL queries extracted from endpoint logs DBpedia (232 million triples) 30/04/2010–20/07/2010 Linked Geo Data (1 billion triples) 24/11/2010–06/07/2011 Semantic Web Dog Food (300 thousand triples) 16/05/2014–12/11/2014 British Museum (1.4 million triples) 08/11/2014–01/12/2014 Saleem et al. (AKSW) LSQ ISWC 2015 2 / 11
  • 3. LSQ Data Model Saleem et al. (AKSW) LSQ ISWC 2015 3 / 11
  • 4. LSQ Statistics Saleem et al. (AKSW) LSQ ISWC 2015 4 / 11
  • 5. LSQ Statistics Saleem et al. (AKSW) LSQ ISWC 2015 5 / 11
  • 6. LSQ Statistics Saleem et al. (AKSW) LSQ ISWC 2015 6 / 11
  • 7. LSQ Statistics Saleem et al. (AKSW) LSQ ISWC 2015 7 / 11
  • 8. LSQ Statistics Saleem et al. (AKSW) LSQ ISWC 2015 8 / 11
  • 9. LSQ Statistics 90% of the agents issues fewer than 3% queries Saleem et al. (AKSW) LSQ ISWC 2015 9 / 11
  • 11. Conclusion and Future Work First Linked Dataset of real-world SPARQL queries 5.7 million query executions, 73 million triples 90% of the agents issues fewer than 3% queries LSQ is available from (http://aksw.github.io/LSQ/) Add more logs, e.g., Bioportal, Strabon Update current logs (esp. DBpedia) Link to the benchmark generation framework FEASIBLE (http://feasible.aksw.org/) Saleem et al. (AKSW) LSQ ISWC 2015 11 / 11