SlideShare a Scribd company logo
WTF is the Semantic Web and
         Linked Data
         Juan F. Sequeda
  Department of Computer Science
    University of Texas at Austin
           Nov 17, 2011
Semantic Web? Linked Data?

          WTF?
WTF is the Semantic Web?
WTF is the Semantic Web?
Internet != Web
WTF is the Semantic Web and Linked Data
What is the Web?

“… the Web, is a system of
interlinked hypertext documents
accessed via the Internet. With a
web browser, one can view web
pages that may contain text,
images […] and navigate between
them via hyperlinks”
             http://en.wikipedia.org/wiki/World_Wide_Web
Current Web = internet + links + docs
History of the Web
•   Created by Tim Berners-Lee at CERN in 1989
•   Mosaic browser in 1993
•   W3C created in 1994
•   Exponential growth mid 90s
•   Amazon, Ebay – 1995
•   Search engines – Google 1998
•   Dot-com boom 1997 – 2001
•   Web 2.0 – blogs, Facebook, Twitter, etc
What is the problem?
WHAT’S THE
                                                  WEATHER IN
                                                 AUSTIN TODAY?




http://www.flickr.com/photos/jamieca/31631256/
WTF is the Semantic Web and Linked Data
WTF is the Semantic Web and Linked Data
WTF is the Semantic Web and Linked Data
What is the problem?
• The web is full of documents
• We aren’t always interested in documents
  – We are interested in THINGS
  – These THINGS might be in documents
• We can read a HTML document rendered in a
  browser and find what we are searching for
  – This is hard for computers.
  – Computers have to guess (even though they are
    pretty good at it)
The Web of Documents
     Search



Search
Engine


Crawler
The Web is a Data Shredder




Structured                 Unstructured
   Data                        Data

                    Thanks Martin Hepp
What would we like?
• Make it easy for computers/software to find
  THINGS



  Do you SEARCH or do you FIND?
Search for

   Football Players who went to the
University of Texas at Austin, played for
  the Dallas Cowboys as Cornerback
WTF is the Semantic Web and Linked Data
WTF is the Semantic Web and Linked Data
WTF is the Semantic Web and Linked Data
Why can’t we just FIND it…
WTF is the Semantic Web and Linked Data
WTF is the Semantic Web and Linked Data
Guess how I FOUND out?
On a Semantic Web
• Besides publishing documents on the web
  – which computers can’t understand easily
• Let’s publish on the web something that
  computers can understand



                DATA
The Semantic Web is a
     web of data
   The current web is a
    web of documents
But wait… doesn’t the web
    already have data?
Current Data on the Web
•   Relational Databases
•   APIs
•   XML
•   CSV
•   XLS
•   …
•   Can’t computers and applications already
    consume that data on the web?
Yes! But it is all in different
formats and data models!
This makes it hard to integrate
             data
The data in different
data sources aren’t linked
For example, how do I know that the
  Juan Sequeda in Facebook is the
  same as Juan Sequeda in Twitter
Or if I create a mashup from
different services, I have to learn
different APIs and I get different
       formats of data back
Data is Siloed
Wouldn’t it be great if we had a
standard way of publishing data on
             the Web?
We have a standardized way of
publishing documents on the web,
              right?
             HTML
Then why can’t we have a standard
way of publishing data on the Web?
Good question! And the answer
       is YES. There is!
             RDF
Resource Description Framework
               (RDF)
• Data Model = a way to model data
  – i.e. Relational databases use relational data model
• RDF is a graph data model
Key Value vs Graph
• Key Values
  – firstName Juan
  – lastName  Sequeda
  – livesIn  Austin
  – knows  Stephane Corlosquet
• But what are these key/values describing?
  – ME!
RDF is a Graph
• Let’s group the Key/Values together
  – <JuanSequeda> <firstName> “Juan”
  – <JuanSequeda> <lastName> “Sequeda”
  – <JuanSequeda> <livesIn> “Austin”
  – <JuanSequeda> <knows> <StephaneCorlosquet>
  – ..
  – <StephaneCorlosquet> <firstName> “Stephane”
  – <StephaneCorlosquet> <lastName> “Corlosquet”
  – <StephaneCorlosquet> <livesIn> “Boston”
Identifier for
the “group”      RDF is a Graph               Key/Value

• Let’s group the Key/Values together
    – <JuanSequeda> <firstName> “Juan”
    – <JuanSequeda> <lastName> “Sequeda”
    – <JuanSequeda> <livesIn> “Austin”
    – <JuanSequeda> <knows> <StephaneCorlosquet>
    – ..
    – <StephaneCorlosquet> <firstName> “Stephane”
    – <StephaneCorlosquet> <lastName> “Corlosquet”
    – <StephaneCorlosquet> <livesIn> “Boston”
RDF can be serialized in different ways
•   RDF/XML
•   RDFa (RDF in HTML)
•   N3
•   Turtle
•   JSON
WTF is the Semantic Web and Linked Data
RDFa
RDF/XML
RDF/N-triples
RDF/Turtle
So does that mean that I have to
  publish my data in RDF now?
You don’t have to… but we would
          like you to 
           Schema.org
          Rich Snippets
                …
An example
Document on the Web
Databases back up documents

                                        THINGS have PROPERTIES:
                                        A Book as a Title, an author, …

Isbn           Title           Author             PublisherID       ReleasedData
978-0-596-     Programming     Toby Segaran       1                 July 2009
15381-6        the Semantic
               Web
…              …               …                  …                 …


                                              PublisherID       PublisherName
This is a THING:
A book title “Programming the                 1                 O’Reilly Media
Semantic Web” by Toby Segaran, …              …                 …
Lets represent the data in RDF
Isbn     Title               Author    PublisherID   ReleasedData

978-0-   Programming         Toby      1             July 2009
596-     the Semantic        Segaran
15381-   Web
6
                                                                                  Programming the
PublisherID      PublisherName                       title
                                                                                   Semantic Web
1                O’Reilly Media


                                                             author
                                           book                                   Toby Segaran




                                                             isbn
                                                                              978-0-596-15381-6
                                                  publisher
                                                                                      name
                                                                      Publisher                  O’Reilly
Remember that we are on the
          web
  Everything on the web is identified
               by a URI
And now let’s link the data to other
                data
                                                      Programming the
                        title
                                                       Semantic Web



           http://…/i           author
                                                      Toby Segaran
            sbn978



                                isbn
                                                  978-0-596-15381-6
                  publisher
                                         http://…/p       name
                                          ublisher1                  O’Reilly
And now consider the data from
                    Revyu.com
http://…/      hasReview    http://…/i
 review1                     sbn978
              description
reviewer

              Awesome
                Book


  http://…/
                     name
  reviewer

                        Juan
                      Sequeda
Let’s start to link data
  http://…/      hasReview     http://…/i
   review1                      sbn978
                                                                             Programming the
                description                    title
                                                                              Semantic Web
hasReviewer                   owl:sameAs

                Awesome                                author
                                  http://…/i
                  Book                                                       Toby Segaran
                                   sbn978


    http://…/
                       name
    reviewer                                           isbn
                                                                         978-0-596-15381-6
                          Juan              publisher
                        Sequeda                                 http://…/p       name
                                                                 ublisher1                  O’Reilly
Juan Sequeda publishes data too




http://juanse   livesIn      http://dbpedia.org/Austin
queda.com/id    name      Juan Sequeda
Let’s link more data
  http://…/      hasReview    http://…/i
   review1                     sbn978
                description
hasReviewer

                Awesome
                  Book


    http://…/
                       name
    reviewer

    sameAs                Juan
                        Sequeda

    http://juanse             livesIn         http://dbpedia.org/Austin
    queda.com/id              name         Juan Sequeda
And more
  http://…/      hasReview     http://…/i
   review1                      sbn978
                                                                          Programming the
                description                     title
                                                                           Semantic Web
hasReviewer                   owl:sameAs

                Awesome                                 author
                                  http://…/i
                  Book                                                    Toby Segaran
                                   sbn978


    http://…/
                       name
    reviewer                                            isbn
                                                                          978-0-596-15381-6
    owl:sameAs            Juan              publisher
                                                                 http://…/p
                        Sequeda                                               name
                                                                  ublisher1
                                                                                         O’Reilly
    http://juanse              livesIn            http://dbpedia.org/Austin
    queda.com/id               name            Juan Sequeda
Data on the Web that is in RDF and
   is linked to other RDF data is
            LINKED DATA
Linked Data Principles
1. Use URIs as names for
   things
2. Use HTTP URIs so that
   people can look up
   (dereference) those
   names.
3. When someone looks up
   a URI, provide useful
   information.
4. Include links to other
   URIs so that they can
   discover more things.
Linked Data makes the web appear as
               ONE
             GIANT
             HUGE
           GLOBAL
         DATABASE!
I can query a database with SQL. Is
 there a way to query Linked Data
      with a query language?
Yes! There is actually a
standardize language for that
           SPARQL
FIND all the reviews on the book
“Programming the Semantic Web”
   by people who live in Austin
SPARQL

SELECT ?review ?comment
WHERE {
  isbn:978 ex:hasReview ?review .
  ?review ex:description ?comment .
  ?review ex:hasReviewer ?person .
  ?person ex:lives dbpedia:Austin .
}
SELECT ?review ?comment
                                                WHERE {
                                                isbn:978 ex:hasReview ?review .
                                                ?review ex:description ?comment .
  http://…/      hasReview                      ?review ex:hasReviewer ?person .
                                http://…/i
   review1                                      ?person ex:lives dbpedia:Austin .
                                 sbn978
                                                }
                                                                        Programming the
                description                      title
                                                                         Semantic Web
hasReviewer                   sameAs

                Awesome                              author
                                   http://…/i
                  Book                                                     Toby Segaran
                                    sbn978


    http://…/
                       name
    reviewer                                         isbn
                                                                       978-0-596-15381-6
    sameAs                Juan               publisher
                        Sequeda                               http://…/p       name
                                                               ublisher1                  O’Reilly
     http://juanse
      queda.com               livesIn           http://dbpedia.org/Austin
                              name           Juan Sequeda
OWL
•   Here is where the real semantics shows up
•   Web Ontology Language
•   Define schema/vocabulary
•   Classes, Properties, Inheritance, etc
•   Subclasses, Subproperties
•   …
•   You can get more complicated with rules…
auth: <http://dblp.l3s.de/d2r/page/authors/>
                                                         dexa: <http://dblp.l3s.de/d2r/page/publications/conf/dexa/>
                                                         dc: <http://purl.org/dc/elements/1.1/>
                                                         sw: <http://data.semanticweb.org/person/>
                                                         swrc: <http://swrc.ontoware.org/ontology#>
                                                         owl: <http://www.w3.org/2002/07/owl#>
                                                         rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
                                                         rdfs: <http://www.w3.org/2000/01/rdf-schema#>

  swrc:Publicatio
        n


          rdfs:subClassOf

                             dc:creator
swrc:InProceedings                              foaf:Person
                                                                                                          OWL
                                                      rdf:type
                                                                        owl:sameAs
          rdf:type                          auth:Juan_Sequeda                                  sw:juan-f-sequeda


                            dc:creator                                   owl:sameAs
dexa:TirmiziSM08                          auth:Daniel_P._Miranker                             sw:daniel-miranker


          dc:title                                                       owl:sameAs
                                          auth:Syed_Hamid_Tirmizi                                sw:syed-tirmizi

 “Translating SQL
Applications to the
 Semantic Web"                                                                                            RDF
RDB and the Semantic Web

            TRIGGERS             RIF


          CONSTRAINTS            OWL
TIME




         TABLE DEFINITION        RDFS

           RELATIONAL
                                 RDF
             MODEL




                            74
This looks cool, but let’s be realistic.
  What is the incentive to publish
            Linked Data?
What was your incentive to
publish an HTML page in 1990?
1) Share data in documents
2) Because you neighbor was doing it
            … later on …
  3) Marketing, Advertising, …, SEO
So why should we publish
  Linked Data in 2011?
1) Share data as data
2) Because you neighbor is doing it
                 …
 3) Marketing, Advertising, SEO ++
Linked Data Publishers
•   UK Government
•   US Government
•   BBC
•   Open Calais – Thomson Reuters
•   Freebase/Google
•   NY Times
•   Best Buy
•   Sears
•   Kmart
•   Overstock.com
•   CNET
•   Dbpedia
•   O’Reilly Media
•   …
May 2007
Oct 2007
Nov 2007
Feb 2008
Mar 2008
Sept 2008
Mar 2009 (1)
Mar 2009 (2)
July 2009
September 2010
September 2011




Linking Open Data
cloud diagram, by
Richard Cyganiak and
Anja Jentzsch.
 http://lod-cloud.net/
YOU GET THE PICTURE
ITS BIG and getting

BIGGER and
 BIGGER
What is the Web
•   Web of Documents  HTML
•   Web of Data  RDF
•   Global Unique IDs  HTTP URIs
•   Schema/Ontologies  OWL
•   Query RDF  SPARQL
Now what can we do with this
          data?
Generic Applications
Linked Data Browsers
Linked Data Browsers
• Not actually separate browsers. Run inside of
  HTML browsers
• View the data that is returned after looking up
  a URI in tabular form
• User can navigate between data sources by
  following RDF Links
• (IMO) No usability
WTF is the Semantic Web and Linked Data
Linked Data Browsers
•   http://browse.semanticweb.org/
•   Tabulator
•   OpenLink Dataexplorer
•   Zitgist
•   Marbles
•   Explorator
•   Disco
•   LinkSailor
Linked Data (Semantic Web)
      Search Engines
Linked Data (Semantic Web)
                Search Engines
• Just like conventional search engines (Google, Bing, Yahoo),
  crawl RDF documents and follow RDF links.
   – Current search engines don’t crawl data, unless it’s RDFa 
• Human focus Search
   – Falcons - Keyword
   – SWSE – Keyworkd
   – VisiNav – Complex Queries
• Machine focus Search
   –   Sindice – data instances
   –   Swoogle - ontologies
   –   Watson - ontologies
   –   Uberblic – curated integrated data instances
(Semantic) SEO ++
• Markup your HTML with RDFa
• Use standard vocabularies (ontologies)
  – Google Vocabulary
  – Good Relations
  – Dublin Core
• Google and Yahoo will crawl this data and use
  it for better rendering
WTF is the Semantic Web and Linked Data
On-the-fly Mashups
http://sig.ma
Domain Specific Applications
Domain Specific Applications
• Government
  – Data.gov
  – Data.gov.uk
  – http://data-gov.tw.rpi.edu/wiki/Demos
• Music
  – Seevl.net
• Dbpedia Mobile
• Life Science
  – LinkedLifeData
• Sports
  – BBC World Cup
Faceted Browsers
http://dbpedia.neofonie.de/browse/
Query your data
Find all the locations of all the original
         paintings of Modigliani
Select all proteins that are linked to a curated interaction from
          the literature and to inflammatory response




                                                 http://linkedlifedata.com/
http://tata.csres.utexas.edu:8080/specify/data/taxon51807
http://tata.csres.utexas.edu:8080/specify/data/taxon51807




                                          Links to other Data Sources
Linked Data is Data Integration

  SPARQL
   Query



Diamond


           Ultrawrap
                                   Ultrawrap
            Specify    Ultrawrap
                                   Morphster
                       Morphbank
Example 1 (Specify – DBpedia)
• Get full name and guid from taxon with id
  http://tata.csres.utexas.edu:8080/specify/data/t
  axon51807#thing
• AND fin any subjects it may have “skos:subject”
Result Example 1
• Note that
  http://dbpedia.org/resource/Category:Fish_of_
  Australia comes from a different data source
  (dbpedia.org)
Example 2 (Specify-Morphbank)
• Get full name and guid from taxon with id
  http://tata.csres.utexas.edu:8080/specify/data/t
  axon42947#thing
• AND the rank and kingdom from Morphbank
Result Example 2
• Note that full name and guid come from Specify
  http://tata.csres.utexas.edu:8080/specify/data/t
  axon42947
• AND rank and kingdom come from
  Morphbank
  http://tata.csres.utexas.edu:8080/morphbank/d
  ata/taxa398354
The killer app for        A little semantics
Semantic Technology is    goes a long way
YOUR life (online)            - Jim Hendler
   – Tom Gruber
                                Knowledge is Power
          Occupy Your Data        - Jim Hendler
            - Tim Finin
                             Linked Data is the
                             (Semantic) Web done right
The novel part of the           - Tim Berners-Lee
Semantic Web is not the
Semantics, but the Web
   - Frank van Harmelen         RAW DATA NOW
                                  - Tim Berners-Lee
QUESTIONS?

More Related Content

What's hot (20)

PPTX
NCompass Live: RDA: Are We There Yet?
Nebraska Library Commission
 
KEY
Semantic Web: A web that is not the Web
Bruce Esrig
 
PDF
Beyond Google: Advanced Search
GenealogyMedia.com
 
PPTX
Name That Graph !
Fabien Gandon
 
KEY
How do you Google?
mackas
 
PDF
Serendipity in Linked Open Data
i_serena
 
ZIP
Dagstuhl FOAF history talk
Dan Brickley
 
PPT
Yahoo Making The Web Searchable
kksst
 
PPTX
Graph and RDF databases
Nassim Bahri
 
PPTX
Research Skills
Brett30
 
PPT
Mpl brownbag sept2011
Jason Coleman
 
PDF
Searching tricks and tips
Imogen Bertin
 
PDF
The Simple Power of the link
Richard Wallis
 
DOC
Searching techniques
PCTE
 
PDF
Linked Data Challenge and Opportunity
Richard Wallis
 
KEY
The Simple Power of the Link
Richard Wallis
 
PDF
Publishing and Using Linked Data
ostephens
 
PPT
Search engine-optimization-tips
Trí Tuệ Việt
 
PDF
Combining Social Music and Semantic Web for music-related recommender systems
Alexandre Passant
 
PPTX
Smart Internet Searching for Genealogists
GenealogyMedia.com
 
NCompass Live: RDA: Are We There Yet?
Nebraska Library Commission
 
Semantic Web: A web that is not the Web
Bruce Esrig
 
Beyond Google: Advanced Search
GenealogyMedia.com
 
Name That Graph !
Fabien Gandon
 
How do you Google?
mackas
 
Serendipity in Linked Open Data
i_serena
 
Dagstuhl FOAF history talk
Dan Brickley
 
Yahoo Making The Web Searchable
kksst
 
Graph and RDF databases
Nassim Bahri
 
Research Skills
Brett30
 
Mpl brownbag sept2011
Jason Coleman
 
Searching tricks and tips
Imogen Bertin
 
The Simple Power of the link
Richard Wallis
 
Searching techniques
PCTE
 
Linked Data Challenge and Opportunity
Richard Wallis
 
The Simple Power of the Link
Richard Wallis
 
Publishing and Using Linked Data
ostephens
 
Search engine-optimization-tips
Trí Tuệ Việt
 
Combining Social Music and Semantic Web for music-related recommender systems
Alexandre Passant
 
Smart Internet Searching for Genealogists
GenealogyMedia.com
 

Viewers also liked (20)

PPTX
Linked Data: principles and examples
Victor de Boer
 
PPT
Web of Data - Introduction (english)
Thomas Francart
 
PDF
Linked data and Semantic Web Applications for Libraries
Vikas Bhushan
 
PPTX
Linked data for Libraries, Archives, Museums
ljsmart
 
PDF
Guest Lecture: Linked Open Data for the Humanities and Social Sciences
Laura Hollink
 
PDF
Graph Processing with Titan and Scylla
Jason Plurad
 
PDF
TinkerPop: a story of graphs, DBs, and graph DBs
Joshua Shinavier
 
PPTX
Introduccion a la Web Semantica
Juan Sequeda
 
PDF
Introduction to Linked Data - WWW2010
Juan Sequeda
 
PDF
Linked Data Applications - WWW2010
Juan Sequeda
 
PDF
Welcome to Consuming Linked Data tutorial WWW2010
Juan Sequeda
 
PPTX
Conclusions - Linked Data
Juan Sequeda
 
PPTX
What is the Semantic Web
Juan Sequeda
 
PDF
Consuming Linked Data by Humans - WWW2010
Juan Sequeda
 
PDF
Virtualizing Relational Databases as Graphs: a multi-model approach
Juan Sequeda
 
PPTX
Drupal 7 and Semantic Web Hands-on Tutorial
Juan Sequeda
 
PPTX
Welcome to Linked Data 0/5 Semtech2011
Juan Sequeda
 
PDF
Consuming Linked Data by Machines - WWW2010
Juan Sequeda
 
PPTX
Free Money (a.k.a Fellowships)
Juan Sequeda
 
PDF
Do I need a Graph Database?
Juan Sequeda
 
Linked Data: principles and examples
Victor de Boer
 
Web of Data - Introduction (english)
Thomas Francart
 
Linked data and Semantic Web Applications for Libraries
Vikas Bhushan
 
Linked data for Libraries, Archives, Museums
ljsmart
 
Guest Lecture: Linked Open Data for the Humanities and Social Sciences
Laura Hollink
 
Graph Processing with Titan and Scylla
Jason Plurad
 
TinkerPop: a story of graphs, DBs, and graph DBs
Joshua Shinavier
 
Introduccion a la Web Semantica
Juan Sequeda
 
Introduction to Linked Data - WWW2010
Juan Sequeda
 
Linked Data Applications - WWW2010
Juan Sequeda
 
Welcome to Consuming Linked Data tutorial WWW2010
Juan Sequeda
 
Conclusions - Linked Data
Juan Sequeda
 
What is the Semantic Web
Juan Sequeda
 
Consuming Linked Data by Humans - WWW2010
Juan Sequeda
 
Virtualizing Relational Databases as Graphs: a multi-model approach
Juan Sequeda
 
Drupal 7 and Semantic Web Hands-on Tutorial
Juan Sequeda
 
Welcome to Linked Data 0/5 Semtech2011
Juan Sequeda
 
Consuming Linked Data by Machines - WWW2010
Juan Sequeda
 
Free Money (a.k.a Fellowships)
Juan Sequeda
 
Do I need a Graph Database?
Juan Sequeda
 
Ad

Similar to WTF is the Semantic Web and Linked Data (20)

PDF
The Semantic Web: What IAs Need to Know About Web 3.0
Chiara Fox Ogan
 
PPTX
Linked data in the digital humanities skills workshop for realising the oppo...
jodischneider
 
PDF
When?
Dan Brickley
 
PPTX
Introduction to Linked Data 1/5
Juan Sequeda
 
PDF
Applying Semantic Extensions And New Services To Drupal Sem Tech June 2010
AI4BD GmbH
 
PDF
ISWC GoodRelations Tutorial Part 2
Martin Hepp
 
PDF
GoodRelations Tutorial Part 2
guestecacad2
 
PPTX
Beautifying Data in the real world
Tan Tran
 
PDF
Neo4j -- or why graph dbs kick ass
Emil Eifrem
 
PPTX
Hacking with Semantic Web
Tom Praison Praison
 
PDF
Semantic we bnext
Tapas Kumar Mishra
 
PPTX
Semantics
Ruju Gandhi
 
PDF
Eifrem neo4j
Shridhar Joshi
 
PPS
Linking Open Data with Drupal
emmanuel_jamin
 
PDF
121004 linking open_data_with_drupal_v1
manujam
 
PDF
Publishing linked data from relational databases
Iván Ruiz-Rube
 
PPTX
Semantic web
Tapas Kumar Mishra
 
PPT
Tutorial
Atner Yegorov
 
PPTX
Consuming Linked Data SemTech2010
Juan Sequeda
 
PDF
Питер Мика "Making the web searchable"
Yandex
 
The Semantic Web: What IAs Need to Know About Web 3.0
Chiara Fox Ogan
 
Linked data in the digital humanities skills workshop for realising the oppo...
jodischneider
 
Introduction to Linked Data 1/5
Juan Sequeda
 
Applying Semantic Extensions And New Services To Drupal Sem Tech June 2010
AI4BD GmbH
 
ISWC GoodRelations Tutorial Part 2
Martin Hepp
 
GoodRelations Tutorial Part 2
guestecacad2
 
Beautifying Data in the real world
Tan Tran
 
Neo4j -- or why graph dbs kick ass
Emil Eifrem
 
Hacking with Semantic Web
Tom Praison Praison
 
Semantic we bnext
Tapas Kumar Mishra
 
Semantics
Ruju Gandhi
 
Eifrem neo4j
Shridhar Joshi
 
Linking Open Data with Drupal
emmanuel_jamin
 
121004 linking open_data_with_drupal_v1
manujam
 
Publishing linked data from relational databases
Iván Ruiz-Rube
 
Semantic web
Tapas Kumar Mishra
 
Tutorial
Atner Yegorov
 
Consuming Linked Data SemTech2010
Juan Sequeda
 
Питер Мика "Making the web searchable"
Yandex
 
Ad

More from Juan Sequeda (11)

PDF
Integrating Semantic Web with the Real World - A Journey between Two Cities ...
Juan Sequeda
 
PDF
Integrating Semantic Web in the Real World: A Journey between Two Cities
Juan Sequeda
 
PDF
Integrating Relational Databases with the Semantic Web: A Reflection
Juan Sequeda
 
PDF
Graph Query Languages: update from LDBC
Juan Sequeda
 
PPTX
RDB2RDF Tutorial (R2RML and Direct Mapping) at ISWC 2013
Juan Sequeda
 
PPTX
Consuming Linked Data 4/5 Semtech2011
Juan Sequeda
 
PPTX
Publishing Linked Data 3/5 Semtech2011
Juan Sequeda
 
PPTX
Creating Linked Data 2/5 Semtech2011
Juan Sequeda
 
PDF
Open Research Problems in Linked Data - WWW2010
Juan Sequeda
 
PPTX
Consuming Linked Data by Humans
Juan Sequeda
 
PPTX
Introduction to Linked Data
Juan Sequeda
 
Integrating Semantic Web with the Real World - A Journey between Two Cities ...
Juan Sequeda
 
Integrating Semantic Web in the Real World: A Journey between Two Cities
Juan Sequeda
 
Integrating Relational Databases with the Semantic Web: A Reflection
Juan Sequeda
 
Graph Query Languages: update from LDBC
Juan Sequeda
 
RDB2RDF Tutorial (R2RML and Direct Mapping) at ISWC 2013
Juan Sequeda
 
Consuming Linked Data 4/5 Semtech2011
Juan Sequeda
 
Publishing Linked Data 3/5 Semtech2011
Juan Sequeda
 
Creating Linked Data 2/5 Semtech2011
Juan Sequeda
 
Open Research Problems in Linked Data - WWW2010
Juan Sequeda
 
Consuming Linked Data by Humans
Juan Sequeda
 
Introduction to Linked Data
Juan Sequeda
 

Recently uploaded (20)

PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
PDF
Survival Models: Proper Scoring Rule and Stochastic Optimization with Competi...
Paris Women in Machine Learning and Data Science
 
PPTX
Securing Model Context Protocol with Keycloak: AuthN/AuthZ for MCP Servers
Hitachi, Ltd. OSS Solution Center.
 
PPTX
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
PPTX
Agentforce World Tour Toronto '25 - Supercharge MuleSoft Development with Mod...
Alexandra N. Martinez
 
PDF
Modern Decentralized Application Architectures.pdf
Kalema Edgar
 
PPTX
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
PDF
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PDF
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
PDF
99 Bottles of Trust on the Wall — Operational Principles for Trust in Cyber C...
treyka
 
PDF
🚀 Let’s Build Our First Slack Workflow! 🔧.pdf
SanjeetMishra29
 
PDF
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PPTX
Manual Testing for Accessibility Enhancement
Julia Undeutsch
 
PDF
“ONNX and Python to C++: State-of-the-art Graph Compilation,” a Presentation ...
Edge AI and Vision Alliance
 
PPTX
Digital Circuits, important subject in CS
contactparinay1
 
PDF
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
PDF
Dev Dives: Accelerating agentic automation with Autopilot for Everyone
UiPathCommunity
 
PDF
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
PPTX
Essential Content-centric Plugins for your Website
Laura Byrne
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
Survival Models: Proper Scoring Rule and Stochastic Optimization with Competi...
Paris Women in Machine Learning and Data Science
 
Securing Model Context Protocol with Keycloak: AuthN/AuthZ for MCP Servers
Hitachi, Ltd. OSS Solution Center.
 
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
Agentforce World Tour Toronto '25 - Supercharge MuleSoft Development with Mod...
Alexandra N. Martinez
 
Modern Decentralized Application Architectures.pdf
Kalema Edgar
 
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
99 Bottles of Trust on the Wall — Operational Principles for Trust in Cyber C...
treyka
 
🚀 Let’s Build Our First Slack Workflow! 🔧.pdf
SanjeetMishra29
 
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
Manual Testing for Accessibility Enhancement
Julia Undeutsch
 
“ONNX and Python to C++: State-of-the-art Graph Compilation,” a Presentation ...
Edge AI and Vision Alliance
 
Digital Circuits, important subject in CS
contactparinay1
 
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
Dev Dives: Accelerating agentic automation with Autopilot for Everyone
UiPathCommunity
 
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
Essential Content-centric Plugins for your Website
Laura Byrne
 

WTF is the Semantic Web and Linked Data

  • 1. WTF is the Semantic Web and Linked Data Juan F. Sequeda Department of Computer Science University of Texas at Austin Nov 17, 2011
  • 2. Semantic Web? Linked Data? WTF?
  • 3. WTF is the Semantic Web?
  • 4. WTF is the Semantic Web?
  • 7. What is the Web? “… the Web, is a system of interlinked hypertext documents accessed via the Internet. With a web browser, one can view web pages that may contain text, images […] and navigate between them via hyperlinks” http://en.wikipedia.org/wiki/World_Wide_Web
  • 8. Current Web = internet + links + docs
  • 9. History of the Web • Created by Tim Berners-Lee at CERN in 1989 • Mosaic browser in 1993 • W3C created in 1994 • Exponential growth mid 90s • Amazon, Ebay – 1995 • Search engines – Google 1998 • Dot-com boom 1997 – 2001 • Web 2.0 – blogs, Facebook, Twitter, etc
  • 10. What is the problem?
  • 11. WHAT’S THE WEATHER IN AUSTIN TODAY? http://www.flickr.com/photos/jamieca/31631256/
  • 15. What is the problem? • The web is full of documents • We aren’t always interested in documents – We are interested in THINGS – These THINGS might be in documents • We can read a HTML document rendered in a browser and find what we are searching for – This is hard for computers. – Computers have to guess (even though they are pretty good at it)
  • 16. The Web of Documents Search Search Engine Crawler
  • 17. The Web is a Data Shredder Structured Unstructured Data Data Thanks Martin Hepp
  • 18. What would we like? • Make it easy for computers/software to find THINGS Do you SEARCH or do you FIND?
  • 19. Search for Football Players who went to the University of Texas at Austin, played for the Dallas Cowboys as Cornerback
  • 23. Why can’t we just FIND it…
  • 26. Guess how I FOUND out?
  • 27. On a Semantic Web • Besides publishing documents on the web – which computers can’t understand easily • Let’s publish on the web something that computers can understand DATA
  • 28. The Semantic Web is a web of data The current web is a web of documents
  • 29. But wait… doesn’t the web already have data?
  • 30. Current Data on the Web • Relational Databases • APIs • XML • CSV • XLS • … • Can’t computers and applications already consume that data on the web?
  • 31. Yes! But it is all in different formats and data models!
  • 32. This makes it hard to integrate data
  • 33. The data in different data sources aren’t linked
  • 34. For example, how do I know that the Juan Sequeda in Facebook is the same as Juan Sequeda in Twitter
  • 35. Or if I create a mashup from different services, I have to learn different APIs and I get different formats of data back
  • 37. Wouldn’t it be great if we had a standard way of publishing data on the Web?
  • 38. We have a standardized way of publishing documents on the web, right? HTML
  • 39. Then why can’t we have a standard way of publishing data on the Web?
  • 40. Good question! And the answer is YES. There is! RDF
  • 41. Resource Description Framework (RDF) • Data Model = a way to model data – i.e. Relational databases use relational data model • RDF is a graph data model
  • 42. Key Value vs Graph • Key Values – firstName Juan – lastName  Sequeda – livesIn  Austin – knows  Stephane Corlosquet • But what are these key/values describing? – ME!
  • 43. RDF is a Graph • Let’s group the Key/Values together – <JuanSequeda> <firstName> “Juan” – <JuanSequeda> <lastName> “Sequeda” – <JuanSequeda> <livesIn> “Austin” – <JuanSequeda> <knows> <StephaneCorlosquet> – .. – <StephaneCorlosquet> <firstName> “Stephane” – <StephaneCorlosquet> <lastName> “Corlosquet” – <StephaneCorlosquet> <livesIn> “Boston”
  • 44. Identifier for the “group” RDF is a Graph Key/Value • Let’s group the Key/Values together – <JuanSequeda> <firstName> “Juan” – <JuanSequeda> <lastName> “Sequeda” – <JuanSequeda> <livesIn> “Austin” – <JuanSequeda> <knows> <StephaneCorlosquet> – .. – <StephaneCorlosquet> <firstName> “Stephane” – <StephaneCorlosquet> <lastName> “Corlosquet” – <StephaneCorlosquet> <livesIn> “Boston”
  • 45. RDF can be serialized in different ways • RDF/XML • RDFa (RDF in HTML) • N3 • Turtle • JSON
  • 47. RDFa
  • 51. So does that mean that I have to publish my data in RDF now?
  • 52. You don’t have to… but we would like you to  Schema.org Rich Snippets …
  • 55. Databases back up documents THINGS have PROPERTIES: A Book as a Title, an author, … Isbn Title Author PublisherID ReleasedData 978-0-596- Programming Toby Segaran 1 July 2009 15381-6 the Semantic Web … … … … … PublisherID PublisherName This is a THING: A book title “Programming the 1 O’Reilly Media Semantic Web” by Toby Segaran, … … …
  • 56. Lets represent the data in RDF Isbn Title Author PublisherID ReleasedData 978-0- Programming Toby 1 July 2009 596- the Semantic Segaran 15381- Web 6 Programming the PublisherID PublisherName title Semantic Web 1 O’Reilly Media author book Toby Segaran isbn 978-0-596-15381-6 publisher name Publisher O’Reilly
  • 57. Remember that we are on the web Everything on the web is identified by a URI
  • 58. And now let’s link the data to other data Programming the title Semantic Web http://…/i author Toby Segaran sbn978 isbn 978-0-596-15381-6 publisher http://…/p name ublisher1 O’Reilly
  • 59. And now consider the data from Revyu.com http://…/ hasReview http://…/i review1 sbn978 description reviewer Awesome Book http://…/ name reviewer Juan Sequeda
  • 60. Let’s start to link data http://…/ hasReview http://…/i review1 sbn978 Programming the description title Semantic Web hasReviewer owl:sameAs Awesome author http://…/i Book Toby Segaran sbn978 http://…/ name reviewer isbn 978-0-596-15381-6 Juan publisher Sequeda http://…/p name ublisher1 O’Reilly
  • 61. Juan Sequeda publishes data too http://juanse livesIn http://dbpedia.org/Austin queda.com/id name Juan Sequeda
  • 62. Let’s link more data http://…/ hasReview http://…/i review1 sbn978 description hasReviewer Awesome Book http://…/ name reviewer sameAs Juan Sequeda http://juanse livesIn http://dbpedia.org/Austin queda.com/id name Juan Sequeda
  • 63. And more http://…/ hasReview http://…/i review1 sbn978 Programming the description title Semantic Web hasReviewer owl:sameAs Awesome author http://…/i Book Toby Segaran sbn978 http://…/ name reviewer isbn 978-0-596-15381-6 owl:sameAs Juan publisher http://…/p Sequeda name ublisher1 O’Reilly http://juanse livesIn http://dbpedia.org/Austin queda.com/id name Juan Sequeda
  • 64. Data on the Web that is in RDF and is linked to other RDF data is LINKED DATA
  • 65. Linked Data Principles 1. Use URIs as names for things 2. Use HTTP URIs so that people can look up (dereference) those names. 3. When someone looks up a URI, provide useful information. 4. Include links to other URIs so that they can discover more things.
  • 66. Linked Data makes the web appear as ONE GIANT HUGE GLOBAL DATABASE!
  • 67. I can query a database with SQL. Is there a way to query Linked Data with a query language?
  • 68. Yes! There is actually a standardize language for that SPARQL
  • 69. FIND all the reviews on the book “Programming the Semantic Web” by people who live in Austin
  • 70. SPARQL SELECT ?review ?comment WHERE { isbn:978 ex:hasReview ?review . ?review ex:description ?comment . ?review ex:hasReviewer ?person . ?person ex:lives dbpedia:Austin . }
  • 71. SELECT ?review ?comment WHERE { isbn:978 ex:hasReview ?review . ?review ex:description ?comment . http://…/ hasReview ?review ex:hasReviewer ?person . http://…/i review1 ?person ex:lives dbpedia:Austin . sbn978 } Programming the description title Semantic Web hasReviewer sameAs Awesome author http://…/i Book Toby Segaran sbn978 http://…/ name reviewer isbn 978-0-596-15381-6 sameAs Juan publisher Sequeda http://…/p name ublisher1 O’Reilly http://juanse queda.com livesIn http://dbpedia.org/Austin name Juan Sequeda
  • 72. OWL • Here is where the real semantics shows up • Web Ontology Language • Define schema/vocabulary • Classes, Properties, Inheritance, etc • Subclasses, Subproperties • … • You can get more complicated with rules…
  • 73. auth: <http://dblp.l3s.de/d2r/page/authors/> dexa: <http://dblp.l3s.de/d2r/page/publications/conf/dexa/> dc: <http://purl.org/dc/elements/1.1/> sw: <http://data.semanticweb.org/person/> swrc: <http://swrc.ontoware.org/ontology#> owl: <http://www.w3.org/2002/07/owl#> rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> rdfs: <http://www.w3.org/2000/01/rdf-schema#> swrc:Publicatio n rdfs:subClassOf dc:creator swrc:InProceedings foaf:Person OWL rdf:type owl:sameAs rdf:type auth:Juan_Sequeda sw:juan-f-sequeda dc:creator owl:sameAs dexa:TirmiziSM08 auth:Daniel_P._Miranker sw:daniel-miranker dc:title owl:sameAs auth:Syed_Hamid_Tirmizi sw:syed-tirmizi “Translating SQL Applications to the Semantic Web" RDF
  • 74. RDB and the Semantic Web TRIGGERS RIF CONSTRAINTS OWL TIME TABLE DEFINITION RDFS RELATIONAL RDF MODEL 74
  • 75. This looks cool, but let’s be realistic. What is the incentive to publish Linked Data?
  • 76. What was your incentive to publish an HTML page in 1990?
  • 77. 1) Share data in documents 2) Because you neighbor was doing it … later on … 3) Marketing, Advertising, …, SEO
  • 78. So why should we publish Linked Data in 2011?
  • 79. 1) Share data as data 2) Because you neighbor is doing it … 3) Marketing, Advertising, SEO ++
  • 80. Linked Data Publishers • UK Government • US Government • BBC • Open Calais – Thomson Reuters • Freebase/Google • NY Times • Best Buy • Sears • Kmart • Overstock.com • CNET • Dbpedia • O’Reilly Media • …
  • 91. September 2011 Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
  • 92. YOU GET THE PICTURE ITS BIG and getting BIGGER and BIGGER
  • 93. What is the Web • Web of Documents  HTML • Web of Data  RDF • Global Unique IDs  HTTP URIs • Schema/Ontologies  OWL • Query RDF  SPARQL
  • 94. Now what can we do with this data?
  • 97. Linked Data Browsers • Not actually separate browsers. Run inside of HTML browsers • View the data that is returned after looking up a URI in tabular form • User can navigate between data sources by following RDF Links • (IMO) No usability
  • 99. Linked Data Browsers • http://browse.semanticweb.org/ • Tabulator • OpenLink Dataexplorer • Zitgist • Marbles • Explorator • Disco • LinkSailor
  • 100. Linked Data (Semantic Web) Search Engines
  • 101. Linked Data (Semantic Web) Search Engines • Just like conventional search engines (Google, Bing, Yahoo), crawl RDF documents and follow RDF links. – Current search engines don’t crawl data, unless it’s RDFa  • Human focus Search – Falcons - Keyword – SWSE – Keyworkd – VisiNav – Complex Queries • Machine focus Search – Sindice – data instances – Swoogle - ontologies – Watson - ontologies – Uberblic – curated integrated data instances
  • 102. (Semantic) SEO ++ • Markup your HTML with RDFa • Use standard vocabularies (ontologies) – Google Vocabulary – Good Relations – Dublin Core • Google and Yahoo will crawl this data and use it for better rendering
  • 107. Domain Specific Applications • Government – Data.gov – Data.gov.uk – http://data-gov.tw.rpi.edu/wiki/Demos • Music – Seevl.net • Dbpedia Mobile • Life Science – LinkedLifeData • Sports – BBC World Cup
  • 111. Find all the locations of all the original paintings of Modigliani
  • 112. Select all proteins that are linked to a curated interaction from the literature and to inflammatory response http://linkedlifedata.com/
  • 115. Linked Data is Data Integration SPARQL Query Diamond Ultrawrap Ultrawrap Specify Ultrawrap Morphster Morphbank
  • 116. Example 1 (Specify – DBpedia) • Get full name and guid from taxon with id http://tata.csres.utexas.edu:8080/specify/data/t axon51807#thing • AND fin any subjects it may have “skos:subject”
  • 117. Result Example 1 • Note that http://dbpedia.org/resource/Category:Fish_of_ Australia comes from a different data source (dbpedia.org)
  • 118. Example 2 (Specify-Morphbank) • Get full name and guid from taxon with id http://tata.csres.utexas.edu:8080/specify/data/t axon42947#thing • AND the rank and kingdom from Morphbank
  • 119. Result Example 2 • Note that full name and guid come from Specify http://tata.csres.utexas.edu:8080/specify/data/t axon42947 • AND rank and kingdom come from Morphbank http://tata.csres.utexas.edu:8080/morphbank/d ata/taxa398354
  • 120. The killer app for A little semantics Semantic Technology is goes a long way YOUR life (online) - Jim Hendler – Tom Gruber Knowledge is Power Occupy Your Data - Jim Hendler - Tim Finin Linked Data is the (Semantic) Web done right The novel part of the - Tim Berners-Lee Semantic Web is not the Semantics, but the Web - Frank van Harmelen RAW DATA NOW - Tim Berners-Lee