SlideShare a Scribd company logo
Drupal, Calais
& the Semantic Web
Prepared by Frank Febbraro, CTO & Presented by Jeff Walpole, CEO
Introductions (and sizing each other up)
 Raise your hand if you are a…
 Technologist?
 Journalist?
 SemHead?
 Raise your hand if you use or have used Drupal?
 Calais API?
 Lets play word association…
 Linked data
 RDF
 SPARQL
 GRDDL
Publishing tech Phase2 is working on
 CMS frameworks
 Drupal & Java Development
 Taxonomy solutions
 Geo-tagging & Mapping
 Charting & Graphing Data
 Semantic Web integration
 Open Data/APIs
 Topic Hubs
 Publishing workflow
 Feed Syndication
 Buzz and topic trend monitoring
 Community collaboration sites
 Multi-site & virtual site CMS
architecture
 An open source CMS installation
specifically for publishers – called
OpenPublish
DrupalDrupal
We heart Drupal
Why We use Drupal for CMS
 Performance/Reliability: Dozens of major
publishers turn to Drupal and tens of
thousands of high traffic sites because it is
an enterprise class platform
 Ease/Expense of Implementation: As one
of the leading shops developing for this
platform, we can be as efficient as anyone
and this platform is our preferred
technology.
 Evolving Technology Extensibility: You
need something modular/extensible that
allows you to add new features easily and
we know this is possible with Drupal.
 Easier Modular Enhancements: Drupal's
architecture is modular and integrates well
without requiring customization to core
components that would make them
difficult to maintain.
 P2 Expertise: Our entire development staff
of 12+ developers can support you on
Drupal and we are known as one of the
top firms in the country.
 Large Community Support: You need a
community that is active, robust,
responsive and growing. We are involved
in the Drupal community and have an ear
to the ground on features and changes
that would affect your site.
 Easy Staff Training: The Drupal CMS is
intuitive and we are well versed in training
others to use it. To support training, there
are numerous videos, online tutorials, local
classes and even books on how it works.
 Decreased Support Costs: Publishers find
they can do a lot more themselves and
when they do need help, the time is a
fraction of what a proprietary CMS would
cost for similar changes.
Drupal Semantic Modules
rdf, rdf cck, foaf, relations, sparql, sioc, calais collection
http://www.youtube.com/watch?v=r4WgTRIRoa0
The Calais API
connect. everything.
How does Calais work?
1. Categorizes and metatags the
people, places, companies, facts
and events in your content to make
it ‘machine-readable,’ and returns
that metadata to you.
2. Makes connections between the
entities in your content and related
data in Wikipedia, GeoNames, the
IMDB, Shopping.com and more
3. Empowers you to share your
metadata with search engines,
news aggregators, ‘related stories’
applications and others in the
content ecosystem.
<Topic>M&A</Topic>
<Acquisition offset="494" length="130">
<Company_Acquirer>Reuters</Company_Acquirer>
<Company_Acquired>ClearForest
Ltd.</Company_Acquired>
<Status>Planned</Status>
</Acquisition>
<Company>Reuters</Company>
<Company>ClearForest Ltd.</Company>
Reuters Announced the Acquisition of ClearForest
New York - April 30, 2007
Reuters, the global information company, has entered
into an agreement to acquire all of the outstanding
shares of ClearForest Ltd., a privately held provider of
Text Analytics solutions, whose tagging platform and
analytical products allow clients to derive precise
business information from huge amounts of textual
content.
ClearForest has received sufficient shareholder
approval to complete the transaction, which is
expected to close in approximately 30 days, subject to
customary closing conditions. The financial terms were
not disclosed. Reuters plans to retain and continue to
work with the existing management team and their
highly skilled workforces in the US and Israel. It also
plans to continue to support existing products and
customers.
Reuters believes that search will be a pivotal element to
the future of how financial information is sourced and
consumed. As part of its drive into this space, Reuters
has created a new strategic group and appointed
Gerry Campbell, who will oversee the integration of
ClearForest and drive this innovation.
<Product>Text Analytic Solution </Product>
<Company>ClearForest Ltd.</Company>
<Company>Reuters</Company>
<Country>United States</Country>
<Country>Israel</Country>
<Company>Reuters</Company>
<Person>Gerry Campbell</Person>
<ManagementChange offset="2789" length="92">
<Person>Gerry Campbell</Person>
<Company>Reuters</Company>
<Action>Enters</Position>
</ManagementChange>
What Would that Look Like (in code)?
DrupalDrupal
Calais for DrupalCalais for Drupal
Calais Collection
What does Calais for Drupal Look Like?
•suggest terms
allowing full user
control of the tagging
(think of del.icio.us
recommending tags).
Calais Terms or Taxonomy Terms?
configure per node type
save. magic happens.
Too much information? Limit it.
autodiscovery, what’s that mean?
RDF for your nodes
LOOK AWAY!!!!!!!
DrupalDrupal
Calais for DrupalCalais for Drupal
More Like ThisMore Like This Topic HubsTopic HubsGeoGeo
More Like This
automatically prefill from Calais
configure threshold
relevant on-site content to browse
relevant news from the web
DrupalDrupal
Calais for DrupalCalais for Drupal
More Like ThisMore Like This Topic HubsTopic HubsGeoGeo
self organizing content
create a contextual expression
configure plugins (or define your own)
tell your story
show content in various contexts
they are panels, so rearrange
DrupalDrupal
Calais for DrupalCalais for Drupal
Linked DataLinked Data
More Like ThisMore Like This Topic HubsTopic HubsGeoGeo
Linked Data
Linked Data Datasets
Linked Data
it’s all about the URIs
Drupal: http://dbpedia.org/resource/Drupal
Washington DC: http://d.opencalais.com/er/geo/city/ralg-geo1/f497898f-2b9b-7cda-
ec7b-85d896acbe3e
Calais linked data for humans
Calais linked data RDF
hello dbpedia (for geeks)
Calais geo config
on a map, but wait, there’s more
dbpedia data (or other sources)
semantic company data
company data from dbpedia
Calais URI for Toyota
Get the next link to follow
SPARQL query to get the data from DBPedia
render it to html, voilà
DrupalDrupal
Calais for DrupalCalais for Drupal
Linked DataLinked Data
More Like ThisMore Like This Topic HubsTopic HubsGeoGeo
MarmosetMarmoset
Marmoset: microformats for search agents
The Big Picture – OpenPublish
DrupalDrupal
Calais for DrupalCalais for Drupal
Linked DataLinked Data
More Like ThisMore Like This Topic HubsTopic HubsGeoGeo
MarmosetMarmoset
http://opensourceopenminds.com/openpublish
Enough Talk - lets see a demo…
Q&A

More Related Content

What's hot (20)

KEY
JavaScript in Drupal 7: What developers need to know
katbailey
 
PDF
Drush - use full power - DrupalCamp Donetsk 2014
Alex S
 
PPT
Drupal image gallery_workshop
Heather Bohn
 
PPTX
10 Things Every Plugin Developer Should Know (WordCamp Atlanta 2013)
arcware
 
PDF
Using RequireJS with CakePHP
Stephen Young
 
DOC
How to migrate Cakephp 1.x to 2.x
Andolasoft Inc
 
PDF
Assetic (Symfony Live Paris)
Kris Wallsmith
 
PDF
Drupal & javascript
Almog Baku
 
PPTX
Becoming A Drupal Master Builder
Philip Norton
 
PPTX
8 things to know about theming in drupal 8
Logan Farr
 
ZIP
Drupal Development (Part 2)
Jeff Eaton
 
PDF
Backbone
Glenn De Backer
 
PDF
Introduction to backbone presentation
Brian Hogg
 
ZIP
Learning the basics of the Drupal API
Alexandru Badiu
 
KEY
Bcblackpool jquery tips
Jack Franklin
 
PDF
Drupal Step-by-Step: How We Built Our Training Site, Part 1
Acquia
 
PDF
Drupal 8, Where Did the Code Go? From Info Hook to Plugin
Acquia
 
PDF
Drupal theming - a practical approach (European Drupal Days 2015)
Eugenio Minardi
 
KEY
【前端Mvc】之豆瓣说实践
taobao.com
 
PDF
TurboGears2 Pluggable Applications
Alessandro Molina
 
JavaScript in Drupal 7: What developers need to know
katbailey
 
Drush - use full power - DrupalCamp Donetsk 2014
Alex S
 
Drupal image gallery_workshop
Heather Bohn
 
10 Things Every Plugin Developer Should Know (WordCamp Atlanta 2013)
arcware
 
Using RequireJS with CakePHP
Stephen Young
 
How to migrate Cakephp 1.x to 2.x
Andolasoft Inc
 
Assetic (Symfony Live Paris)
Kris Wallsmith
 
Drupal & javascript
Almog Baku
 
Becoming A Drupal Master Builder
Philip Norton
 
8 things to know about theming in drupal 8
Logan Farr
 
Drupal Development (Part 2)
Jeff Eaton
 
Backbone
Glenn De Backer
 
Introduction to backbone presentation
Brian Hogg
 
Learning the basics of the Drupal API
Alexandru Badiu
 
Bcblackpool jquery tips
Jack Franklin
 
Drupal Step-by-Step: How We Built Our Training Site, Part 1
Acquia
 
Drupal 8, Where Did the Code Go? From Info Hook to Plugin
Acquia
 
Drupal theming - a practical approach (European Drupal Days 2015)
Eugenio Minardi
 
【前端Mvc】之豆瓣说实践
taobao.com
 
TurboGears2 Pluggable Applications
Alessandro Molina
 

Similar to Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009 (20)

PPT
Calais @ the Palo Alto Semantic Web Meetup
Krista Thomas
 
PPT
Calais @ the SD Forum
Krista Thomas
 
PPT
Open Calais Release 4.0
Krista Thomas
 
PPT
Final Calais For ONA
Krista Thomas
 
PPT
Calais For Ona
Thomas Tague
 
PPT
JahiaOne - Semantic Web with Jahia
Jahia Solutions Group
 
PDF
12 Things the Semantic Web Should Know about Content Analytics
Seth Grimes
 
PPT
Intro To The Calais Web Service @ OpenCalais.com
Krista Thomas
 
PPT
Five Ways To Calais V01
Thomas Tague
 
PPTX
Structured Data and Semantic SEO
Matthew Brown
 
PPT
SemanticWebApp
Adela Beres
 
PPT
The Semantic Web An Introduction
shaouy
 
PPT
From Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia Industry
Joel Amoussou
 
PDF
Drupal as a Semantic Web platform - ISWC 2012
scorlosquet
 
PPTX
Recent Trends in Semantic Search Technologies
Thanh Tran
 
PPTX
Sem tech2013 tutorial
Thengo Kim
 
PPTX
Presentation on SEO, .htaccess, Open-source, Ontology, Semantic web, etc.
Indian Institute of Mnagement, Ahmedabad
 
PDF
The Semantic Web and Drupal 7 - Loja 2013
scorlosquet
 
PPTX
Taking Advantage of Semantic Web
Sankaranand Balaradjou
 
PDF
Drupal and RDF
scorlosquet
 
Calais @ the Palo Alto Semantic Web Meetup
Krista Thomas
 
Calais @ the SD Forum
Krista Thomas
 
Open Calais Release 4.0
Krista Thomas
 
Final Calais For ONA
Krista Thomas
 
Calais For Ona
Thomas Tague
 
JahiaOne - Semantic Web with Jahia
Jahia Solutions Group
 
12 Things the Semantic Web Should Know about Content Analytics
Seth Grimes
 
Intro To The Calais Web Service @ OpenCalais.com
Krista Thomas
 
Five Ways To Calais V01
Thomas Tague
 
Structured Data and Semantic SEO
Matthew Brown
 
SemanticWebApp
Adela Beres
 
The Semantic Web An Introduction
shaouy
 
From Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia Industry
Joel Amoussou
 
Drupal as a Semantic Web platform - ISWC 2012
scorlosquet
 
Recent Trends in Semantic Search Technologies
Thanh Tran
 
Sem tech2013 tutorial
Thengo Kim
 
Presentation on SEO, .htaccess, Open-source, Ontology, Semantic web, etc.
Indian Institute of Mnagement, Ahmedabad
 
The Semantic Web and Drupal 7 - Loja 2013
scorlosquet
 
Taking Advantage of Semantic Web
Sankaranand Balaradjou
 
Drupal and RDF
scorlosquet
 
Ad

More from Krista Thomas (13)

PDF
Ad.ly Introduction
Krista Thomas
 
PDF
San diego
Krista Thomas
 
PDF
San diego
Krista Thomas
 
PPT
The OpenCalais Workshop at WeMedia 2010
Krista Thomas
 
PPT
Open Calais Workshop at WeMedia 2010
Krista Thomas
 
PDF
Web 3 0 Krista Thomas 1 26 10
Krista Thomas
 
PPT
Web 3 0 Krista Thomas 1 26 10
Krista Thomas
 
PDF
Simple OpenCalais Whitepaper
Krista Thomas
 
PDF
OpenCalais At The San Diego Software Industry Council
Krista Thomas
 
PDF
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09
Krista Thomas
 
PPT
Open Calais @ Transparent Text
Krista Thomas
 
PDF
Tague Semtech Keynote 2009
Krista Thomas
 
PDF
Open Calais For SF And LA Meetups
Krista Thomas
 
Ad.ly Introduction
Krista Thomas
 
San diego
Krista Thomas
 
San diego
Krista Thomas
 
The OpenCalais Workshop at WeMedia 2010
Krista Thomas
 
Open Calais Workshop at WeMedia 2010
Krista Thomas
 
Web 3 0 Krista Thomas 1 26 10
Krista Thomas
 
Web 3 0 Krista Thomas 1 26 10
Krista Thomas
 
Simple OpenCalais Whitepaper
Krista Thomas
 
OpenCalais At The San Diego Software Industry Council
Krista Thomas
 
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09
Krista Thomas
 
Open Calais @ Transparent Text
Krista Thomas
 
Tague Semtech Keynote 2009
Krista Thomas
 
Open Calais For SF And LA Meetups
Krista Thomas
 
Ad

Recently uploaded (20)

PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PPTX
Using Google Data Studio (Looker Studio) to Create Effective and Easy Data Re...
Orage Technologies
 
PDF
Basics of Electronics for IOT(actuators ,microcontroller etc..)
arnavmanesh
 
PPTX
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
PDF
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
PPTX
Agile Chennai 18-19 July 2025 | Workshop - Enhancing Agile Collaboration with...
AgileNetwork
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PPTX
Machine Learning Benefits Across Industries
SynapseIndia
 
PDF
OpenInfra ID 2025 - Are Containers Dying? Rethinking Isolation with MicroVMs.pdf
Muhammad Yuga Nugraha
 
PDF
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
PPTX
Simple and concise overview about Quantum computing..pptx
mughal641
 
PPTX
Agentic AI in Healthcare Driving the Next Wave of Digital Transformation
danielle hunter
 
PPTX
AI Code Generation Risks (Ramkumar Dilli, CIO, Myridius)
Priyanka Aash
 
PDF
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
PDF
Researching The Best Chat SDK Providers in 2025
Ray Fields
 
PPTX
Farrell_Programming Logic and Design slides_10e_ch02_PowerPoint.pptx
bashnahara11
 
PDF
introduction to computer hardware and sofeware
chauhanshraddha2007
 
PDF
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification
Ivan Ruchkin
 
PDF
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
Using Google Data Studio (Looker Studio) to Create Effective and Easy Data Re...
Orage Technologies
 
Basics of Electronics for IOT(actuators ,microcontroller etc..)
arnavmanesh
 
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
Agile Chennai 18-19 July 2025 | Workshop - Enhancing Agile Collaboration with...
AgileNetwork
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
Machine Learning Benefits Across Industries
SynapseIndia
 
OpenInfra ID 2025 - Are Containers Dying? Rethinking Isolation with MicroVMs.pdf
Muhammad Yuga Nugraha
 
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
Simple and concise overview about Quantum computing..pptx
mughal641
 
Agentic AI in Healthcare Driving the Next Wave of Digital Transformation
danielle hunter
 
AI Code Generation Risks (Ramkumar Dilli, CIO, Myridius)
Priyanka Aash
 
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
Researching The Best Chat SDK Providers in 2025
Ray Fields
 
Farrell_Programming Logic and Design slides_10e_ch02_PowerPoint.pptx
bashnahara11
 
introduction to computer hardware and sofeware
chauhanshraddha2007
 
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification
Ivan Ruchkin
 
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 

Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009

  • 1. Drupal, Calais & the Semantic Web Prepared by Frank Febbraro, CTO & Presented by Jeff Walpole, CEO
  • 2. Introductions (and sizing each other up)  Raise your hand if you are a…  Technologist?  Journalist?  SemHead?  Raise your hand if you use or have used Drupal?  Calais API?  Lets play word association…  Linked data  RDF  SPARQL  GRDDL
  • 3. Publishing tech Phase2 is working on  CMS frameworks  Drupal & Java Development  Taxonomy solutions  Geo-tagging & Mapping  Charting & Graphing Data  Semantic Web integration  Open Data/APIs  Topic Hubs  Publishing workflow  Feed Syndication  Buzz and topic trend monitoring  Community collaboration sites  Multi-site & virtual site CMS architecture  An open source CMS installation specifically for publishers – called OpenPublish
  • 5. Why We use Drupal for CMS  Performance/Reliability: Dozens of major publishers turn to Drupal and tens of thousands of high traffic sites because it is an enterprise class platform  Ease/Expense of Implementation: As one of the leading shops developing for this platform, we can be as efficient as anyone and this platform is our preferred technology.  Evolving Technology Extensibility: You need something modular/extensible that allows you to add new features easily and we know this is possible with Drupal.  Easier Modular Enhancements: Drupal's architecture is modular and integrates well without requiring customization to core components that would make them difficult to maintain.  P2 Expertise: Our entire development staff of 12+ developers can support you on Drupal and we are known as one of the top firms in the country.  Large Community Support: You need a community that is active, robust, responsive and growing. We are involved in the Drupal community and have an ear to the ground on features and changes that would affect your site.  Easy Staff Training: The Drupal CMS is intuitive and we are well versed in training others to use it. To support training, there are numerous videos, online tutorials, local classes and even books on how it works.  Decreased Support Costs: Publishers find they can do a lot more themselves and when they do need help, the time is a fraction of what a proprietary CMS would cost for similar changes.
  • 6. Drupal Semantic Modules rdf, rdf cck, foaf, relations, sparql, sioc, calais collection http://www.youtube.com/watch?v=r4WgTRIRoa0
  • 8. How does Calais work? 1. Categorizes and metatags the people, places, companies, facts and events in your content to make it ‘machine-readable,’ and returns that metadata to you. 2. Makes connections between the entities in your content and related data in Wikipedia, GeoNames, the IMDB, Shopping.com and more 3. Empowers you to share your metadata with search engines, news aggregators, ‘related stories’ applications and others in the content ecosystem.
  • 9. <Topic>M&A</Topic> <Acquisition offset="494" length="130"> <Company_Acquirer>Reuters</Company_Acquirer> <Company_Acquired>ClearForest Ltd.</Company_Acquired> <Status>Planned</Status> </Acquisition> <Company>Reuters</Company> <Company>ClearForest Ltd.</Company> Reuters Announced the Acquisition of ClearForest New York - April 30, 2007 Reuters, the global information company, has entered into an agreement to acquire all of the outstanding shares of ClearForest Ltd., a privately held provider of Text Analytics solutions, whose tagging platform and analytical products allow clients to derive precise business information from huge amounts of textual content. ClearForest has received sufficient shareholder approval to complete the transaction, which is expected to close in approximately 30 days, subject to customary closing conditions. The financial terms were not disclosed. Reuters plans to retain and continue to work with the existing management team and their highly skilled workforces in the US and Israel. It also plans to continue to support existing products and customers. Reuters believes that search will be a pivotal element to the future of how financial information is sourced and consumed. As part of its drive into this space, Reuters has created a new strategic group and appointed Gerry Campbell, who will oversee the integration of ClearForest and drive this innovation. <Product>Text Analytic Solution </Product> <Company>ClearForest Ltd.</Company> <Company>Reuters</Company> <Country>United States</Country> <Country>Israel</Country> <Company>Reuters</Company> <Person>Gerry Campbell</Person> <ManagementChange offset="2789" length="92"> <Person>Gerry Campbell</Person> <Company>Reuters</Company> <Action>Enters</Position> </ManagementChange> What Would that Look Like (in code)?
  • 10. DrupalDrupal Calais for DrupalCalais for Drupal Calais Collection
  • 11. What does Calais for Drupal Look Like? •suggest terms allowing full user control of the tagging (think of del.icio.us recommending tags).
  • 12. Calais Terms or Taxonomy Terms?
  • 17. RDF for your nodes
  • 19. DrupalDrupal Calais for DrupalCalais for Drupal More Like ThisMore Like This Topic HubsTopic HubsGeoGeo More Like This
  • 24. DrupalDrupal Calais for DrupalCalais for Drupal More Like ThisMore Like This Topic HubsTopic HubsGeoGeo self organizing content
  • 25. create a contextual expression
  • 26. configure plugins (or define your own)
  • 28. show content in various contexts
  • 29. they are panels, so rearrange
  • 30. DrupalDrupal Calais for DrupalCalais for Drupal Linked DataLinked Data More Like ThisMore Like This Topic HubsTopic HubsGeoGeo Linked Data
  • 32. Linked Data it’s all about the URIs Drupal: http://dbpedia.org/resource/Drupal Washington DC: http://d.opencalais.com/er/geo/city/ralg-geo1/f497898f-2b9b-7cda- ec7b-85d896acbe3e Calais linked data for humans
  • 36. on a map, but wait, there’s more
  • 37. dbpedia data (or other sources)
  • 39. company data from dbpedia
  • 40. Calais URI for Toyota
  • 41. Get the next link to follow
  • 42. SPARQL query to get the data from DBPedia
  • 43. render it to html, voilà
  • 44. DrupalDrupal Calais for DrupalCalais for Drupal Linked DataLinked Data More Like ThisMore Like This Topic HubsTopic HubsGeoGeo MarmosetMarmoset Marmoset: microformats for search agents
  • 45. The Big Picture – OpenPublish DrupalDrupal Calais for DrupalCalais for Drupal Linked DataLinked Data More Like ThisMore Like This Topic HubsTopic HubsGeoGeo MarmosetMarmoset
  • 47. Enough Talk - lets see a demo…
  • 48. Q&A

Editor's Notes

  • #7: Developing quite a few great SemWeb modules too. Arto is a maniac
  • #8: Calais provides the Semantic Engine for OpenPublish. It gives us the context to the world outside if our site. So lets talk about how Calais and Drupal work together.
  • #11: Has anyone used Calais? This represents the core of our discussions. The Calais module sits at the epicenter of this collection of modules. It is an API and integration with nodes. It provides auto tagging of your nodes, and these other modules we developed site on top of the Calais data to drive the power of the meta data into your site and to your users.
  • #13: As I said, Calais is an auto-tagger. It’s really just a taxonomy integration. Calais Terms are like the maternal twin of Taxonomy. We wanted to make use of taxonomy for the added benefits.
  • #14: How is it configured? Calais is configured per content type.
  • #15: Saving is where the magic happens.
  • #16: Use the relevance threshold to limit the amount of noise, you can also blacklist terms, subsititue, hook into, etc.
  • #17: Autodiscovery links allow bots, browsers, readers, etc to find content in other formats related to the current page. Seen here there are a few other related content formats, the application/rdf+xml is the related Calais RDF document in XML form.
  • #18: RDF is great for representing data, but awful for your eyes. That is why semwebbers all wear glasses. This is the #1 comment I have received. RDFa is a method for embedding RDF data into XHTML documents. GRDDL can be used to transform it into RDF. We did not tackle RDFa YET!!! in Calais b/c this is an area that is beign worked on and integrated into D7 (at the theme layer) and has already begun. Might be a nice back-port though.
  • #19: RDF can turn you into stone.
  • #20: A collection of modules that consist of a core “framework” module that provides a plugin architecture allowing modules to provide related content. On or off site content.
  • #21: Start with a More Like This Thumbprint (Terms). This is the thumbprint of a node, the terms that you feel most accurately represent the essence of your node content. In here you will select or enter terms, or have Calais prefill. Calais returns a relevancy score, we can use that to prefill these automatically.
  • #22: Configure the relevance score that a term must have to be automagically applied.
  • #23: When viewing a node, it now provides other relevant on site nodes matched based on taxonomy.
  • #24: It also does off site searching, seen here using Yahoo’s BOSS, Build your Own Search Service.
  • #25: Topic Hubs are site pages that aggregate content based on inclusion in taxonomy expressions.
  • #26: Here is where you can build your expressions. You can broaden or narrow the scope based on the expression you create. But simply put, all nodes/comments, etc that match this expression will be present in your topichub.
  • #27: There are a variety of plugins, or you can define your own.
  • #28: This represents how the various plugins represent the content on your site that is matching your contextual expression
  • #29: The map provides some nice features. Showing your content based on geo graphical terms. Cities, States or Countries.
  • #30: They are just panels so add whatever you want. Node content, views, blocks, define your own. What makes the TopicHub plugins unique is that they respond to the context of your Hub, using the expression.
  • #31: Linked Data refers to the linking of RDF datasets across the Semantic Web. Sony referenced over here, is the same Sony talked about over there. This has been a huge goal of the semantic web for quite some time and it is finally alive.
  • #32: Diagram shows the Linked Data world. There are new datasets being release all the time and this diagram is already obsolete as the Calais Linked Data is not in there
  • #33: Diagram shows the Linked Data world. There are new datasets being release all the time and this diagram is already obsolete as the Calais Linked Data is not in there
  • #34: Again, RDF is ugly.
  • #35: DBPedia human-readable data.
  • #36: Calais has disambiguated these geographical terms and provided lat/lon for us.
  • #37: But the Calais Linked Data URI allow much more.
  • #38: Here we are showing additional data retrieved from DBPedia
  • #39: Article about Toyota having a rough go at it. Who would have thought a car company would be in financial trouble in this day and age!?!?!
  • #40: This grabs the most relevant company from Calais and if it is disambiguated, looks up data on DBPedia.
  • #41: This is a view of the Taxonomy Term edit screen. The Calais Term for Toyota has the following Linked Data URI.
  • #42: With that URI, we grab the RDF from Calais for the disambiguated company. That RDF doc returned has a link to the DBPedia resource that is “the same as” this resource.
  • #43: With that Resource URI, we create a SPARQL query to get data from the DBPedia via it’s SPARQL endpoint. (endpoint is just a fancy name for webservice that responds to SPARQL queries)
  • #44: We then render the resultant data into HTML. Easy as Pie.
  • #45: Recognizes search bots (configurable) and sends your page to Calais and injects microformats into the body of your page that crawlers such as Yahoo SearchMonkey can comprehend. So what does this pyramid bring us to?
  • #46: OpenPublish is a Drupal semantic publishing platform. It consists of Drupal, and Install profile, and a number of Modules that we have combined to provide a great starting point for publishers filled with best practices from our experience. There is nothing you could not build yourself, but we have combined things you would likely want to safe you a few (or few hundred) hours. Save a newspaper.
  • #47: Go and download it, install it, kick the tires. Let us know your thoughts. We love feedback.
  • #49: We will be showing people how to install and configure OpenPublish and the Calais Collection modules. Work through issues, give feedback, provide ideas.