SlideShare a Scribd company logo
Thesauri, Controlled Vocabularies, and Metadata Information Architecture
Why? A way to view the network of relationships between the IA systems Glue that holds the systems together
Metadata “ Data about data” Provide information about or documentation of other data managed with an application or environment. For example: data about elements or attributes (name, size, data type, etc.)
Metadata Metadata tags are used to describe documents, pages, images, software, video and audio files. And other content objects for the purpose of improving navigation and retrieval. Example: <META name=“keywords” content=“”information Architecture, content management, knowledge management, user experience”>
Metadata Metadata driven web-sites take advantage of : Content management software Controlled vocabulary We need to describe the documents and the software and vocabulary take care of the rest.
Controlled Vocabulary
Controlled Vocabularies A controlled vocabulary is a list of equivalent terms in the form of synonym ring, or list of preferred terms in the form of an authority file. A subset of natural language
Types of Controlled Vocabularies Simple Complex (Relationships) (Vocabularies) Synonym Rings Authority Files Classification Schemes Thesauri Equivalence Hierarchical Associative
Controlled Vocabularies Synonym Rings connects a set of words that are defined as equivalent for the purpose of retrieval. Cuisinart Food processor blender Kitchen aid Kitchenaid Cuizinart
Synonym Rings Pros: Help users to locate information using different terms Can be easily implemented using standard capabilities of search engines Increases recall Cons: Users can be confused by results that actually don’t include their keywords. Might reduce precision
Synonym Rings Recall Precision trade-off
Authority Files Authority Files It is a list of preferred terms or acceptable values. It may include variants or synonyms Authority files are synonym rings in which one term has been defined as  preferred term .
Authority Files Example: A list of U.S. states AL  :: Alabama AK  :: Alaska AZ  :: Arizona AR  :: Arkansas . . .
Authority Files Pros Can be a tool for improving consistency among content authors and indexers. Can be used to “educate” users. Preferred terms are useful for labeling and navigation Cons If equivalent terms begin with different letters, preferred terms must be complemented with links to other terms.  Example: Aspirin see Bayer
Classification Schemes Classification Schemes or taxonomies Is a hierarchical arrangement of preferred terms. Examples: Dewey Decimal Classification (DDC) Yahoo! Hierarchy of categories
Controlled Vocabulary Thesauri “ A thesaurus is a controlled vocabulary in which equivalence, hierarchical, and associative relationships are identified for purposes of improved retrieval.”
Controlled Vocabulary Associative Relationship Preferred Term Broader Term Variant Term Variant Term Related Term Related Term Narrower Term Associative Relationship Hierarchical Relationship Hierarchical Relationship Equivalence Relationship Equivalence Relationship
Controlled Vocabulary Technical Lingo Preferred term (PT) Variant Term (VT) Broader Term (BT) Narrower Term (NT) Related Term (RT) Use (U) Used For (UF) Scope Note (SN)
Controlled Vocabulary Examples of Thesaurus in web design PubMed (National Library of Medicine)  http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
Types of Thesauri Searching Thesaurus No tagging of content Can enrich queries Indexing Thesaurus Enables browsable  indexes value untapped By search No Thesaurus Natural language search Classic Thesaurus High-end full Function tool Thesaurus Used in Indexing Thesaurus used in searching
Semantic Relationships Equivalence Connects preferred terms and their variants. Example: Preferred Term Aspirin Variant Terms Acetysal, Acetylsalicylic Acid, ASA, Bayer, Polopirin A = B
Semantic Relationships Hierarchical Divides up the information space into categories and subcategories. Subtypes: Generic Whole-part Instance A B
 
Semantic Relationships Associative Strongly implied semantic connections that aren’t capture within equivalence or hierarchical relationships Examples: Field and object of study :  Cardiology RT Heart Process and its agent   : Termite Control RT Pesticides Concepts and properties  :  Poison RT Toxicity Action and product   : Eating RT Indigestion Causal dependency   : Celebration RT New Year’s Eve B A
Preferred Terms Term form Grammatical form: Usually nouns Spelling: Most common spelling form employed by users Singular and Plural form:  count nouns in plural (i.e.cars, roads, maps) Conceptual nouns in singular (i.e. math) Abbreviations and acronyms: default to popular use.
Preferred Terms Term Selection Term selection should be guided by your goals and how the thesaurus will integrate with your web site.
Preferred Terms Term Definition Extreme specificity – we want to control vocab Examples Cells (biology) Cells (electric) Cells (prison)
Preferred Terms Term specificity Whether to use pre-coordination of terms or not. For example:  “ Knowledge Management Software” OR “ Knowledge Management” “ Software” Decision depends on your context.
Polyhierarchy Polyhierarchy allows multiple parents for a single node. Diseases Respiratory Track Infections Viral Pneumonia Virus Diseases
Faceted Classification Invented by Shiyali R. Ranganathan in 1930. Main principle: Documents and objects have multiple dimensions, or  facets .
Faceted Classification The faceted classification uses multiple taxonomies that focus on different dimensions of the content.
Faceted Classification Ranganathan’s universal facets: Personality Matter Energy Space Time
Faceted Classification Most common facets used in the business world: Topic Product Document Type Audience Geography Price
Facted Classification Example of a faceted classification in a web site:  wine.com  Facet Sample controlled vocabulary values Type Red, white, sparkling, Pink, Dessert Region (origin) Australian, Californian, French, Italian Winery (manufacturer) Blackstone, Clos du Bois, Cakebread Year 1969, 1990, 199, 2000, 2001, 2002 Price $3.99, $29.99, <$199, Cheap, Moderate, Expensive
Faceted Classification More information about faceted classification: KMconnection:  http://www.kmconnection.com/DOC100100.htm   Presentation of Faceted Classification  http://www.asis.org/Conferences/Summit2002/Gruenberg.ppt   Innovation in classification:  http://www.peterme.com/archives/00000063.html
Ad

More Related Content

What's hot (20)

Subject cataloguing
Subject cataloguingSubject cataloguing
Subject cataloguing
Sarika Sawant
 
Informetrics final
Informetrics finalInformetrics final
Informetrics final
Aamir Abbas
 
Uniterm indexing
Uniterm indexing Uniterm indexing
Uniterm indexing
kavikaviarasan
 
Thesaurus 2101
Thesaurus 2101Thesaurus 2101
Thesaurus 2101
roseline2101
 
Index Language.pptx
Index Language.pptxIndex Language.pptx
Index Language.pptx
Institute of Strategic Studies Islamabad (ISSI)
 
SEARS LIST OF SUBJECT HEADINGS (PRACTICE)
SEARS LIST OF SUBJECT HEADINGS (PRACTICE)SEARS LIST OF SUBJECT HEADINGS (PRACTICE)
SEARS LIST OF SUBJECT HEADINGS (PRACTICE)
Libcorpio
 
Webometrics
WebometricsWebometrics
Webometrics
roseline2101
 
Inernet use and application in library
Inernet use and application in libraryInernet use and application in library
Inernet use and application in library
S.M. Ashif
 
Common communication format
Common communication formatCommon communication format
Common communication format
avid
 
Precis
PrecisPrecis
Precis
silambu111
 
INSPEC
INSPECINSPEC
INSPEC
Jessica Danielle Smith
 
Index and abstract (3)
Index and abstract (3)Index and abstract (3)
Index and abstract (3)
Iqra tasifali
 
Spiral of Scientific Method Arun Joseph MPhil
Spiral of Scientific Method   Arun Joseph MPhilSpiral of Scientific Method   Arun Joseph MPhil
Spiral of Scientific Method Arun Joseph MPhil
Arun Joseph (Librarian), MLISc, UGC NET
 
ISO 2709
ISO 2709ISO 2709
ISO 2709
Shuvra Ghosh
 
Subject gateway knowledge organisation
Subject gateway knowledge organisationSubject gateway knowledge organisation
Subject gateway knowledge organisation
Aparna Sane
 
Dictionary catalogue vs classified catalogue
Dictionary catalogue vs classified catalogueDictionary catalogue vs classified catalogue
Dictionary catalogue vs classified catalogue
Aparna Sane
 
DELNET.pptx
DELNET.pptxDELNET.pptx
DELNET.pptx
DrIrfanulHaqAkhoon
 
SEARS LIST OF SUBJECT HEADING ppt
SEARS LIST OF SUBJECT HEADING pptSEARS LIST OF SUBJECT HEADING ppt
SEARS LIST OF SUBJECT HEADING ppt
University of Delhi
 
PHASE RELATION .ppt.
PHASE RELATION .ppt.PHASE RELATION .ppt.
PHASE RELATION .ppt.
Jiwaji university
 
Introduction to WebDewey
Introduction to WebDeweyIntroduction to WebDewey
Introduction to WebDewey
awilso02
 

Viewers also liked (20)

Open access resources
Open access resourcesOpen access resources
Open access resources
Akshay Kumar
 
3rd Thesaurus
3rd Thesaurus3rd Thesaurus
3rd Thesaurus
Ringgold Primary School
 
Using a thesaurus.ppt
Using a thesaurus.pptUsing a thesaurus.ppt
Using a thesaurus.ppt
Taylor Rogers
 
POPSI
POPSIPOPSI
POPSI
silambu111
 
Keyword Searching: Advanced Techniques
Keyword Searching: Advanced TechniquesKeyword Searching: Advanced Techniques
Keyword Searching: Advanced Techniques
Kris Jacobson
 
5013 Indexing Presentation
5013 Indexing Presentation5013 Indexing Presentation
5013 Indexing Presentation
lmartin8
 
Google searching techniques
Google searching techniquesGoogle searching techniques
Google searching techniques
abbas mohd
 
Advanced keyword research
Advanced keyword researchAdvanced keyword research
Advanced keyword research
Jono Alderson
 
Slic System
Slic SystemSlic System
Slic System
Ahmed Shammasi
 
Kwic
KwicKwic
Kwic
PU
 
Identifying Keywords and Searching Techniques
Identifying Keywords and Searching TechniquesIdentifying Keywords and Searching Techniques
Identifying Keywords and Searching Techniques
La Trobe University Library- College of ASSC
 
From KWIC to Enterprise Search - M G Lindquist
From KWIC to Enterprise Search - M G LindquistFrom KWIC to Enterprise Search - M G Lindquist
From KWIC to Enterprise Search - M G Lindquist
mglindquist
 
Port mann bridge modification
Port mann bridge modificationPort mann bridge modification
Port mann bridge modification
jacobkwack
 
Relations and Functions (Algebra 2)
Relations and Functions (Algebra 2)Relations and Functions (Algebra 2)
Relations and Functions (Algebra 2)
rfant
 
Search strategies – subject searching
Search strategies – subject searchingSearch strategies – subject searching
Search strategies – subject searching
doverlibrary
 
Institutional Repositories
Institutional RepositoriesInstitutional Repositories
Institutional Repositories
Sarika Sawant
 
Searching techniques
Searching techniquesSearching techniques
Searching techniques
Jayatunga Amaraweera
 
Types of sentences
Types of sentencesTypes of sentences
Types of sentences
Taylor Rogers
 
Richard kwock jsm 2012 poster
Richard kwock jsm 2012 posterRichard kwock jsm 2012 poster
Richard kwock jsm 2012 poster
Ajay Ohri
 
Presentation search strategy
Presentation   search strategyPresentation   search strategy
Presentation search strategy
jmunks
 
Open access resources
Open access resourcesOpen access resources
Open access resources
Akshay Kumar
 
Using a thesaurus.ppt
Using a thesaurus.pptUsing a thesaurus.ppt
Using a thesaurus.ppt
Taylor Rogers
 
Keyword Searching: Advanced Techniques
Keyword Searching: Advanced TechniquesKeyword Searching: Advanced Techniques
Keyword Searching: Advanced Techniques
Kris Jacobson
 
5013 Indexing Presentation
5013 Indexing Presentation5013 Indexing Presentation
5013 Indexing Presentation
lmartin8
 
Google searching techniques
Google searching techniquesGoogle searching techniques
Google searching techniques
abbas mohd
 
Advanced keyword research
Advanced keyword researchAdvanced keyword research
Advanced keyword research
Jono Alderson
 
Kwic
KwicKwic
Kwic
PU
 
From KWIC to Enterprise Search - M G Lindquist
From KWIC to Enterprise Search - M G LindquistFrom KWIC to Enterprise Search - M G Lindquist
From KWIC to Enterprise Search - M G Lindquist
mglindquist
 
Port mann bridge modification
Port mann bridge modificationPort mann bridge modification
Port mann bridge modification
jacobkwack
 
Relations and Functions (Algebra 2)
Relations and Functions (Algebra 2)Relations and Functions (Algebra 2)
Relations and Functions (Algebra 2)
rfant
 
Search strategies – subject searching
Search strategies – subject searchingSearch strategies – subject searching
Search strategies – subject searching
doverlibrary
 
Institutional Repositories
Institutional RepositoriesInstitutional Repositories
Institutional Repositories
Sarika Sawant
 
Richard kwock jsm 2012 poster
Richard kwock jsm 2012 posterRichard kwock jsm 2012 poster
Richard kwock jsm 2012 poster
Ajay Ohri
 
Presentation search strategy
Presentation   search strategyPresentation   search strategy
Presentation search strategy
jmunks
 
Ad

Similar to Thesauri (20)

Taxonomy made easy
Taxonomy made easyTaxonomy made easy
Taxonomy made easy
Earley Information Science
 
Keyword searching idc
Keyword searching idcKeyword searching idc
Keyword searching idc
SuchittaU
 
Why Are Taxonomies Necessary?
Why Are Taxonomies Necessary?Why Are Taxonomies Necessary?
Why Are Taxonomies Necessary?
Fred Leise
 
Business Research Methods. search strategies for online databases
Business Research Methods. search strategies for online databasesBusiness Research Methods. search strategies for online databases
Business Research Methods. search strategies for online databases
Ahsan Khan Eco (Superior College)
 
LAC Group - Metadata for mere mortals (Controlled vocabularies)
LAC Group - Metadata for mere mortals (Controlled vocabularies)LAC Group - Metadata for mere mortals (Controlled vocabularies)
LAC Group - Metadata for mere mortals (Controlled vocabularies)
LAC Group
 
Introduction To Controlled Vocabularies
Introduction To Controlled VocabulariesIntroduction To Controlled Vocabularies
Introduction To Controlled Vocabularies
Fred Leise
 
Theresa regli bw-3
Theresa regli bw-3Theresa regli bw-3
Theresa regli bw-3
R Aunpad
 
Chapter 9Enterprise Content and Record ManagementSt. Rit
Chapter 9Enterprise Content and Record ManagementSt. RitChapter 9Enterprise Content and Record ManagementSt. Rit
Chapter 9Enterprise Content and Record ManagementSt. Rit
JinElias52
 
Tex Share Homework Help Lesson 1
Tex Share Homework Help Lesson 1Tex Share Homework Help Lesson 1
Tex Share Homework Help Lesson 1
TSLAC - Library Development
 
Deep Machine Reading for Customer Analytics
Deep Machine Reading for Customer AnalyticsDeep Machine Reading for Customer Analytics
Deep Machine Reading for Customer Analytics
Naveen Ashish
 
Synonyms, Alternative Labels, and Nonpreferred Terms
Synonyms, Alternative Labels, and Nonpreferred TermsSynonyms, Alternative Labels, and Nonpreferred Terms
Synonyms, Alternative Labels, and Nonpreferred Terms
Heather Hedden
 
Literature Based Framework for Semantic Descriptions of e-Science resources
Literature Based Framework for Semantic Descriptions of e-Science resourcesLiterature Based Framework for Semantic Descriptions of e-Science resources
Literature Based Framework for Semantic Descriptions of e-Science resources
Hammad Afzal
 
Murtha Baca
Murtha BacaMurtha Baca
Murtha Baca
vonjobi
 
Taxonomy Fundamentals Workshop 2013
Taxonomy Fundamentals Workshop 2013Taxonomy Fundamentals Workshop 2013
Taxonomy Fundamentals Workshop 2013
Access Innovations, Inc.
 
Labeling Systems
Labeling SystemsLabeling Systems
Labeling Systems
Miles Price
 
Leveraging Your Taxonomy With Navtree and MAIQuery
Leveraging Your Taxonomy With Navtree and MAIQueryLeveraging Your Taxonomy With Navtree and MAIQuery
Leveraging Your Taxonomy With Navtree and MAIQuery
Access Innovations, Inc.
 
Database Searching Basics
Database Searching BasicsDatabase Searching Basics
Database Searching Basics
zhang48
 
Using metadata repositories with search
Using metadata repositories with searchUsing metadata repositories with search
Using metadata repositories with search
Jean Graef
 
Taxonomies for Publishing: Enhancing the User Experience
Taxonomies for Publishing: Enhancing the User ExperienceTaxonomies for Publishing: Enhancing the User Experience
Taxonomies for Publishing: Enhancing the User Experience
TSoholt
 
Authority
AuthorityAuthority
Authority
Denise Garofalo
 
Keyword searching idc
Keyword searching idcKeyword searching idc
Keyword searching idc
SuchittaU
 
Why Are Taxonomies Necessary?
Why Are Taxonomies Necessary?Why Are Taxonomies Necessary?
Why Are Taxonomies Necessary?
Fred Leise
 
Business Research Methods. search strategies for online databases
Business Research Methods. search strategies for online databasesBusiness Research Methods. search strategies for online databases
Business Research Methods. search strategies for online databases
Ahsan Khan Eco (Superior College)
 
LAC Group - Metadata for mere mortals (Controlled vocabularies)
LAC Group - Metadata for mere mortals (Controlled vocabularies)LAC Group - Metadata for mere mortals (Controlled vocabularies)
LAC Group - Metadata for mere mortals (Controlled vocabularies)
LAC Group
 
Introduction To Controlled Vocabularies
Introduction To Controlled VocabulariesIntroduction To Controlled Vocabularies
Introduction To Controlled Vocabularies
Fred Leise
 
Theresa regli bw-3
Theresa regli bw-3Theresa regli bw-3
Theresa regli bw-3
R Aunpad
 
Chapter 9Enterprise Content and Record ManagementSt. Rit
Chapter 9Enterprise Content and Record ManagementSt. RitChapter 9Enterprise Content and Record ManagementSt. Rit
Chapter 9Enterprise Content and Record ManagementSt. Rit
JinElias52
 
Deep Machine Reading for Customer Analytics
Deep Machine Reading for Customer AnalyticsDeep Machine Reading for Customer Analytics
Deep Machine Reading for Customer Analytics
Naveen Ashish
 
Synonyms, Alternative Labels, and Nonpreferred Terms
Synonyms, Alternative Labels, and Nonpreferred TermsSynonyms, Alternative Labels, and Nonpreferred Terms
Synonyms, Alternative Labels, and Nonpreferred Terms
Heather Hedden
 
Literature Based Framework for Semantic Descriptions of e-Science resources
Literature Based Framework for Semantic Descriptions of e-Science resourcesLiterature Based Framework for Semantic Descriptions of e-Science resources
Literature Based Framework for Semantic Descriptions of e-Science resources
Hammad Afzal
 
Murtha Baca
Murtha BacaMurtha Baca
Murtha Baca
vonjobi
 
Labeling Systems
Labeling SystemsLabeling Systems
Labeling Systems
Miles Price
 
Leveraging Your Taxonomy With Navtree and MAIQuery
Leveraging Your Taxonomy With Navtree and MAIQueryLeveraging Your Taxonomy With Navtree and MAIQuery
Leveraging Your Taxonomy With Navtree and MAIQuery
Access Innovations, Inc.
 
Database Searching Basics
Database Searching BasicsDatabase Searching Basics
Database Searching Basics
zhang48
 
Using metadata repositories with search
Using metadata repositories with searchUsing metadata repositories with search
Using metadata repositories with search
Jean Graef
 
Taxonomies for Publishing: Enhancing the User Experience
Taxonomies for Publishing: Enhancing the User ExperienceTaxonomies for Publishing: Enhancing the User Experience
Taxonomies for Publishing: Enhancing the User Experience
TSoholt
 
Ad

More from Miles Price (9)

User Experience
User ExperienceUser Experience
User Experience
Miles Price
 
Search Systems
Search SystemsSearch Systems
Search Systems
Miles Price
 
Process And Methodology Research
Process And Methodology ResearchProcess And Methodology Research
Process And Methodology Research
Miles Price
 
Personas
PersonasPersonas
Personas
Miles Price
 
Organization Systems
Organization SystemsOrganization Systems
Organization Systems
Miles Price
 
Navigation Systems
Navigation SystemsNavigation Systems
Navigation Systems
Miles Price
 
Design And Documentation
Design And DocumentationDesign And Documentation
Design And Documentation
Miles Price
 
Anatomy Of Ia
Anatomy Of IaAnatomy Of Ia
Anatomy Of Ia
Miles Price
 
Intro To Ia
Intro To IaIntro To Ia
Intro To Ia
Miles Price
 
Process And Methodology Research
Process And Methodology ResearchProcess And Methodology Research
Process And Methodology Research
Miles Price
 
Organization Systems
Organization SystemsOrganization Systems
Organization Systems
Miles Price
 
Navigation Systems
Navigation SystemsNavigation Systems
Navigation Systems
Miles Price
 
Design And Documentation
Design And DocumentationDesign And Documentation
Design And Documentation
Miles Price
 

Recently uploaded (20)

Drupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy ConsumptionDrupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy Consumption
Exove
 
Mobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi ArabiaMobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi Arabia
Steve Jonas
 
Linux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdfLinux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdf
RHCSA Guru
 
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxSpecial Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
shyamraj55
 
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes
 
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptxDevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
Justin Reock
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In FranceManifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
chb3
 
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven InsightsAndrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell
 
Big Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur MorganBig Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur Morgan
Arthur Morgan
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveDesigning Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
SOFTTECHHUB
 
How analogue intelligence complements AI
How analogue intelligence complements AIHow analogue intelligence complements AI
How analogue intelligence complements AI
Paul Rowe
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Impelsys Inc.
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 
Drupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy ConsumptionDrupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy Consumption
Exove
 
Mobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi ArabiaMobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi Arabia
Steve Jonas
 
Linux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdfLinux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdf
RHCSA Guru
 
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxSpecial Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
shyamraj55
 
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes
 
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptxDevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
Justin Reock
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In FranceManifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
chb3
 
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven InsightsAndrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell
 
Big Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur MorganBig Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur Morgan
Arthur Morgan
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveDesigning Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
SOFTTECHHUB
 
How analogue intelligence complements AI
How analogue intelligence complements AIHow analogue intelligence complements AI
How analogue intelligence complements AI
Paul Rowe
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Impelsys Inc.
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 

Thesauri

  • 1. Thesauri, Controlled Vocabularies, and Metadata Information Architecture
  • 2. Why? A way to view the network of relationships between the IA systems Glue that holds the systems together
  • 3. Metadata “ Data about data” Provide information about or documentation of other data managed with an application or environment. For example: data about elements or attributes (name, size, data type, etc.)
  • 4. Metadata Metadata tags are used to describe documents, pages, images, software, video and audio files. And other content objects for the purpose of improving navigation and retrieval. Example: <META name=“keywords” content=“”information Architecture, content management, knowledge management, user experience”>
  • 5. Metadata Metadata driven web-sites take advantage of : Content management software Controlled vocabulary We need to describe the documents and the software and vocabulary take care of the rest.
  • 7. Controlled Vocabularies A controlled vocabulary is a list of equivalent terms in the form of synonym ring, or list of preferred terms in the form of an authority file. A subset of natural language
  • 8. Types of Controlled Vocabularies Simple Complex (Relationships) (Vocabularies) Synonym Rings Authority Files Classification Schemes Thesauri Equivalence Hierarchical Associative
  • 9. Controlled Vocabularies Synonym Rings connects a set of words that are defined as equivalent for the purpose of retrieval. Cuisinart Food processor blender Kitchen aid Kitchenaid Cuizinart
  • 10. Synonym Rings Pros: Help users to locate information using different terms Can be easily implemented using standard capabilities of search engines Increases recall Cons: Users can be confused by results that actually don’t include their keywords. Might reduce precision
  • 11. Synonym Rings Recall Precision trade-off
  • 12. Authority Files Authority Files It is a list of preferred terms or acceptable values. It may include variants or synonyms Authority files are synonym rings in which one term has been defined as preferred term .
  • 13. Authority Files Example: A list of U.S. states AL :: Alabama AK :: Alaska AZ :: Arizona AR :: Arkansas . . .
  • 14. Authority Files Pros Can be a tool for improving consistency among content authors and indexers. Can be used to “educate” users. Preferred terms are useful for labeling and navigation Cons If equivalent terms begin with different letters, preferred terms must be complemented with links to other terms. Example: Aspirin see Bayer
  • 15. Classification Schemes Classification Schemes or taxonomies Is a hierarchical arrangement of preferred terms. Examples: Dewey Decimal Classification (DDC) Yahoo! Hierarchy of categories
  • 16. Controlled Vocabulary Thesauri “ A thesaurus is a controlled vocabulary in which equivalence, hierarchical, and associative relationships are identified for purposes of improved retrieval.”
  • 17. Controlled Vocabulary Associative Relationship Preferred Term Broader Term Variant Term Variant Term Related Term Related Term Narrower Term Associative Relationship Hierarchical Relationship Hierarchical Relationship Equivalence Relationship Equivalence Relationship
  • 18. Controlled Vocabulary Technical Lingo Preferred term (PT) Variant Term (VT) Broader Term (BT) Narrower Term (NT) Related Term (RT) Use (U) Used For (UF) Scope Note (SN)
  • 19. Controlled Vocabulary Examples of Thesaurus in web design PubMed (National Library of Medicine) http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  • 20. Types of Thesauri Searching Thesaurus No tagging of content Can enrich queries Indexing Thesaurus Enables browsable indexes value untapped By search No Thesaurus Natural language search Classic Thesaurus High-end full Function tool Thesaurus Used in Indexing Thesaurus used in searching
  • 21. Semantic Relationships Equivalence Connects preferred terms and their variants. Example: Preferred Term Aspirin Variant Terms Acetysal, Acetylsalicylic Acid, ASA, Bayer, Polopirin A = B
  • 22. Semantic Relationships Hierarchical Divides up the information space into categories and subcategories. Subtypes: Generic Whole-part Instance A B
  • 23.  
  • 24. Semantic Relationships Associative Strongly implied semantic connections that aren’t capture within equivalence or hierarchical relationships Examples: Field and object of study : Cardiology RT Heart Process and its agent : Termite Control RT Pesticides Concepts and properties : Poison RT Toxicity Action and product : Eating RT Indigestion Causal dependency : Celebration RT New Year’s Eve B A
  • 25. Preferred Terms Term form Grammatical form: Usually nouns Spelling: Most common spelling form employed by users Singular and Plural form: count nouns in plural (i.e.cars, roads, maps) Conceptual nouns in singular (i.e. math) Abbreviations and acronyms: default to popular use.
  • 26. Preferred Terms Term Selection Term selection should be guided by your goals and how the thesaurus will integrate with your web site.
  • 27. Preferred Terms Term Definition Extreme specificity – we want to control vocab Examples Cells (biology) Cells (electric) Cells (prison)
  • 28. Preferred Terms Term specificity Whether to use pre-coordination of terms or not. For example: “ Knowledge Management Software” OR “ Knowledge Management” “ Software” Decision depends on your context.
  • 29. Polyhierarchy Polyhierarchy allows multiple parents for a single node. Diseases Respiratory Track Infections Viral Pneumonia Virus Diseases
  • 30. Faceted Classification Invented by Shiyali R. Ranganathan in 1930. Main principle: Documents and objects have multiple dimensions, or facets .
  • 31. Faceted Classification The faceted classification uses multiple taxonomies that focus on different dimensions of the content.
  • 32. Faceted Classification Ranganathan’s universal facets: Personality Matter Energy Space Time
  • 33. Faceted Classification Most common facets used in the business world: Topic Product Document Type Audience Geography Price
  • 34. Facted Classification Example of a faceted classification in a web site: wine.com Facet Sample controlled vocabulary values Type Red, white, sparkling, Pink, Dessert Region (origin) Australian, Californian, French, Italian Winery (manufacturer) Blackstone, Clos du Bois, Cakebread Year 1969, 1990, 199, 2000, 2001, 2002 Price $3.99, $29.99, <$199, Cheap, Moderate, Expensive
  • 35. Faceted Classification More information about faceted classification: KMconnection: http://www.kmconnection.com/DOC100100.htm Presentation of Faceted Classification http://www.asis.org/Conferences/Summit2002/Gruenberg.ppt Innovation in classification: http://www.peterme.com/archives/00000063.html