SlideShare a Scribd company logo
Lecture 01
Information Retrieval
About the Course
 Book:
 An Introduction to Information Retrieval, Christopher D.
Manning Prabhakar Raghavan Hinrich Schütze, Cambridge
University Press, 2009.
 Other materials may be considered depending on the subject.
 Principal objective of this course:
 To introduce students to Information Retrieval concepts,
paradigms and techniques, with an emphasis on String and
Semantics based IR techniques.
About the Course
 Grading & Assessment:
 First Exam …………………….. 20%
 Second Exam ………………….. 20%
 Final Exam …………………….. 35%
 Other Activities ………………. 10%
 Major Assignment ……………. 15%
“You are to build a prototype for a search engine that employs
both text-based and semantics-based techniques for retrieving the
most relevant results to users’ queries. The search space will be a
collection of documents, in addition to a collection of images
associated with some textual descriptions”.
Course Topics
 Part 01 – Introduction
 What is IR?
 Examples of IR Systems.
 Other topics related to IR.
 Models of IR
 Part 02 – Boolean Retrieval
 What is Boolean IR?
 Term-Document Incidence Matrices
 Terminology and Notations
Course Topics
 Part 03 – Indexing
 Building Indexes
 Semantic Networks
 Part 04 – Retrieval
 Scoring, Ranking
 Relevance Feedback
 Precision/Recall
Course Topics
 Part 05 – Exploiting Ontologies in IR
 Ontologies
 Traditional vs. Semantics-based IR techniques
Introduction
What is IR
 Information Retrieval:
“Information retrieval (IR) is finding material (usually documents) of an
unstructured nature (usually text) that satisfies an information need from
within large collections (usually stored on computers).”
 Unstructured Data:
“refers to data which does not have clear, semantically overt, easy-for-a-
computer structure.”
e.g.  Textual information in web pages.
 Semistructured Data:
“refers to data which have a partially clear, semantically overt, easy-for-a-
computer structure.”
e.g.  finding a document where the title contains Java and the body
contains threading.
Introduction
What is IR
 Structured Data:
“refers to data which have a clear, semantically overt, easy-
for-a-computer structure.”
e.g.  Relational Databases.
 A look back: 1990s
 Studies showed that most people preferred getting
information from other people rather than from information
retrieval systems.
 Online booking systems?
 Following to this period and after relentless optimization of
IR:
 The field of information retrieval has moved from being a
primarily academic discipline to being the basis underlying
most people’s preferred means of information access.
Introduction
What is IR
 Information retrieval did not begin with the Web.
 The field began with scientific publications and library
records, but soon spread to other forms of content, particularly
those of information professionals, such as journalists, lawyers,
and doctors
Introduction
What is IR
Introduction
Other Topics Related to IR
 Cross-language IR
 Multimedia IR
 Speech retrieval
 User interfaces for IR
 Ontology and Semantics-based IR
 Natural Language Processing (NLP) techniques
 Dynamic IR
 Online Advertising !?
Introduction
Other Topics Related to IR
 The field of information retrieval also covers supporting users in
browsing or filtering document collections or further processing
a set of retrieved documents.
 Given a set of documents, clustering is the task of coming up
with a good grouping of the documents based on their contents.
 Given a set of topics, standing information needs, or other
categories (such as suitability of texts for different age groups),
classification is the task of deciding which class(es), if any,
each of a set of documents belongs to. It is often approached by
first manually classifying some documents and then hoping to
be able to classify new documents automatically.
Introduction
Classification of IR systems
 Scale-based Classification of IR systems: Distinguishing
between Information retrieval systems according to the scale at
which they operate.
1. Web search: The search is conducted over billions of
documents stored on millions of computers.
 Issues to consider:
1. Needing to gather documents for indexing.
2. Being able to build systems that work efficiently at this
enormous scale.
3. Handling particular aspects of the web, such as the
exploitation of hypertext and page ranking given the
commercial importance of the web.
2. Personal Information Retrieval: Integrating information
retrieval into consumer operating systems.
 Issues to consider:
1. Handling the broad range of document types on a typical
personal computer.
2. Making the search system maintenance free and
sufficiently lightweight in terms of startup, processing, and
disk space usage that it can run on one machine without
annoying its owner.
Introduction
Classification of IR systems
3. Enterprise, Institutional, and Domain-specific Search:
A corporation’s documents will typically be stored on
centralized file systems and one or a handful of
dedicated machines will provide search over the
collection.
 Issues to consider:
1. Handling the broad range of document types on a
centralized computer.
2. Scale and Efficiency of the IR system.
3. Maintenance of the search system.
Introduction
Classification of IR systems
Introduction
Classification of IR systems
 Technique-based Classification of IR systems:
Distinguishing between Information retrieval systems
according to the search technique that they employ.
1. Keyword-based search: String matching algorithms are
employed to find documents relevant to the user’s query.
 Issues to consider:
1. Precision and Recall of the search algorithm.
2. Gap between the textual information contained in the
document collections and the user’s information need.
Introduction
Classification of IR systems
2. Semantics-based search: Semantic aspects of the
user’s query are derived in an attempt to find documents
relevant to the user’s query.
 Issues to consider:
1. Precision and Recall of the search algorithm.
2. Lack of Semantic Resources.
3. Incompleteness of Background Knowledge
represented in existing Semantic Resources.
4. Semantic Heterogeneity problem between existing
Semantic Resources.
5. Lack of Multi-lingual Semantic Resources.
Introduction
Classification of IR systems
2. Hybrid Approaches: Keyword-based search is enriched with
Semantics-based search to retrieve more relevant results to the
user’s information needs.
 Issues to consider:
1. Precision and Recall of the search algorithm.
2. Lack of Semantic Resources.
3. Priority of the employed techniques.
4. Incompleteness of Background Knowledge represented in
existing Semantic Resources.
5. Types of queries that the system can handle (Single-term vs.
Verbose queries).
6. Lack of Multi-lingual Semantic Resources.
 Research is very active in this area.
 Example: Dbpedia based search engine (June 2015)
Ad

More Related Content

What's hot (20)

INFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.LINFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.L
anujessy
 
INFORMATION RETRIEVAL ‎AND DISSEMINATION
INFORMATION RETRIEVAL ‎AND DISSEMINATIONINFORMATION RETRIEVAL ‎AND DISSEMINATION
INFORMATION RETRIEVAL ‎AND DISSEMINATION
Libcorpio
 
Information retrieval
Information retrievalInformation retrieval
Information retrieval
hplap
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
Roi Blanco
 
Aggregation for searching complex information spaces
Aggregation for searching complex information spacesAggregation for searching complex information spaces
Aggregation for searching complex information spaces
Mounia Lalmas-Roelleke
 
Konsep Dasar Information Retrieval - Edi faizal
Konsep Dasar Information Retrieval - Edi faizal Konsep Dasar Information Retrieval - Edi faizal
Konsep Dasar Information Retrieval - Edi faizal
EdiFaizal2
 
Functions of information retrival system(1)
Functions of information retrival system(1)Functions of information retrival system(1)
Functions of information retrival system(1)
silambu111
 
Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)
Kira
 
Information_Retrieval_Models_Nfaoui_El_Habib
Information_Retrieval_Models_Nfaoui_El_HabibInformation_Retrieval_Models_Nfaoui_El_Habib
Information_Retrieval_Models_Nfaoui_El_Habib
El Habib NFAOUI
 
information retrieval Techniques and normalization
information retrieval Techniques and normalizationinformation retrieval Techniques and normalization
information retrieval Techniques and normalization
Ameenababs
 
Information retrieval system
Information retrieval systemInformation retrieval system
Information retrieval system
Leslie Vargas
 
Text Indexing and Retrieval
Text Indexing and RetrievalText Indexing and Retrieval
Text Indexing and Retrieval
Rachmat Wahid Saleh Insani
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
Carsten Eickhoff
 
Information retrieval s
Information retrieval sInformation retrieval s
Information retrieval s
silambu111
 
Vector space model of information retrieval
Vector space model of information retrievalVector space model of information retrieval
Vector space model of information retrieval
Nanthini Dominique
 
Lectures 1,2,3
Lectures 1,2,3Lectures 1,2,3
Lectures 1,2,3
alaa223
 
An Introduction to Information Retrieval and Applications
 An Introduction to Information Retrieval and Applications An Introduction to Information Retrieval and Applications
An Introduction to Information Retrieval and Applications
sathish sak
 
Lec1,2
Lec1,2Lec1,2
Lec1,2
alaa223
 
Tovek Presentation by Livio Costantini
Tovek Presentation by Livio CostantiniTovek Presentation by Livio Costantini
Tovek Presentation by Livio Costantini
maxfalc
 
Information Retrieval Fundamentals - An introduction
Information Retrieval Fundamentals - An introduction Information Retrieval Fundamentals - An introduction
Information Retrieval Fundamentals - An introduction
Grace Hui Yang
 
INFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.LINFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.L
anujessy
 
INFORMATION RETRIEVAL ‎AND DISSEMINATION
INFORMATION RETRIEVAL ‎AND DISSEMINATIONINFORMATION RETRIEVAL ‎AND DISSEMINATION
INFORMATION RETRIEVAL ‎AND DISSEMINATION
Libcorpio
 
Information retrieval
Information retrievalInformation retrieval
Information retrieval
hplap
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
Roi Blanco
 
Aggregation for searching complex information spaces
Aggregation for searching complex information spacesAggregation for searching complex information spaces
Aggregation for searching complex information spaces
Mounia Lalmas-Roelleke
 
Konsep Dasar Information Retrieval - Edi faizal
Konsep Dasar Information Retrieval - Edi faizal Konsep Dasar Information Retrieval - Edi faizal
Konsep Dasar Information Retrieval - Edi faizal
EdiFaizal2
 
Functions of information retrival system(1)
Functions of information retrival system(1)Functions of information retrival system(1)
Functions of information retrival system(1)
silambu111
 
Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)
Kira
 
Information_Retrieval_Models_Nfaoui_El_Habib
Information_Retrieval_Models_Nfaoui_El_HabibInformation_Retrieval_Models_Nfaoui_El_Habib
Information_Retrieval_Models_Nfaoui_El_Habib
El Habib NFAOUI
 
information retrieval Techniques and normalization
information retrieval Techniques and normalizationinformation retrieval Techniques and normalization
information retrieval Techniques and normalization
Ameenababs
 
Information retrieval system
Information retrieval systemInformation retrieval system
Information retrieval system
Leslie Vargas
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
Carsten Eickhoff
 
Information retrieval s
Information retrieval sInformation retrieval s
Information retrieval s
silambu111
 
Vector space model of information retrieval
Vector space model of information retrievalVector space model of information retrieval
Vector space model of information retrieval
Nanthini Dominique
 
Lectures 1,2,3
Lectures 1,2,3Lectures 1,2,3
Lectures 1,2,3
alaa223
 
An Introduction to Information Retrieval and Applications
 An Introduction to Information Retrieval and Applications An Introduction to Information Retrieval and Applications
An Introduction to Information Retrieval and Applications
sathish sak
 
Tovek Presentation by Livio Costantini
Tovek Presentation by Livio CostantiniTovek Presentation by Livio Costantini
Tovek Presentation by Livio Costantini
maxfalc
 
Information Retrieval Fundamentals - An introduction
Information Retrieval Fundamentals - An introduction Information Retrieval Fundamentals - An introduction
Information Retrieval Fundamentals - An introduction
Grace Hui Yang
 

Viewers also liked (17)

Introduction to Information Retrieval & Models
Introduction to Information Retrieval & ModelsIntroduction to Information Retrieval & Models
Introduction to Information Retrieval & Models
Mounia Lalmas-Roelleke
 
Representation theory slideshare
Representation theory slideshareRepresentation theory slideshare
Representation theory slideshare
chloelogan1
 
Bab ii
Bab iiBab ii
Bab ii
Najiebud Dien
 
Ir 02
Ir   02Ir   02
Ir 02
Mohammed Romi
 
Ch8
Ch8Ch8
Ch8
Mohammed Romi
 
Ir 09
Ir   09Ir   09
Ir 09
Mohammed Romi
 
Ir 03
Ir   03Ir   03
Ir 03
Mohammed Romi
 
Information Retrieval Techniques of Google
Information Retrieval Techniques of Google Information Retrieval Techniques of Google
Information Retrieval Techniques of Google
Cyr Ish
 
Ir 08
Ir   08Ir   08
Ir 08
Mohammed Romi
 
Ch2020
Ch2020Ch2020
Ch2020
Mohammed Romi
 
Ch7
Ch7Ch7
Ch7
Mohammed Romi
 
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information RetrievalKeystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Mauro Dragoni
 
Ai 02 intelligent_agents(1)
Ai 02 intelligent_agents(1)Ai 02 intelligent_agents(1)
Ai 02 intelligent_agents(1)
Mohammed Romi
 
IR
IRIR
IR
Girish Khanzode
 
Ian Sommerville, Software Engineering, 9th EditionCh 8
Ian Sommerville,  Software Engineering, 9th EditionCh 8Ian Sommerville,  Software Engineering, 9th EditionCh 8
Ian Sommerville, Software Engineering, 9th EditionCh 8
Mohammed Romi
 
Artifical intelligance
Artifical intelliganceArtifical intelligance
Artifical intelligance
Gangasailakshmi Tellakula
 
SlideShare 101
SlideShare 101SlideShare 101
SlideShare 101
Amit Ranjan
 
Ad

Similar to Ir 01 (20)

CS8080_IRT__UNIT_I_NOTES.pdf
CS8080_IRT__UNIT_I_NOTES.pdfCS8080_IRT__UNIT_I_NOTES.pdf
CS8080_IRT__UNIT_I_NOTES.pdf
AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING
 
CS8080 IRT UNIT I NOTES.pdf
CS8080 IRT UNIT I  NOTES.pdfCS8080 IRT UNIT I  NOTES.pdf
CS8080 IRT UNIT I NOTES.pdf
AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING
 
Chapter 1.pptx
Chapter 1.pptxChapter 1.pptx
Chapter 1.pptx
Habtamu100
 
Chapter 1 - Introduction to IR Information retrieval ch1 Information retrieva...
Chapter 1 - Introduction to IR Information retrieval ch1 Information retrieva...Chapter 1 - Introduction to IR Information retrieval ch1 Information retrieva...
Chapter 1 - Introduction to IR Information retrieval ch1 Information retrieva...
shumawakjira26
 
Chapter 1 Intro Information Rerieval.pptx
Chapter 1 Intro Information Rerieval.pptxChapter 1 Intro Information Rerieval.pptx
Chapter 1 Intro Information Rerieval.pptx
bekidea
 
Introduction to Information Retrieval (concepts and principles)
Introduction to Information Retrieval  (concepts and principles)Introduction to Information Retrieval  (concepts and principles)
Introduction to Information Retrieval (concepts and principles)
ImtithalSaeed1
 
unit 1 INTRODUCTION
unit 1                      INTRODUCTIONunit 1                      INTRODUCTION
unit 1 INTRODUCTION
karthiksmart21
 
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information RetrievalIndexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
Vikas Bhushan
 
Hci
HciHci
Hci
Er. Saurabh Singh
 
IRT Unit_I.pptx
IRT Unit_I.pptxIRT Unit_I.pptx
IRT Unit_I.pptx
thenmozhip8
 
Chapter 1: Introduction to Information Storage and Retrieval
Chapter 1: Introduction to Information Storage and RetrievalChapter 1: Introduction to Information Storage and Retrieval
Chapter 1: Introduction to Information Storage and Retrieval
captainmactavish1996
 
Chapter 1 Introduction to Information Storage and Retrieval.pdf
Chapter 1 Introduction to Information Storage and Retrieval.pdfChapter 1 Introduction to Information Storage and Retrieval.pdf
Chapter 1 Introduction to Information Storage and Retrieval.pdf
Habtamu100
 
Informationa Retrieval Techniques .pptx
Informationa Retrieval Techniques  .pptxInformationa Retrieval Techniques  .pptx
Informationa Retrieval Techniques .pptx
lekhacce
 
Mam assign
Mam assignMam assign
Mam assign
silambu111
 
Hci encyclopedia irshortefords
Hci encyclopedia irshortefordsHci encyclopedia irshortefords
Hci encyclopedia irshortefords
apollobgslibrary
 
Hci encyclopedia irshortefords
Hci encyclopedia irshortefordsHci encyclopedia irshortefords
Hci encyclopedia irshortefords
apollobgslibrary
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
ijceronline
 
IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...
IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...
IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...
ISAR Publications
 
Chap 1 general introduction of information retrieval
Chap 1  general introduction of information retrievalChap 1  general introduction of information retrieval
Chap 1 general introduction of information retrieval
Malobe Lottin Cyrille Marcel
 
1.1 library concepts, terms and systems edited
1.1 library concepts, terms and systems edited1.1 library concepts, terms and systems edited
1.1 library concepts, terms and systems edited
ChandraSekhar1115
 
Chapter 1.pptx
Chapter 1.pptxChapter 1.pptx
Chapter 1.pptx
Habtamu100
 
Chapter 1 - Introduction to IR Information retrieval ch1 Information retrieva...
Chapter 1 - Introduction to IR Information retrieval ch1 Information retrieva...Chapter 1 - Introduction to IR Information retrieval ch1 Information retrieva...
Chapter 1 - Introduction to IR Information retrieval ch1 Information retrieva...
shumawakjira26
 
Chapter 1 Intro Information Rerieval.pptx
Chapter 1 Intro Information Rerieval.pptxChapter 1 Intro Information Rerieval.pptx
Chapter 1 Intro Information Rerieval.pptx
bekidea
 
Introduction to Information Retrieval (concepts and principles)
Introduction to Information Retrieval  (concepts and principles)Introduction to Information Retrieval  (concepts and principles)
Introduction to Information Retrieval (concepts and principles)
ImtithalSaeed1
 
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information RetrievalIndexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
Vikas Bhushan
 
Chapter 1: Introduction to Information Storage and Retrieval
Chapter 1: Introduction to Information Storage and RetrievalChapter 1: Introduction to Information Storage and Retrieval
Chapter 1: Introduction to Information Storage and Retrieval
captainmactavish1996
 
Chapter 1 Introduction to Information Storage and Retrieval.pdf
Chapter 1 Introduction to Information Storage and Retrieval.pdfChapter 1 Introduction to Information Storage and Retrieval.pdf
Chapter 1 Introduction to Information Storage and Retrieval.pdf
Habtamu100
 
Informationa Retrieval Techniques .pptx
Informationa Retrieval Techniques  .pptxInformationa Retrieval Techniques  .pptx
Informationa Retrieval Techniques .pptx
lekhacce
 
Hci encyclopedia irshortefords
Hci encyclopedia irshortefordsHci encyclopedia irshortefords
Hci encyclopedia irshortefords
apollobgslibrary
 
Hci encyclopedia irshortefords
Hci encyclopedia irshortefordsHci encyclopedia irshortefords
Hci encyclopedia irshortefords
apollobgslibrary
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
ijceronline
 
IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...
IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...
IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...
ISAR Publications
 
Chap 1 general introduction of information retrieval
Chap 1  general introduction of information retrievalChap 1  general introduction of information retrieval
Chap 1 general introduction of information retrieval
Malobe Lottin Cyrille Marcel
 
1.1 library concepts, terms and systems edited
1.1 library concepts, terms and systems edited1.1 library concepts, terms and systems edited
1.1 library concepts, terms and systems edited
ChandraSekhar1115
 
Ad

More from Mohammed Romi (14)

Ai 01 introduction
Ai 01 introductionAi 01 introduction
Ai 01 introduction
Mohammed Romi
 
Ai 03 solving_problems_by_searching
Ai 03 solving_problems_by_searchingAi 03 solving_problems_by_searching
Ai 03 solving_problems_by_searching
Mohammed Romi
 
Swiching
SwichingSwiching
Swiching
Mohammed Romi
 
Ch19 network layer-logical add
Ch19 network layer-logical addCh19 network layer-logical add
Ch19 network layer-logical add
Mohammed Romi
 
Ch12
Ch12Ch12
Ch12
Mohammed Romi
 
Angel6 e05
Angel6 e05Angel6 e05
Angel6 e05
Mohammed Romi
 
Chapter02 graphics-programming
Chapter02 graphics-programmingChapter02 graphics-programming
Chapter02 graphics-programming
Mohammed Romi
 
Swe notes
Swe notesSwe notes
Swe notes
Mohammed Romi
 
Ian Sommerville, Software Engineering, 9th Edition Ch 4
Ian Sommerville,  Software Engineering, 9th Edition Ch 4Ian Sommerville,  Software Engineering, 9th Edition Ch 4
Ian Sommerville, Software Engineering, 9th Edition Ch 4
Mohammed Romi
 
Ian Sommerville, Software Engineering, 9th Edition Ch2
Ian Sommerville,  Software Engineering, 9th Edition Ch2Ian Sommerville,  Software Engineering, 9th Edition Ch2
Ian Sommerville, Software Engineering, 9th Edition Ch2
Mohammed Romi
 
Ian Sommerville, Software Engineering, 9th Edition Ch1
Ian Sommerville,  Software Engineering, 9th Edition Ch1Ian Sommerville,  Software Engineering, 9th Edition Ch1
Ian Sommerville, Software Engineering, 9th Edition Ch1
Mohammed Romi
 
Ian Sommerville, Software Engineering, 9th Edition Ch 23
Ian Sommerville,  Software Engineering, 9th Edition Ch 23Ian Sommerville,  Software Engineering, 9th Edition Ch 23
Ian Sommerville, Software Engineering, 9th Edition Ch 23
Mohammed Romi
 
Ch 6
Ch 6Ch 6
Ch 6
Mohammed Romi
 
Ch 4 software engineering
Ch 4 software engineeringCh 4 software engineering
Ch 4 software engineering
Mohammed Romi
 
Ai 03 solving_problems_by_searching
Ai 03 solving_problems_by_searchingAi 03 solving_problems_by_searching
Ai 03 solving_problems_by_searching
Mohammed Romi
 
Ch19 network layer-logical add
Ch19 network layer-logical addCh19 network layer-logical add
Ch19 network layer-logical add
Mohammed Romi
 
Chapter02 graphics-programming
Chapter02 graphics-programmingChapter02 graphics-programming
Chapter02 graphics-programming
Mohammed Romi
 
Ian Sommerville, Software Engineering, 9th Edition Ch 4
Ian Sommerville,  Software Engineering, 9th Edition Ch 4Ian Sommerville,  Software Engineering, 9th Edition Ch 4
Ian Sommerville, Software Engineering, 9th Edition Ch 4
Mohammed Romi
 
Ian Sommerville, Software Engineering, 9th Edition Ch2
Ian Sommerville,  Software Engineering, 9th Edition Ch2Ian Sommerville,  Software Engineering, 9th Edition Ch2
Ian Sommerville, Software Engineering, 9th Edition Ch2
Mohammed Romi
 
Ian Sommerville, Software Engineering, 9th Edition Ch1
Ian Sommerville,  Software Engineering, 9th Edition Ch1Ian Sommerville,  Software Engineering, 9th Edition Ch1
Ian Sommerville, Software Engineering, 9th Edition Ch1
Mohammed Romi
 
Ian Sommerville, Software Engineering, 9th Edition Ch 23
Ian Sommerville,  Software Engineering, 9th Edition Ch 23Ian Sommerville,  Software Engineering, 9th Edition Ch 23
Ian Sommerville, Software Engineering, 9th Edition Ch 23
Mohammed Romi
 
Ch 4 software engineering
Ch 4 software engineeringCh 4 software engineering
Ch 4 software engineering
Mohammed Romi
 

Recently uploaded (20)

Geography Sem II Unit 1C Correlation of Geography with other school subjects
Geography Sem II Unit 1C Correlation of Geography with other school subjectsGeography Sem II Unit 1C Correlation of Geography with other school subjects
Geography Sem II Unit 1C Correlation of Geography with other school subjects
ProfDrShaikhImran
 
Operations Management (Dr. Abdulfatah Salem).pdf
Operations Management (Dr. Abdulfatah Salem).pdfOperations Management (Dr. Abdulfatah Salem).pdf
Operations Management (Dr. Abdulfatah Salem).pdf
Arab Academy for Science, Technology and Maritime Transport
 
GDGLSPGCOER - Git and GitHub Workshop.pptx
GDGLSPGCOER - Git and GitHub Workshop.pptxGDGLSPGCOER - Git and GitHub Workshop.pptx
GDGLSPGCOER - Git and GitHub Workshop.pptx
azeenhodekar
 
World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...
World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...
World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...
larencebapu132
 
One Hot encoding a revolution in Machine learning
One Hot encoding a revolution in Machine learningOne Hot encoding a revolution in Machine learning
One Hot encoding a revolution in Machine learning
momer9505
 
Introduction to Vibe Coding and Vibe Engineering
Introduction to Vibe Coding and Vibe EngineeringIntroduction to Vibe Coding and Vibe Engineering
Introduction to Vibe Coding and Vibe Engineering
Damian T. Gordon
 
Social Problem-Unemployment .pptx notes for Physiotherapy Students
Social Problem-Unemployment .pptx notes for Physiotherapy StudentsSocial Problem-Unemployment .pptx notes for Physiotherapy Students
Social Problem-Unemployment .pptx notes for Physiotherapy Students
DrNidhiAgarwal
 
SPRING FESTIVITIES - UK AND USA -
SPRING FESTIVITIES - UK AND USA            -SPRING FESTIVITIES - UK AND USA            -
SPRING FESTIVITIES - UK AND USA -
Colégio Santa Teresinha
 
Presentation on Tourism Product Development By Md Shaifullar Rabbi
Presentation on Tourism Product Development By Md Shaifullar RabbiPresentation on Tourism Product Development By Md Shaifullar Rabbi
Presentation on Tourism Product Development By Md Shaifullar Rabbi
Md Shaifullar Rabbi
 
CBSE - Grade 8 - Science - Chemistry - Metals and Non Metals - Worksheet
CBSE - Grade 8 - Science - Chemistry - Metals and Non Metals - WorksheetCBSE - Grade 8 - Science - Chemistry - Metals and Non Metals - Worksheet
CBSE - Grade 8 - Science - Chemistry - Metals and Non Metals - Worksheet
Sritoma Majumder
 
Exploring-Substances-Acidic-Basic-and-Neutral.pdf
Exploring-Substances-Acidic-Basic-and-Neutral.pdfExploring-Substances-Acidic-Basic-and-Neutral.pdf
Exploring-Substances-Acidic-Basic-and-Neutral.pdf
Sandeep Swamy
 
2541William_McCollough_DigitalDetox.docx
2541William_McCollough_DigitalDetox.docx2541William_McCollough_DigitalDetox.docx
2541William_McCollough_DigitalDetox.docx
contactwilliamm2546
 
To study Digestive system of insect.pptx
To study Digestive system of insect.pptxTo study Digestive system of insect.pptx
To study Digestive system of insect.pptx
Arshad Shaikh
 
apa-style-referencing-visual-guide-2025.pdf
apa-style-referencing-visual-guide-2025.pdfapa-style-referencing-visual-guide-2025.pdf
apa-style-referencing-visual-guide-2025.pdf
Ishika Ghosh
 
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulsepulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
sushreesangita003
 
K12 Tableau Tuesday - Algebra Equity and Access in Atlanta Public Schools
K12 Tableau Tuesday  - Algebra Equity and Access in Atlanta Public SchoolsK12 Tableau Tuesday  - Algebra Equity and Access in Atlanta Public Schools
K12 Tableau Tuesday - Algebra Equity and Access in Atlanta Public Schools
dogden2
 
Presentation of the MIPLM subject matter expert Erdem Kaya
Presentation of the MIPLM subject matter expert Erdem KayaPresentation of the MIPLM subject matter expert Erdem Kaya
Presentation of the MIPLM subject matter expert Erdem Kaya
MIPLM
 
Marie Boran Special Collections Librarian Hardiman Library, University of Gal...
Marie Boran Special Collections Librarian Hardiman Library, University of Gal...Marie Boran Special Collections Librarian Hardiman Library, University of Gal...
Marie Boran Special Collections Librarian Hardiman Library, University of Gal...
Library Association of Ireland
 
Metamorphosis: Life's Transformative Journey
Metamorphosis: Life's Transformative JourneyMetamorphosis: Life's Transformative Journey
Metamorphosis: Life's Transformative Journey
Arshad Shaikh
 
Multi-currency in odoo accounting and Update exchange rates automatically in ...
Multi-currency in odoo accounting and Update exchange rates automatically in ...Multi-currency in odoo accounting and Update exchange rates automatically in ...
Multi-currency in odoo accounting and Update exchange rates automatically in ...
Celine George
 
Geography Sem II Unit 1C Correlation of Geography with other school subjects
Geography Sem II Unit 1C Correlation of Geography with other school subjectsGeography Sem II Unit 1C Correlation of Geography with other school subjects
Geography Sem II Unit 1C Correlation of Geography with other school subjects
ProfDrShaikhImran
 
GDGLSPGCOER - Git and GitHub Workshop.pptx
GDGLSPGCOER - Git and GitHub Workshop.pptxGDGLSPGCOER - Git and GitHub Workshop.pptx
GDGLSPGCOER - Git and GitHub Workshop.pptx
azeenhodekar
 
World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...
World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...
World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...
larencebapu132
 
One Hot encoding a revolution in Machine learning
One Hot encoding a revolution in Machine learningOne Hot encoding a revolution in Machine learning
One Hot encoding a revolution in Machine learning
momer9505
 
Introduction to Vibe Coding and Vibe Engineering
Introduction to Vibe Coding and Vibe EngineeringIntroduction to Vibe Coding and Vibe Engineering
Introduction to Vibe Coding and Vibe Engineering
Damian T. Gordon
 
Social Problem-Unemployment .pptx notes for Physiotherapy Students
Social Problem-Unemployment .pptx notes for Physiotherapy StudentsSocial Problem-Unemployment .pptx notes for Physiotherapy Students
Social Problem-Unemployment .pptx notes for Physiotherapy Students
DrNidhiAgarwal
 
Presentation on Tourism Product Development By Md Shaifullar Rabbi
Presentation on Tourism Product Development By Md Shaifullar RabbiPresentation on Tourism Product Development By Md Shaifullar Rabbi
Presentation on Tourism Product Development By Md Shaifullar Rabbi
Md Shaifullar Rabbi
 
CBSE - Grade 8 - Science - Chemistry - Metals and Non Metals - Worksheet
CBSE - Grade 8 - Science - Chemistry - Metals and Non Metals - WorksheetCBSE - Grade 8 - Science - Chemistry - Metals and Non Metals - Worksheet
CBSE - Grade 8 - Science - Chemistry - Metals and Non Metals - Worksheet
Sritoma Majumder
 
Exploring-Substances-Acidic-Basic-and-Neutral.pdf
Exploring-Substances-Acidic-Basic-and-Neutral.pdfExploring-Substances-Acidic-Basic-and-Neutral.pdf
Exploring-Substances-Acidic-Basic-and-Neutral.pdf
Sandeep Swamy
 
2541William_McCollough_DigitalDetox.docx
2541William_McCollough_DigitalDetox.docx2541William_McCollough_DigitalDetox.docx
2541William_McCollough_DigitalDetox.docx
contactwilliamm2546
 
To study Digestive system of insect.pptx
To study Digestive system of insect.pptxTo study Digestive system of insect.pptx
To study Digestive system of insect.pptx
Arshad Shaikh
 
apa-style-referencing-visual-guide-2025.pdf
apa-style-referencing-visual-guide-2025.pdfapa-style-referencing-visual-guide-2025.pdf
apa-style-referencing-visual-guide-2025.pdf
Ishika Ghosh
 
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulsepulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
sushreesangita003
 
K12 Tableau Tuesday - Algebra Equity and Access in Atlanta Public Schools
K12 Tableau Tuesday  - Algebra Equity and Access in Atlanta Public SchoolsK12 Tableau Tuesday  - Algebra Equity and Access in Atlanta Public Schools
K12 Tableau Tuesday - Algebra Equity and Access in Atlanta Public Schools
dogden2
 
Presentation of the MIPLM subject matter expert Erdem Kaya
Presentation of the MIPLM subject matter expert Erdem KayaPresentation of the MIPLM subject matter expert Erdem Kaya
Presentation of the MIPLM subject matter expert Erdem Kaya
MIPLM
 
Marie Boran Special Collections Librarian Hardiman Library, University of Gal...
Marie Boran Special Collections Librarian Hardiman Library, University of Gal...Marie Boran Special Collections Librarian Hardiman Library, University of Gal...
Marie Boran Special Collections Librarian Hardiman Library, University of Gal...
Library Association of Ireland
 
Metamorphosis: Life's Transformative Journey
Metamorphosis: Life's Transformative JourneyMetamorphosis: Life's Transformative Journey
Metamorphosis: Life's Transformative Journey
Arshad Shaikh
 
Multi-currency in odoo accounting and Update exchange rates automatically in ...
Multi-currency in odoo accounting and Update exchange rates automatically in ...Multi-currency in odoo accounting and Update exchange rates automatically in ...
Multi-currency in odoo accounting and Update exchange rates automatically in ...
Celine George
 

Ir 01

  • 2. About the Course  Book:  An Introduction to Information Retrieval, Christopher D. Manning Prabhakar Raghavan Hinrich Schütze, Cambridge University Press, 2009.  Other materials may be considered depending on the subject.  Principal objective of this course:  To introduce students to Information Retrieval concepts, paradigms and techniques, with an emphasis on String and Semantics based IR techniques.
  • 3. About the Course  Grading & Assessment:  First Exam …………………….. 20%  Second Exam ………………….. 20%  Final Exam …………………….. 35%  Other Activities ………………. 10%  Major Assignment ……………. 15% “You are to build a prototype for a search engine that employs both text-based and semantics-based techniques for retrieving the most relevant results to users’ queries. The search space will be a collection of documents, in addition to a collection of images associated with some textual descriptions”.
  • 4. Course Topics  Part 01 – Introduction  What is IR?  Examples of IR Systems.  Other topics related to IR.  Models of IR  Part 02 – Boolean Retrieval  What is Boolean IR?  Term-Document Incidence Matrices  Terminology and Notations
  • 5. Course Topics  Part 03 – Indexing  Building Indexes  Semantic Networks  Part 04 – Retrieval  Scoring, Ranking  Relevance Feedback  Precision/Recall
  • 6. Course Topics  Part 05 – Exploiting Ontologies in IR  Ontologies  Traditional vs. Semantics-based IR techniques
  • 7. Introduction What is IR  Information Retrieval: “Information retrieval (IR) is finding material (usually documents) of an unstructured nature (usually text) that satisfies an information need from within large collections (usually stored on computers).”  Unstructured Data: “refers to data which does not have clear, semantically overt, easy-for-a- computer structure.” e.g.  Textual information in web pages.  Semistructured Data: “refers to data which have a partially clear, semantically overt, easy-for-a- computer structure.” e.g.  finding a document where the title contains Java and the body contains threading.
  • 8. Introduction What is IR  Structured Data: “refers to data which have a clear, semantically overt, easy- for-a-computer structure.” e.g.  Relational Databases.
  • 9.  A look back: 1990s  Studies showed that most people preferred getting information from other people rather than from information retrieval systems.  Online booking systems?  Following to this period and after relentless optimization of IR:  The field of information retrieval has moved from being a primarily academic discipline to being the basis underlying most people’s preferred means of information access. Introduction What is IR
  • 10.  Information retrieval did not begin with the Web.  The field began with scientific publications and library records, but soon spread to other forms of content, particularly those of information professionals, such as journalists, lawyers, and doctors Introduction What is IR
  • 11. Introduction Other Topics Related to IR  Cross-language IR  Multimedia IR  Speech retrieval  User interfaces for IR  Ontology and Semantics-based IR  Natural Language Processing (NLP) techniques  Dynamic IR  Online Advertising !?
  • 12. Introduction Other Topics Related to IR  The field of information retrieval also covers supporting users in browsing or filtering document collections or further processing a set of retrieved documents.  Given a set of documents, clustering is the task of coming up with a good grouping of the documents based on their contents.  Given a set of topics, standing information needs, or other categories (such as suitability of texts for different age groups), classification is the task of deciding which class(es), if any, each of a set of documents belongs to. It is often approached by first manually classifying some documents and then hoping to be able to classify new documents automatically.
  • 13. Introduction Classification of IR systems  Scale-based Classification of IR systems: Distinguishing between Information retrieval systems according to the scale at which they operate. 1. Web search: The search is conducted over billions of documents stored on millions of computers.  Issues to consider: 1. Needing to gather documents for indexing. 2. Being able to build systems that work efficiently at this enormous scale. 3. Handling particular aspects of the web, such as the exploitation of hypertext and page ranking given the commercial importance of the web.
  • 14. 2. Personal Information Retrieval: Integrating information retrieval into consumer operating systems.  Issues to consider: 1. Handling the broad range of document types on a typical personal computer. 2. Making the search system maintenance free and sufficiently lightweight in terms of startup, processing, and disk space usage that it can run on one machine without annoying its owner. Introduction Classification of IR systems
  • 15. 3. Enterprise, Institutional, and Domain-specific Search: A corporation’s documents will typically be stored on centralized file systems and one or a handful of dedicated machines will provide search over the collection.  Issues to consider: 1. Handling the broad range of document types on a centralized computer. 2. Scale and Efficiency of the IR system. 3. Maintenance of the search system. Introduction Classification of IR systems
  • 16. Introduction Classification of IR systems  Technique-based Classification of IR systems: Distinguishing between Information retrieval systems according to the search technique that they employ. 1. Keyword-based search: String matching algorithms are employed to find documents relevant to the user’s query.  Issues to consider: 1. Precision and Recall of the search algorithm. 2. Gap between the textual information contained in the document collections and the user’s information need.
  • 17. Introduction Classification of IR systems 2. Semantics-based search: Semantic aspects of the user’s query are derived in an attempt to find documents relevant to the user’s query.  Issues to consider: 1. Precision and Recall of the search algorithm. 2. Lack of Semantic Resources. 3. Incompleteness of Background Knowledge represented in existing Semantic Resources. 4. Semantic Heterogeneity problem between existing Semantic Resources. 5. Lack of Multi-lingual Semantic Resources.
  • 18. Introduction Classification of IR systems 2. Hybrid Approaches: Keyword-based search is enriched with Semantics-based search to retrieve more relevant results to the user’s information needs.  Issues to consider: 1. Precision and Recall of the search algorithm. 2. Lack of Semantic Resources. 3. Priority of the employed techniques. 4. Incompleteness of Background Knowledge represented in existing Semantic Resources. 5. Types of queries that the system can handle (Single-term vs. Verbose queries). 6. Lack of Multi-lingual Semantic Resources.  Research is very active in this area.  Example: Dbpedia based search engine (June 2015)