0% found this document useful (0 votes)

24 views

Information Retrieval

This document outlines the course contents and objectives for an information retrieval course. The course covers basic and advanced techniques for building text-based information systems, including indexing, retrieval models, evaluation, query languages, advanced query operations, text preprocessing, searching, document clustering, multimedia retrieval, parallel and distributed systems, meta-ranking, web search, user interfaces, link analysis, crawling, and applications of search systems.

Uploaded by

Noureen Zafar

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views

Information Retrieval

Uploaded by

Noureen Zafar

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

PIR MEHR ALI SHAH ARID AGRICULTURE UNIVERSITY

University Institute of Information Technology

CS-802 Information retrieval

Credit Hours: 3(3-0) Prerequisites: None
Course Learning Outcomes (CLOs)
At the end of course the students will be able to: Domain BT Level*
1. Understand the basic concepts of the information C 1
retrieval.
2. Learn tools and techniques to do cutting-edge research in C 2
the area of information retrieval or text mining.
3. Identify the involvement of the information retrieval in C 3
modern life style & social media
4. Get hands on project experience by developing real- C 4
world applications, such as intelligent tools for
improving search accuracy from user feedback, email
spam detection, recommendation system, or scientific
literature organization and mining.
*BT- Bloom’s Taxonomy, C=Cognitive domain, P=Psychomotor domain, A=Affective domain

Course Contents:
Information retrieval is the process through which a computer system can respond to a user's
query for text-based information on a specific topic. IR was one of the first and remains one of
the most important problems in the domain of natural language processing (NLP). Web search
is the application of information retrieval techniques to the largest corpus of text anywhere --
the web -- and it is the area in which most people interact with IR systems most frequently.
In this course, we will cover basic and advanced techniques for building text-based
information systems, including the following topics:
Efficient text indexing
 Boolean and vector-space retrieval models
 Evaluation and interface issues
 IR techniques for the web, including crawling, link-based algorithms, and metadata
usage
 Document clustering and classification
 Traditional and machine learning-based ranking approaches

Course Objective:
By the end of this course the student should:
 understand the theoretical basis behind the standard models of IR (Boolean,

Vector-space, Probabilistic and Logical models),
  understand the difficulty of representing and retrieving documents, images,
speech, etc.,
  be able to implement, run and test a standard IR system,

  understand the standard methods for Web indexing and retrieval,

  understand how techniques from natural language processing, artificial

intelligence, human-computer interaction and visualization integrate with IR, and
  be familiar with various algorithms and systems.

Teaching Methodology:
Lectures, Written Assignments, Practical labs, Semester Project, Presentations
Courses Assessment:
Exams, Assignments, Quizzes. Course will be assessed using a combination of written
examinations.
Reference Materials:
There are several good textbooks for the topic of information retrieval. The first book listed
below is our official textbook, and the others are recommended references.
1. Introduction to Information Retrieval. Christopher D. Manning, Prabhakar Raghavan, and
Hinrich Schuetze, Cambridge University Press, 2007.
2. Search Engines: Information Retrieval in Practice. Bruce Croft, Donald Metzler, and
Trevor Strohman, Pearson Education, 2009.
3. Modern Information Retrieval. Baeza-Yates Ricardo and Berthier Ribeiro-Neto. 2nd
edition, Addison-Wesley, 2011. 1 SYLLABUS IFORMATION RETRIEVAL
4. Information Retrieval: Implementing and Evaluating Search Engines. Stefan Buttcher,
Charlie Clarke, Gordon Cormack, MIT Press, 2010.
Week Contents Theory
1 Introduction to Information Retrieval

 Motivation
 Information Retrieval vs Data Retrieval
 Flashback

2 Models of Information Retrieval

 Boolean Model
 Vector Space Model
 Probabilistic Model
 Alternative Models

3 Retrieval Evaluation

 Recall and Precision

 Alternative Measures
 Reference Collections and Evaluation of IR systems

4 Query Languages for IR

 Keywords
 Boolean Queries
 Context Queries
 Natural Language Queries
 Structural Queries

5 Advanced Query Operations

 Relevance Feedback
 Query Expansion
 Automatic Local Analysis
 Automatic Global Analysis

6 Text Indexing, Preprocessing and File Organization

 Stopwards, stemming, thesauri

 File (Text) organization (invert,suff)
 Text statistics (properties)
 Text compression

7 Text Searching

 Knuth-Morris-Pratt
 Boyer-Moore family
 Suffix automaton
 Phrases and Proximity

8 Document Clustering
MID TERM
9 Multimedia Information Retrieval

 Similarity Queries
 Feature-based Indexing and Searching
 Spatial Access Methods
 Searching in Multidimensional Spaces

10 Parallel and Distributed IR

 Architectures MIMD and SIMD

 Collection Partitioning
 Source Selection
 Query Processing
 Peer-2-Peer Architectures and Systems

11 Meta-Ranking

 Integrated vs Isolated Methods

 Interleaving
 Voting

12 Web Search

 History of Web
 Indexing
 Spidering/Crawling
 Link Analysis (HITS, PageRank)

13 User Interfaces and Visualization

14 Link Analysis
 Ranking the web frontier
 The WebGraph framework I: Compression techniques
 Extrapolation methods for accelerating PageRank computations
 Searching the workplace web

15 Crawling and near-duplicate pages

 Mercator: A scalable, extensible web crawler.

 A standard for robot exclusion

16 Search applications
Introduce modern applications in search systems, including recommendation,
personalization, and online advertising, if time allows.
Final Exam

1.introduction Information Retrival
No ratings yet
1.introduction Information Retrival
31 pages
Monday - IR Fundamentals - Grace Yang - AFIRM19-IR
No ratings yet
Monday - IR Fundamentals - Grace Yang - AFIRM19-IR
77 pages
Introduction To Information Retrieval
No ratings yet
Introduction To Information Retrieval
42 pages
Course No.: CS F469 Course Title: Information Retrieval Instructor-In-Charge: POONAM GOYAL (
No ratings yet
Course No.: CS F469 Course Title: Information Retrieval Instructor-In-Charge: POONAM GOYAL (
4 pages
Information Retrieval: Dr. Bassel ALKHATIB
No ratings yet
Information Retrieval: Dr. Bassel ALKHATIB
55 pages
Unit - I - IR
No ratings yet
Unit - I - IR
39 pages
An Overview of Information Retrieval Outline: A (Simple) Database Example Databases vs. IR
No ratings yet
An Overview of Information Retrieval Outline: A (Simple) Database Example Databases vs. IR
16 pages
Wollo University Kombolcha Institute of Technology College of Informatics Department of Information Technology
100% (1)
Wollo University Kombolcha Institute of Technology College of Informatics Department of Information Technology
35 pages
Tycs Sem Vi Informational Retrival Final Notes (WWW - Profajaypashankar.com-1
No ratings yet
Tycs Sem Vi Informational Retrival Final Notes (WWW - Profajaypashankar.com-1
103 pages
Lecture1 Chap1
No ratings yet
Lecture1 Chap1
22 pages
01 Introduction To ISR
No ratings yet
01 Introduction To ISR
48 pages
21ite09 Information Reterival
No ratings yet
21ite09 Information Reterival
2 pages
UNIT I IR Final
No ratings yet
UNIT I IR Final
26 pages
Gujarat Technological University: Page 1 of 2
No ratings yet
Gujarat Technological University: Page 1 of 2
2 pages
1 IR Chapter-One
No ratings yet
1 IR Chapter-One
47 pages
Materi Pertemuan Ke-1-Dno 2018-1
No ratings yet
Materi Pertemuan Ke-1-Dno 2018-1
42 pages
Chapter 1
No ratings yet
Chapter 1
52 pages
Chap 1
No ratings yet
Chap 1
23 pages
1-Introduction-MIR
No ratings yet
1-Introduction-MIR
35 pages
1stunit GN
No ratings yet
1stunit GN
36 pages
Introduction
No ratings yet
Introduction
25 pages
Information Storage and Retrival (Course Outline) - New
No ratings yet
Information Storage and Retrival (Course Outline) - New
7 pages
CS8080 Irt
100% (1)
CS8080 Irt
33 pages
Syllabus Information Retrieval Techniques
No ratings yet
Syllabus Information Retrieval Techniques
2 pages
Anand Institute of Higher Technology KAZHIPATTUR - 603 103
No ratings yet
Anand Institute of Higher Technology KAZHIPATTUR - 603 103
5 pages
Chapter 1 Introduction To ISR
No ratings yet
Chapter 1 Introduction To ISR
39 pages
IR Textbook
No ratings yet
IR Textbook
167 pages
IR UNIT I - Notes
No ratings yet
IR UNIT I - Notes
23 pages
All Units Notes TYBSC-CS-Information-Retrieval
No ratings yet
All Units Notes TYBSC-CS-Information-Retrieval
89 pages
Jeppiaar Institute of Technology: Department OF Computer Science and Engineering
No ratings yet
Jeppiaar Institute of Technology: Department OF Computer Science and Engineering
24 pages
DDB Ch27
No ratings yet
DDB Ch27
60 pages
Informaiton Retrieval and Web Search
No ratings yet
Informaiton Retrieval and Web Search
44 pages
Cs8080 - Irt - Notes All
No ratings yet
Cs8080 - Irt - Notes All
281 pages
Cs6007 - Information Retrieval: Objectives: The Student Should Be Made To
No ratings yet
Cs6007 - Information Retrieval: Objectives: The Student Should Be Made To
24 pages
Week 1
No ratings yet
Week 1
28 pages
IR chapter 1 (2)
No ratings yet
IR chapter 1 (2)
29 pages
Introduction To: Information Retrieval
No ratings yet
Introduction To: Information Retrieval
32 pages
01 - Lect - Introd
No ratings yet
01 - Lect - Introd
23 pages
Concepts of Information Retrieval System
No ratings yet
Concepts of Information Retrieval System
10 pages
Syllabus
No ratings yet
Syllabus
9 pages
1520784495 Lec5 Ir Introduction
No ratings yet
1520784495 Lec5 Ir Introduction
37 pages
Week 2 - Information Retrieval Basics
No ratings yet
Week 2 - Information Retrieval Basics
74 pages
Cs8080irtunitinotes 220515215754 E06d144b
No ratings yet
Cs8080irtunitinotes 220515215754 E06d144b
43 pages
IRS B Tech CSE Part 1
No ratings yet
IRS B Tech CSE Part 1
161 pages
CompletedUNIT 1 PPT 10.7.17
100% (6)
CompletedUNIT 1 PPT 10.7.17
87 pages
CS317 IR W1a
No ratings yet
CS317 IR W1a
20 pages
Introduction To Information Retrieval
No ratings yet
Introduction To Information Retrieval
50 pages
Information Retrieval CS485: Tibebe Beshah
No ratings yet
Information Retrieval CS485: Tibebe Beshah
137 pages
5 Unit Notes
100% (1)
5 Unit Notes
166 pages
IR Unit-1 - Updated
No ratings yet
IR Unit-1 - Updated
50 pages
Information Retrieval Systems (A70533)
No ratings yet
Information Retrieval Systems (A70533)
11 pages
Information Retrieval - Lecture 1
No ratings yet
Information Retrieval - Lecture 1
15 pages
Introduction To IR 2021
No ratings yet
Introduction To IR 2021
40 pages
1_IR_Introductionn (1)
No ratings yet
1_IR_Introductionn (1)
30 pages
Unit 1: Introduction and Data Pre-Processing
No ratings yet
Unit 1: Introduction and Data Pre-Processing
71 pages
Information Retrieval 1
100% (2)
Information Retrieval 1
12 pages
cs8080 Irt Local Author
No ratings yet
cs8080 Irt Local Author
168 pages

Information Retrieval

Uploaded by

Information Retrieval

Uploaded by

PIR MEHR ALI SHAH ARID AGRICULTURE UNIVERSITY

University Institute of Information Technology

CS-802 Information retrieval

  understand the standard methods for Web indexing and retrieval,

  understand how techniques from natural language processing, artificial

2 Models of Information Retrieval

 Recall and Precision

4 Query Languages for IR

5 Advanced Query Operations

6 Text Indexing, Preprocessing and File Organization

 Stopwards, stemming, thesauri

10 Parallel and Distributed IR

 Architectures MIMD and SIMD

 Integrated vs Isolated Methods

13 User Interfaces and Visualization

15 Crawling and near-duplicate pages

 Mercator: A scalable, extensible web crawler.

You might also like