Information Retrieval - Lecture 1
Information Retrieval - Lecture 1
Engines
BIS216E
• Textbook:
Essential Books:
– SEO 2018: Learn search engine optimization with
smart internet marketing strategies Adam Clarke,
Simple Effectiveness Publishing, 2018.
Recommended Books:
- Search Engine Optimization All-in-One For
Dummies by Bruce Clay (Author), Kristopher B.
Jones (Author) 2022 For Dummies (Business &
Personal Finance)) 4th Edition.
For success:
Achieving 50% of total score & achieving at least 12 out of
40 at the Final exam.
6
Course: Information Retrieval & Search Engines
The problem of IR
• Goal = find documents relevant to an information
need from a large document set
Inf
o.
ne
Query ed
IR
Document Retrieval
system
collection Answer list
7
Course: Information Retrieval & Search Engines
Example
Web
8
Course: Information Retrieval & Search Engines
What is a Document?
• Examples:
– web pages, email, books, news stories, scholarly
papers, text messages, Word, Powerpoint, PDF,
forum postings, patents, IM sessions, etc.
• Common properties
– Significant text content
– Some structure (e.g., title, author, date for papers;
subject, sender, destination for email)
12
Course: Information Retrieval & Search Engines
Unstructured (text) vs. structured
(database) data today
13
Course: Information Retrieval & Search Engines
Sec. 1.1
Basic assumptions of
Information Retrieval
• Collection: A set of documents
– Assume it is a static collection for the
moment
14
Course: Information Retrieval & Search Engines
The classic search model
User task Get rid of mice in a
politically correct way
Misconception?
Info need
Info about removing mice
without killing them
Misformulation?
Search
Query how trap mice
alive
Search
engine
Query Results
Collection
refinement