0% found this document useful (0 votes)

10 views10 pages

Learning Guide Unit 6 _ Home

Unit 6 focuses on evaluating the effectiveness of information retrieval (IR) systems, emphasizing metrics such as precision, recall, and the F Measure. The unit discusses methodologies for assessing the relevance of retrieved documents and the importance of using known document collections and queries for evaluation. Students are expected to engage in various activities including peer assessments, discussions, and reflective journal entries to deepen their understanding of the material.

Uploaded by

Reg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views10 pages

Learning Guide Unit 6 _ Home

Uploaded by

Reg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Learning Guide Unit 6 | Home https://my.uopeople.edu/mod/book/tool/print/index.php?

id=443868

Site: University of the People Printed by: Patrick Rolemodel Asante

Course: CS 3308-01 Information Retrieval - AY2025-T2 Date: Tuesday, 10 December 2024, 12:03 PM
Book: Learning Guide Unit 6

1 of 10 12/10/2024, 12:03 PM
Learning Guide Unit 6 | Home https://my.uopeople.edu/mod/book/tool/print/index.php?id=443868

Learning Guide Unit 6

2 of 10 12/10/2024, 12:03 PM
Learning Guide Unit 6 | Home https://my.uopeople.edu/mod/book/tool/print/index.php?id=443868

3 of 10 12/10/2024, 12:03 PM
Learning Guide Unit 6 | Home https://my.uopeople.edu/mod/book/tool/print/index.php?id=443868

• Evaluating of the e�ectiveness of Information retrieval

• Measures of information retrieval e�ectiveness including precision and recall
• Assessing relevance
• User utility of information retrieval system

By the end of this Unit, you will be able to:

1. Describe methodologies for evaluating information retrieval systems

2. Implement techniques to evaluate unranked retrieval sets including:
◦ Precision
◦ Recall
3. Compute the F Measure which balances precision and recall
4. Recognize various techniques to compute recall precision
◦ Interpolated precision
◦ 11-Point interpolated average precision
◦ Mean Average Precision
◦ Precision at k
◦ R-Precision

• Peer assess Unit 5 Development Assignment

• Read the Learning Guide and Reading Assignments
• Participate in the Discussion Assignment (post, comment, and rate in the Discussion Forum)
• Make entries to the Learning Journal
• Take the Self-Quiz

4 of 10 12/10/2024, 12:03 PM
Learning Guide Unit 6 | Home https://my.uopeople.edu/mod/book/tool/print/index.php?id=443868

Unit six explores the area of evaluation of the information retrieval system. Up to this point we have learned how to build the inverted
index, we have learned how to develop a weighted term retrieval scheme by computing the cosine similarity between a query and the
documents that are candidates to be relevant to the query and we have successfully put all of these components together to create a
complete search system that can successfully search our corpus of 2,476 Reuters news article documents.

What we have yet learned is how to determine if our search engine is really accomplishing the goal of searching the corpus and retrieving
the most relevant results. The weighting provided by the calculation of the tf-idf (term frequency inverse document frequency) metric
combined with the calculation of the cosine similarity is intended to identify those documents are likely to be relevant to the query terms
that the user of the system is using to search for information.

We really don’t know, however, if they in fact WERE relevant. In chapter 8 of our text we explore techniques to determine just how
e�ective our information retrieval system is at returning relevant documents. Some key metrics that will be used to determine relevance
is recall which is a ratio of relevant items received over the total number of potential relevant items. Obviously the higher the recall the
better.

Another key metric is precision which provides a ratio of the number of relevant items retrieved over the total number items retrieved.
This metric is a measure of the relevance retrieved in a query, where recall measures how e�ective the search was to get the relevant
items that are in the collection. Obviously a higher recall is a better more relevant query.

These two metrics are combined to form the F Measure which is an overall measure of the relevance and e�ciency of information
retrieval performance.

What is important to understand from this unit is how a corpus along with standard queries and results are used to test and validate the
e�ectiveness of an IR system. In chapter 8 of the text, we learn that there are a number of document collections (corpus) that are used for
this purpose. These document collections are used along with well known queries (keep in mind that a query is the terms that are used to
search the collection). An IR system to be evaluated �rst indexes the corpus and then the queries are used to test the results that the IR
system returns. What is important about this process is that for these queries there are known metrics such as the number of documents
in the collection that SHOULD be relevant. Because we have these standard queries for which the actual number of relevant documents
has been determined, we can use these same queries to determine how e�ective our IR system is. We will not be using most of these
corpus in this course because they are relatively large and would require considerable processing time.

5 of 10 12/10/2024, 12:03 PM
Learning Guide Unit 6 | Home https://my.uopeople.edu/mod/book/tool/print/index.php?id=443868

Manning, C.D., Raghaven, P., & Schütze, H. (2009). An Introduction to Information Retrieval (Online ed.). Cambridge, MA: Cambridge
University Press. Available at http://nlp.stanford.edu/IR-book/information-retrieval-book.html

Chapter 8: Evaluation in Information Retrieval

• Relevance
• Gold Standard Around Truth
• Precision
• Recall
• Accuracy
• F-Measure
• Interpolated precision

6 of 10 12/10/2024, 12:03 PM
Learning Guide Unit 6 | Home https://my.uopeople.edu/mod/book/tool/print/index.php?id=443868

Unit six looks at how to evaluate the e�ectiveness of an information retrieval system. Precision, recall, accuracy, and the F measure are all
discussed as metrics that can be used to measure the e�ectiveness of results retrieved from an IR system. In chapter 8 of the text, we
learn that there are a number of document collections (corpus) that are used for this purpose. These document collections are used along
with well known queries (keep in mind that a query is the terms that are used to search the collection). An IR system to be evaluated �rst
indexes the corpus and then the queries are used to test the results that the IR system returns. What is important about this process is
that for these queries there are known metrics such as the number of documents in the collection that SHOULD be relevant.

These measures of e�ectiveness are calculated based upon such known information and the results returned from a query submitted to
an IR system.

'For example, consider . We know that it contains documents which

Suppose that we have known metrics such as the fact that there are documents in this collection that are relevant to a query for
the terms .

Assume that the IR system that we have developed returns 8 relevant documents and 10 documents that are not relevant. Using this
information and the formulas for Precision, Recall, F-Measure, and Accuracy, calculate what each of these measures would be for the
example presented above. When you have determined the metric for each post a response that includes:

1. The Precision, Recall, F-Measure, and Accuracy e�ectiveness metrics which you will calculate using the metrics provided above.
2. Discuss which approach provides the most valid measure of the e�ectiveness of the IR system and why.

Keep in mind that Precision and Recall are used together a measure of e�ectiveness, the F-Measure provides a single measure that
balances Precision and Recall metrics and Accuracy provides a measure of the accuracy of classi�cations in the collection.

7 of 10 12/10/2024, 12:03 PM
Learning Guide Unit 6 | Home https://my.uopeople.edu/mod/book/tool/print/index.php?id=443868

Your learning journal entry must be a re�ective statement that considers the following questions:

• Describe what you did. This does not mean that you copy and paste from what you have posted or the assignments you have
prepared. You need to describe what you did and how you did it.
• Describe your reactions to what you did
• Describe any feedback you received or any speci�c interactions you had. Discuss how they were helpful
• Describe your feelings and attitudes
• Describe what you learned

Another set of questions to consider in your learning journal statement include:

• What surprised me or caused me to wonder?

• What happened that felt particularly challenging? Why was it challenging to me?
• What skills and knowledge do I recognize that I am gaining?
• What am I realizing about myself as a learner?
• In what ways am I able to apply the ideas and concepts gained to my own experience?

Your Learning Journal must be a minimum of 500 words.

8 of 10 12/10/2024, 12:03 PM
Learning Guide Unit 6 | Home https://my.uopeople.edu/mod/book/tool/print/index.php?id=443868

The Self-Quiz gives you an opportunity to self-assess your knowledge of what you have learned so far.

The results of the Self-Quiz do not count towards your �nal grade, but the quiz is an important part of the University’s learning process
and it is expected that you will take it to ensure understanding of the materials presented. Reviewing and analyzing your results will help
you perform better on future Graded Quizzes and the Final Exam.

Please access the Self-Quiz on the main course homepage; it will be listed inside the Unit.

9 of 10 12/10/2024, 12:03 PM
Learning Guide Unit 6 | Home https://my.uopeople.edu/mod/book/tool/print/index.php?id=443868

Peer assess Unit 5 Development Assignment

Read the Learning Guide and Reading Assignments

Participate in the Discussion Assignment (post, comment, and rate in the Discussion Forum)

Make entries to the Learning Journal

Take the Self-Quiz

10 of 10 12/10/2024, 12:03 PM

Research Strategies: Finding Your Way Through the Information Fog
From Everand
Research Strategies: Finding Your Way Through the Information Fog
William Badke
5/5 (1)
Project Management of Clinical Trials
From Everand
Project Management of Clinical Trials
Richard Chamberlain
No ratings yet
Learning Guide Unit 6_ Discussion Assignment _ Home
No ratings yet
Learning Guide Unit 6_ Discussion Assignment _ Home
1 page
1727759531-6 Evaluation in Information Retrieval
No ratings yet
1727759531-6 Evaluation in Information Retrieval
24 pages
Evaluation of Information Retrieval Systems: Thanks To Marti Hearst, Ray Larson, Chris Manning
No ratings yet
Evaluation of Information Retrieval Systems: Thanks To Marti Hearst, Ray Larson, Chris Manning
108 pages
IR Chapt 5
No ratings yet
IR Chapt 5
55 pages
IR - Chapter 5
No ratings yet
IR - Chapter 5
28 pages
CS 3308 Learning Journal Unit 6
No ratings yet
CS 3308 Learning Journal Unit 6
7 pages
lecture5-6
No ratings yet
lecture5-6
30 pages
TREC Evalution Measures
No ratings yet
TREC Evalution Measures
10 pages
IR Unit 5
No ratings yet
IR Unit 5
5 pages
5-Retrieval Effectiveness
No ratings yet
5-Retrieval Effectiveness
20 pages
Performance Evaluation of Information Retrieval Systems
No ratings yet
Performance Evaluation of Information Retrieval Systems
45 pages
5 Retrievalefective
No ratings yet
5 Retrievalefective
22 pages
Learning Guide Unit 5 _ Home
No ratings yet
Learning Guide Unit 5 _ Home
12 pages
IR Lecture 5b
No ratings yet
IR Lecture 5b
36 pages
IR Lecture 5b
No ratings yet
IR Lecture 5b
36 pages
5 Retrieval Evaluation
No ratings yet
5 Retrieval Evaluation
20 pages
CS 3308 Learning Journal 6
No ratings yet
CS 3308 Learning Journal 6
8 pages
5 retrievalEfective
No ratings yet
5 retrievalEfective
13 pages
5 Retrieval Effectiveness
No ratings yet
5 Retrieval Effectiveness
20 pages
chapter3-MA212-Evaluation
No ratings yet
chapter3-MA212-Evaluation
63 pages
Evaluation 1
No ratings yet
Evaluation 1
63 pages
Chapter 6-8IR Revised
No ratings yet
Chapter 6-8IR Revised
76 pages
10 Evaluation FSS20
No ratings yet
10 Evaluation FSS20
24 pages
Information Retrieval: IR Evaluation
No ratings yet
Information Retrieval: IR Evaluation
36 pages
Chapter 5 Retrieval Efective
No ratings yet
Chapter 5 Retrieval Efective
24 pages
3 Retrieval Evaluation
No ratings yet
3 Retrieval Evaluation
31 pages
IR END PYQ SOLS
No ratings yet
IR END PYQ SOLS
8 pages
Lecture 7 - Evaluation in IR, Relevance Feedback, Query Expansion
No ratings yet
Lecture 7 - Evaluation in IR, Relevance Feedback, Query Expansion
79 pages
Ch5 Retrieval Evaluation 2021
No ratings yet
Ch5 Retrieval Evaluation 2021
26 pages
Towards best practice in the Archetype Development Process
From Everand
Towards best practice in the Archetype Development Process
Alberto Moreno Conde
No ratings yet
IR Evaluation Tugas Kampus
No ratings yet
IR Evaluation Tugas Kampus
25 pages
CS336 MIR w5 Evaluation
No ratings yet
CS336 MIR w5 Evaluation
38 pages
Evaluation and Result Summaries
No ratings yet
Evaluation and Result Summaries
60 pages
SIT772 Lecture 10
No ratings yet
SIT772 Lecture 10
34 pages
4 IRinArabic2021 Ranked Retrieval I
No ratings yet
4 IRinArabic2021 Ranked Retrieval I
49 pages
3
No ratings yet
3
14 pages
Introduction To: Information Retrieval
No ratings yet
Introduction To: Information Retrieval
50 pages
Evaluation Metrics and Evaluation
No ratings yet
Evaluation Metrics and Evaluation
9 pages
09 Evaluation
No ratings yet
09 Evaluation
22 pages
NLP-week10-IR-enc-dec-annotated_by_Ces
No ratings yet
NLP-week10-IR-enc-dec-annotated_by_Ces
83 pages
ISR chap...6
No ratings yet
ISR chap...6
14 pages
Unit-V
No ratings yet
Unit-V
54 pages
Title: Perform Evaluation of Any Popular Search Engine Based On Relevancy. (E.g Google) Theory
No ratings yet
Title: Perform Evaluation of Any Popular Search Engine Based On Relevancy. (E.g Google) Theory
9 pages
6 Retrieval Effectiveness
No ratings yet
6 Retrieval Effectiveness
18 pages
Arpan Halder-0001 - 20230802234722 - Assessing The Reliability of Information Retrieval NLP and Fuzzy
No ratings yet
Arpan Halder-0001 - 20230802234722 - Assessing The Reliability of Information Retrieval NLP and Fuzzy
10 pages
2 Introduction To Information Retrieval
No ratings yet
2 Introduction To Information Retrieval
38 pages
Efficient Management of Large Metadata Catalogs in a Ubiquitous Computing Environment
From Everand
Efficient Management of Large Metadata Catalogs in a Ubiquitous Computing Environment
Daniel Beatty
No ratings yet
IR Practical Theory.docx
No ratings yet
IR Practical Theory.docx
9 pages
Mastering Business Research: A Practical Guide to Scholars and Practitioners
From Everand
Mastering Business Research: A Practical Guide to Scholars and Practitioners
Kondwani Monjeza
No ratings yet
Information Retrieval CMSC 476/676: Evaluation and Result Summaries
No ratings yet
Information Retrieval CMSC 476/676: Evaluation and Result Summaries
45 pages
Classwork For Information Retrieval
No ratings yet
Classwork For Information Retrieval
118 pages
Implementing the Stakeholder Based Goal-Question-Metric (Gqm) Measurement Model for Software Projects
From Everand
Implementing the Stakeholder Based Goal-Question-Metric (Gqm) Measurement Model for Software Projects
Dr. Prashanth Harish Southekal
No ratings yet
bulu
No ratings yet
bulu
47 pages
Unit V Easy To Learn
No ratings yet
Unit V Easy To Learn
21 pages
Automatic Image Annotation: Enhancing Visual Understanding through Automated Tagging
From Everand
Automatic Image Annotation: Enhancing Visual Understanding through Automated Tagging
Fouad Sabry
No ratings yet
NLP-week10-IR-enc-dec
No ratings yet
NLP-week10-IR-enc-dec
68 pages
Confirmative Evaluation: Practical Strategies for Valuing Continuous Improvement
From Everand
Confirmative Evaluation: Practical Strategies for Valuing Continuous Improvement
Joan C. Dessinger
No ratings yet
A Measurement Framework for Software Projects: A Generic and Practical Goal-Question-Metric(Gqm) Based Approach.
From Everand
A Measurement Framework for Software Projects: A Generic and Practical Goal-Question-Metric(Gqm) Based Approach.
Prashanth Harish Southekal
No ratings yet
Learning Guide Unit 1 _ Home
No ratings yet
Learning Guide Unit 1 _ Home
10 pages
CS 3308 Learning Journal Unit 5
No ratings yet
CS 3308 Learning Journal Unit 5
6 pages
MATH 1281 - Unit 4 Discussion Assignment
No ratings yet
MATH 1281 - Unit 4 Discussion Assignment
5 pages
CS 3308 Learning Journal Unit 7
No ratings yet
CS 3308 Learning Journal Unit 7
5 pages
MATH 1302 - Unit 2 Discussion Assignment
No ratings yet
MATH 1302 - Unit 2 Discussion Assignment
4 pages
MATH 1281 - Unit 8 Assignment
100% (1)
MATH 1281 - Unit 8 Assignment
2 pages
MATH 1281 - Unit 3 Assignment
No ratings yet
MATH 1281 - Unit 3 Assignment
5 pages
MATH 1281 - Unit 5 Assignment
No ratings yet
MATH 1281 - Unit 5 Assignment
4 pages
ENGL 1102-Unit 2 Discussion Assignment
No ratings yet
ENGL 1102-Unit 2 Discussion Assignment
3 pages
MATH 1280-Unit 2 Discussion Assignment
No ratings yet
MATH 1280-Unit 2 Discussion Assignment
2 pages
MATH 1280-Unit 1 Discussion Assignment
No ratings yet
MATH 1280-Unit 1 Discussion Assignment
3 pages
Lecture 2
No ratings yet
Lecture 2
29 pages
Korean 101: Elementary Korean I: Seongyeon Ko (CMAL, Queens College)
100% (1)
Korean 101: Elementary Korean I: Seongyeon Ko (CMAL, Queens College)
34 pages
Cognitive Linguistics
No ratings yet
Cognitive Linguistics
7 pages
Crime Scene Investigation
No ratings yet
Crime Scene Investigation
3 pages
Basic Methods of Linguistic Analysis
83% (6)
Basic Methods of Linguistic Analysis
4 pages
External Stimuli
No ratings yet
External Stimuli
27 pages
Introduction To AI
No ratings yet
Introduction To AI
27 pages
Portfolio Final 11
No ratings yet
Portfolio Final 11
15 pages
Keputusan PDF
No ratings yet
Keputusan PDF
4 pages
Developmental History Script
No ratings yet
Developmental History Script
5 pages
Kinds of Nouns
No ratings yet
Kinds of Nouns
4 pages
Module 4: Developmental Reading
No ratings yet
Module 4: Developmental Reading
8 pages
Artificial Intelligence in Supply Chain
No ratings yet
Artificial Intelligence in Supply Chain
19 pages
The Optimal Number of Vowels in Languages
No ratings yet
The Optimal Number of Vowels in Languages
16 pages
Can You or Cant You
No ratings yet
Can You or Cant You
2 pages
Final Research Report
No ratings yet
Final Research Report
4 pages
Introduction To Organizational Development
No ratings yet
Introduction To Organizational Development
20 pages
Command Words
No ratings yet
Command Words
15 pages
BOW Gr7 Quarter-4.edited
No ratings yet
BOW Gr7 Quarter-4.edited
5 pages
Impact of Early Second-Language Acquisition On The Development of First Language and Verbal Short-Term and Working Memory
No ratings yet
Impact of Early Second-Language Acquisition On The Development of First Language and Verbal Short-Term and Working Memory
13 pages
6.artificial Intelligence and Playable Media by Eric Freedman - Z-Library
No ratings yet
6.artificial Intelligence and Playable Media by Eric Freedman - Z-Library
4 pages
Agree - Disagree Essays
No ratings yet
Agree - Disagree Essays
9 pages
Sociolinguistic Report
No ratings yet
Sociolinguistic Report
2 pages
21AI641
No ratings yet
21AI641
5 pages
Pretest 205
No ratings yet
Pretest 205
2 pages
File 2 - Journal Writing
No ratings yet
File 2 - Journal Writing
6 pages
Effect of Mood On Problem Solving 1
No ratings yet
Effect of Mood On Problem Solving 1
21 pages
CSEN2031_AI_ASSIGNMENT_I
No ratings yet
CSEN2031_AI_ASSIGNMENT_I
4 pages
Understanding Individual Motivation: Michael Angelo Mendez, Mpsych
No ratings yet
Understanding Individual Motivation: Michael Angelo Mendez, Mpsych
31 pages
Article The Key To Develop Self Acceptance, Self Esteem and Reflective Skill
No ratings yet
Article The Key To Develop Self Acceptance, Self Esteem and Reflective Skill
6 pages
Masculinity and Mens Mental Health
100% (1)
Masculinity and Mens Mental Health
75 pages

Learning Guide Unit 6 _ Home

Uploaded by

Learning Guide Unit 6 _ Home

Uploaded by

Learning Guide Unit 6 | Home https://my.uopeople.edu/mod/book/tool/print/index.php?

Site: University of the People Printed by: Patrick Rolemodel Asante

Learning Guide Unit 6

• Evaluating of the e�ectiveness of Information retrieval

By the end of this Unit, you will be able to:

1. Describe methodologies for evaluating information retrieval systems

• Peer assess Unit 5 Development Assignment

Chapter 8: Evaluation in Information Retrieval

'For example, consider . We know that it contains documents which

Another set of questions to consider in your learning journal statement include:

• What surprised me or caused me to wonder?

Your Learning Journal must be a minimum of 500 words.

Peer assess Unit 5 Development Assignment

Read the Learning Guide and Reading Assignments

Make entries to the Learning Journal

Take the Self-Quiz

You might also like