0% found this document useful (0 votes)
155 views3 pages

IRS Assignment-I: 1. Define IRS & Goals. Ans

An informational retrieval system (IRS) aims to efficiently store, retrieve, and maintain information from a collection of documents. The goals of an IRS are to minimize the time a user spends searching for needed information. Key objectives include maximizing precision, or retrieving the most relevant documents, and recall, or retrieving all relevant documents from the collection. An IRS allows users to browse summaries of search results and uses indexing and cataloguing to organize documents for efficient searching. Signature files provide an alternative to inverted indexes for identifying relevant files during a search query.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
155 views3 pages

IRS Assignment-I: 1. Define IRS & Goals. Ans

An informational retrieval system (IRS) aims to efficiently store, retrieve, and maintain information from a collection of documents. The goals of an IRS are to minimize the time a user spends searching for needed information. Key objectives include maximizing precision, or retrieving the most relevant documents, and recall, or retrieving all relevant documents from the collection. An IRS allows users to browse summaries of search results and uses indexing and cataloguing to organize documents for efficient searching. Signature files provide an alternative to inverted indexes for identifying relevant files during a search query.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

IRS Assignment-I

1. Define IRS & goals.


Ans: An Informational Retrieval System is a system that is capable
of storing retrieval and maintenance of information.
It is the formal study of efficient and effective ways to extract the
right bit of information from a collection.
It consists of a software that facilitates a user to find the
information the user needs.
The goals of IRS are:
To minimize the user overhead for a user to find the needed
information.
Overhead of a user is the time required to find the information
needed, excluding the time for actual reading the relevant data.
2. Describe Objectives of IRS.
Ans: The Objectives are:
To minimize the user overhead for a user to find the needed
information.
Two major measures:
1) Precision: The ability to retrieve the top-ranked documents that
are relevant.
2) Recall: the ability to search all the relevant items from the
collection.
When a user decides to issue a search looking for information on a
topic then, the total database divides into 4 segments those are:
1) Relevant Retrieved
2) Non-Relevant Retrieved
3) Relevant Non-Retrieved
4) Non-Relevant Non-Retrieved
3. Explain browser capabilities.
Ans: Browse capabilities provide the user with the capability to
determine which items are of interest and select those to be
displayed.
There are two ways of displaying a summary of the items that are
associated with a query: line item status and data visualization.
4. Explain Catalogue & Indexing briefly.
Ans: Catalogue and indexing are the same thing. For IR systems,
in order to efficiently judge whether the documents from a corpus
match a given query, a pre-process called indexing is usually
applied. It is the way documents are managed in the collection.
To make searching more efficient, a retrieval system stores
documents in an abstract representation.
The indexing process includes several steps, which are described
as follows:
1) Tokenization
2) Stop Words Removal
3) Stemming
5. Explain signature file structure.
Ans: The signature can identify a file from within the file or be
kept in a separate file or database. In
document retrieval the signature file method competes with the
inverted index method to produce query results.
A signature block, or sig file is a block of text automatically
appended at the bottom of an email message.

You might also like