0% found this document useful (0 votes)
62 views3 pages

IRS Bits Unit-3

The document discusses different types of indexing used in information retrieval systems including automatic, probabilistic, natural language, concept, and hypertext linkage indexing. It provides examples and definitions of term frequency, document frequency, and other indexing concepts. Crawlers and automatically generated indexes are also covered.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
62 views3 pages

IRS Bits Unit-3

The document discusses different types of indexing used in information retrieval systems including automatic, probabilistic, natural language, concept, and hypertext linkage indexing. It provides examples and definitions of term frequency, document frequency, and other indexing concepts. Crawlers and automatically generated indexes are also covered.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

OBJECTIVE TYPE BITS

UNIT-3
1. is the process of analyzing an item to extract the information to be kept
permanently in an index.
a. Class Indexing b. Automatic Indexing c. Manual Indexing d. Any
Ans: b
2. is used mostly in commercial systems.
a. Statistical b. Natural Language c. Concept d. Hypertext Linkages
Ans: a
3. indexing stores the information that are used in calculating a probability that
a particular item satisfies a particular query.

a. Probalistic b. Bayesian c .vector space d. neural net. Ans: a

4. approaches store information used in generating a relative confedence level of


an item relevance to a query.
a. Bayesian b. vector space c. both d. none
Ans: c
5. are dynamic learning structures that are discussed under concept indexing where
they are used to determine concept classes.
a. Probalistic b. Bayesian c .vector space d. neural net. Ans: d
6. indexing uses words with in an item to correlate to concepts discussed in the item.
a. Statistical b. Natural Language c. Concept d. Hypertext Linkages
Ans: c
7. approach ia based upon direct application of the theory of probability to IRS.
a. Probabilistic b. Natural Language c. Concept d. Hypertext Linkages
Ans: a
8. produces the efficient results when data is retrieving from multiple databases.
a. Probabilistic b. Natural Language c. Concept d. Hypertext Linkages
Ans: a
9. processing is used to semantic information in addition to statistical information
to enhance the indexing of the item.
a. Probabilistic b. Natural Language c. Concept d. Hypertext Linkages
Ans: b
10. Tagged Text Parser structure allows for identification of potential term phrases based
upon identification.
a. verb b. noun c. adjective d. all
Ans: b
11. processing will use DR-LINK System.
a. Probabilistic b. Natural Language c. Concept d. Hypertext Linkages
Ans: b
12. system attempts to introduce a higher level of abstraction indexing on top of the
statistical processes.
a. Probabilistic b. Natural Language c. Concept d. Hypertext Linkages
Ans: b
13. indexing is a statistical technique whose goal is to determine a canonical
representation of concept.
a. Probabilistic b. Natural Language c. Concept d. Hypertext Linkages
Ans:c
14. techniques have very powerful representation.
a. Binary b. Vector c. Both c. None
Ans : b
15. pages at each Internet site are indexed automatically.
a. Automatically generated b. manually generated c. Crawlers d. All
Ans: a
16. In users define search terms, and it goes to various sites searching for the
desired information.
a. Automatically generated b. manually generated c. Crawlers d. All
Ans: c
17. is the example for WebCrawler’s
a. WebCrawler’s b. Open Text c. Path Finder d. All
Ans: d

Fill in the blanks


1. Term Frequency TFij is the frequency of occurrence of a term Ti in a
document Dj.
2. Total Term Frequency TTFi is the frequency of occurrence of a term Ti in
the entire collection.
3. Document Frequency DFi is the number of unique documents in the
collection that contain a term Ti..
4. Tagged Text Parser structure allows for identification of potential term phrases
based upon Noun identification.
5. Automatic Indexing is the process of analyzing an item to extract the
information to be kept permanently in an index.
6. Manually generated (e.g. Yahoo!) pages are indexed manually into a linked
hierarchy(an “index”). Users browse in the hierarchy by following links.
7. Automatically generated (e.g. Alta Vista) pages at each Internet site are
indexed automatically (creating a “searchable data structure”).
8. Automatically generated structures are used for querying, rather than
browsing.
9. Crawlers (e.g. WebCrawler) No a priori indexing.
10. Crawlers (e.g. WebCrawler) Users define search terms, and the crawler goes
to various sites searching for the desired information.
11. Hypertext Linkages Provides virtual threads of concepts between items
versus directly defining the concepts with in an item.
12. The SMART system uses Vector Model.

You might also like