Information retrieval (IR) is the process of searching for and retrieving relevant documents from a large collection based on a user's query. Key aspects of IR include:
- Representing documents and queries in a way that allows measuring their similarity, such as the vector space model.
- Ranking retrieved documents by relevance to the query using factors like term frequency and inverse document frequency.
- Allowing for similarity-based retrieval where documents similar to a given document are retrieved.