Recommender MidTerm - 2
Recommender MidTerm - 2
Part A
1. List the basic components of content based systems
Part B
1. Illustrate rule based classifers
Definition
Working Principle
Advantages
Disadvantages
Definition
How It Works
● The system collects and stores user data for continuous learning.
● Personalization is updated with each interaction, improving
recommendations.
Benefits
Definition
Working Principle
Definition
Working Principle
Offline Evaluation
Online Evaluation
Definition
Working Principle
Part C
1. Key Components
a. Input Data
b. Similarity Computation
● Measures how closely the features of an item match the user profile.
● Common Similarity Measures:
○ Cosine Similarity: Measures angle similarity between feature
vectors.
○ Euclidean Distance: Measures straight-line distance in feature
space.
○ Pearson Correlation: Measures linear correlation between
features.
c. Recommendation Engine
2. Workflow
● Input Data:
○ Movie profiles: Genre (Action, Comedy), Director (Spielberg), Cast
(Tom Hanks).
○ User profile: Interested in Comedy and movies by Spielberg.
● Feature Extraction:
○ Movie and user preferences converted into feature vectors.
● Similarity Computation:
○ Cosine similarity matches the user profile with the feature vectors
of movies.
● Output:
○ Recommends comedy movies directed by Spielberg, such as
Catch Me If You Can.
Step 1: User Input Users provide preferences or constraints (e.g., family size,
ZIP code).
Step 2: Rule Matching The system checks its knowledge base for rules
matching the user's input. Example: If a user specifies a family size of 6,
the system may infer additional constraints like "3+ bedrooms" and "2+
bathrooms."
Step 3: Query Refinement Logical rules expand the user’s input to refine
the search query.
Step 4: Item Retrieval The system retrieves items that satisfy all
constraints. Example: A user specifies a large family and a ZIP code; the
system refines the query to include "3+ bedrooms" and "Price ≥
$100,000."
2. Secondary Goals
a) Novelty
b) Serendipity
3. Soft Goals
● Purpose: Satisfied users are more likely to return, increasing loyalty and
site usage.
1. Accuracy
● Coverage evaluates the proportion of items or users that the system can
recommend to.
● User-space coverage: The fraction of users for whom
recommendations are made.
● Item-space coverage: The fraction of items included in
recommendations.
● Ensures the system addresses data sparsity and provides inclusive
recommendations.
4. Novelty
5. Serendipity
6. Diversity
8. Scalability