Module5 Recommender Systems PartA
Module5 Recommender Systems PartA
Which digital camera should I buy? What is the best holiday for me and
my family? Which is the best investment for supporting the education of my
children? Which movie should I rent? Which web sites will I find interesting?
Which book should I buy for my next vacation? Which degree and university
are the best for my future?
Problem domain
Recommendation systems (RS) help to match users with items
Recommendation Systems are software agents that elicit the interests and
preferences of individual consumers and make recommendations accordingly.
They have the potential to support and improve the quality of the
decisions consumers make while searching for and selecting products online.
(Xiao & Benbasat 20071)
Recommend items
from the long tail 20% of items
accumulate 74% of all
positive ratings
(1) Manning, Raghavan, and Schütze, Introduction to information retrieval, Cambridge University Press, 2008
Common operational and technical
goals of recommender systems
Relevance
◦ To recommend items that are relevant to the user at hand.
Novelty
◦ recommended item that the user has not seen in the past
Serendipity
◦ recommendations are truly surprising to the user, rather
than simply something they did not know about before.
Increasing recommendation diversity
◦ suggest a list of top-k items. When all these recommended
items are very similar, it increases the risk that the user
might not like any of these items
How RS function?
Recommender systems function with two
kinds of information:
𝒑 ∈𝑷(𝒓𝒂,𝒑 − 𝒓𝒂 )(𝒓𝒃,𝒑 − 𝒓𝒃 )
𝒔𝒊𝒎 𝒂, 𝒃 =
𝟐 𝟐
𝒑 ∈𝑷 𝒓𝒂,𝒑 − 𝒓𝒂 𝒑 ∈𝑷 𝒓𝒃,𝒑 − 𝒓𝒃
Measuring user similarity (2)
A popular similarity measure in user-based CF:
Pearson correlation
𝑎, 𝑏 : users
𝑟𝑎,𝑝 : rating of user 𝑎 for item 𝑝
: set ofItem2
𝑃 Item1 items,Item3
rated both
Item4 byItem5
𝑎 and 𝑏
Alice 5 3 4 4 ?
◦ Possible
User1 3
similarity
1
values
2
between
3
−1
3
and 1sim = 0,85
User2 4 3 4 3 5 sim = 0,00
User3 3 3 1 5 4 sim = 0,70
User4 1 5 5 2 1 sim = -0,79
Pearson correlation
Takes differences in rating behavior into account
6 Alice
5 User1
User4
4
Ratings
3
𝒃 ∈𝑵 𝒔𝒊𝒎 𝒂, 𝒃 ∗ (𝒓𝒃,𝒑 − 𝒓𝒃 )
𝒑𝒓𝒆𝒅 𝒂, 𝒑 = 𝒓𝒂 +
𝒃 ∈𝑵 𝒔𝒊𝒎 𝒂, 𝒃