0% found this document useful (0 votes)

5 views

Lecture 1_Collaborative Filtering

The lecture focuses on collaborative filtering recommender systems, discussing their definition, input data, and algorithms for measuring similarity and predicting ratings. It categorizes recommender systems into collaborative, content-based, knowledge-based, and hybrid types, with a primary focus on memory-based collaborative filtering. Key concepts include user-item interaction data, similarity measures, and methods for predicting unobserved ratings.

Uploaded by

rhwkdsk125

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Lecture 1_Collaborative Filtering

Uploaded by

rhwkdsk125

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

DIGB368 Lecture 1:

Collaborative Filtering
Tae-Sub Yun
Department of Digital Business, Korea University
[email protected]

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 1 ]

Lecture Objectives
In today's class, we aim to ...

learn about collaborative filtering recommender systems, covering the following topics:

• Definition of Collaborative Filtering?

• Input Data for Collaborative Filtering

• User-item interaction data

• Collaborative Filtering Algorithms

• Measuring similarity
• Predicting ratings

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 2 ]

Classification of Recommender Systems
Recommender Systems

Collaborative Content-based Knowledge-based Hybrid

Filtering

Memory-based Model-based

User- Item- Matrix Deep

based based Clustering Factorization Learning

In this lecture, we will focus primarily on Memory-Based Collaborative Filtering (CF).

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 3 ]

Basic Models of Recommender Systems
Recommender systems work with two kinds of data!

• User-item interactions data:

Data of actions performed by users in relation to items
ex. ratings, buying behavior User-item interactions data example

• Attribute information data:

Data related to the attributes of users or items
ex. textual profiles, relevant keywords

Recommender systems are categorized based on the utilized data!

Collaborative filtering models are a type of

recommender system that uses “user-item interaction data”.

Attribute information data example

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 4 ]
Collaborative Filtering Models
Collaborative Filtering (CF) Model
1. Viewed by
• Utilize the collaborative power of the both users
user-item interactions provided by multiple users

• Three steps in CF recommender system (User-based)

1. Get User-item interactions data

Obtain user viewing history data
2. Identify
similar users
2. Identify similar users
For each user, identify similar users who share
similar video-watching patterns.
3. Viewed by her,
recommended to him
3. Make recommendations
recommend videos that similar users have watched,
Example of collaborative filtering model using
but the target user hasn't seen yet
YouTube watching history data

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 5 ]

Tabular Representation of CF
Video 1 Video 2 Video 3 Video 4 Video 5

User: Andy O O

User: Jessica O O

User: Bob O O O

User: Sophia O O O

( ※ The "O" symbol indicates that the user has already viewed that video. )
Questions

• Who is the user with the most similar video preferences to user Andy?

• Which video is the most recommended to user Andy?

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 6 ]

CF based on Ratings
Movie 1 Movie 2 Movie 3 Movie 4 Movie 5

User: Andy 4 5 1 ??

User: Jenny 4 4

User: Mike 3 4 1 2

User: Sally 4 1 2 4

( ※ Users rate movies on a scale of 1 to 5. )

Questions

• Who is the user with the most similar video preferences to user Andy?

• What is the expected rating from user Andy for video 5? (Is video 5 worth recommending?)

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 7 ]

User-Item Interaction Data
Types of user-item interaction

• Explicit ratings:
Users provide direct feedback on individual items through numerical ratings
ex. liking/disliking videos on YouTube, movie ratings

• Implicit feedback:
User preferences are indirectly estimated through user behavior
ex. Buying behavior, time spent on a website, clicked on detailed information

Noises in Implicit feedback

• There is a possibility of misinterpretation

ex. accidental clicks, purchases made as gifts (not reflecting the user's own preferences)

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 8 ]

Types of Ratings
Ratings can be defined in a variety of ways, depending on the application at hand!

• Unary ratings:
specify a positive preference for an item, but there is no mechanism to specify negative preference.
ex. “like” button on Facebook

• Binary ratings:
only two options are present, corresponding to positive or negative response.
ex. either like or dislike on YouTube Music

• Interval-based ratings:
representing preferences through discrete numbers within a specific range.
ex. 5-point, 10-point rating system for movies

• Continuous ratings:
representing preferences through continuous numbers within a specific range.

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 9 ]

Notations of Rating Matrices
Rating

• 𝑟 , : rating of user 𝑢 for item 𝑗

Ratings matrix

• 𝑚 × 𝑛 matrix, containing 𝑚 users and 𝑛 items, denoted by 𝑅

𝑟, ⋯ 𝑟,
• 𝑅 = 𝑟 , = ⋮ ⋱ ⋮
𝑟 , … 𝑟 ,

• The 𝑢 row contains a collection of ratings for individual items

by user 𝑢

• The 𝑗 column is a collection of ratings by users for item 𝑗

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 10 ]

Problem Formulation of CF
The problem formulation of collaborative filtering (CF)

• Assumption:
incomplete 𝑚 × 𝑛 matrix 𝑅 = 𝑟 ,
→ only a small subset of the rating matrix is specified (or observed)

• Primitive problem:
predicting the missing (unobserved) rating values of a user-item rating matrix.

• Advanced problem:
Determining the top-k items or top-k users.
→ equivalent to the problem of selecting the top-k from the expected ratings.

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 11 ]

Predictions for Unobserved Ratings
There are two types of methods used in CF recommender system.

• Memory-based methods:
directly utilize the ratings matrix for predicting unobserved ratings
(also referred to as neighborhood-based methods)

1. User-based methods:
aggregating ratings or preferences from similar users
→ the goal is to find users similar to the target user

2. Item-based Methods:
aggregating ratings or preferences from similar items
→the goal is to find items similar to the ones the target user has interacted with

• Model-based methods:
Indirectly utilize the ratings matrix to estimate a representative model

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 12 ]

Similarity Measures
Ratings database for collaborative recommendation
6 User: Andy User: Jenny User: Bob
Item 1 Item 2 Item 3 Item 4
5

User: 4
5 3 4 4

Ratings
Andy 3

User: 2
3 1 2 3
Jenny
1
User: 0
1 5 5 2
Bob Item 1 Item 2 Item 3 Movie 4

Questions

• Who is the user with the most similar preferences to user Andy?

• Can you guess a way to numerically compare similarity between users?

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 13 ]

Euclidean Distance
Similarity measure using Euclidean distance Item 1 Item 2 Item 3 Item 4

• Preference difference (gap) Andy 5 3 4 4

The preference difference for item 𝑗 between Jenny 3 1 2 3
two users (Andy and Jenny) is as follows:
Bob 1 5 5 2
𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑐𝑒 = 𝑟 , −𝑟 ,

• Euclidean distance
The differences for each item can be combined into a single value using the Euclidean distance.
𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒 𝑎𝑛𝑑𝑦, 𝑗𝑒𝑛𝑛𝑦 = ∑ ∈ , , , 𝑟 , −𝑟 ,

= 5−3 + 3−1 + 4−2 + 4−3 = 13 = 3.61

Questions

• What is the similarity, using Euclidean distance, between Andy and Bob?

• In terms of similarity using Euclidean distance, who has preferences closer to Andy's?

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 14 ]

Cosine Similarity in 2D
Item 2
Item 1 Item 2 Item 3 Item 4 Rating vector
Bob (1,5)
Andy 5 3 4 4 Euclidean distance
(Andy, Bob)
Jenny 3 1 2 3
Rating vector
Bob 1 5 5 2 Andy (5,3)

How to calculate 𝒄𝒐𝒔𝒊𝒏𝒆(𝜽) between the rating vectors Cosine Similarity

(Andy, Bob)
For two vectors A and B in n-dimensional space:
• Cosine similarity
𝜽
𝑐𝑜𝑠𝑖𝑛𝑒 𝐴, 𝐵 = Rating vector
Jenny (3,1)

• Dot product (𝐴 𝐵) Item 1

𝐴 𝐵 = ∑ ∈{ , ,…, }(𝐴 𝐵)
𝐴 𝐵
𝑐𝑜𝑠𝑖𝑛𝑒 𝐴𝑛𝑑𝑦, 𝐵𝑜𝑏 =
• Euclidean norm ( 𝐴 , 𝐵 ) 𝐴 𝐵
5 1 + (3 5) 20
𝐴 = ∑ 𝐴 = = = 0.67
∈{ , ,…, } 5 +3 1 +5 34 26

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 15 ]

Cosine Similarity
Cosine similarity in n-dimension Item 1 Item 2 Item 3 Item 4

• Magnitude of rating vectors Andy 5 3 4 4

The magnitude of each rating vectors is calculated Jenny 3 1 2 3
using the Euclidean norm formula.
Bob 1 5 5 2

𝑟𝑎𝑡𝑖𝑛𝑔𝑠 = ∑ ∈{ , , , } 𝑟 , = 5 + 3 + 4 + 4 = 66 = 8.12
𝑟𝑎𝑡𝑖𝑛𝑔𝑠 = 3 + 1 + 2 + 3 = 23 = 4.80

• Cosine similarity
The calculation of cosine similarity between two rating vectors is as follow:

∑ ∈{ , ,…, }( , , )
𝑐𝑜𝑠𝑖𝑛𝑒 𝑎𝑛𝑑𝑦, 𝑗𝑒𝑛𝑛𝑦 = = = = 0.97
. .

Questions

• What is the cosine similarity between Andy and Bob?

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 16 ]

Pearson’s Correlation Coefficient
Pearson’s correlation coefficient Item 1 Item 2 Item 3 Item 4

• Average rating Andy 5 3 4 4

Average rating values are used to remove Jenny 3 1 2 3
user- or item-specific biases.
Bob 1 5 5 2
∑ ∈{ , , , } ,
𝜇 = ∑ ∈{ , , , }
= =4
Questions
𝜇 = = 2.25
• What is the Pearson similarity between Andy and Bob?
• Pearson similarity
The calculation of Pearson similarity between two rating vectors is as follow:

∑ ∈{ , ,…, }( , )( , )
𝑃𝑒𝑎𝑟𝑠𝑜𝑛 𝑎𝑛𝑑𝑦, 𝑗𝑒𝑛𝑛𝑦 =
∑ ∈ , ,…, , ∑ ∈ , ,…, ,
. . . ( )( . )
= = .
= 0.85
. . . .

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 17 ]

Similarity in Incomplete Ratings Matrix
Movie 1 Movie 2 Movie 3 Movie 4 Movie 5
User: Andy 4 5 1 2
User: Jenny 4 4 1
User: Mike 3 4 1 2

Similarity calculation in an incomplete ratings matrix

• Set of item indices:

denote the set of item indices for which ratings have been specified by user 𝑢.

𝐼 = 1, 2, 3, 4 ; 𝐼 = 3, 4, 5 ; 𝐼 = {1, 2, 4, 5} → 𝐼 ∩𝐼 = {3, 4}
∑ ∈
• Similarity (Pearson):
,
𝜇 =
∑ ∈ ∩ ( , )( , )
𝑃𝑒𝑎𝑟𝑠𝑜𝑛 𝑎𝑛𝑑𝑦, 𝑗𝑒𝑛𝑛𝑦 =
∑ ∈ ∩ , ∑ ∈ ∩ ,

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 18 ]

Jaccard Similarity
Movie 1 Movie 2 Movie 3 Movie 4 Movie 5
User: Andy O O O O
User: Jenny O O O
User: Mike O O O O

Similarity calculation for non-ratings matrix

• Set of item indices (repeated):

denote the set of item indices for which ratings have been specified by user 𝑢.

𝐼 = 1, 2, 3, 4 ; 𝐼 = 3, 4, 5 ; 𝐼 = 1, 2, 4, 5

• Jaccard similarity:

∩
𝐽𝑎𝑐𝑐ard andy, jenny = = = 0.4
∪

Finding Neighbors with Similar Tastes
User-user similarity computation between user 3 and other users

Item Item Item Item Item Item Avg.

𝒄𝒐𝒔𝒊𝒏𝒆(𝒊, 𝟑) 𝑷𝒆𝒂𝒓𝒔𝒐𝒏 𝒊, 𝟑
1 2 3 4 5 6 rating
User 1 7 6 7 4 5 4 5.5 0.956 0.894
User 2 6 7 - 4 3 4 4.8 0.981 0.939
User 3 - 3 3 1 1 - 2 1.0 1.0
User 4 1 2 2 3 3 4 2.5 0.789 -1.0
User 5 1 - 1 2 3 3 2 0.645 -0.817

Questions

• Who is the user with similar preferences to User 3 from a cosine similarity perspective?

• Who is the user with similar preferences to User 3 from a Pearson similarity perspective?

Predicting Unobserved Ratings
Item Item Item Item Item Item
𝒄𝒐𝒔𝒊𝒏𝒆(𝒊, 𝟑) 𝑷𝒆𝒂𝒓𝒔𝒐𝒏 𝒊, 𝟑
1 2 3 4 5 6
User 1 7 6 7 4 5 4 0.956 0.894 Neighbors:
user with similar
User 2 6 7 - 4 3 4 0.981 0.939 preferences
User 3 ?? 3 3 1 1 ?? 1.0 1.0
User 4 1 2 2 3 3 4 0.789 -1.0
User 5 1 - 1 2 3 3 0.645 -0.817

Predicting unobserved (missing) ratings

• Hat notation:
The hat notation “^” on top of 𝑟 , indicates a predicted rating.

𝑟̂ , : The predicted rating for User 3 on Item 1

𝑟̂ , : The predicted rating for User 3 on Item 6

Raw Rating Prediction
Item Item Item Item Item Item
𝒄𝒐𝒔𝒊𝒏𝒆(𝒊, 𝟑) 𝑷𝒆𝒂𝒓𝒔𝒐𝒏 𝒊, 𝟑
1 2 3 4 5 6
User 1 7 6 7 4 5 4 0.956 0.894 Neighbors:
user with similar
User 2 6 7 - 4 3 4 0.981 0.939 preferences
User 3 ?? 3 3 1 1 ?? 1.0 1.0
User 4 1 2 2 3 3 4 0.789 -1.0
User 5 1 - 1 2 3 3 0.645 -0.817

Raw rating prediction:

predicting unrated items using the raw ratings of neighbors.
※ Notations
Based on cosine similarity: 𝑁(= 1, 2 ):
set of user with similar tastes to the target user.
∑ ∈ , ∗ , ∗ . ∗ .
𝑟̂ , = ∑ ∈ ,
= . .
= 6.49
𝑏 ∈ 𝑁:
∑ ∈ , ∗
𝑟̂ , = ,
=
∗ . ∗ .
=4 𝑏 represents an user belonging to the set N.
∑ ∈ , . .

Does something seem odd?
Unobserved rating prediction result with raw ratings of neighbors

Item 1 Item 2 Item 3 Item 4 Item 5 Item 6 𝒄𝒐𝒔𝒊𝒏𝒆(𝒊, 𝟑) 𝑷𝒆𝒂𝒓𝒔𝒐𝒏 𝒊, 𝟑

User 1 7 6 7 4 5 4 0.956 0.894

User 2 6 7 - 4 3 4 0.981 0.939

User 3 6.49 3 3 1 1 4 1.0 1.0

User 4 1 2 2 3 3 4 0.789 -1.0

User 5 1 - 1 2 3 3 0.645 -0.817

Questions

• Can you guess what's strange?

Mean-Centered Prediction
Item Item Item Item Item Item Avg.
𝒄𝒐𝒔𝒊𝒏𝒆(𝒊, 𝟑) 𝑷𝒆𝒂𝒓𝒔𝒐𝒏 𝒊, 𝟑
1 2 3 4 5 6 rating
User 1 7 6 7 4 5 4 5.5 0.956 0.894
User 2 6 7 - 4 3 4 4.8 0.981 0.939
User 3 ?? 3 3 1 1 ?? 2 1.0 1.0

Mean-centered prediction:
predicting unrated items using the mean-centered ratings of neighbors. ※ Average rating (recap)
∑∈ 𝑟,
• Mean-centered ratings: 𝑠 , =𝑟 , −𝜇 𝜇 =
𝐼

Based on cosine similarity:

∑ ∈ , ∗ , ( . )∗ . ( . )∗ .
𝑟̂ , =𝜇 + ∑ ∈
=2+ = 3.35
, . .
∑ ∈ , ∗ , ( . )∗ . ( . )∗ .
𝑟̂ , =𝜇 + ∑ ∈
=2+ = 0.85
, . .

Actual Recommendation
Unobserved rating prediction result with mean-centered ratings of neighbors

Item 1 Item 2 Item 3 Item 4 Item 5 Item 6 𝒄𝒐𝒔𝒊𝒏𝒆(𝒊, 𝟑) 𝑷𝒆𝒂𝒓𝒔𝒐𝒏 𝒊, 𝟑

User 1 7 6 7 4 5 4 0.956 0.894

User 2 6 7 - 4 3 4 0.981 0.939

User 3 3.35 3 3 1 1 0.85 1.0 1.0

User 4 1 2 2 3 3 4 0.789 -1.0

User 5 1 - 1 2 3 3 0.645 -0.817

Questions

• What item would you recommend to User 3?

In-Class Activities
Item-based collaborative filtering case:

Item 1 Item 2 Item 3 Item 4 Item 5 Item 6

User 1 7 6 7 4 5 4
User 2 6 7 - 4 3 4
User 3 ?? 3 3 1 1 ??
User 4 1 2 2 3 3 4
User 5 1 - 1 2 3 3
Avg. rating ?? ?? ?? ?? ?? ??
𝒄𝒐𝒔𝒊𝒏𝒆 𝟏, 𝒋
?? ?? ?? ?? ?? ??
(item-item)
𝒄𝒐𝒔𝒊𝒏𝒆 𝟔, 𝒋
?? ?? ?? ?? ?? ??
(item-item)

To-do: Please fill in the correct values in the cells marked with ??.

Wrap up!
What we’ve learned In today's class ...

• Definition of Collaborative Filtering

→ Recommendation technique
that predicts a user's preferences based on the past preferences data.

• Input Data for Collaborative Filtering

• User-item interaction data
→ Records of actions performed by users in relation to items,
such as ratings or buying behavior.

• Collaborative Filtering Algorithms

• Types of CF → memory-based (user-based, item-based), model-based
• Measuring similarity → cosine similarity, Pearson similarity, …
• Predicting ratings → raw rating prediction, mean-centered prediction

Recommendations Using Collaborative Filtering
No ratings yet
Recommendations Using Collaborative Filtering
37 pages
Slides Lecture 2 RecSys
No ratings yet
Slides Lecture 2 RecSys
86 pages
Movie Recommendation System: CSN-382 Project
No ratings yet
Movie Recommendation System: CSN-382 Project
25 pages
AN OPTIMIZED ITEM-BASED COLLABORATIVE FILTERING RECOMMENDATION ALGORITHM
No ratings yet
AN OPTIMIZED ITEM-BASED COLLABORATIVE FILTERING RECOMMENDATION ALGORITHM
5 pages
Recommender System - New
No ratings yet
Recommender System - New
49 pages
Recommender System
No ratings yet
Recommender System
26 pages
Recommendation System
No ratings yet
Recommendation System
32 pages
Combining Memory-Based and Model-Based Collaborative Filtering in Recommender System
100% (1)
Combining Memory-Based and Model-Based Collaborative Filtering in Recommender System
4 pages
Recommender Systems & Collaborative Filtering
No ratings yet
Recommender Systems & Collaborative Filtering
14 pages
CAIM: Cerca I Anàlisi D'informació Massiva: FIB, Grau en Enginyeria Informàtica
No ratings yet
CAIM: Cerca I Anàlisi D'informació Massiva: FIB, Grau en Enginyeria Informàtica
36 pages
A Personalized Recommender Integrating Item-Based and User-Based Collaborative Filtering
No ratings yet
A Personalized Recommender Integrating Item-Based and User-Based Collaborative Filtering
4 pages
Is593-Lecture04 Recommendation Systems
No ratings yet
Is593-Lecture04 Recommendation Systems
51 pages
Module5 Recommender Systems PartB
No ratings yet
Module5 Recommender Systems PartB
57 pages
L6 Recommendation
No ratings yet
L6 Recommendation
56 pages
Module 5
No ratings yet
Module 5
8 pages
.Trashed-1724941095-Recommender Systems
No ratings yet
.Trashed-1724941095-Recommender Systems
30 pages
Recommended System [5]
No ratings yet
Recommended System [5]
33 pages
RS Part 1
No ratings yet
RS Part 1
40 pages
Recommender Week6
No ratings yet
Recommender Week6
34 pages
Collaborative Filtering: Niranjan Shah (073/BCT/545) Shekhar Khadka (073/BCT/572)
No ratings yet
Collaborative Filtering: Niranjan Shah (073/BCT/545) Shekhar Khadka (073/BCT/572)
12 pages
Unit Iii-Collaborative Filtering
No ratings yet
Unit Iii-Collaborative Filtering
34 pages
RecSys Updated
No ratings yet
RecSys Updated
37 pages
CS345A Data Mining: Recommendation Systems
No ratings yet
CS345A Data Mining: Recommendation Systems
26 pages
Module5 Recommender Systems PartA
No ratings yet
Module5 Recommender Systems PartA
54 pages
T10 Recommender System
No ratings yet
T10 Recommender System
45 pages
Recommendation System
No ratings yet
Recommendation System
17 pages
Week 6 Recommender
No ratings yet
Week 6 Recommender
17 pages
Recommender Systems Notes
No ratings yet
Recommender Systems Notes
16 pages
[2012]_sistemasderecomendacion
No ratings yet
[2012]_sistemasderecomendacion
18 pages
A Collaborative Filtering Recommendation Algorithm Based on Item Genre and Rating Similarity
No ratings yet
A Collaborative Filtering Recommendation Algorithm Based on Item Genre and Rating Similarity
4 pages
Title_obvhbResearch_Project
No ratings yet
Title_obvhbResearch_Project
7 pages
15.0 Collaborative Filtering
No ratings yet
15.0 Collaborative Filtering
13 pages
Collab Survey
No ratings yet
Collab Survey
19 pages
Recommender Systems-Unit Iii
No ratings yet
Recommender Systems-Unit Iii
9 pages
Lec15-S Sarkar
No ratings yet
Lec15-S Sarkar
12 pages
recommender_system_part4
No ratings yet
recommender_system_part4
28 pages
Unit-3
No ratings yet
Unit-3
21 pages
M03 Item-Based CF-V2 (1)
No ratings yet
M03 Item-Based CF-V2 (1)
27 pages
12-recsys-1 - converted
No ratings yet
12-recsys-1 - converted
11 pages
M02 User-Based CF V02
No ratings yet
M02 User-Based CF V02
20 pages
Incremental Collaborative Filtering For Binary Ratings: December 2008
No ratings yet
Incremental Collaborative Filtering For Binary Ratings: December 2008
5 pages
Miranda 2008 A
No ratings yet
Miranda 2008 A
5 pages
Lecture 2 Part1
No ratings yet
Lecture 2 Part1
14 pages
Recommendation System in Python
No ratings yet
Recommendation System in Python
13 pages
Collaborative Filtering
No ratings yet
Collaborative Filtering
31 pages
Recommender Systems
No ratings yet
Recommender Systems
12 pages
CS583 Recommender Systems
No ratings yet
CS583 Recommender Systems
40 pages
6CS4 ML Unit-5
No ratings yet
6CS4 ML Unit-5
33 pages
8 Recommender
No ratings yet
8 Recommender
139 pages
Advances in Artificial Intelligence - 2009 - Su - A Survey of Collaborative Filtering Techniques
No ratings yet
Advances in Artificial Intelligence - 2009 - Su - A Survey of Collaborative Filtering Techniques
19 pages
PCL Group2
No ratings yet
PCL Group2
21 pages
Movie Recommendations
No ratings yet
Movie Recommendations
12 pages
Abbdf Zhang
No ratings yet
Abbdf Zhang
10 pages
10 Recommender Systems
No ratings yet
10 Recommender Systems
35 pages
2404 16177v1
No ratings yet
2404 16177v1
6 pages
Recommender System
No ratings yet
Recommender System
20 pages
AStudyof Mathematical Modelfor Collaborative Filtering
No ratings yet
AStudyof Mathematical Modelfor Collaborative Filtering
10 pages
An Item-based Collaborative Filtering Recommendation Algorithm Using Slope
No ratings yet
An Item-based Collaborative Filtering Recommendation Algorithm Using Slope
3 pages
Chapter4 - Web Based Personalization Systems - Part2 - Collaborative Filtering - KNN
No ratings yet
Chapter4 - Web Based Personalization Systems - Part2 - Collaborative Filtering - KNN
22 pages
The YouTube Algorithm: Decoding the Mystery
From Everand
The YouTube Algorithm: Decoding the Mystery
Rowan Everhart
No ratings yet
Asynchronous Counters
No ratings yet
Asynchronous Counters
5 pages
ACH_FORM
No ratings yet
ACH_FORM
2 pages
FULLTEXT01
No ratings yet
FULLTEXT01
14 pages
Statement of Purpose
No ratings yet
Statement of Purpose
3 pages
The Enterprise Resource Planning Decade Lessons Learned and Issues for the Future 1st Edition FréDéRic Adam all chapter instant download
100% (3)
The Enterprise Resource Planning Decade Lessons Learned and Issues for the Future 1st Edition FréDéRic Adam all chapter instant download
34 pages
Sumanth Resume
No ratings yet
Sumanth Resume
8 pages
BITWeek8 - L12 - ITE2422 V1
No ratings yet
BITWeek8 - L12 - ITE2422 V1
11 pages
Infor - ERP - VISUAL - Detailed Functionality - Version9
No ratings yet
Infor - ERP - VISUAL - Detailed Functionality - Version9
78 pages
Dynamic Supply and Demand Zones [AlgoAlpha] — אינדיקטור מאת AlgoAlpha
No ratings yet
Dynamic Supply and Demand Zones [AlgoAlpha] — אינדיקטור מאת AlgoAlpha
1 page
Elavon Terminal Pre-Requisite Checklist
No ratings yet
Elavon Terminal Pre-Requisite Checklist
2 pages
Cis Controls V7.1: Center For Internet Security
No ratings yet
Cis Controls V7.1: Center For Internet Security
5 pages
GC Strategies - Expert-Level Resources With Exceptional Service
No ratings yet
GC Strategies - Expert-Level Resources With Exceptional Service
4 pages
Moralde Justine Assignment 2
No ratings yet
Moralde Justine Assignment 2
7 pages
116566764
No ratings yet
116566764
82 pages
Seller Onboarding Training Manual 12
No ratings yet
Seller Onboarding Training Manual 12
43 pages
Full download (Ebook) Introducing Microsoft WebMatrix by Laurence Moroney ISBN 0735649707 pdf docx
No ratings yet
Full download (Ebook) Introducing Microsoft WebMatrix by Laurence Moroney ISBN 0735649707 pdf docx
82 pages
XtremSW Cache User Guide 2.0.1
No ratings yet
XtremSW Cache User Guide 2.0.1
288 pages
Test in Ms Powerpoint
No ratings yet
Test in Ms Powerpoint
1 page
For Pos Developers
No ratings yet
For Pos Developers
62 pages
Minibjörn's Monitor Hunter's Fact Sheet
No ratings yet
Minibjörn's Monitor Hunter's Fact Sheet
24 pages
The Scopes and Sequences For Grade 4
No ratings yet
The Scopes and Sequences For Grade 4
3 pages
Forex Trend Classification by Machine Learning
No ratings yet
Forex Trend Classification by Machine Learning
7 pages
Q.1 Design A Program For Creating A Machine That Accept Three Consecutive Ones. Program
50% (2)
Q.1 Design A Program For Creating A Machine That Accept Three Consecutive Ones. Program
44 pages
Minutes of Meeting - 29 June
No ratings yet
Minutes of Meeting - 29 June
2 pages
A Case Study of A Virtual Power Plant VPP As A Dat
No ratings yet
A Case Study of A Virtual Power Plant VPP As A Dat
24 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
Project Report
No ratings yet
Project Report
3 pages
Cubis II MCA - 21 CFR Compliance Checklist e
No ratings yet
Cubis II MCA - 21 CFR Compliance Checklist e
10 pages
FPS Shooter Games Presentation
No ratings yet
FPS Shooter Games Presentation
10 pages
Configuring and Troubleshooting Print Devices
No ratings yet
Configuring and Troubleshooting Print Devices
11 pages

Lecture 1_Collaborative Filtering

Uploaded by

Lecture 1_Collaborative Filtering

Uploaded by

DIGB368 Lecture 1:

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 1 ]

• Definition of Collaborative Filtering?

• Input Data for Collaborative Filtering

• Collaborative Filtering Algorithms

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 2 ]

Collaborative Content-based Knowledge-based Hybrid

User- Item- Matrix Deep

In this lecture, we will focus primarily on Memory-Based Collaborative Filtering (CF).

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 3 ]

• User-item interactions data:

• Attribute information data:

Recommender systems are categorized based on the utilized data!

Collaborative filtering models are a type of

Attribute information data example

• Three steps in CF recommender system (User-based)

1. Get User-item interactions data

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 5 ]

• Which video is the most recommended to user Andy?

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 6 ]

( ※ Users rate movies on a scale of 1 to 5. )

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 7 ]

Noises in Implicit feedback

• There is a possibility of misinterpretation

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 8 ]

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 9 ]

• 𝑟 , : rating of user 𝑢 for item 𝑗

• 𝑚 × 𝑛 matrix, containing 𝑚 users and 𝑛 items, denoted by 𝑅

• The 𝑢 row contains a collection of ratings for individual items

• The 𝑗 column is a collection of ratings by users for item 𝑗

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 10 ]

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 11 ]

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 12 ]

• Can you guess a way to numerically compare similarity between users?

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 13 ]

• Preference difference (gap) Andy 5 3 4 4

= 5−3 + 3−1 + 4−2 + 4−3 = 13 = 3.61

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 14 ]

How to calculate 𝒄𝒐𝒔𝒊𝒏𝒆(𝜽) between the rating vectors Cosine Similarity

• Dot product (𝐴 𝐵) Item 1

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 15 ]

• Magnitude of rating vectors Andy 5 3 4 4

• What is the cosine similarity between Andy and Bob?

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 16 ]

• Average rating Andy 5 3 4 4

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 17 ]

Similarity calculation in an incomplete ratings matrix

• Set of item indices:

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 18 ]

Similarity calculation for non-ratings matrix

• Set of item indices (repeated):

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 19 ]

Item Item Item Item Item Item Avg.

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 20 ]

Predicting unobserved (missing) ratings

𝑟̂ , : The predicted rating for User 3 on Item 1

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 21 ]

Raw rating prediction:

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 22 ]

Item 1 Item 2 Item 3 Item 4 Item 5 Item 6 𝒄𝒐𝒔𝒊𝒏𝒆(𝒊, 𝟑) 𝑷𝒆𝒂𝒓𝒔𝒐𝒏 𝒊, 𝟑

User 1 7 6 7 4 5 4 0.956 0.894

User 2 6 7 - 4 3 4 0.981 0.939

User 3 6.49 3 3 1 1 4 1.0 1.0

User 4 1 2 2 3 3 4 0.789 -1.0

User 5 1 - 1 2 3 3 0.645 -0.817

• Can you guess what's strange?

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 23 ]

Based on cosine similarity:

Copyright ⓒ 2024 by Tae-Sub Yun, Dept. of Digital Business, Korea University [ 24 ]

Item 1 Item 2 Item 3 Item 4 Item 5 Item 6 𝒄𝒐𝒔𝒊𝒏𝒆(𝒊, 𝟑) 𝑷𝒆𝒂𝒓𝒔𝒐𝒏 𝒊, 𝟑

User 1 7 6 7 4 5 4 0.956 0.894

User 2 6 7 - 4 3 4 0.981 0.939

User 3 3.35 3 3 1 1 0.85 1.0 1.0

User 4 1 2 2 3 3 4 0.789 -1.0

User 5 1 - 1 2 3 3 0.645 -0.817

• What item would you recommend to User 3?