SlideShare a Scribd company logo
FEBRUARY 22, 2018, WARSAW
Factorization Machines for building
Recommender System
Paweł Łagodziński, SAS Institute Poland
FEBRUARY 22, 2018, WARSAW
Recommender Systems
• Primary goal – recommend most relevant items to users
Plenty of success stories:
• music, movies
• retail products
• ad verbs
• news
• query results
• social links
Users
Items
Context
Recommender System
FEBRUARY 22, 2018, WARSAW
Recommender Systems
• Key challenges
• sparse data
• extremely large data
• cold start
• Other issues
• different ratings types
• difficult performance evolution
FEBRUARY 22, 2018, WARSAW
Regression models
• Linear regression
• Polynomial regression (2-d)
– input vector with p predictors
y
– predicted rating
– number of parameters
O( ҧ𝑝) – learning complexity
1+p
– observed rating
1+p+p2 – number of parameters
O( ҧ𝑝2
) – learning complexity
FEBRUARY 22, 2018, WARSAW
Regression models
• Pros
• Combines real-value and categorical features
• Useful for regression and classification problems
• Fast learning for linear model
• Cons
• Linear regression does not include user-item interactions
• Polynomial regression model includes, but only in theory for sparse data
• For sparse data pairwise interactions are very rarely observed, in consequences
parameters cannot be estimated directly from the data
FEBRUARY 22, 2018, WARSAW
Laten factor models - matrix factorization
𝑌 ∈ ℝ|𝑈|×|𝐼| matrix of observed ratings
i1 i2 i3 i4
u1 2 3 1 ?
u2 ? 5 ? 1
u3 ? ? 4 1
u4 2 3 ? 2
I - items
U-users
• If matrix Y is low-rank it can be approximated with ෠𝑌
𝑌 ≈ ෠𝑌
where k is the rank of matrix ෡𝒀
(the size of latent factors vector)෠𝑌 = 𝑊𝐻 𝑇
, W ∈ ℝ 𝑈 ×𝑘, H ∈ ℝ|𝐼|×𝑘
Rating
predictions
ො𝑦 𝑢𝑖 = ෍
𝑗=1
𝑘
𝑤 𝑢𝑗 ∙ ℎ𝑗𝑖 =< 𝒘 𝑢 𝒉𝑖 >
FEBRUARY 22, 2018, WARSAW
Nero
JuliusCaesar
Cleopatra
SleeplessinSeatle
PrettyWomen
Casablanca
1 0 0 0 0 0 0
2 0 0 0 0 0 0
3 0 0 0 0 0 0
4 0 0 -1 0 0 0
5 0 0 -1 0 0 0
6 0 0 1 0 0 0
7 0 0 -1 0 0 0
≈
History
Romance
1 1 0
2 1 0
3 1 0
4 1 1
5 -1 1
6 -1 1
7 -1 1
Nero
JuliusCaesar
Cleopatra
SleeplessinSeatle
PrettyWomen
Casablanca
History 1 1 1 0 0 0
Romance 0 0 1 1 1 1
Rating: -1- dislike, 0 – neutral, 1 - like
History
Romance
Both x
Laten factor models - matrix factorization
• correlations -> lower rank matrix
• strong correlations -> works for sparse data
after C.C. Aggarwal [1]
Nero
JuliusCaesar
Cleopatra
SleeplessinSeatle
PrettyWomen
Casablanca
1 1 1 1 0 0 0
2 1 1 1 0 0 0
3 1 1 1 0 0 0
4 1 1 1 1 1 1
5 -1 -1 -1 1 1 1
6 -1 -1 1 1 1 1
7 -1 -1 -1 1 1 1
• Intuition behind matrix factorization with low (2) rank matrix
𝑊 𝐻 𝑇𝑌 𝑌 − ෠𝑌
FEBRUARY 22, 2018, WARSAW
Matrix factorization based models
• Many specific variants of factorization models were proposed
• Examples:
• MF
• Time Senstive Factorization Models
• User-item matrix factorization
• SVD++
• Factorized kNN
• timeSVD
• timeTF
• Tensore Factorization Models
• Sequential Models
• PARAFAC
• PITF
• FMC
• FPMC
(after S. Randal, MLConf, 2014)
FEBRUARY 22, 2018, WARSAW
Matrix factorization based models
• Pros
• Enable modelling interactions (user-item-…) even if such events are not
observed in data
• Effective on sparse data
• Cons
• Learning algorithms are tailored to each specific model
• In result, it is hard to adjust model to the problem under consideration
FEBRUARY 22, 2018, WARSAW
Factorization Machines
• Concept generalizing regression and factorization models
• Introduced in 2010 (S. Randel [2])
• Very good performance in data science challenges (after S. Randal, MLConf, 2014):
• Click prediction - KDDCup 2012: Track 2 (3rd place), Criteon Display Advertising Challenge (1st place)
• Rating prediction – Netlix prize, Movielens, KDDCup 2011, EMI Music Data Science Hackathon (2nd
place)
• Social link prediction (social media) - KDDCup 2012: Track 1 (2nd place)
• Recommend given name - ECML/PKDD Discovery Challenge 2013 (1st place on-line track, 2nd place
off-line track)
• Predicting result of next exam question - KDDCup 2010, Grockit Challenge (1st place)
• Prediction of auction sale price –Kaggle, Blue Book for Bulldozers (1st place)
FEBRUARY 22, 2018, WARSAW
Factorization Machines - model
• Factorization Machines (2nd degree)
– input vector with p predictors
y – observed rating
– predicted rating
1+p+kp – number of parameters
O( ҧ𝑝𝑘) – learning complexity (lower than ҧ𝑝2 for 2-d polynomial regression)
• FM of higher degrees possible but not used in practice
• Parameters can be estimated with use of universal optimization
methods, Stochastic Gradient Decent for instance
FEBRUARY 22, 2018, WARSAW
Factorization Machines
• Features engineering enables great expressiveness of FM model
• Matrix / tensor data can be easily transformed into features vector
Input Y
i1 i2 i3 i4
u1 2 3 1 ?
u2 ? 5 ? 1
u3 ? ? 4 1
User(U)
Item (I)
U I y
1 u1 i1 2
2 u1 i2 3
3 u1 i3 1
4 u2 i2 5
5 u2 i4 1
6 u3 i3 4
7 u3 i4 1
target
y
x1 1 0 0 1 0 0 0 2 y1
x2 1 0 0 0 1 0 0 3 y2
x3 1 0 0 0 0 1 0 1 y3
x4 0 1 0 0 1 0 0 5 y4
x5 0 1 0 0 0 0 1 1 y5
x6 0 0 1 0 0 1 0 4 y6
x7 0 0 1 0 0 0 1 1 y7
u1 u2 u3 i1 i2 i3 i4
Users Items
x
Features vector
(one-hot
encoding)
FEBRUARY 22, 2018, WARSAW
Factorization Machines
• Applying FM model
target
y
x1 1 0 0 1 0 0 0 2 y1
x2 1 0 0 0 1 0 0 3 y2
x3 1 0 0 0 0 1 0 1 y3
x4 0 1 0 0 1 0 0 5 y4
x5 0 1 0 0 0 0 1 1 y5
x6 0 0 1 0 0 1 0 4 y6
x7 0 0 1 0 0 0 1 1 y7
u1 u2 u3 i1 i2 i3 i4
Users Items
x
Features vector
FM is equal to combination of MF (interaction
user-item) and biases for user and item
FEBRUARY 22, 2018, WARSAW
Factorization Machines
• Adding context categorical information gives other MF model
FM is equivalent to PITF model, one of
extension of MF model
ො𝑦 𝑢𝑖𝑐 = 𝑤0 + 𝑤 𝑢 + 𝑤𝑖 + 𝑤𝑐 +< 𝒗 𝑢 𝒗𝑖 >+< 𝒗 𝑢 𝒗 𝑐 >+< 𝒗𝑖 𝒗 𝑐 >
target
y
x1 1 0 0 1 0 0 0 0 0 1 2 y1
x2 1 0 0 0 1 0 0 1 0 0 3 y2
x3 1 0 0 0 0 1 0 1 0 0 1 y3
x4 0 1 0 0 1 0 0 0 1 0 5 y4
x5 0 1 0 0 0 0 1 1 0 0 1 y5
x6 0 0 1 0 0 1 0 0 1 0 4 y6
x7 0 0 1 0 0 0 1 0 0 1 1 y7
u1 u2 u3 i1 i2 i3 i4 c1 c2 c3
Context
Features vector
x
Users Items
FEBRUARY 22, 2018, WARSAW
Factorization Machines
• Factorization Machines offers combination of regression and factorization models
• Low rank matrix approximation enables estimation of unobserved interactions
• Effective for sparse and extremely sparse data
• Flexible through features engineering
• One of the most disruptive algorithms of recent years
FEBRUARY 22, 2018, WARSAW
FactMac – Factorization Machines in SAS Viya
FEBRUARY 22, 2018, WARSAW
FactMac – Factorization Machines in SAS Viya
• 2-nd degree Factorization Machines
• Learning with „HOGWILD!”
• Hyper-parameters optimization with Grid Search, Genetic Algorithm
• Different deployment options
• SAS Viya
• In-database
• In-stream (SAS ESP)
• Considered for the next releases
• Field-Aware Factorization Machines, FFM [3] – better predictions than FM
• Warm-up restart – faster retraining
• Other loss functions
2
3
1
1
2
3
FEBRUARY 22, 2018, WARSAW
Shot Recommender System for NBA Coaches
• Application of Factorization Machines in SAS Viya to sport data
• Presented during 2016 KDD Large-Scale Sports Analytics [5]
• Goal – recommend shot types for players
• Data - recorded shots taken during the 2015-2016 NBA games
• shots for 420 players
• for shot: player, action type, shot zone range, shot zone area, result,
and other characteristics.
• Model for shot prediction (FM 2-d, k=25, p = 404), log-odds as a target
?
Image source: Wikipedia
FEBRUARY 22, 2018, WARSAW
Shot Recommender System for NBA Coaches
• Biases can be very informative
FEBRUARY 22, 2018, WARSAW
Shot Recommender System for NBA Coaches
• Latent factors analysis
FEBRUARY 22, 2018, WARSAW
Shot Recommender System for NBA Coaches
• Shot recommendations (plogit):
FEBRUARY 22, 2018, WARSAW
Shot Recommender System for NBA Coaches
• Demonstration of the concept on NCAA* basketball data
• The National Collegiate Athletic Association (NCAA) is a non-profit organization which regulates athletes
of 1,281 institutions, conferences, and individuals (source: wikipedia).
• Recent news for Kaggle’rs,
new competation launched
2 days ago
FEBRUARY 22, 2018, WARSAW
Where to use Factorization Machines?
• Rule of Thumb
• design matrix is very sparse (mostly missing data, 90% missing or more)
• the cardinality of the nominal variables is very high and each level of the nominal variables
has only a few representatives in the data
• you have good reason to believe the design matrix has low rank
• Findings in [3], seems to confirm above. FFM was working better for CTR prediction but no so good for more
dense data (web phishing, census income)
• Our (local SAS office) experiments, still in progress, showed that FM even for not very sparse data
can produce robust set of baseline predictions. More specialized and tailored models can
outperform FM but requires more effort in development
• FactMac was tested for SAS Customer Intelligence 360 solution with good results.
FEBRUARY 22, 2018, WARSAW
Thank You !
FEBRUARY 22, 2018, WARSAW
References
• [1] Charu C. Aggrwal. „Recommender Systems, The Textbook”. Springer. 2016.
• [2] Steffen Rendle. “Factorization Machines.” In Proceedings of the 10th IEEE
International Conference on Data Mining (ICDM). Piscataway, NJ: Institute of Electrical
And Electronics Engineers. 2010.
• [3] Yuchin Juan et al., „Field-aware Factorization Machines for CTR Prediction”. RecSys
'16 Proceedings of the 10th ACM Conference on Recommender Systems. 2016.
• [4] Jason Lee et al., „Practical Large-Scale Optimization for Max-Norm Regularization”,
Advances in Neural Information Processing Systems 23 (NIPS). 2010.
• [5] Reymond E. Wrigt et al. „Shot Recommender System for NBA Coaches”. KDD. 2016
FEBRUARY 22, 2018, WARSAW

More Related Content

What's hot (20)

Recommendation Systems
Recommendation SystemsRecommendation Systems
Recommendation Systems
Robin Reni
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation Systems
Trieu Nguyen
 
Steffen Rendle, Research Scientist, Google at MLconf SF
Steffen Rendle, Research Scientist, Google at MLconf SFSteffen Rendle, Research Scientist, Google at MLconf SF
Steffen Rendle, Research Scientist, Google at MLconf SF
MLconf
 
Recommender system
Recommender systemRecommender system
Recommender system
Nilotpal Pramanik
 
Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial
Alexandros Karatzoglou
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Xavier Amatriain
 
Calibrated Recommendations
Calibrated RecommendationsCalibrated Recommendations
Calibrated Recommendations
Harald Steck
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architecture
Liang Xiang
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
Carlos Castillo (ChaTo)
 
Interactive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and SpotifyInteractive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and Spotify
Chris Johnson
 
A Hybrid Recommendation system
A Hybrid Recommendation systemA Hybrid Recommendation system
A Hybrid Recommendation system
Pranav Prakash
 
How to build a recommender system?
How to build a recommender system?How to build a recommender system?
How to build a recommender system?
blueace
 
텐서플로우 설치도 했고 튜토리얼도 봤고 기초 예제도 짜봤다면 TensorFlow KR Meetup 2016
텐서플로우 설치도 했고 튜토리얼도 봤고 기초 예제도 짜봤다면 TensorFlow KR Meetup 2016텐서플로우 설치도 했고 튜토리얼도 봤고 기초 예제도 짜봤다면 TensorFlow KR Meetup 2016
텐서플로우 설치도 했고 튜토리얼도 봤고 기초 예제도 짜봤다면 TensorFlow KR Meetup 2016
Taehoon Kim
 
Sequential Decision Making in Recommendations
Sequential Decision Making in RecommendationsSequential Decision Making in Recommendations
Sequential Decision Making in Recommendations
Jaya Kawale
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
Yves Raimond
 
Intro to Factorization Machines
Intro to Factorization MachinesIntro to Factorization Machines
Intro to Factorization Machines
Pavel Kalaidin
 
Recent advances in deep recommender systems
Recent advances in deep recommender systemsRecent advances in deep recommender systems
Recent advances in deep recommender systems
NAVER Engineering
 
Recommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringRecommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filtering
Viet-Trung TRAN
 
Matrix Factorisation (and Dimensionality Reduction)
Matrix Factorisation (and Dimensionality Reduction)Matrix Factorisation (and Dimensionality Reduction)
Matrix Factorisation (and Dimensionality Reduction)
HJ van Veen
 
Recommendation System Explained
Recommendation System ExplainedRecommendation System Explained
Recommendation System Explained
Crossing Minds
 
Recommendation Systems
Recommendation SystemsRecommendation Systems
Recommendation Systems
Robin Reni
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation Systems
Trieu Nguyen
 
Steffen Rendle, Research Scientist, Google at MLconf SF
Steffen Rendle, Research Scientist, Google at MLconf SFSteffen Rendle, Research Scientist, Google at MLconf SF
Steffen Rendle, Research Scientist, Google at MLconf SF
MLconf
 
Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial
Alexandros Karatzoglou
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Xavier Amatriain
 
Calibrated Recommendations
Calibrated RecommendationsCalibrated Recommendations
Calibrated Recommendations
Harald Steck
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architecture
Liang Xiang
 
Interactive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and SpotifyInteractive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and Spotify
Chris Johnson
 
A Hybrid Recommendation system
A Hybrid Recommendation systemA Hybrid Recommendation system
A Hybrid Recommendation system
Pranav Prakash
 
How to build a recommender system?
How to build a recommender system?How to build a recommender system?
How to build a recommender system?
blueace
 
텐서플로우 설치도 했고 튜토리얼도 봤고 기초 예제도 짜봤다면 TensorFlow KR Meetup 2016
텐서플로우 설치도 했고 튜토리얼도 봤고 기초 예제도 짜봤다면 TensorFlow KR Meetup 2016텐서플로우 설치도 했고 튜토리얼도 봤고 기초 예제도 짜봤다면 TensorFlow KR Meetup 2016
텐서플로우 설치도 했고 튜토리얼도 봤고 기초 예제도 짜봤다면 TensorFlow KR Meetup 2016
Taehoon Kim
 
Sequential Decision Making in Recommendations
Sequential Decision Making in RecommendationsSequential Decision Making in Recommendations
Sequential Decision Making in Recommendations
Jaya Kawale
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
Yves Raimond
 
Intro to Factorization Machines
Intro to Factorization MachinesIntro to Factorization Machines
Intro to Factorization Machines
Pavel Kalaidin
 
Recent advances in deep recommender systems
Recent advances in deep recommender systemsRecent advances in deep recommender systems
Recent advances in deep recommender systems
NAVER Engineering
 
Recommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringRecommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filtering
Viet-Trung TRAN
 
Matrix Factorisation (and Dimensionality Reduction)
Matrix Factorisation (and Dimensionality Reduction)Matrix Factorisation (and Dimensionality Reduction)
Matrix Factorisation (and Dimensionality Reduction)
HJ van Veen
 
Recommendation System Explained
Recommendation System ExplainedRecommendation System Explained
Recommendation System Explained
Crossing Minds
 

Similar to The Factorization Machines algorithm for building recommendation system - Paweł Łagodziński, SAS Institute (20)

And Then There Are Algorithms - Danilo Poccia - Codemotion Rome 2018
And Then There Are Algorithms - Danilo Poccia - Codemotion Rome 2018And Then There Are Algorithms - Danilo Poccia - Codemotion Rome 2018
And Then There Are Algorithms - Danilo Poccia - Codemotion Rome 2018
Codemotion
 
Factorization Machines with libFM
Factorization Machines with libFMFactorization Machines with libFM
Factorization Machines with libFM
Liangjie Hong
 
Reading group nfm - 20170312
Reading group  nfm - 20170312Reading group  nfm - 20170312
Reading group nfm - 20170312
Shuai Zhang
 
Steffen Rendle, Research Scientist, Google at MLconf SF
Steffen Rendle, Research Scientist, Google at MLconf SFSteffen Rendle, Research Scientist, Google at MLconf SF
Steffen Rendle, Research Scientist, Google at MLconf SF
MLconf
 
Hybridisation Techniques for Cold-Starting Context-Aware Recommender Systems
Hybridisation Techniques for Cold-Starting Context-Aware Recommender SystemsHybridisation Techniques for Cold-Starting Context-Aware Recommender Systems
Hybridisation Techniques for Cold-Starting Context-Aware Recommender Systems
Matthias Braunhofer
 
Spring 2016 Intern at Treasure Data
Spring 2016 Intern at Treasure DataSpring 2016 Intern at Treasure Data
Spring 2016 Intern at Treasure Data
Sotaro Sugimoto
 
行列分解いろいろ
行列分解いろいろ行列分解いろいろ
行列分解いろいろ
sakamikosuke
 
Matrix Factorization
Matrix FactorizationMatrix Factorization
Matrix Factorization
Yusuke Yamamoto
 
Low-rank matrix approximations in Python by Christian Thurau PyData 2014
Low-rank matrix approximations in Python by Christian Thurau PyData 2014Low-rank matrix approximations in Python by Christian Thurau PyData 2014
Low-rank matrix approximations in Python by Christian Thurau PyData 2014
PyData
 
Recommender Systems Tutorial (Part 2) -- Offline Components
Recommender Systems Tutorial (Part 2) -- Offline ComponentsRecommender Systems Tutorial (Part 2) -- Offline Components
Recommender Systems Tutorial (Part 2) -- Offline Components
Bee-Chung Chen
 
Simple representations for learning: factorizations and similarities
Simple representations for learning: factorizations and similarities Simple representations for learning: factorizations and similarities
Simple representations for learning: factorizations and similarities
Gael Varoquaux
 
Context-Aware Recommender System Based on Boolean Matrix Factorisation
Context-Aware Recommender System Based on Boolean Matrix FactorisationContext-Aware Recommender System Based on Boolean Matrix Factorisation
Context-Aware Recommender System Based on Boolean Matrix Factorisation
Dmitrii Ignatov
 
lecture26-mf.pptx
lecture26-mf.pptxlecture26-mf.pptx
lecture26-mf.pptx
Jadna Almeida
 
lecture244-mf.pptx
lecture244-mf.pptxlecture244-mf.pptx
lecture244-mf.pptx
Jadna Almeida
 
Building Data Pipelines for Music Recommendations at Spotify
Building Data Pipelines for Music Recommendations at SpotifyBuilding Data Pipelines for Music Recommendations at Spotify
Building Data Pipelines for Music Recommendations at Spotify
Vidhya Murali
 
Matrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsMatrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender Systems
Lei Guo
 
ML+Hadoop at NYC Predictive Analytics
ML+Hadoop at NYC Predictive AnalyticsML+Hadoop at NYC Predictive Analytics
ML+Hadoop at NYC Predictive Analytics
Erik Bernhardsson
 
Recommendation system
Recommendation systemRecommendation system
Recommendation system
Ding Li
 
Algorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyAlgorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at Spotify
Chris Johnson
 
And Then There Are Algorithms
And Then There Are AlgorithmsAnd Then There Are Algorithms
And Then There Are Algorithms
InfluxData
 
And Then There Are Algorithms - Danilo Poccia - Codemotion Rome 2018
And Then There Are Algorithms - Danilo Poccia - Codemotion Rome 2018And Then There Are Algorithms - Danilo Poccia - Codemotion Rome 2018
And Then There Are Algorithms - Danilo Poccia - Codemotion Rome 2018
Codemotion
 
Factorization Machines with libFM
Factorization Machines with libFMFactorization Machines with libFM
Factorization Machines with libFM
Liangjie Hong
 
Reading group nfm - 20170312
Reading group  nfm - 20170312Reading group  nfm - 20170312
Reading group nfm - 20170312
Shuai Zhang
 
Steffen Rendle, Research Scientist, Google at MLconf SF
Steffen Rendle, Research Scientist, Google at MLconf SFSteffen Rendle, Research Scientist, Google at MLconf SF
Steffen Rendle, Research Scientist, Google at MLconf SF
MLconf
 
Hybridisation Techniques for Cold-Starting Context-Aware Recommender Systems
Hybridisation Techniques for Cold-Starting Context-Aware Recommender SystemsHybridisation Techniques for Cold-Starting Context-Aware Recommender Systems
Hybridisation Techniques for Cold-Starting Context-Aware Recommender Systems
Matthias Braunhofer
 
Spring 2016 Intern at Treasure Data
Spring 2016 Intern at Treasure DataSpring 2016 Intern at Treasure Data
Spring 2016 Intern at Treasure Data
Sotaro Sugimoto
 
行列分解いろいろ
行列分解いろいろ行列分解いろいろ
行列分解いろいろ
sakamikosuke
 
Low-rank matrix approximations in Python by Christian Thurau PyData 2014
Low-rank matrix approximations in Python by Christian Thurau PyData 2014Low-rank matrix approximations in Python by Christian Thurau PyData 2014
Low-rank matrix approximations in Python by Christian Thurau PyData 2014
PyData
 
Recommender Systems Tutorial (Part 2) -- Offline Components
Recommender Systems Tutorial (Part 2) -- Offline ComponentsRecommender Systems Tutorial (Part 2) -- Offline Components
Recommender Systems Tutorial (Part 2) -- Offline Components
Bee-Chung Chen
 
Simple representations for learning: factorizations and similarities
Simple representations for learning: factorizations and similarities Simple representations for learning: factorizations and similarities
Simple representations for learning: factorizations and similarities
Gael Varoquaux
 
Context-Aware Recommender System Based on Boolean Matrix Factorisation
Context-Aware Recommender System Based on Boolean Matrix FactorisationContext-Aware Recommender System Based on Boolean Matrix Factorisation
Context-Aware Recommender System Based on Boolean Matrix Factorisation
Dmitrii Ignatov
 
Building Data Pipelines for Music Recommendations at Spotify
Building Data Pipelines for Music Recommendations at SpotifyBuilding Data Pipelines for Music Recommendations at Spotify
Building Data Pipelines for Music Recommendations at Spotify
Vidhya Murali
 
Matrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsMatrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender Systems
Lei Guo
 
ML+Hadoop at NYC Predictive Analytics
ML+Hadoop at NYC Predictive AnalyticsML+Hadoop at NYC Predictive Analytics
ML+Hadoop at NYC Predictive Analytics
Erik Bernhardsson
 
Recommendation system
Recommendation systemRecommendation system
Recommendation system
Ding Li
 
Algorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyAlgorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at Spotify
Chris Johnson
 
And Then There Are Algorithms
And Then There Are AlgorithmsAnd Then There Are Algorithms
And Then There Are Algorithms
InfluxData
 

More from Evention (20)

A/B testing powered by Big data - Saurabh Goyal, Booking.com
A/B testing powered by Big data - Saurabh Goyal, Booking.comA/B testing powered by Big data - Saurabh Goyal, Booking.com
A/B testing powered by Big data - Saurabh Goyal, Booking.com
Evention
 
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
Evention
 
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
Evention
 
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
Evention
 
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, AdformBuilding a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
Evention
 
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data ArtisansApache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
Evention
 
Privacy by Design - Lars Albertsson, Mapflat
Privacy by Design - Lars Albertsson, MapflatPrivacy by Design - Lars Albertsson, Mapflat
Privacy by Design - Lars Albertsson, Mapflat
Evention
 
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...
Evention
 
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
Evention
 
Enhancing Spark - increase streaming capabilities of your applications - Kami...
Enhancing Spark - increase streaming capabilities of your applications - Kami...Enhancing Spark - increase streaming capabilities of your applications - Kami...
Enhancing Spark - increase streaming capabilities of your applications - Kami...
Evention
 
7 Days of Playing Minesweeper, or How to Shut Down Whistleblower Defense with...
7 Days of Playing Minesweeper, or How to Shut Down Whistleblower Defense with...7 Days of Playing Minesweeper, or How to Shut Down Whistleblower Defense with...
7 Days of Playing Minesweeper, or How to Shut Down Whistleblower Defense with...
Evention
 
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
Evention
 
Stream processing with Apache Flink - Maximilian Michels Data Artisans
Stream processing with Apache Flink - Maximilian Michels Data ArtisansStream processing with Apache Flink - Maximilian Michels Data Artisans
Stream processing with Apache Flink - Maximilian Michels Data Artisans
Evention
 
Scaling Cassandra in all directions - Jimmy Mardell Spotify
Scaling Cassandra in all directions - Jimmy Mardell SpotifyScaling Cassandra in all directions - Jimmy Mardell Spotify
Scaling Cassandra in all directions - Jimmy Mardell Spotify
Evention
 
Big Data for unstructured data Dariusz Śliwa
Big Data for unstructured data Dariusz ŚliwaBig Data for unstructured data Dariusz Śliwa
Big Data for unstructured data Dariusz Śliwa
Evention
 
Elastic development. Implementing Big Data search Grzegorz Kołpuć
Elastic development. Implementing Big Data search Grzegorz KołpućElastic development. Implementing Big Data search Grzegorz Kołpuć
Elastic development. Implementing Big Data search Grzegorz Kołpuć
Evention
 
H2 o deep water making deep learning accessible to everyone -jo-fai chow
H2 o deep water   making deep learning accessible to everyone -jo-fai chowH2 o deep water   making deep learning accessible to everyone -jo-fai chow
H2 o deep water making deep learning accessible to everyone -jo-fai chow
Evention
 
That won’t fit into RAM - Michał Brzezicki
That won’t fit into RAM -  Michał  BrzezickiThat won’t fit into RAM -  Michał  Brzezicki
That won’t fit into RAM - Michał Brzezicki
Evention
 
Stream Analytics with SQL on Apache Flink - Fabian Hueske
Stream Analytics with SQL on Apache Flink - Fabian HueskeStream Analytics with SQL on Apache Flink - Fabian Hueske
Stream Analytics with SQL on Apache Flink - Fabian Hueske
Evention
 
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
Evention
 
A/B testing powered by Big data - Saurabh Goyal, Booking.com
A/B testing powered by Big data - Saurabh Goyal, Booking.comA/B testing powered by Big data - Saurabh Goyal, Booking.com
A/B testing powered by Big data - Saurabh Goyal, Booking.com
Evention
 
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
Evention
 
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
Evention
 
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
Evention
 
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, AdformBuilding a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
Evention
 
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data ArtisansApache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
Evention
 
Privacy by Design - Lars Albertsson, Mapflat
Privacy by Design - Lars Albertsson, MapflatPrivacy by Design - Lars Albertsson, Mapflat
Privacy by Design - Lars Albertsson, Mapflat
Evention
 
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...
Evention
 
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
Evention
 
Enhancing Spark - increase streaming capabilities of your applications - Kami...
Enhancing Spark - increase streaming capabilities of your applications - Kami...Enhancing Spark - increase streaming capabilities of your applications - Kami...
Enhancing Spark - increase streaming capabilities of your applications - Kami...
Evention
 
7 Days of Playing Minesweeper, or How to Shut Down Whistleblower Defense with...
7 Days of Playing Minesweeper, or How to Shut Down Whistleblower Defense with...7 Days of Playing Minesweeper, or How to Shut Down Whistleblower Defense with...
7 Days of Playing Minesweeper, or How to Shut Down Whistleblower Defense with...
Evention
 
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
Evention
 
Stream processing with Apache Flink - Maximilian Michels Data Artisans
Stream processing with Apache Flink - Maximilian Michels Data ArtisansStream processing with Apache Flink - Maximilian Michels Data Artisans
Stream processing with Apache Flink - Maximilian Michels Data Artisans
Evention
 
Scaling Cassandra in all directions - Jimmy Mardell Spotify
Scaling Cassandra in all directions - Jimmy Mardell SpotifyScaling Cassandra in all directions - Jimmy Mardell Spotify
Scaling Cassandra in all directions - Jimmy Mardell Spotify
Evention
 
Big Data for unstructured data Dariusz Śliwa
Big Data for unstructured data Dariusz ŚliwaBig Data for unstructured data Dariusz Śliwa
Big Data for unstructured data Dariusz Śliwa
Evention
 
Elastic development. Implementing Big Data search Grzegorz Kołpuć
Elastic development. Implementing Big Data search Grzegorz KołpućElastic development. Implementing Big Data search Grzegorz Kołpuć
Elastic development. Implementing Big Data search Grzegorz Kołpuć
Evention
 
H2 o deep water making deep learning accessible to everyone -jo-fai chow
H2 o deep water   making deep learning accessible to everyone -jo-fai chowH2 o deep water   making deep learning accessible to everyone -jo-fai chow
H2 o deep water making deep learning accessible to everyone -jo-fai chow
Evention
 
That won’t fit into RAM - Michał Brzezicki
That won’t fit into RAM -  Michał  BrzezickiThat won’t fit into RAM -  Michał  Brzezicki
That won’t fit into RAM - Michał Brzezicki
Evention
 
Stream Analytics with SQL on Apache Flink - Fabian Hueske
Stream Analytics with SQL on Apache Flink - Fabian HueskeStream Analytics with SQL on Apache Flink - Fabian Hueske
Stream Analytics with SQL on Apache Flink - Fabian Hueske
Evention
 
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
Evention
 

Recently uploaded (20)

Lec 11.pdfgghjuuyffhkiiiiuuiiiiiiuhffghjiu
Lec 11.pdfgghjuuyffhkiiiiuuiiiiiiuhffghjiuLec 11.pdfgghjuuyffhkiiiiuuiiiiiiuhffghjiu
Lec 11.pdfgghjuuyffhkiiiiuuiiiiiiuhffghjiu
saifalroby72
 
PM003_SERENE-CM-PM-Training Material-EAM Maintenance Notification.pptx
PM003_SERENE-CM-PM-Training Material-EAM Maintenance Notification.pptxPM003_SERENE-CM-PM-Training Material-EAM Maintenance Notification.pptx
PM003_SERENE-CM-PM-Training Material-EAM Maintenance Notification.pptx
afriyanrtanjung007
 
Role_Based_Permissions_Kick-off_Deck_202203.pptx
Role_Based_Permissions_Kick-off_Deck_202203.pptxRole_Based_Permissions_Kick-off_Deck_202203.pptx
Role_Based_Permissions_Kick-off_Deck_202203.pptx
SystemsBenya
 
Magician PeterMagician PeterMagician PeterMagician Peter
Magician PeterMagician PeterMagician PeterMagician PeterMagician PeterMagician PeterMagician PeterMagician Peter
Magician PeterMagician PeterMagician PeterMagician Peter
seomarket363
 
An Algorithmic Test Using The Game of Poker
An Algorithmic Test Using The Game of PokerAn Algorithmic Test Using The Game of Poker
An Algorithmic Test Using The Game of Poker
Graham Ware
 
STRABAG SE - Investor Presentation - February 2024.pdf
STRABAG SE - Investor Presentation - February 2024.pdfSTRABAG SE - Investor Presentation - February 2024.pdf
STRABAG SE - Investor Presentation - February 2024.pdf
andrianalampka
 
time_series_forecasting_constructor_uni.pptx
time_series_forecasting_constructor_uni.pptxtime_series_forecasting_constructor_uni.pptx
time_series_forecasting_constructor_uni.pptx
stefanopinto1113
 
14th International Conference on Advanced Computer Science and Information Te...
14th International Conference on Advanced Computer Science and Information Te...14th International Conference on Advanced Computer Science and Information Te...
14th International Conference on Advanced Computer Science and Information Te...
ijitcs
 
Lec 12.pdfghhjjhhjkkkkkkkkkkkjfcvhiiugcvvh
Lec 12.pdfghhjjhhjkkkkkkkkkkkjfcvhiiugcvvhLec 12.pdfghhjjhhjkkkkkkkkkkkjfcvhiiugcvvh
Lec 12.pdfghhjjhhjkkkkkkkkkkkjfcvhiiugcvvh
saifalroby72
 
this is the Dr. ibrahim presentations ppt.pptx
this is the Dr. ibrahim presentations ppt.pptxthis is the Dr. ibrahim presentations ppt.pptx
this is the Dr. ibrahim presentations ppt.pptx
ibrahimabdi22
 
apidays New York 2025 - The Evolution of Travel APIs by Eric White (Eviivo)
apidays New York 2025 - The Evolution of Travel APIs by Eric White (Eviivo)apidays New York 2025 - The Evolution of Travel APIs by Eric White (Eviivo)
apidays New York 2025 - The Evolution of Travel APIs by Eric White (Eviivo)
apidays
 
Ethical Frameworks for Trustworthy AI – Opportunities for Researchers in Huma...
Ethical Frameworks for Trustworthy AI – Opportunities for Researchers in Huma...Ethical Frameworks for Trustworthy AI – Opportunities for Researchers in Huma...
Ethical Frameworks for Trustworthy AI – Opportunities for Researchers in Huma...
Karim Baïna
 
apidays New York 2025 - From UX to AX by Karin Hendrikse (Netlify)
apidays New York 2025 - From UX to AX by Karin Hendrikse (Netlify)apidays New York 2025 - From UX to AX by Karin Hendrikse (Netlify)
apidays New York 2025 - From UX to AX by Karin Hendrikse (Netlify)
apidays
 
Block chauin techncology by engineer saniya samreen
Block chauin techncology by engineer saniya samreenBlock chauin techncology by engineer saniya samreen
Block chauin techncology by engineer saniya samreen
Shoyeb16
 
apidays New York 2025 - AI for All by Ananya Upadhyay (United Rentals, Inc.)
apidays New York 2025 - AI for All by Ananya Upadhyay (United Rentals, Inc.)apidays New York 2025 - AI for All by Ananya Upadhyay (United Rentals, Inc.)
apidays New York 2025 - AI for All by Ananya Upadhyay (United Rentals, Inc.)
apidays
 
How Data Annotation Services Drive Innovation in Autonomous Vehicles.docx
How Data Annotation Services Drive Innovation in Autonomous Vehicles.docxHow Data Annotation Services Drive Innovation in Autonomous Vehicles.docx
How Data Annotation Services Drive Innovation in Autonomous Vehicles.docx
sofiawilliams5966
 
Cyber Security Presentation(Neon)xu.pptx
Cyber Security Presentation(Neon)xu.pptxCyber Security Presentation(Neon)xu.pptx
Cyber Security Presentation(Neon)xu.pptx
vilakshbhargava
 
Computer Applications: An International Journal (CAIJ)
Computer Applications: An International Journal (CAIJ)Computer Applications: An International Journal (CAIJ)
Computer Applications: An International Journal (CAIJ)
ijitcs
 
time_series_forecasting_constructor_uni.pptx
time_series_forecasting_constructor_uni.pptxtime_series_forecasting_constructor_uni.pptx
time_series_forecasting_constructor_uni.pptx
stefanopinto1113
 
IoT, Data Analytics and Big Data Security.pptx
IoT, Data Analytics and Big Data Security.pptxIoT, Data Analytics and Big Data Security.pptx
IoT, Data Analytics and Big Data Security.pptx
fizarcse
 
Lec 11.pdfgghjuuyffhkiiiiuuiiiiiiuhffghjiu
Lec 11.pdfgghjuuyffhkiiiiuuiiiiiiuhffghjiuLec 11.pdfgghjuuyffhkiiiiuuiiiiiiuhffghjiu
Lec 11.pdfgghjuuyffhkiiiiuuiiiiiiuhffghjiu
saifalroby72
 
PM003_SERENE-CM-PM-Training Material-EAM Maintenance Notification.pptx
PM003_SERENE-CM-PM-Training Material-EAM Maintenance Notification.pptxPM003_SERENE-CM-PM-Training Material-EAM Maintenance Notification.pptx
PM003_SERENE-CM-PM-Training Material-EAM Maintenance Notification.pptx
afriyanrtanjung007
 
Role_Based_Permissions_Kick-off_Deck_202203.pptx
Role_Based_Permissions_Kick-off_Deck_202203.pptxRole_Based_Permissions_Kick-off_Deck_202203.pptx
Role_Based_Permissions_Kick-off_Deck_202203.pptx
SystemsBenya
 
Magician PeterMagician PeterMagician PeterMagician Peter
Magician PeterMagician PeterMagician PeterMagician PeterMagician PeterMagician PeterMagician PeterMagician Peter
Magician PeterMagician PeterMagician PeterMagician Peter
seomarket363
 
An Algorithmic Test Using The Game of Poker
An Algorithmic Test Using The Game of PokerAn Algorithmic Test Using The Game of Poker
An Algorithmic Test Using The Game of Poker
Graham Ware
 
STRABAG SE - Investor Presentation - February 2024.pdf
STRABAG SE - Investor Presentation - February 2024.pdfSTRABAG SE - Investor Presentation - February 2024.pdf
STRABAG SE - Investor Presentation - February 2024.pdf
andrianalampka
 
time_series_forecasting_constructor_uni.pptx
time_series_forecasting_constructor_uni.pptxtime_series_forecasting_constructor_uni.pptx
time_series_forecasting_constructor_uni.pptx
stefanopinto1113
 
14th International Conference on Advanced Computer Science and Information Te...
14th International Conference on Advanced Computer Science and Information Te...14th International Conference on Advanced Computer Science and Information Te...
14th International Conference on Advanced Computer Science and Information Te...
ijitcs
 
Lec 12.pdfghhjjhhjkkkkkkkkkkkjfcvhiiugcvvh
Lec 12.pdfghhjjhhjkkkkkkkkkkkjfcvhiiugcvvhLec 12.pdfghhjjhhjkkkkkkkkkkkjfcvhiiugcvvh
Lec 12.pdfghhjjhhjkkkkkkkkkkkjfcvhiiugcvvh
saifalroby72
 
this is the Dr. ibrahim presentations ppt.pptx
this is the Dr. ibrahim presentations ppt.pptxthis is the Dr. ibrahim presentations ppt.pptx
this is the Dr. ibrahim presentations ppt.pptx
ibrahimabdi22
 
apidays New York 2025 - The Evolution of Travel APIs by Eric White (Eviivo)
apidays New York 2025 - The Evolution of Travel APIs by Eric White (Eviivo)apidays New York 2025 - The Evolution of Travel APIs by Eric White (Eviivo)
apidays New York 2025 - The Evolution of Travel APIs by Eric White (Eviivo)
apidays
 
Ethical Frameworks for Trustworthy AI – Opportunities for Researchers in Huma...
Ethical Frameworks for Trustworthy AI – Opportunities for Researchers in Huma...Ethical Frameworks for Trustworthy AI – Opportunities for Researchers in Huma...
Ethical Frameworks for Trustworthy AI – Opportunities for Researchers in Huma...
Karim Baïna
 
apidays New York 2025 - From UX to AX by Karin Hendrikse (Netlify)
apidays New York 2025 - From UX to AX by Karin Hendrikse (Netlify)apidays New York 2025 - From UX to AX by Karin Hendrikse (Netlify)
apidays New York 2025 - From UX to AX by Karin Hendrikse (Netlify)
apidays
 
Block chauin techncology by engineer saniya samreen
Block chauin techncology by engineer saniya samreenBlock chauin techncology by engineer saniya samreen
Block chauin techncology by engineer saniya samreen
Shoyeb16
 
apidays New York 2025 - AI for All by Ananya Upadhyay (United Rentals, Inc.)
apidays New York 2025 - AI for All by Ananya Upadhyay (United Rentals, Inc.)apidays New York 2025 - AI for All by Ananya Upadhyay (United Rentals, Inc.)
apidays New York 2025 - AI for All by Ananya Upadhyay (United Rentals, Inc.)
apidays
 
How Data Annotation Services Drive Innovation in Autonomous Vehicles.docx
How Data Annotation Services Drive Innovation in Autonomous Vehicles.docxHow Data Annotation Services Drive Innovation in Autonomous Vehicles.docx
How Data Annotation Services Drive Innovation in Autonomous Vehicles.docx
sofiawilliams5966
 
Cyber Security Presentation(Neon)xu.pptx
Cyber Security Presentation(Neon)xu.pptxCyber Security Presentation(Neon)xu.pptx
Cyber Security Presentation(Neon)xu.pptx
vilakshbhargava
 
Computer Applications: An International Journal (CAIJ)
Computer Applications: An International Journal (CAIJ)Computer Applications: An International Journal (CAIJ)
Computer Applications: An International Journal (CAIJ)
ijitcs
 
time_series_forecasting_constructor_uni.pptx
time_series_forecasting_constructor_uni.pptxtime_series_forecasting_constructor_uni.pptx
time_series_forecasting_constructor_uni.pptx
stefanopinto1113
 
IoT, Data Analytics and Big Data Security.pptx
IoT, Data Analytics and Big Data Security.pptxIoT, Data Analytics and Big Data Security.pptx
IoT, Data Analytics and Big Data Security.pptx
fizarcse
 

The Factorization Machines algorithm for building recommendation system - Paweł Łagodziński, SAS Institute

  • 1. FEBRUARY 22, 2018, WARSAW Factorization Machines for building Recommender System Paweł Łagodziński, SAS Institute Poland
  • 2. FEBRUARY 22, 2018, WARSAW Recommender Systems • Primary goal – recommend most relevant items to users Plenty of success stories: • music, movies • retail products • ad verbs • news • query results • social links Users Items Context Recommender System
  • 3. FEBRUARY 22, 2018, WARSAW Recommender Systems • Key challenges • sparse data • extremely large data • cold start • Other issues • different ratings types • difficult performance evolution
  • 4. FEBRUARY 22, 2018, WARSAW Regression models • Linear regression • Polynomial regression (2-d) – input vector with p predictors y – predicted rating – number of parameters O( ҧ𝑝) – learning complexity 1+p – observed rating 1+p+p2 – number of parameters O( ҧ𝑝2 ) – learning complexity
  • 5. FEBRUARY 22, 2018, WARSAW Regression models • Pros • Combines real-value and categorical features • Useful for regression and classification problems • Fast learning for linear model • Cons • Linear regression does not include user-item interactions • Polynomial regression model includes, but only in theory for sparse data • For sparse data pairwise interactions are very rarely observed, in consequences parameters cannot be estimated directly from the data
  • 6. FEBRUARY 22, 2018, WARSAW Laten factor models - matrix factorization 𝑌 ∈ ℝ|𝑈|×|𝐼| matrix of observed ratings i1 i2 i3 i4 u1 2 3 1 ? u2 ? 5 ? 1 u3 ? ? 4 1 u4 2 3 ? 2 I - items U-users • If matrix Y is low-rank it can be approximated with ෠𝑌 𝑌 ≈ ෠𝑌 where k is the rank of matrix ෡𝒀 (the size of latent factors vector)෠𝑌 = 𝑊𝐻 𝑇 , W ∈ ℝ 𝑈 ×𝑘, H ∈ ℝ|𝐼|×𝑘 Rating predictions ො𝑦 𝑢𝑖 = ෍ 𝑗=1 𝑘 𝑤 𝑢𝑗 ∙ ℎ𝑗𝑖 =< 𝒘 𝑢 𝒉𝑖 >
  • 7. FEBRUARY 22, 2018, WARSAW Nero JuliusCaesar Cleopatra SleeplessinSeatle PrettyWomen Casablanca 1 0 0 0 0 0 0 2 0 0 0 0 0 0 3 0 0 0 0 0 0 4 0 0 -1 0 0 0 5 0 0 -1 0 0 0 6 0 0 1 0 0 0 7 0 0 -1 0 0 0 ≈ History Romance 1 1 0 2 1 0 3 1 0 4 1 1 5 -1 1 6 -1 1 7 -1 1 Nero JuliusCaesar Cleopatra SleeplessinSeatle PrettyWomen Casablanca History 1 1 1 0 0 0 Romance 0 0 1 1 1 1 Rating: -1- dislike, 0 – neutral, 1 - like History Romance Both x Laten factor models - matrix factorization • correlations -> lower rank matrix • strong correlations -> works for sparse data after C.C. Aggarwal [1] Nero JuliusCaesar Cleopatra SleeplessinSeatle PrettyWomen Casablanca 1 1 1 1 0 0 0 2 1 1 1 0 0 0 3 1 1 1 0 0 0 4 1 1 1 1 1 1 5 -1 -1 -1 1 1 1 6 -1 -1 1 1 1 1 7 -1 -1 -1 1 1 1 • Intuition behind matrix factorization with low (2) rank matrix 𝑊 𝐻 𝑇𝑌 𝑌 − ෠𝑌
  • 8. FEBRUARY 22, 2018, WARSAW Matrix factorization based models • Many specific variants of factorization models were proposed • Examples: • MF • Time Senstive Factorization Models • User-item matrix factorization • SVD++ • Factorized kNN • timeSVD • timeTF • Tensore Factorization Models • Sequential Models • PARAFAC • PITF • FMC • FPMC (after S. Randal, MLConf, 2014)
  • 9. FEBRUARY 22, 2018, WARSAW Matrix factorization based models • Pros • Enable modelling interactions (user-item-…) even if such events are not observed in data • Effective on sparse data • Cons • Learning algorithms are tailored to each specific model • In result, it is hard to adjust model to the problem under consideration
  • 10. FEBRUARY 22, 2018, WARSAW Factorization Machines • Concept generalizing regression and factorization models • Introduced in 2010 (S. Randel [2]) • Very good performance in data science challenges (after S. Randal, MLConf, 2014): • Click prediction - KDDCup 2012: Track 2 (3rd place), Criteon Display Advertising Challenge (1st place) • Rating prediction – Netlix prize, Movielens, KDDCup 2011, EMI Music Data Science Hackathon (2nd place) • Social link prediction (social media) - KDDCup 2012: Track 1 (2nd place) • Recommend given name - ECML/PKDD Discovery Challenge 2013 (1st place on-line track, 2nd place off-line track) • Predicting result of next exam question - KDDCup 2010, Grockit Challenge (1st place) • Prediction of auction sale price –Kaggle, Blue Book for Bulldozers (1st place)
  • 11. FEBRUARY 22, 2018, WARSAW Factorization Machines - model • Factorization Machines (2nd degree) – input vector with p predictors y – observed rating – predicted rating 1+p+kp – number of parameters O( ҧ𝑝𝑘) – learning complexity (lower than ҧ𝑝2 for 2-d polynomial regression) • FM of higher degrees possible but not used in practice • Parameters can be estimated with use of universal optimization methods, Stochastic Gradient Decent for instance
  • 12. FEBRUARY 22, 2018, WARSAW Factorization Machines • Features engineering enables great expressiveness of FM model • Matrix / tensor data can be easily transformed into features vector Input Y i1 i2 i3 i4 u1 2 3 1 ? u2 ? 5 ? 1 u3 ? ? 4 1 User(U) Item (I) U I y 1 u1 i1 2 2 u1 i2 3 3 u1 i3 1 4 u2 i2 5 5 u2 i4 1 6 u3 i3 4 7 u3 i4 1 target y x1 1 0 0 1 0 0 0 2 y1 x2 1 0 0 0 1 0 0 3 y2 x3 1 0 0 0 0 1 0 1 y3 x4 0 1 0 0 1 0 0 5 y4 x5 0 1 0 0 0 0 1 1 y5 x6 0 0 1 0 0 1 0 4 y6 x7 0 0 1 0 0 0 1 1 y7 u1 u2 u3 i1 i2 i3 i4 Users Items x Features vector (one-hot encoding)
  • 13. FEBRUARY 22, 2018, WARSAW Factorization Machines • Applying FM model target y x1 1 0 0 1 0 0 0 2 y1 x2 1 0 0 0 1 0 0 3 y2 x3 1 0 0 0 0 1 0 1 y3 x4 0 1 0 0 1 0 0 5 y4 x5 0 1 0 0 0 0 1 1 y5 x6 0 0 1 0 0 1 0 4 y6 x7 0 0 1 0 0 0 1 1 y7 u1 u2 u3 i1 i2 i3 i4 Users Items x Features vector FM is equal to combination of MF (interaction user-item) and biases for user and item
  • 14. FEBRUARY 22, 2018, WARSAW Factorization Machines • Adding context categorical information gives other MF model FM is equivalent to PITF model, one of extension of MF model ො𝑦 𝑢𝑖𝑐 = 𝑤0 + 𝑤 𝑢 + 𝑤𝑖 + 𝑤𝑐 +< 𝒗 𝑢 𝒗𝑖 >+< 𝒗 𝑢 𝒗 𝑐 >+< 𝒗𝑖 𝒗 𝑐 > target y x1 1 0 0 1 0 0 0 0 0 1 2 y1 x2 1 0 0 0 1 0 0 1 0 0 3 y2 x3 1 0 0 0 0 1 0 1 0 0 1 y3 x4 0 1 0 0 1 0 0 0 1 0 5 y4 x5 0 1 0 0 0 0 1 1 0 0 1 y5 x6 0 0 1 0 0 1 0 0 1 0 4 y6 x7 0 0 1 0 0 0 1 0 0 1 1 y7 u1 u2 u3 i1 i2 i3 i4 c1 c2 c3 Context Features vector x Users Items
  • 15. FEBRUARY 22, 2018, WARSAW Factorization Machines • Factorization Machines offers combination of regression and factorization models • Low rank matrix approximation enables estimation of unobserved interactions • Effective for sparse and extremely sparse data • Flexible through features engineering • One of the most disruptive algorithms of recent years
  • 16. FEBRUARY 22, 2018, WARSAW FactMac – Factorization Machines in SAS Viya
  • 17. FEBRUARY 22, 2018, WARSAW FactMac – Factorization Machines in SAS Viya • 2-nd degree Factorization Machines • Learning with „HOGWILD!” • Hyper-parameters optimization with Grid Search, Genetic Algorithm • Different deployment options • SAS Viya • In-database • In-stream (SAS ESP) • Considered for the next releases • Field-Aware Factorization Machines, FFM [3] – better predictions than FM • Warm-up restart – faster retraining • Other loss functions 2 3 1 1 2 3
  • 18. FEBRUARY 22, 2018, WARSAW Shot Recommender System for NBA Coaches • Application of Factorization Machines in SAS Viya to sport data • Presented during 2016 KDD Large-Scale Sports Analytics [5] • Goal – recommend shot types for players • Data - recorded shots taken during the 2015-2016 NBA games • shots for 420 players • for shot: player, action type, shot zone range, shot zone area, result, and other characteristics. • Model for shot prediction (FM 2-d, k=25, p = 404), log-odds as a target ? Image source: Wikipedia
  • 19. FEBRUARY 22, 2018, WARSAW Shot Recommender System for NBA Coaches • Biases can be very informative
  • 20. FEBRUARY 22, 2018, WARSAW Shot Recommender System for NBA Coaches • Latent factors analysis
  • 21. FEBRUARY 22, 2018, WARSAW Shot Recommender System for NBA Coaches • Shot recommendations (plogit):
  • 22. FEBRUARY 22, 2018, WARSAW Shot Recommender System for NBA Coaches • Demonstration of the concept on NCAA* basketball data • The National Collegiate Athletic Association (NCAA) is a non-profit organization which regulates athletes of 1,281 institutions, conferences, and individuals (source: wikipedia). • Recent news for Kaggle’rs, new competation launched 2 days ago
  • 23. FEBRUARY 22, 2018, WARSAW Where to use Factorization Machines? • Rule of Thumb • design matrix is very sparse (mostly missing data, 90% missing or more) • the cardinality of the nominal variables is very high and each level of the nominal variables has only a few representatives in the data • you have good reason to believe the design matrix has low rank • Findings in [3], seems to confirm above. FFM was working better for CTR prediction but no so good for more dense data (web phishing, census income) • Our (local SAS office) experiments, still in progress, showed that FM even for not very sparse data can produce robust set of baseline predictions. More specialized and tailored models can outperform FM but requires more effort in development • FactMac was tested for SAS Customer Intelligence 360 solution with good results.
  • 24. FEBRUARY 22, 2018, WARSAW Thank You !
  • 25. FEBRUARY 22, 2018, WARSAW References • [1] Charu C. Aggrwal. „Recommender Systems, The Textbook”. Springer. 2016. • [2] Steffen Rendle. “Factorization Machines.” In Proceedings of the 10th IEEE International Conference on Data Mining (ICDM). Piscataway, NJ: Institute of Electrical And Electronics Engineers. 2010. • [3] Yuchin Juan et al., „Field-aware Factorization Machines for CTR Prediction”. RecSys '16 Proceedings of the 10th ACM Conference on Recommender Systems. 2016. • [4] Jason Lee et al., „Practical Large-Scale Optimization for Max-Norm Regularization”, Advances in Neural Information Processing Systems 23 (NIPS). 2010. • [5] Reymond E. Wrigt et al. „Shot Recommender System for NBA Coaches”. KDD. 2016