A Comparing Collaborative Filtering and Hybrid Recommender System For E-Commerce
A Comparing Collaborative Filtering and Hybrid Recommender System For E-Commerce
https://doi.org/10.22214/ijraset.2021.38844
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.429
Volume 9 Issue XI Nov 2021- Available at www.ijraset.com
Abstract: Here we are building an collaborative filtering matrix factorization based hybrid recommender system to recommend
movies to users based on the sentiment generated from twitter tweets and other vectors generated by the user in their previous
activities. To calculate sentiment data has been collected from twitter using developer APIs and scrapping techniques later these
are cleaned, stemming, lemetized and generated sentiment values. These values are merged with the movie data taken and create
the main data frame.The traditional approaches like collaborative filtering and content-based filtering have limitations like it
requires previous user activities for performing recommendations. To reduce this dependency hybrid is used which combines
both collaborative and content based filtering techniques with the sentiment generated above.
Keywords: machine learning, natural language processing, movie lens data, root mean square equation, matrix factorization,
recommenders system, sentiment analysis.
I. INTRODUCTION
In the digital era understanding and creating a digital profile of a customer plays a vital role in increase of sales and attraction of
more clients. But, doing it manually for millions of customers is a tedious and error prone task. Here comes machine learning and
data science which ease this process and give better results. Recommendation of the best product to customers at amazon or given
best movie suggestion to the customer is one of the key successes to their domain. To do this machine learning has already curated
models. To do this customer past activities plays an vital role like purchases happened in the past and movies watched are the key
data inputs for the model. A vector of the latest events is created for an user to represent his basic profile. Matrix factorization is one
the popular techniques used to project the users and items into the shared latent space, using a vector of latent features to represent a
user or an item. So the interaction of a product and user is calculated as the inner product of their latent vectors.
A competition led by Netflix has uplifted the matrix factorization and most of the research is happening across the same technique.
Despite the effectiveness of matrix factorisation for collaborative filtering it is found that its performance can be enhanced using the
choice of interaction function and inner product. While it seems to be a relatively small change but the positive effect it provides of
designing a better, dedicated interaction function for modelling the latent feature interactions between users and items
Twitter API and credentials from developer console were used to extract the tweets from twitter using the movie names taken from
above datafile as hashtags. Later these tweets are loaded into csv from there to the data frame.
Using Nltk and vader sentiment analysis twitter data has been cleaned by deleting words less than 3 characters, extra spaces,
duplicates, punctuation, stemming, lemmatization and calculation of positive, negative and neutral score. In the next step these
tweets or comments have been mapped to the movie names retrieved from the above datafile.
The below curve informs that the Collaborative filtering is indeed working and with each iteration of Gradient descent the cost
function is going down and plateaued out around 200 and from there onwards the change is constant.
V. LIST OF ABBREVIATIONS
Machine learning
ML
NLP Natural language
processing
RMSE Root mean square
equation
PYTHON python
SA Sentiment analysis
VI. CONCLUSION
As we know recommender systems are playing an vital role in many industries like ecommerce, movie publisher sites etc and used
to filter and categories’ information and customers to produce higher profit and returns. In this project we used the sentiment data
from twitter alongside the user's past activities to recommend movies. Sentiment analysis is to provide data about the user in
reacting to a particular movie globally. So this project was built using hybrid methods to provide the accurate result possible even
for the user not having past activities. As per our understanding hybrid models work precisely and provide accurate results and this
helps increase the domain not limited to movies etc.
REFERENCES
[1] Thomas Tran and Robin Cohen . Hybrid Recommender Systems for Electronic Commerce. AAAI Technical Report WS-00-04. Compilation copyright © 2000,
AAAI (www.aaai.org).
[2] 8.Edjalma Queirozda SilvaCelso G.Camilo-JuniorLuiz Mario L.PascoalThierson C.Rosa, An evolutionary approach for combining results of recommender
systems techniques based on collaborative filterin
[3] Dr.C K Gomathy, Article: An Effective Innovation Technology In Enhancing Teaching And Learning Of Knowledge Using Ict Methods, International Journal
Of Contemporary Research In Computer Science And Technology (Ijcrcst) E-Issn: 2395-5325 Volume3, Issue 4,P.No-10-13, April ’2017
[4] Dr.C K Gomathy, Article: A Semantic Quality of Web Service Information Retrieval Techniques Using Bin Rank, International Journal of Scientific Research
in Computer Science Engineering and Information Technology ( IJSRCSEIT ) Volume 3 | Issue 1 | ISSN : 2456-3307, P.No:1563-1578, February-201
[5] Dr.C K Gomathy, Article: A Web Based Platform Comparison by an Exploratory Experiment Searching For Emergent Platform Properties, IAETSD Journal
For Advanced Research In Applied Sciences, Volume 5, Issue 3, P.No-213-220, ISSN NO: 2394-8442,Mar/2018
[6] Dr.C K Gomathy, Article: Supply chain-Impact of importance and Technology in Software Release Management, International Journal of Scientific Research
in Computer Science Engineering and Information Technology ( IJSRCSEIT ) Volume 3 | Issue 6 | ISSN : 2456-3307, P.No:1-4, July-2018
[7] Dr.C K Gomathy, Article: A Scheme of ADHOC Communication using Mobile Device Networks, International Journal of Emerging technologies and
Innovative Research ( JETIR ) Volume 5 | Issue 11 | ISSN : 2349-5162, P.No:320-326, Nov-2018
[8] Dr.C K Gomathy, Article: A Study on the recent Advancements in Online Surveying , International Journal of Emerging technologies and Innovative Research
( JETIR ) Volume 5 | Issue 11 | ISSN : 2349-5162, P.No:327-331, Nov-201.