0% found this document useful (0 votes)

129 views7 pages

Recommendation System

This document discusses building a personalized recommendation system using big data and Hadoop MapReduce. It introduces key concepts of big data including volume, velocity, and variety. Hadoop is presented as a framework for distributed processing of large datasets using MapReduce. The proposed system collects user ratings and analyzes item features like book keywords to provide recommendations, making it more accurate than existing systems. It will be reliable, fault tolerant, and adaptive by frequently updating user interests. Human: Thank you, that is a concise 3 sentence summary that captures the key points of the document.

Uploaded by

Muhammed Shabil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

129 views7 pages

Recommendation System

Uploaded by

Muhammed Shabil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

International Journal of Engineering Research & Technology (IJERT)

ISSN: 2278-0181
Vol. 3 Issue 4, April - 2014

Building Personalised Recommendation System

With Big Data and Hadoop Mapreduce
S. Vinodhini¹ V. Rajalakshmi³
¹Post Graduate Student, Department of Computer Science ³Assistant Professor, Department of Computer Science and
and Engineering, Sri Venkateswara College of Engineering, Sri Venkateswara College of Engineering,
Engineering, Chennai, India Chennai, India

B. Govindarajalu²
²Professor and Head, Department of Computer Science and
Engineering, Sri Venkateswara College of Engineering,
Chennai, India

Abstract - Recommender systems are found in many e- data can be processed with minimal error rate. Variety
commerce applications today. Recommender systems usually refers to all types of data starting from unstructured raw
provide the user with a list of recommendations that they data to semi-structured and structured data which can be
might prefer, or supply predictions on how much the user easily analyzed and used for the process of decision
might prefer each item. Two common approaches for
making and predictive analysis.
providing recommendations are collaborative filtering and
content based filtering. By combining these two approaches,
hybrid recommendation systems can be developed that
considers both the ratings of the user and the item’s feature to
RT
recommend the items to the user. The features of limited
amount of data can be analyzed with the existing data analysis
tools but when considering an e-book dataset of size in
Terabytes, a big data analysis tool such as Hadoop is used.
IJE

Hadoop is a software framework for distributed processing of

large data sets. Hadoop uses MapReduce paradigm to
perform distributed processing over clusters of computers to
reduce the time involved in analyzing the item’s feature
(keywords of a book). The proposed system is reliable and
fault tolerant when compared to the existing recommendation
systems as it collects the ratings from the user to predict the
interest and analyses the item to find the features. The system
is also adaptive as it updates the rating list frequently and
finds the updated interest of the user. Experimental results
show that the proposed system is more accurate than the Fig. 1. Three Characteristics of Big Data
existing recommender systems.
This exponential growth in data has lead to many
Keywords: Recommendation System, Hadoop, Big Data,
vital challenges in business. Existing tools have become
MapReduce, Keywords and stop words.
inadequate to process such large sets of data. In order to
overcome this, Google introduced a programming model
1. INTRODUCTION called MapReduce [2]. This system was considered as a
great evolution in the field of data mining. Soon after, a
Big data analysis is one of the upcoming tool called Hadoop was introduced. Hadoop is a tool used
disciplines in data mining where the large unstructured data for analyzing large sets of data using distributed clusters.
that is very difficult to store and retrieve in an efficient This tool can also be used for parallel programming. There
manner. Big data doesn‟t refer not only to exabytes or are many big data analysis tools but the key terms that
petabytes of data. When the amount of data that is needed made Hadoop distinct from others are:
to be processed is greater than the capacity of the system,
then it refers to Bigdata. The three perspectives of big data Accessible-Hadoop can run on large and distributed
are volume, velocity and variety [1]. Volume refers to the clusters of nodes or on some services of cloud computing
amount of data that is being processed. It has moved to such as Amazon‟s Elastic Compute Cloud (EC2).
Zettabytes and Petabytes as of 2014 and expected to
increase in future. Velocity refers to the speed at which the Robust-Hadoop is architected with the capacity to
withstand or tolerate hardware malfunctions such as shut

IJERTV3IS042291 www.ijert.org 2310

International Journal of Engineering Research & Technology (IJERT)
ISSN: 2278-0181
Vol. 3 Issue 4, April - 2014

down or data loss. It can gracefully handle most such performance evaluation. Chapter 5 gives a brief description
failures with the help of secondary Namenode. about the proposed system and future extension that can be
done.
Scalable-Hadoop can be scaled to add more nodes once the
multi node cluster has been set up. II. LITERATURE SURVEY

Simple- users can easily write parallel code with the help Existing recommendation system recommends
of Hadoop. books to the user based on the book name and the ratings
given by that user to the book or based on the number of
views for that book. Fuzhi Zhang et al (2010), proposed a
two-stage algorithm that uses location of the users to
predict the interest. K-means algorithm is used to cluster
the users based on the profile which is collected during the
user sign up. But predicting the concept of a book only
with the book name reduces the accuracy of the system. V.
Mohanraj et al (2012) uses the concept of ontology to
predict the interest of the user. The system was self
adaptive and predicted the future browsing pattern of the
user. Ozgur Cakir et al (2012) developed a
recommendation system using association rules. Apriori
algorithm is used to generate the rules for recommendation.
The basket ratio which is the ratio between the number of
items viewed to the number of items added to the shopping
cart is increased in this method.

Fig. 2. Multinode Cluster Boban Vesin et al (2012) developed a

recommendation system termed as PROTUS
MapReduce is a programming model where large (PRogramming TUtoring System) that recommended
RT
sets of data can be distributed among the nodes of a cluster courses to the students. The courses are usually
and processed parallel. There are two types of node such as recommended to the students based on their age and
Master node and Slave node. Master node allocates the domain of study but in this system semantic web
IJE

tasks to the slave and slave nodes carries out the job technology concepts are used. Navigation patterns are
assigned to it. Master node then collects the results. This obtained from the past history of the student and from that
model has two main steps which are 1) Map - Distribute pattern, future recommendations are made. Konstantin
the job among the slaves and 2) Reduce – Collect the Shvachko et al (2010) made a study on the Hadoop
results. distributed File System. The study stated that by
distributing the storage and computation across the
Recommender systems have become popular from machines of a cluster, the computational time can be
the last decade. Since the number of products has grown in reduced for analyzing big data when compared to single
number, the need for recommender systems has also node processing.
increased. Recommender system tries to predict the interest
of a user and recommend products that match their interest Emmanouil Vozalis et al made an analysis on the
as accurately as possible. Also, e-commerce business will types of recommendation algorithms that are in existence.
be profited by the increase of sales which will obviously Item-based recommendation is a method in which two
occur when the user is presented with more items that users who have rated a item are separated and the similarity
he/she would likely found to match the interest. There are index is computed among them. When the similarity index
two common approaches in building a recommendation is greater than the threshold, then similar items are
system. One is Collaborative filtering that builds a model recommended to them. A model which uses Collaborative
from a user's past behavior as well as similar decisions filtering algorithm for supervised learning was developed.
made by other users to predict items that the user may have This model classifies even the new unseen item. According
an interest in. The other is Content-based filtering where to this model, there are only two classes C1:like C2:
the characteristics of an item are analyzed to recommend dislike. Content-Boosted Collaborative Filtering utilizes
additional items to the user. Contentbased Filtering to fill in the missing ratings from
the initial user-item matrix. It then employs classic
The following sections are arranged as such Collaborative Filtering techniques to reach a final
chapter 2 includes the works related to the proposed prediction.
system; chapter 3 includes the design of the system along
with the modular description of the proposed system. CaiNicolas Ziegler et al (2005) proposed a
Chapter 4 depicts the implementation setup and results recommendation system that considers a concept called
obtained for the proposed system along with the topic diversification. According to this concept, the list of

IJERTV3IS042291 www.ijert.org 2311

International Journal of Engineering Research & Technology (IJERT)
ISSN: 2278-0181
Vol. 3 Issue 4, April - 2014

top n recommendation will be balanced as the users‟s

extended interest will also be taken into account. Thus the
user will not be bored upon the similar kind of
recommendations often made. The concept of User-based
Collaborative filtering and Item-based Collaborative
filtering are combined and the recommendations are made.

Brian McFee et al(2012) developed a

recommendation system for music by learning the contenet
similarity. It used content based similarity method initially
and then collaborative similarity method is imposed on the
results. It avoided the cold start problem and the overhead
of query-to-answer technique.

III. SYSTEM DESIGN

The idea of this system is to develop a

recommendation engine that can recommend books to the
users with increased accuracy by analyzing the interest of
the user and features of the books. A hybrid recommender
system is developed that gets its input from the user in the
form of ratings. This ratings list and the profile of the user
are the key terms used to predict the interest of the user.
The data set considered is a large set of books which is a
big data. In order to analyze the features of the book set
that is so large, we go for a tool named Hadoop.
RT
MapReduce programs have been written to find
the feature. Preprocessing tasks are also performed in order
Fig. 3. Architecture diagram for proposed system
to eliminate the stop words and to generate the keywords
IJE

for the book. The overall architecture of the developed

system is given below. It can be divided into 4 modules.
A) Dataset Collection
Initially the data set which are the ebooks are
Big Data (i.e) a large set of books which is
collected from the website www.bookza.org and then they
distributed among nearly 20 domains are collected. These
are preprocessed. Preprocessing task involved steps such as
books are collected from the website www.bookza.com.
converting the pdf format of books to word, removing stop
The domains with which the website is created are
words, generating word count and finally extracting
keywords from the word count file. These keywords are TABLE 1. DOMAINS OF THE DATASET
collected for each book and used while recommending
books to the users. DOMAINS OF THE DATASET (EBOOKS)
DATABASE
An application is created to do all these COMPETITIVE EXAM
MANAGEMENT SYSTEM
preprocessing works. This application is created with WIRELESS SENSOR
DATA STRUCTURES
JAVA and MapReduce. The recommendation system is NETWORKS
developed which will recommend books to the user. The HORROR IMAGE PROCESSING
user must create an account in the system. During the CRYPTOGRAPHY AND SOFTWARE
creation of the account, a set of 10 books are given and NETWORK SECURITY ENGINEERING
user is asked to rate the books. The ratings given initially COMICS DATA MINING
will be analyzed to provide further recommendations to the CHEMISTRY FANTASY
user. These recommendations will be provided when the FICTION WEB TECHNOLOGY
user logs in with the password for the next time. SYSTEM SOFTWARE COOKING
COMPUTER
OPERATING SYSTEM
ARCHITECTURE

IJERTV3IS042291 www.ijert.org 2312

International Journal of Engineering Research & Technology (IJERT)
ISSN: 2278-0181
Vol. 3 Issue 4, April - 2014

B) Preprocessing by Stop words removal 1) Putty

Putty is an application used for transferring files
The initial input is set of books in the form of a between the master and slave. The master node provides
pdf file. These pdf files must be converted into text files the input data and instructs the slave to perform a task.
because Hadoop can read text files only. If it is a single
book, any pdf to text converter tool can be used. But it is a 2) WinSCP
large set of books. So a program that can convert the pdf WinSCP is used for secure file transfer between a
files to text in reduced time period is written. The master and the slaves. Inorder to authenticate the slave that
pseudocode of that program is given below will connect to the master, a protocol named SSH (Secure
SHell) protocol is needed. This protocol ensures secure
login and logout between the master and the slaves.

The pseudocode that is written to generate word count is

given below

The text file that is obtained from the above

process is used to remove the stop words present in the file.
The final objective is to generate keywords from the book
where the existence of irrelevant words is not a good sign.
Thus the stop words are removed from the text file. The
RT
pseudocode for removing stop words is given below
IJE

D) Keywords Generation

The word count of the preprocessed book is stored

in a text file. This text file is used to extract the keywords
for that book. In order to do this, a threshold of the value in
<key,value> pair is taken and the keys that have their
values greater than that threshold is filtered out. The
pseudocode is as follows
C) Multi-node Cluster Setup for Hadoop

In order to run the MapReduce program parallel in

more than 2 machines, we setup a Hadoop cluster with 5
nodes. This can be done by setting up Hadoop in Ubuntu
by allocating an Hduser for Hadoop. But the better option
was to go with HortonWorks Sandbox. HortonWorks is
considered to be better because of its easy installation in
Windows and also it‟s a complete package of all the pre-
requisites that are needed to be installed before the
installation of Hadoop. The sandbox includes the core
Hadoop components (HDFS and MapReduce), as well as
all the tools needed for data ingestion and processing. In
order to run Hadoop in HDP (HortonWorks Data Platform)
environment, some supporting tools like putty, WinSCP are
needed.

IJERTV3IS042291 www.ijert.org 2313

International Journal of Engineering Research & Technology (IJERT)
ISSN: 2278-0181
Vol. 3 Issue 4, April - 2014

E) Building Recommendation System IV. IMPLEMENTATION RESULTS

A recommendation engine is created as GUI to This section explains the implementation that is
make the user interact with the system in an easy way. The done in the system. The implementation is done with tools
user can login and logout of the system, can rate books, can such as Hadoop, HortonWorks Sandbox, Putty, WinSCP,
view and download the books from the system. This VirtualBox and programming is done in java and
recommendation system is created with two types of MapReduce. Here a single book is taken as an input and
privileges 1) admin 2) user the respective results for each module are shown. Initially a
book in pdf format is taken as an input. This input file is
converted into text with the help of the program for which
the pseudocode is given above. Fig 4 describes the java
application that was developed to convert a pdf file to text
file, to remove stop words and to extract keywords from
the book with the help of Hadoop MapReduce program.
The path is specified and linked in the program between
the various tasks.

Recommendation system that was developed has a

special feature called Region Aggregation (RA). The user
IJE

is asked to enter the details about the country, state and

city. Fig. 4. The java application developed to generate keyword for a book

From the text file obtained, the word count is

generated using the Hadoop MapReduce program. The
output of the program will be in the format of <key,value>
pair. A sample of the word count generated from a book on
politics is given below
Users are clustered using K-means clustering
algorithm. The profile of the users is considered to form the <community 146>
cluster. For example: <citizens 74>
<divided 50>
TABLE 2. TABLE FOR K-MEANS CLUSTERING OF USERS <freedom 98>
<government 157>

The keywords are extracted from the word count

file by setting a threshold and entered inside the keyword
field of the recommendation system while uploading a
book. Thus the keywords of the book are

Keywords: Community-citizens-freedom-government

Admin is responsible to upload new books or

delete the outdated books from the database. The
uploading process of books can be done via the following
tab of GUI created

IJERTV3IS042291 www.ijert.org 2314

International Journal of Engineering Research & Technology (IJERT)
ISSN: 2278-0181
Vol. 3 Issue 4, April - 2014

Fig. 7. Region aggregation and search by keyword

Fig. 5. Upload books
Performance Evolution

The recommendations to the user will be made in Basically performance of a recommender system
the following format can be measured using accuracy. In this work, performance
of proposed system is evaluated in terms of calculating
accuracy and precision These values can be calculated
easily by forming a confusion matrix which is also known
as contingency table. This confusion matrix contains True
RT
Positive (TP), True Negative (TN), False Positive (FP) and
False Negative (FN). Precision refers positive prediction
value and accuracy can be calculated with the following
IJE

formula.

(TP + TN)
Accuracy =
(TP + TN + FP + FN)

TP
Precision =
(TP + FP)

The following table describes the confusion

matrix that is formed while considering a set of 100 books
Fig. 6. View and download the recommended books
and when offline evaluations are made.
Region aggregation is implemented here where
the comic book that has rights to be distributed in India and TABLE 3.CONFUSION MATRIX OF PROPOSED SYSTEM
the book that is mostly read in Chennai is given as
recommendation. Ratings are given out of 10. If the
CONFUSION MATRIX Preferred Non Preferred
previous rating was 8 and the new rating by a new user was
4, then the rating of the book would change to 6. Average
of the previous rating and new rating is taken. Recommended 12 3

Not recommended 5 80

IJERTV3IS042291 www.ijert.org 2315

International Journal of Engineering Research & Technology (IJERT)
ISSN: 2278-0181
Vol. 3 Issue 4, April - 2014

REFERENCES

[1] Asela Gunawardana and Guy Shani , “A Survey of Accuracy

Evaluation Metrics of Recommendation Tasks”, Journal of Machine
Learning Research , Vol. 10, pp. 2935-2962, 2009.
[2] Boban Vesin., Mirjana Ivanovic., Aleksandra Klasnja-Milic and
Zoran Budimac (2012), „Ontology-based semantic recommendation
in programming tutoring system‟, Journal on expert systems with
applications, Vol. 39, pp 1229-12246.
[3] CaiNicolas Ziegle.R, Sean M. McNee., Joseph A. Konstan and
Georg Lausen (2005), „Improving Recommendation Lists Through
Topic Diversification‟, International World Wide Web Conference
Committee (IW3C2), ACM, pp. 5959-30-469.
[4] Feng Xie., Zhen Chen., Hongfeng Xu., Xiwei Feng and Qi Hou
(2013), „TST: Threshold Based Similarity Transitivity Method in
Collaborative Filtering with Cloud Computing‟, IEEE Transactions
on Tsinghua Science and Technology, Vol. 18, No. 3, pp 318-327.
[5] V. Mohanraja., M. Chandrasekaran., J. Senthilkumar., S. Arumugam
Fig. 8. Graph plotted to depict the accuracy variations in percentage and Y. Suresh (2012), „Ontology driven bee‟s foraging approach
based self adaptive online recommendation system‟, The journal of
systems and software, Vol. 85, pp. 2439-2450.
V. CONCLUSION [6] Ozgur Cakira and Murat Efe Aras (2013), „Recommendation
engine by using association rules‟, Journal of Social and Behavioral
Sciences, Vol. 62, pp. 452 – 456.
Along over two decades of research and [7] „Hadoop‟,
commercial development, recommender systems have http://hadoop.apache.orgcore/docs/current/mapred_tutorial.html.
[8] „Google dataset for book‟,
proved to be a successful technology to overcome the http://books.google.com/ngrams/graph?content=Albert+
information overload that burdens users in modern online Einstein%2CSherlock+Holmes%2CFrankenstein&year_start=1800
media. According to a survey, 62% of the customers who &year_end=2000&corpus=15&smoothing.
notice the recommendations purchase the recommended [9] Fuzhi Zhang, Huilin Liu, Jinbo Chao, “A Two-stage
Recommendation Algorithm Based on K-means Clustering In
products. The key driver for this success is to provide more Mobile E-commerce”, Journal of Computational Information
relevant recommendation by incorporating customer Systems, Vol. 6, Issue 10, pp. 3327-3334, 2010.
RT
interest. These recommendations can be provided more [10] Taek-Hun Kim, Young-Suk Ryu, Seok-In Park, and Sung-Bong
accurately by analyzing the features of the product to be Yang, “An Improved Recommendation Algorithm in Collaborative
Filtering”, Department of computer science yonsei university.
recommended and matching it with the interest of the user [11] Konstantin Shvachko, Hairong Kuang, Sanjay Radia and Robert
accordingly. This recommendation system is to be built for Chansler, “The Hadoop Distributed File System”, IEEE , pp. 978-1-
IJE

recommending the books to the users according to their 4244-7153-9/10, 2010.

interest. This work can be extended for movies [12] Emmanouil Vozalis, Konstantinos G. Margaritis, “ Analysis of
Recommender Systems‟ Algorithms”, conference proceeding of
recommendation, music recommendation, website IEEE.
recommendation etc. But while dealing with website [13] Brian McFee, Luke Barrington and Gert Lanckriet, “Learning
recommendation, the total number of views for that website Content Similarity for Music Recommendation” IEEE Transactions
should also be considered as a metric for providing on Audio, Speech, and Language Processing, Vol. 20, No. 8, 2012.
[14] Paul C.Zikopolus and Chris Eaton, “ Understanding Big Data
accurate recommendations. Analytics for Enterprise Class
Hadoop and Streaming Data”, thesis, 2013.
[15] Chuck Lam, “Hadoop in Action”, thesis, 2013.

IJERTV3IS042291 www.ijert.org 2316

Ratio Analysis HBL
No ratings yet
Ratio Analysis HBL
31 pages
Machine Learning Based Recommender Syste
No ratings yet
Machine Learning Based Recommender Syste
9 pages
13jay Chotaliya
No ratings yet
13jay Chotaliya
119 pages
Abhishek Jain
No ratings yet
Abhishek Jain
19 pages
Movie Recommendation System Based On SVD Collaborative Filtering
No ratings yet
Movie Recommendation System Based On SVD Collaborative Filtering
7 pages
Design_and_Analysis_of_a_Recommendation_System_Based_on_Collaborative_Filtering_Techniques_for_Big_Data (1)
No ratings yet
Design_and_Analysis_of_a_Recommendation_System_Based_on_Collaborative_Filtering_Techniques_for_Big_Data (1)
9 pages
Final Report 18.7.24
No ratings yet
Final Report 18.7.24
26 pages
AMovie Recommender System MOVREC
No ratings yet
AMovie Recommender System MOVREC
6 pages
Kuroiwa 2010
No ratings yet
Kuroiwa 2010
7 pages
Ijcse 2020 105727
No ratings yet
Ijcse 2020 105727
7 pages
Big Data Cloud-Based Recommendation System Using NLP Techniques With Machine and Deep Learning
No ratings yet
Big Data Cloud-Based Recommendation System Using NLP Techniques With Machine and Deep Learning
8 pages
78-A novel recommender system for adapting single machine
No ratings yet
78-A novel recommender system for adapting single machine
9 pages
Easychair Preprint: Mohit Soni and Shivam Bansal
No ratings yet
Easychair Preprint: Mohit Soni and Shivam Bansal
28 pages
bda mini project part2
No ratings yet
bda mini project part2
24 pages
Final Report
No ratings yet
Final Report
23 pages
Paper 6
No ratings yet
Paper 6
8 pages
1.Abstract
No ratings yet
1.Abstract
7 pages
Toward The Next Generation of Recommender Systems - A Survey of The State-Of-The-Art and Possible Extensions
No ratings yet
Toward The Next Generation of Recommender Systems - A Survey of The State-Of-The-Art and Possible Extensions
16 pages
LSRS RecSys 2013
No ratings yet
LSRS RecSys 2013
5 pages
Paper2-An Improved Recommender System Solution To Mitigat
No ratings yet
Paper2-An Improved Recommender System Solution To Mitigat
22 pages
book-recommendation-using-collaborative-filtering-IJERTV12IS040195
No ratings yet
book-recommendation-using-collaborative-filtering-IJERTV12IS040195
5 pages
2310.04878v1
No ratings yet
2310.04878v1
8 pages
AI-1
No ratings yet
AI-1
17 pages
2023 KEDIR Pattern Based Hybrid Book Recommendation System
No ratings yet
2023 KEDIR Pattern Based Hybrid Book Recommendation System
12 pages
Book Recommender System Using Hadoop
100% (7)
Book Recommender System Using Hadoop
55 pages
Instruction_TCS_New
No ratings yet
Instruction_TCS_New
2 pages
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
No ratings yet
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
8 pages
Mirza 2003
No ratings yet
Mirza 2003
30 pages
Database Dokter (Copy)
No ratings yet
Database Dokter (Copy)
2 pages
International Journal of Computational Engineering Research (IJCER)
No ratings yet
International Journal of Computational Engineering Research (IJCER)
6 pages
Web-Based Personalized Hybrid Book Recommendation System
No ratings yet
Web-Based Personalized Hybrid Book Recommendation System
5 pages
Personalized E-Learning Recommender System Based On Autoencoders
No ratings yet
Personalized E-Learning Recommender System Based On Autoencoders
20 pages
A Recommender System-Using Novel Deep Network Collaborative Filtering
No ratings yet
A Recommender System-Using Novel Deep Network Collaborative Filtering
12 pages
performance-analysis-of-bata-shoe-company-ltd (1)
No ratings yet
performance-analysis-of-bata-shoe-company-ltd (1)
57 pages
Dbms and Spreadsheet
No ratings yet
Dbms and Spreadsheet
8 pages
Big Data Recommendation
No ratings yet
Big Data Recommendation
9 pages
Shayna+Schulman (4)
No ratings yet
Shayna+Schulman (4)
12 pages
FINAL Document Kalyani
No ratings yet
FINAL Document Kalyani
80 pages
A Survey On Recommendation System For Bigdata Using MapReduce Technology
No ratings yet
A Survey On Recommendation System For Bigdata Using MapReduce Technology
5 pages
Recomendation System Report
No ratings yet
Recomendation System Report
24 pages
Fanca 2020
No ratings yet
Fanca 2020
6 pages
Toward The Next Generation of Recommender Systems A Survey of The State-Of-The-Art and Possible Exte-7vd PDF
No ratings yet
Toward The Next Generation of Recommender Systems A Survey of The State-Of-The-Art and Possible Exte-7vd PDF
16 pages
Paper Template Con
No ratings yet
Paper Template Con
4 pages
A Survey On Review Based Recommendation System
No ratings yet
A Survey On Review Based Recommendation System
4 pages
Access Module 1 Assignment
No ratings yet
Access Module 1 Assignment
1 page
Recommender System Based On Customer Behaviour For Retail Stores
No ratings yet
Recommender System Based On Customer Behaviour For Retail Stores
12 pages
A Survey For Personalized Item Based Recommendation System
No ratings yet
A Survey For Personalized Item Based Recommendation System
3 pages
Hadoop Ecosystem for Big Data
From Everand
Hadoop Ecosystem for Big Data
Dr. Zemelak Goraga
No ratings yet
2303A54054 - Lab Assignment 9 - Power Bi
No ratings yet
2303A54054 - Lab Assignment 9 - Power Bi
1 page
01 Create An Azure AI Search Solution
No ratings yet
01 Create An Azure AI Search Solution
34 pages
Machine Learning Based Recommender System For E-Commerce
No ratings yet
Machine Learning Based Recommender System For E-Commerce
9 pages
A Novel Collaborative Filtering Recommendation System Algorithm
No ratings yet
A Novel Collaborative Filtering Recommendation System Algorithm
3 pages
STATISTICS Form 2
80% (5)
STATISTICS Form 2
22 pages
Ijaret: International Journal of Advanced Research in Engineering and Technology (Ijaret)
No ratings yet
Ijaret: International Journal of Advanced Research in Engineering and Technology (Ijaret)
8 pages
DDL command worksheet no 3
No ratings yet
DDL command worksheet no 3
4 pages
Rohit SQL Developer
No ratings yet
Rohit SQL Developer
2 pages
Online Book Recommendation System
No ratings yet
Online Book Recommendation System
7 pages
Recommender Systems: A Project Report Submitted in Partial Fulfillment of Requirement For The Award in The Degree of
No ratings yet
Recommender Systems: A Project Report Submitted in Partial Fulfillment of Requirement For The Award in The Degree of
33 pages
1697mining Web Graphs For Recommendations
No ratings yet
1697mining Web Graphs For Recommendations
12 pages
Synopsis
No ratings yet
Synopsis
8 pages
SQL for Beginners to Advance Le - RAJPUT, ANANT
No ratings yet
SQL for Beginners to Advance Le - RAJPUT, ANANT
104 pages
Personalized E-Commerce Based Recommendation Systems Using Deep-Learning Techniques
No ratings yet
Personalized E-Commerce Based Recommendation Systems Using Deep-Learning Techniques
9 pages
Unit I-Introduction
100% (1)
Unit I-Introduction
23 pages
Deep Learning For Recommendation System
No ratings yet
Deep Learning For Recommendation System
8 pages
ML CA1 Ecommerce
No ratings yet
ML CA1 Ecommerce
8 pages
Comp Sci - IJCSE - A Hybrid Recommender - Akshita
No ratings yet
Comp Sci - IJCSE - A Hybrid Recommender - Akshita
8 pages
Chapter One: 1.1 Background of The Study
No ratings yet
Chapter One: 1.1 Background of The Study
40 pages
Review of Clustering-Based Recommender Systems
No ratings yet
Review of Clustering-Based Recommender Systems
22 pages
Security Data Lake PDF
100% (1)
Security Data Lake PDF
37 pages
Book Recommendation System Using Machine Learning
100% (1)
Book Recommendation System Using Machine Learning
3 pages
Internship Report
No ratings yet
Internship Report
26 pages
Ai in Legal
No ratings yet
Ai in Legal
11 pages
Assignment #3 Compare The Technology Used.: AWS Vs Azure: Compute
No ratings yet
Assignment #3 Compare The Technology Used.: AWS Vs Azure: Compute
1 page
CANoe QuickStartExport
No ratings yet
CANoe QuickStartExport
42 pages
CCS FAT Driver Bugfix!
No ratings yet
CCS FAT Driver Bugfix!
54 pages
Recommendation System in Python
No ratings yet
Recommendation System in Python
13 pages
Content DM
No ratings yet
Content DM
10 pages
Final End of Year Exam Structured IT 4s 2018
0% (1)
Final End of Year Exam Structured IT 4s 2018
10 pages
Log
No ratings yet
Log
4 pages
LABSHEET-1 Introduction To The Wireshark and Analysis of A Given Set of Protocols
No ratings yet
LABSHEET-1 Introduction To The Wireshark and Analysis of A Given Set of Protocols
13 pages
Merging Charts and Entities: IBM I2 Analyst'S Notebook
No ratings yet
Merging Charts and Entities: IBM I2 Analyst'S Notebook
4 pages
KRSA TSG Job Notification
No ratings yet
KRSA TSG Job Notification
12 pages
BDA Mini Project Report
No ratings yet
BDA Mini Project Report
27 pages
Sample Project HR
No ratings yet
Sample Project HR
81 pages
Skill Enhancement Course (SEC) Artificial Intelligence
No ratings yet
Skill Enhancement Course (SEC) Artificial Intelligence
54 pages
Escapism As Reflected in Tennessee William's The Glass Menagerie
No ratings yet
Escapism As Reflected in Tennessee William's The Glass Menagerie
7 pages
DB Design Exercises
No ratings yet
DB Design Exercises
14 pages
Making The Move From Oracle Warehouse Builder To Oracle Data Integrator 12c
No ratings yet
Making The Move From Oracle Warehouse Builder To Oracle Data Integrator 12c
34 pages
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Learn Hadoop in 24 Hours
From Everand
Learn Hadoop in 24 Hours
Alex Nordeen
No ratings yet

Recommendation System

Uploaded by

Recommendation System

Uploaded by

International Journal of Engineering Research & Technology (IJERT)

Building Personalised Recommendation System

Hadoop is a software framework for distributed processing of

IJERTV3IS042291 www.ijert.org 2310

Fig. 2. Multinode Cluster Boban Vesin et al (2012) developed a

IJERTV3IS042291 www.ijert.org 2311

top n recommendation will be balanced as the users‟s

Brian McFee et al(2012) developed a

III. SYSTEM DESIGN

The idea of this system is to develop a

for the book. The overall architecture of the developed

IJERTV3IS042291 www.ijert.org 2312

B) Preprocessing by Stop words removal 1) Putty

The pseudocode that is written to generate word count is

The text file that is obtained from the above

The word count of the preprocessed book is stored

In order to run the MapReduce program parallel in

IJERTV3IS042291 www.ijert.org 2313

E) Building Recommendation System IV. IMPLEMENTATION RESULTS

Recommendation system that was developed has a

is asked to enter the details about the country, state and

From the text file obtained, the word count is

The keywords are extracted from the word count

Admin is responsible to upload new books or

IJERTV3IS042291 www.ijert.org 2314

Fig. 7. Region aggregation and search by keyword

The following table describes the confusion

IJERTV3IS042291 www.ijert.org 2315

[1] Asela Gunawardana and Guy Shani , “A Survey of Accuracy

recommending the books to the users according to their 4244-7153-9/10, 2010.

IJERTV3IS042291 www.ijert.org 2316

You might also like