Recommendation System
Recommendation System
ISSN: 2278-0181
Vol. 3 Issue 4, April - 2014
B. Govindarajalu²
²Professor and Head, Department of Computer Science and
Engineering, Sri Venkateswara College of Engineering,
Chennai, India
Abstract - Recommender systems are found in many e- data can be processed with minimal error rate. Variety
commerce applications today. Recommender systems usually refers to all types of data starting from unstructured raw
provide the user with a list of recommendations that they data to semi-structured and structured data which can be
might prefer, or supply predictions on how much the user easily analyzed and used for the process of decision
might prefer each item. Two common approaches for
making and predictive analysis.
providing recommendations are collaborative filtering and
content based filtering. By combining these two approaches,
hybrid recommendation systems can be developed that
considers both the ratings of the user and the item’s feature to
RT
recommend the items to the user. The features of limited
amount of data can be analyzed with the existing data analysis
tools but when considering an e-book dataset of size in
Terabytes, a big data analysis tool such as Hadoop is used.
IJE
down or data loss. It can gracefully handle most such performance evaluation. Chapter 5 gives a brief description
failures with the help of secondary Namenode. about the proposed system and future extension that can be
done.
Scalable-Hadoop can be scaled to add more nodes once the
multi node cluster has been set up. II. LITERATURE SURVEY
Simple- users can easily write parallel code with the help Existing recommendation system recommends
of Hadoop. books to the user based on the book name and the ratings
given by that user to the book or based on the number of
views for that book. Fuzhi Zhang et al (2010), proposed a
two-stage algorithm that uses location of the users to
predict the interest. K-means algorithm is used to cluster
the users based on the profile which is collected during the
user sign up. But predicting the concept of a book only
with the book name reduces the accuracy of the system. V.
Mohanraj et al (2012) uses the concept of ontology to
predict the interest of the user. The system was self
adaptive and predicted the future browsing pattern of the
user. Ozgur Cakir et al (2012) developed a
recommendation system using association rules. Apriori
algorithm is used to generate the rules for recommendation.
The basket ratio which is the ratio between the number of
items viewed to the number of items added to the shopping
cart is increased in this method.
tasks to the slave and slave nodes carries out the job technology concepts are used. Navigation patterns are
assigned to it. Master node then collects the results. This obtained from the past history of the student and from that
model has two main steps which are 1) Map - Distribute pattern, future recommendations are made. Konstantin
the job among the slaves and 2) Reduce – Collect the Shvachko et al (2010) made a study on the Hadoop
results. distributed File System. The study stated that by
distributing the storage and computation across the
Recommender systems have become popular from machines of a cluster, the computational time can be
the last decade. Since the number of products has grown in reduced for analyzing big data when compared to single
number, the need for recommender systems has also node processing.
increased. Recommender system tries to predict the interest
of a user and recommend products that match their interest Emmanouil Vozalis et al made an analysis on the
as accurately as possible. Also, e-commerce business will types of recommendation algorithms that are in existence.
be profited by the increase of sales which will obviously Item-based recommendation is a method in which two
occur when the user is presented with more items that users who have rated a item are separated and the similarity
he/she would likely found to match the interest. There are index is computed among them. When the similarity index
two common approaches in building a recommendation is greater than the threshold, then similar items are
system. One is Collaborative filtering that builds a model recommended to them. A model which uses Collaborative
from a user's past behavior as well as similar decisions filtering algorithm for supervised learning was developed.
made by other users to predict items that the user may have This model classifies even the new unseen item. According
an interest in. The other is Content-based filtering where to this model, there are only two classes C1:like C2:
the characteristics of an item are analyzed to recommend dislike. Content-Boosted Collaborative Filtering utilizes
additional items to the user. Contentbased Filtering to fill in the missing ratings from
the initial user-item matrix. It then employs classic
The following sections are arranged as such Collaborative Filtering techniques to reach a final
chapter 2 includes the works related to the proposed prediction.
system; chapter 3 includes the design of the system along
with the modular description of the proposed system. CaiNicolas Ziegler et al (2005) proposed a
Chapter 4 depicts the implementation setup and results recommendation system that considers a concept called
obtained for the proposed system along with the topic diversification. According to this concept, the list of
D) Keywords Generation
A recommendation engine is created as GUI to This section explains the implementation that is
make the user interact with the system in an easy way. The done in the system. The implementation is done with tools
user can login and logout of the system, can rate books, can such as Hadoop, HortonWorks Sandbox, Putty, WinSCP,
view and download the books from the system. This VirtualBox and programming is done in java and
recommendation system is created with two types of MapReduce. Here a single book is taken as an input and
privileges 1) admin 2) user the respective results for each module are shown. Initially a
book in pdf format is taken as an input. This input file is
converted into text with the help of the program for which
the pseudocode is given above. Fig 4 describes the java
application that was developed to convert a pdf file to text
file, to remove stop words and to extract keywords from
the book with the help of Hadoop MapReduce program.
The path is specified and linked in the program between
the various tasks.
RT
Keywords: Community-citizens-freedom-government
The recommendations to the user will be made in Basically performance of a recommender system
the following format can be measured using accuracy. In this work, performance
of proposed system is evaluated in terms of calculating
accuracy and precision These values can be calculated
easily by forming a confusion matrix which is also known
as contingency table. This confusion matrix contains True
RT
Positive (TP), True Negative (TN), False Positive (FP) and
False Negative (FN). Precision refers positive prediction
value and accuracy can be calculated with the following
IJE
formula.
(TP + TN)
Accuracy =
(TP + TN + FP + FN)
TP
Precision =
(TP + FP)
Not recommended 5 80
REFERENCES