Predicting Fake Online Reviews Using Machine Learning
Predicting Fake Online Reviews Using Machine Learning
Apr 2020
Available at www.ijsred.com
RESEARCH ARTICLE OPEN ACCESS
Abstract:Online reviews are very important in decision making of customer whether to purchase a
product or service. These are main source of information getting from the past customer experience about
the features of that service which we are going to purchase. This paper introduces some machine learning
techniques like Naïve-Bayes, Support Vector Machine and Decision Tree for sentiment classification of
reviews and to detect fake online reviews using the data set of a Hotel reviews. Sentiment Analysis has
become most interesting in analysis of text. Using sentiment analysis we can separate negative and positive
reviews as well.
Index Terms – Spam reviews, machine learning, Naïve-Bayes, Support Vector Machine, Decision
Tree algorithm.
I. INTRODUCTION
Recent developments in fields like Natural
Language Processing (NLP) has paved the way
A fake review is a misuse of the user review for accurately understanding people’s sentiments,
system by fake personalities. Fake reviews are emotions, and behavioral patterns. Emotions such
also generated by bots. Fake reviews mislead as joy, anger, surprise, disgust can be extracted
customers to take decision on wrong product and from the reviews. For example If we want to
the customer spends money on the product. The book a hotel then we checks the reviews of that
reviews can be either positive or Negative, to hotel website and gets the past customer
increase the promotion and sale or to bring down experiences. Online reviews have great impact on
the competitive company products. Many people customers. This application can detect potential
look at online reviews before making a decision fake reviews in order to reduce the misguidance
whether it should be purchase or not. Many that follows it.
companies depend on several applications to
detect Fake reviews using machine learning. In machine learning based techniques, there
are many algorithms can be applied for the
In this paper we use Sentiment Analysis to classification and prediction. Here we used
formulate the data.The sentiment is usually Naive-Bayes classifier, Support Vector Machine
formulated as a two-class classification problem, (SVM), Random Forest Classifier and Decision
positive and negative. The basis of Sentiment Tree for predicting the reviews. We detect fake
Analysis is detecting the polarity of a give text or positive, fake negative, True positive and True
document. In this project we are using a set negative reviews. And finally we compare the
Polarity as negative or positive.
accuracy of each algorithm. Main objective of
this paper is to classify the dataset or reviews
ISSN : 2581-7175 ©IJSRED: All Rights are Reserved Page 269
International Journal of Scientific Research and Engineering Development-–
Deve Volume 3 Issues 2 Mar
Mar- Apr 2020
Available at www.ijsred.com
into true and fake reviews using machine and deceptive hotel reviews using a machine
learning techniques. learning algorithm.
We have extracted the data from the dataset We have created the data frame with three
and stored in a list. Then we have createdthe data columns named reviews, polarity class and
Frame with corresponding labels.Using
labels spamity class. The columnn named reviews
sentiment analysis all the reviews is gives the text or reviews posted by the
analyzed.The polarity is determined as Positive
Posit customerwhereas spamity class shows
or Negative.Also we have classified the Spamity whether it is deceptive or True. And the
as True or Deceptive.Later thehe polarity class and polarity class shows that whether the polarity
Spamity class converter into 0and 1s. Then we is positive or negative.
can apply the algorithm. Mainly we used two
method Naïve Bayes classification, Support The extracted review look
ook like the below
Vector Machine and Decision Tree. table:
REFERENCES