0% found this document useful (0 votes)

35 views8 pages

JETIR2104042

Uploaded by

kushs1992003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views8 pages

JETIR2104042

Uploaded by

kushs1992003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

© 2021 JETIR April 2021, Volume 8, Issue 4 www.jetir.

org (ISSN-2349-5162)

Fake Review Detection Using Machine Learning

Techniques
Shilpa yadav1, Dr.Gulbakshee Dharmela2 , Khushali Mistry3
Post Graduate Student, Parul Universitiy, vadodara, Gujrat
2 Professor, Parul Universitiy, vadodara, Gujrat, 3Professor, Parul Universitiy, vadodara, Gujrat

Abstract: Online reviews play a very important role in today's e-commerce for decision-making. Large part of the population i.e.
customers read reviews of products or stores before making the decision of what or from where to buy and whether to buy or not.
As writing fake/fraudulent reviews comes with monetary gain, there has been a huge increase in deceptive opinion spam on online
review websites. Basically fake review or fraudulent review or opinion spam is an untruthful review. Positive reviews of a target
object may attract more customers and increase sales; negative review of a target object may lead to lesser demand and decrease
in sales. These fake/fraudulent reviews are deliberately written to trick potential customers in order to promote/hype them or defame
their reputations. Our work is aimed at identifying whether a review is fake or truthful one.

Index Terms – Naïve Bayes (NB), Support Vector Machine (SVM), K-Nearest Neighbors (KNN-IBK), KStar (K*) and
Decision Tree(DT)

I. INRODUCTION

Reviews are statements which express suggestion, opinion or experience of someone about any market product. On the
online e-commerce websites, users place their reviews on product form to give suggestion or share experience with product providers
/ sellers / producers and new purchasers. The provided user experience can help any business to grow for improvement by analyzing
the suggestions. Polarity of reviews causes certain financial gain or loss to any product provider.

On other side, reviews influence new purchasers while taking decision of purchasing any particular product. It can be
concluded that effects of reviews target both business and users in different ways. Keeping this point of view, many firms / product
providers hire agents to forge fake opinions for growing their business and market reputation. As a result, users take wrong product
selection decision. The pattern of web based shopping is developing day by day. Online e-commerce websites opened channel for
selling or purchasing products.

E-commerce sites facilitates users to purchase product (e.g. motor bike, headphones, laptop, etc.) or avail any service (i.e.
hotel reservation, airline ticket booking, etc.). Users often give suggestion/opinion/review/comment on e-commerce sites to share
their experience after using any product or availing service.BCI helps by using the brain thoughts as input signals for applications
such as cursor control, robotic arms, wheelchairs, and other devices.[2]

Opinion spamming is an immoral activity of posting fake reviews. The goal of opinion spamming is to misguide the review
readers. Users involved in spamming activity are called “spammers”. The task of a spammer is to build fake reputation (either good
or bad) of a business by placing fake reviews.

1.1.1 Importance of User Reviews

Online purchasers on e-commerce sites are increasing day by day. Online purchasers often post reviews/opinions about
certain product they have used. In other words, opinions are content created by users on e-commerce websites to express experience
of users about any service or product.

Importance of user reviews can be viewed from user and business perspective. From user perspective, these reviews can
influence new customers/users for purchasing decision of certain product in a good or bad way. Decision of new purchasers is
influenced by reviews of users. Good of bad features in accordance with user experience are described in reviews which help other
users for taking the decision of purchasing the product. For purchasing online, user often visit e-commerce sites rich with user
experience about products. So quality and number of user experience can effect user traffic on site.

1.1.2 What is Fake Review?

There exist some businesses who pay spammers to promote the company to attract new customers or to demote competent
company of same type of business. A fake review either belong to positive or negative polarity. Review containing praising statement
about the product fall in “positive polarity”. And review containing loathing statements about the product fall in “negative polarity”.

JETIR2104042 Journal of Emerging Technologies and Innovative Research (JETIR) www.jetir.org 308
© 2021 JETIR April 2021, Volume 8, Issue 4 www.jetir.org (ISSN-2349-5162)
Increasing need for identifying fake reviews has captured the attention of researchers for solving the problem. Fake reviews
not only mislead new customer for taking product purchasing decision but also affects business of good quality product. And due to
false and misleading reviews on particular e-commerce site, users will avoid to visit that particular e-commerce site. It is concluded
that identifying fake reviews will tackle three loses at one time.

1.1.3 Contextual and Behavioral Features

It is reported by researchers that the task of identifying untruthful reviews is more challenging task than identifying brand
reviews and non-reviews (D. Zhang, Zhou, Kehoe, & Kilic, 2016). Commonly, two types of features are used to identifying fake
reviews: Contextual and Behavioral features.

1.1.4 Background Knowledge

Fake review detection task is one of the challenging classification task in the field of knowledge discovery. Multiple
angles of capturing deception in reviews data have been focused by researchers for a decade. Focus of our research work is to
investigate the techniques and classification model to identify individual fake reviews by analyzing different perspective of review
data.

1.1.5 Data Mining Techniques

Generally, DM tasks can be divided into two groups: Descriptive mining and Predictive mining (U. Fayyad et al., 1996;
Heydari et al., 2015; Crawford et al., 2015). Descriptive mining involves describing the general characteristics of the information
in the database i.e. clustering and association rules whereas predictive mining involves forecasting values on the basis of available
current data i.e. regression, classification and analysis of outlier (Berry & Linoff, 1997; J. Han, Pei, & Kamber, 2011). We define
some of general techniques of data mining. The section 2 of this paper will give the literature survey, section 3 gives the proposed
work and section 4 gives the conclusion.

II. LITERATURE SURVEY

The task of fake review detection has been studied since 2007, with the analysis of review spamming [1]. In this work, the
authors analyzed the case of Amazon, concluding that manually labeling fake reviews may result challenging, as fake reviewers
could carefully craft their reviews in order to make them more reliable for other users. Consequently, they proposed the use of
duplicates or nearly-duplicates as spam in order to develop a model that detects fake reviews [1]. Research on distributional footprints
has also been carried out, showing a connection between distribution anomalies and deceptive reviews from Amazon products and
TripAdvisor hotels [2].

Fake review detection is a specific application of the general problem of deception detection, where both verbal and
nonverbal clues can be used [3]. Fake review detection research has mainly exploited textual and behavioral features, while other
approaches have taken into account social or temporal aspects. Textual features have been proposed in several papers. Ott et al. [4]
employed psycholinguistic features based on LIWC [5] combined with standard word and Part of Speech (POS) n-gram features.
Mukherjee et al. [6] extend that work including also style and POS based features, such as deep syntax and POS sequence patterns.

Behavioral features refer to nonverbal characteristics of review activity, such as the number of reviews or the time and
device where the review was posted. They were used in order to improve the classification model resulting in encouraging results.
Liu et al. [31] introduced behavioral features on Amazon reviews, distinguising among review features (e.g. number of feedbacks,
position of the review, textual features, rating features, etc.), product features (e.g. price, sales rank) and reviewer features (e.g.
average rating, ratio of the number of reviews that the reviewer wrote which were the first reviews, etc.).

In another work, Zhang et al. [58] explore the effect of both textual and behavioral features in the restaurant and hotel
domain, showing that non-textual features result more relevant for the task of fake review detection. Apart from using textual and
behavioral features, other methodologies were followed for the fake review detection task. Wang et al. [55] proposed a review graph
with the aim of capturing relationships between reviewers, reviews and stores reviewed by the reviewers. Making use of this graph,
an iterative model was used to identify suspicious reviewers. Following also a graph model, network effects were analyzed by
Akoglu et al. [1], following two steps: user and review scoring for fraud detection and grouping for visualization.

Another methodological approach focuses on temporal aspects, and concerns the burstiness of reviews and their impact on
businesses. Bursts of reviews can be either due to sudden popularity of products or spam attacks [17], which were also analyzed in
[38] along with other behavioral and textual features. A deeper time series approach was made by Heydari et al. [27] and Li et al.
[7] propose other types of features such as review density in temporal windows, along with semantic and emotion features. Spatial
and temporal features were used in a Chinese site by Li et al. [36].

JETIR2104042 Journal of Emerging Technologies and Innovative Research (JETIR) www.jetir.org 309
© 2021 JETIR April 2021, Volume 8, Issue 4 www.jetir.org (ISSN-2349-5162)
Regarding classification algorithms, Support Vector Machine [54] was the most used one followed by Naive Bayes [22],
Decision Tree [5], Random Forest [4] and Logistic Regression [11]. Apart from supervised learning, other approaches have been
followed, since collecting data for experiments is a hard task. In [35], authors propose a prediction model based on semi- supervised
learning and a set of textual and behavioral features. Additionally, Hernandez et al. [23] propose a semi-supervised technique called
PU-learning.

III. PROPOSED SYSTEM

The goal of this article is analyzing the fake review problem in the consumer electronics field, more precisely studying
Yelp businesses from four of the biggest cities of the USA. No prior research has been carried out in this concrete field, being
restaurants and hotels the most previously studied cases. We want to prove that fake review detection problem in online consumer
electronics retailers can be solved by machine learning means and to show if the difficulty of achieving it depends on geographic
location.

In order to achieve this goal, we have followed a principled approach. Based on literature review and experimentation, a
feature framework for fake review detection is proposed, which includes some contributions such as the exploitation of the social
perspective. This framework, so called Fake Feature Framework (F3), helps to organize and characterize features for fake review
selection. F3 considers information coming from both the user (personal profile, reviewing activity, trusting information and social
interactions) and review elements (review text), establishing a framework with which categorize existing research.

In order to evaluate the effectiveness of the features defined in F3, a dataset from the social Yelp in four different cities has
been collected and a classification model has been developed and evaluated.

Figure : Architecture of Implementation

Data Visualization of confusion matrix

IV. Data preprocessing

The purpose of preprocessing is to convert raw data into a form that fits machine learning. Structured and clean data allows
a data scientist to get more precise results from an applied machine learning model. The technique includes data formatting, cleaning,
and sampling.

JETIR2104042 Journal of Emerging Technologies and Innovative Research (JETIR) www.jetir.org 310
© 2021 JETIR April 2021, Volume 8, Issue 4 www.jetir.org (ISSN-2349-5162)

V. Dataset splitting

A dataset used for machine learning should be partitioned into three subsets — training, test, and validation sets. Training
set. A data scientist uses a training set to train a model and define its optimal parameters it has to learn from data. Test set. A test set
is needed for an evaluation of the trained model and its capability for generalization. The latter means a model’s ability to identify
patterns in new unseen data after having been trained over a training data. It’s crucial to use different subsets for training and testing
to avoid model overfitting, which is the incapacity for generalization we mentioned above.

VI. Model training

After a data scientist has preprocessed the collected data and split it into train and test can proceed with a model training.
This process entails “feeding” the algorithm with training data. An algorithm will process data and output a model that is able to
find a target value (attribute) in new data an answer you want to get with predictive analysis. The purpose of model training is to
develop a model.

VII. Implementation Methodology

The proposed work is implemented in Python 3.6.4 with libraries scikit-learn, pandas, matplotlib and other mandatory
libraries. We downloaded dataset from yelp.com. The data downloaded contains train set and test set separately with four two classes
of label namely fake and real. The train dataset considered as train set and test dataset considered as test set. Machine learning
algorithm is applied such as Naive bayes, SVM, logistic regression and random forest.

VIII. Processing

In many databases of real world contain conflicting and noise data. The reason is that data is often collected from
numerous and heterogeneous sources. Inconsistency in data results inaccurate outcomes in data mining process. One of the vital step
is preprocessing of data before initiating process of data mining. There are various preprocessing methods (Y. Sun, Kamel,
Wong, & Wang, 2007) to handle variety of data (cleansing, attribute reduction, tokenization, stop words removing, lemmatization, and
stemming). Two types of preprocessing techniques are used for this research work: text and data preprocessing.

IX. Text Preprocessing

Text preprocessing include data mining techniques used to transform unstructured text. Few text preprocessing techniques
on our selected dataset are defined as follows:

Tokenization: Tokenization is task of splitting-up the review text into words (tokens). i.e. Review content is tokenized into
tokens. For calculating RCS and capital diversity, tokenization is vital step to separate each word in review.

Lemmatization: The task of lemmatizer is to transform word with respect to morphological root word e.g.
’bought’ lemmatized into ’buy’.

X. Result Analysis

Experiments conducted with variation of behavioral and contextual feature sets explored importance of selected features for training
fake review detection model. We compared results of different feature sets including three different term weighting schemes on
Naive and RF. From initial experiments for exploring importance of “Review Deviation” with other behavioral and contextual
features we analyze that by new feature improves accuracy. Whereas the finding based on our experimental results shows that by
scaling dataset can improve the classification accuracy and f1- score. Literature on classifier comparison by (D. Zhang et al., 2016)
also reports that RF outperformed other classifiers.

JETIR2104042 Journal of Emerging Technologies and Innovative Research (JETIR) www.jetir.org 311
© 2021 JETIR April 2021, Volume 8, Issue 4 www.jetir.org (ISSN-2349-5162)

Figure 5.2.2 COUNT Confusion matrix RForest

Figure 5.2.3 TFIDF Confusion matrix Naïve Bayes

JETIR2104042 Journal of Emerging Technologies and Innovative Research (JETIR) www.jetir.org 312
© 2021 JETIR April 2021, Volume 8, Issue 4 www.jetir.org (ISSN-2349-5162)

Figure 5.2.4 NGRAM Confusion matrix Naïve Bayes

XI. CONCLUSION

We have implemented Fake review detection taken dataset by applying three feature extraction techniques namely CountVectorizer,
Ngram model, TfidfVectorizer. The extracted features are trained and predicted using four machine learning algorithms namely
Naïve Bayes, Random Forest, Logistic Regression, SVM.
The following table shows the results arrive from our implementation model for N-gram feature extraction and prediction models.
Algorithm Accuracy
Naïve Bayes 66.67
Random Forest 70.37
Logistic Regression 69.13
SVM 74.07
Table: Experimental Analysis of N-gram Model
The following table shows the results arrive from our implementation model for N-count Vectorizer feature extraction and prediction
models.
Algorithm Accuracy
Naïve Bayes 70.7
Random Forest 76.54
Logistic Regression 70.37
SVM 80.24
Table: Experimental Analysis of N-count Model
The following table shows the results arrive from our implementation model for TF-IDF feature extraction and prediction models.
Algorithm Accuracy
Naïve Bayes 69.13
Random Forest 76.54
Logistic Regression 74.07
SVM 67.90

Table: Experimental Analysis of TF-IDF model

From the above results we can understand that Naïve Bayes model is giving good accuracy on prediction.

JETIR2104042 Journal of Emerging Technologies and Innovative Research (JETIR) www.jetir.org 313
© 2021 JETIR April 2021, Volume 8, Issue 4 www.jetir.org (ISSN-2349-5162)

References

[1] Nitin Jindal and Bing Liu. Review spam detection. In Proceedings of the 16th international conference on World Wide Web,
pages 1189–1190. ACM, 2007.

[2] Song Feng, Longfei Xing, Anupam Gogar, and Yejin Choi. Distributional footprints of deceptive product reviews. ICWSM,
12:98–105, 2012.

[3] Eileen Fitzpatrick, Joan Bachenko, and Tommaso Fornaciari. Automatic detection of verbal deception. Synthesis Lectures on
Human Language Technologies, 8(3):1–119, 2015.
[4] Myle Ott, Yejin Choi, Claire Cardie, and Jeffrey T Hancock. Finding deceptive opinion spam by any stretch of the imagination.
In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-
Volume 1, pages 309–319. Association for Computational Linguistics, 2011.

[5] Yla R Tausczik and James W Pennebaker. The psychological meaning of words: Liwc and computerized text analysis methods.
Journal of language and social psychology, 29(1):24– 54, 2010.

[6] Arjun Mukherjee, Vivek Venkataraman, Bing Liu, and Natalie S Glance. What Yelp fake review filter might be doing? In
ICWSM, pages 409–418, 2013.

[7] Yuejun Li, Xiao Feng, and Shuwu Zhang. Detecting fake reviews utilizing semantic and emotion model. In Information Science
and Control Engineering (ICISCE), 2016 3rd International Conference on, pages 317–320. IEEE, 2016.

[8] Rupesh Kumar Dewang and AK Singh. Identification of fake reviews using new set of lexical and syntactic features. In
Proceedings of the Sixth International Conference on Computer and Communication Technology 2015, pages 115–119. ACM,
2015.

[9] Snehasish Banerjee, Alton YK Chua, and Jung-Jae Kim. Using supervised learning to classify authentic and fake online reviews.
In Proceedings of the 9th International Conference on Ubiquitous Information Management and Communication, page 88. ACM,
2015.

[10] Nitin Jindal and Bing Liu. Opinion spam and analysis. In Proceedings of the 2008 International Conference on Web Search and
Data Mining, pages 219–230. ACM, 2008.
[11] Michael Luca and Georgios Zervas. Fake it till you make it: Reputation, competition, and yelp review fraud. Management
Science, 62(12):3412–3427, 2016.
[12] G. Wang, S. Xie, B. Liu, and P. S. Yu. Review graph based online store review spammer detection. In 2011 IEEE 11th
International Conference on Data Mining, pages 1242–1247, Dec 2011.

[13] Leman Akoglu, Rishi Chandy, and Christos Faloutsos. Opinion fraud detection in online reviews by network effects. ICWSM,
13:2–11, 2013.

[14] Atefeh Heydari, Mohammadali Tavakoli, and Naomie Salim. Detection of fake opinions using time series. Expert Systems with
Applications, 58:83–92, 2016.
[1] Rehab Ashari, Charles Anderson, EEG Subspace Analysis and Classification Using Principal Angles for Brain-Computer
Interfaces, Department of Computer Science Colorado State University Fort Collins, Colorado 80523

[2] Sarah N. Abdulkader *, Ayman Atia, Mostafa-Sami M. Mostafa, 2015, Brain computer interfacing: Applications and
challenges, HCI-LAB, Department of Computer Science, Faculty of Computers and Information, Helwan University, Cairo,
Egypt

[3] Min-Ho Lee, Siamac Fazli, Jan Mehnert and Seong-Whan Lee, Hybrid Brain-Computer Interface based on EEG and NIRS
Modalities, 1Department of Brain and Cognitive Engineering, Korea University, Seoul, Korea 2Department of Computer Science,
Berlin Institute of Technology, Berlin, Germany

[4] Mikhail A Lebedev, Towards a versatile brain-machine interface: Neural decoding of multiple behavioral variables and
delivering sensory feedback, Department of Neurobiology Duke University Durham, USA

[5] Ulrich Hoffmann, Jean-Marc Vesin, Touradj Ebrahimi, Recent Advances in Brain-Computer Interfaces, Ulrich Hoffmann,
Jean-Marc Vesin, Touradj Ebrahimi Signal Processing Institute Ecole Polytechnique F´ed´erale de Lausanne (EPFL),
Switzerland

[6] Kup-Sze Choi, Shuang Liang, Enhancing the Performance of Brain-Computer Interface with Haptics, Centre for Smart Health,
School of Nursing The Hong Kong Polytechnic University Hong Kong, China

[7] Chang-Hee Han, Chang-Hwan Im, EEG-based Brain-Computer Interface for Real-Time Communication of Patients in
Completely Locked-in State, Dept. of Biomedical engineering Hanyang University Seoul, Republic of Korea

JETIR2104042 Journal of Emerging Technologies and Innovative Research (JETIR) www.jetir.org 314
© 2021 JETIR April 2021, Volume 8, Issue 4 www.jetir.org (ISSN-2349-5162)

[8] Tomislav Milekovic, Brain-computer interfaces based on intracortical recordings of neural activity for restoration of
movement and communication of people with paralysis, Department of Fundamental Neuroscience, Faculty of Medicine,
University of Geneva, Geneva, Switzerland 2 Center for Neuroprosthetics and Brain Mind Institute, School of Life Sciences,
Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland

[9] N.A. Md Norani, W. Mansor, L.Y. Khuan, A Review of Signal Processing in Brain Computer Interface System, Faculty of
Electrical Engineering Universiti Teknologi Mara, 40450 Shah A1am, Selangor, Malaysia

[10] Michael Pereira , Aleksander Sobolewski and Jose del R. Millan, Modulation of the inter-hemispheric asymmetry of motor-
related brain activity using brain-computer interfaces

[11] Junichi Ushiba, Ph.D., Asuka Morishita, M.S. and Tsuyoshi Maeda, M.S., A Task-Oriented Brain-Computer Interface
Rehabilitation System for Patients with Stroke Hemiplegia, Department of Biosciences and Informatics, Faculty of Science and
Technology, Keio University 3-14-1 Hiyoshi, Kohoku-ku, Yokohama, Kanagawa, Japan

[12] Cuntai Guan, Brain-Computer Interface for Stroke Rehabilitation with Clinical Studies, Brain-Computer Interface Laboratory
Institute for Infocomm Research, A*STAR, Singapore

[13] Keun-Tae Kim , Tom Carlson and Seong-Whan Lee, Design of a Robotic Wheelchair with a Motor Imagery based Brain-
Computer Interface, Department of Brain and Cognitive Engineering, Korea University, Seoul, Korea 2Defitech Chair in Non-
Invasive Brain-Machine Interface, EPFL, Lausanne, Switzerland

JETIR2104042 Journal of Emerging Technologies and Innovative Research (JETIR) www.jetir.org 315

Research Paper On Fake Online Reviews Detection Using Semi-Supervised and Supervised Learning
100% (1)
Research Paper On Fake Online Reviews Detection Using Semi-Supervised and Supervised Learning
9 pages
Deep Learning Hybrid Approaches To Detect Fake Reviews and Ratings
No ratings yet
Deep Learning Hybrid Approaches To Detect Fake Reviews and Ratings
8 pages
Fake Reviewer Group S' Detection System
No ratings yet
Fake Reviewer Group S' Detection System
4 pages
Fake Product Review Monitoring and Removal For Genuine Online Shopping
No ratings yet
Fake Product Review Monitoring and Removal For Genuine Online Shopping
5 pages
Opinion Spam WSDM 08
No ratings yet
Opinion Spam WSDM 08
11 pages
CHAPTER Christo
No ratings yet
CHAPTER Christo
100 pages
Feedback Shiv Report
No ratings yet
Feedback Shiv Report
25 pages
Opinion Mining and Review Spam Detection: Issues and Challenges
No ratings yet
Opinion Mining and Review Spam Detection: Issues and Challenges
8 pages
Shiva
No ratings yet
Shiva
16 pages
Shiva PDF
No ratings yet
Shiva PDF
16 pages
1 Iis 2020 185-194
No ratings yet
1 Iis 2020 185-194
10 pages
A Supervised Machine Learning Approach To Detect The Fake Online Reviews
No ratings yet
A Supervised Machine Learning Approach To Detect The Fake Online Reviews
7 pages
Detection of Fake Online Reviews by Using Machine Learning
No ratings yet
Detection of Fake Online Reviews by Using Machine Learning
7 pages
Spam Review Detection: Unrequested or Unwanted Messages in Bulk Fraudsters For Promotion or Defaming Products, Services
No ratings yet
Spam Review Detection: Unrequested or Unwanted Messages in Bulk Fraudsters For Promotion or Defaming Products, Services
4 pages
Review of Fake Product Review Detection Techniques
No ratings yet
Review of Fake Product Review Detection Techniques
6 pages
Fake Reviews
No ratings yet
Fake Reviews
5 pages
Fake Product Review Monitoring and Removal For Genuine Product Using Opinion Mining
No ratings yet
Fake Product Review Monitoring and Removal For Genuine Product Using Opinion Mining
23 pages
Spam Review Detection Using Self Attention Based CNN and Bi-Directional LSTM
No ratings yet
Spam Review Detection Using Self Attention Based CNN and Bi-Directional LSTM
18 pages
Fake Product Review Monitoring and Removal For Genuine Online Product Reviews Using Opinion Mining
No ratings yet
Fake Product Review Monitoring and Removal For Genuine Online Product Reviews Using Opinion Mining
4 pages
Fake Review Detection Iee Paper
No ratings yet
Fake Review Detection Iee Paper
4 pages
E-Commerce Product Rating Based On Customer Review Mining
No ratings yet
E-Commerce Product Rating Based On Customer Review Mining
4 pages
Fake Product Review Monitoring System
No ratings yet
Fake Product Review Monitoring System
7 pages
[IJETA-V11I6P2]:Jyoti G.Biradar
No ratings yet
[IJETA-V11I6P2]:Jyoti G.Biradar
7 pages
Toward A Language Modeling Approach For Consumer Review Spam Detection
No ratings yet
Toward A Language Modeling Approach For Consumer Review Spam Detection
8 pages
Crawford2015 Article SurveyOfReviewSpamDetectionUsi PDF
No ratings yet
Crawford2015 Article SurveyOfReviewSpamDetectionUsi PDF
24 pages
ssrn-4786593
No ratings yet
ssrn-4786593
13 pages
Fraud Detection in E-Commerce Using Machine Learning
No ratings yet
Fraud Detection in E-Commerce Using Machine Learning
6 pages
[IJIT-V10I6P6]:Jyoti G.Biradar,Anil T.Gaikwad
No ratings yet
[IJIT-V10I6P6]:Jyoti G.Biradar,Anil T.Gaikwad
7 pages
Classification and Analysis of Fake Product Review Using Ai
No ratings yet
Classification and Analysis of Fake Product Review Using Ai
9 pages
Fake Product Review Monitoring System
No ratings yet
Fake Product Review Monitoring System
3 pages
Role of Machine Learning in Fake Review Detection
No ratings yet
Role of Machine Learning in Fake Review Detection
5 pages
Art 20191163
No ratings yet
Art 20191163
3 pages
Fake Review Detection From E-commerce Website
No ratings yet
Fake Review Detection From E-commerce Website
25 pages
Shiva ppt
No ratings yet
Shiva ppt
16 pages
Shiva ppt
No ratings yet
Shiva ppt
16 pages
Bioconf Iscku2024 00099
No ratings yet
Bioconf Iscku2024 00099
12 pages
Fake_Review_Detector[1]
No ratings yet
Fake_Review_Detector[1]
41 pages
New Microsoft PowerPoint Presentation
No ratings yet
New Microsoft PowerPoint Presentation
6 pages
Fake Review Detection
No ratings yet
Fake Review Detection
27 pages
Fake Product Review Detection and Elimination Using Opinion Mining
No ratings yet
Fake Product Review Detection and Elimination Using Opinion Mining
5 pages
Req - Full Doc - Online Fake Reviews Detection in E-Commerce
No ratings yet
Req - Full Doc - Online Fake Reviews Detection in E-Commerce
52 pages
ICCMC.2019.8819685
No ratings yet
ICCMC.2019.8819685
4 pages
A Survey On Online Review Spam Detection Techniques
No ratings yet
A Survey On Online Review Spam Detection Techniques
5 pages
Fin Irjmets1680182289
No ratings yet
Fin Irjmets1680182289
6 pages
Spam_Review_Detection_Using_Machine_Learning__ijariie24145
No ratings yet
Spam_Review_Detection_Using_Machine_Learning__ijariie24145
7 pages
Semester 6 Group 10 Presentation
No ratings yet
Semester 6 Group 10 Presentation
25 pages
S2078
No ratings yet
S2078
12 pages
p191 Mukherjee
No ratings yet
p191 Mukherjee
10 pages
Analysing and Detecting Review Spam Using Text Mining
No ratings yet
Analysing and Detecting Review Spam Using Text Mining
5 pages
Deep Learning Based Model For Fake Review Detection
No ratings yet
Deep Learning Based Model For Fake Review Detection
4 pages
Project Report Vidhan
No ratings yet
Project Report Vidhan
53 pages
(IJIT-V9I3P1) :T. Primya, A. Vanmathi
No ratings yet
(IJIT-V9I3P1) :T. Primya, A. Vanmathi
6 pages
3 PDF
No ratings yet
3 PDF
4 pages
Hydraulic Power Unit Co. Metso
No ratings yet
Hydraulic Power Unit Co. Metso
210 pages
fake review detection
No ratings yet
fake review detection
9 pages
Opinion Mining
No ratings yet
Opinion Mining
32 pages
2023 Ijsem-147259
No ratings yet
2023 Ijsem-147259
23 pages
.E-Commerce Product Rating Based On Customer Review Mining
No ratings yet
.E-Commerce Product Rating Based On Customer Review Mining
4 pages
Latex Report Code
No ratings yet
Latex Report Code
15 pages
Database Design and Programming 20210071
No ratings yet
Database Design and Programming 20210071
440 pages
BDA M2B.COM2
No ratings yet
BDA M2B.COM2
28 pages
Users Manual 731224
No ratings yet
Users Manual 731224
66 pages
F5_201
No ratings yet
F5_201
88 pages
ONA-800 Remote Control Getting Started Guide
No ratings yet
ONA-800 Remote Control Getting Started Guide
52 pages
Detection of Fake Online Reviews Using Semi Supervised and Supervised Learning
No ratings yet
Detection of Fake Online Reviews Using Semi Supervised and Supervised Learning
4 pages
FXF Animation PDF PDF
No ratings yet
FXF Animation PDF PDF
41 pages
Earths Natural Resources PART2
No ratings yet
Earths Natural Resources PART2
18 pages
MAY SF EBOOK v2
No ratings yet
MAY SF EBOOK v2
41 pages
V Mock Interview Transcipt Without Time Stamps
No ratings yet
V Mock Interview Transcipt Without Time Stamps
2 pages
Introduce of Wisenet Viewer - 220124
No ratings yet
Introduce of Wisenet Viewer - 220124
12 pages
Documentation Check List For Equipment Supplier (2 No's File With Complete Documents)
100% (1)
Documentation Check List For Equipment Supplier (2 No's File With Complete Documents)
5 pages
Special Service Tools - Official ESS Tool List
No ratings yet
Special Service Tools - Official ESS Tool List
8 pages
Aiman Adam 2
No ratings yet
Aiman Adam 2
2 pages
ddb919d5-8a5d-45b4-b202-b718ffbcd757
100% (1)
ddb919d5-8a5d-45b4-b202-b718ffbcd757
19 pages
Header + Creat + Neg
No ratings yet
Header + Creat + Neg
10 pages
Digital Privacy in the Age of Surveillance – Are We Truly Safe Online?
No ratings yet
Digital Privacy in the Age of Surveillance – Are We Truly Safe Online?
4 pages
Apple Method
50% (2)
Apple Method
1 page
JMC JD Harvester 7000 2021 - 11
No ratings yet
JMC JD Harvester 7000 2021 - 11
5 pages
George Westinghouse Engineer & Entrepreneur: by Brian Roberts, CIBSE Heritage Group
No ratings yet
George Westinghouse Engineer & Entrepreneur: by Brian Roberts, CIBSE Heritage Group
31 pages
2 Marks (DBMS)
No ratings yet
2 Marks (DBMS)
11 pages
BCT Assignment
No ratings yet
BCT Assignment
12 pages
DAA SET a CT 1 Answer Key
No ratings yet
DAA SET a CT 1 Answer Key
7 pages
Infrared Thermometer GUN Instructions: Control Company
No ratings yet
Infrared Thermometer GUN Instructions: Control Company
2 pages
Section 6: Surge Protective Devices
No ratings yet
Section 6: Surge Protective Devices
8 pages
Foundations of Breach & Attack Simulation: Student Guide
No ratings yet
Foundations of Breach & Attack Simulation: Student Guide
26 pages
Mini Project 2 Guess The Number
No ratings yet
Mini Project 2 Guess The Number
3 pages
GMR-015 - Wet Buckle Contingency FCG Pigs Rev 0-Signed
No ratings yet
GMR-015 - Wet Buckle Contingency FCG Pigs Rev 0-Signed
1 page
Welcome To Keyword Research
No ratings yet
Welcome To Keyword Research
4 pages
Mobile Wallet The Ultimate Step-By-Step Guide
From Everand
Mobile Wallet The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet
Mobile banking Standard Requirements
From Everand
Mobile banking Standard Requirements
Gerardus Blokdyk
No ratings yet

JETIR2104042

Uploaded by

JETIR2104042

Uploaded by

© 2021 JETIR April 2021, Volume 8, Issue 4 www.jetir.

Fake Review Detection Using Machine Learning

1.1.1 Importance of User Reviews

1.1.2 What is Fake Review?

1.1.3 Contextual and Behavioral Features

1.1.4 Background Knowledge

1.1.5 Data Mining Techniques

II. LITERATURE SURVEY

III. PROPOSED SYSTEM

Figure : Architecture of Implementation

Data Visualization of confusion matrix

IV. Data preprocessing

VI. Model training

VII. Implementation Methodology

IX. Text Preprocessing

Figure 5.2.2 COUNT Confusion matrix RForest

Figure 5.2.3 TFIDF Confusion matrix Naïve Bayes

Figure 5.2.4 NGRAM Confusion matrix Naïve Bayes

Table: Experimental Analysis of TF-IDF model

You might also like