Detection of Fake Online Reviews Using Semi-Supervised and Supervised Learning

This paper introduces semi-supervised and supervised text mining models to detect fake online reviews. It compares the efficiency of these techniques on a dataset containing hotel reviews. The proposed system uses tokenization, feature extraction including word frequency, sentiment polarity, and review length to create feature vectors for semi-supervised and supervised classification of reviews as fake or real. The system aims to more accurately detect fake reviews compared to prior work using only semi-supervised learning or sentiment analysis alone.

Uploaded by

Websoft Tech-Hyd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

100% found this document useful (2 votes)

954 views

Detection of Fake Online Reviews Using Semi-Supervised and Supervised Learning

Uploaded by

Websoft Tech-Hyd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Detection of fake online reviews using semi-

supervised and supervised learning

ABSTRACT

Online reviews have great impact on today’s business and commerce. Decision
making for purchase of online products mostly depends on reviews given by the
users. Hence, opportunistic individuals or groups try to manipulate product reviews
for their own interests. This paper introduces some semi-supervised and supervised
text mining models to detect fake online reviews as well as compares the efficiency
of both techniques on dataset containing hotel reviews.

EXISTING SYSTEM

 Content based methods focus on what is the content of the review. That is the
text of the review or what is told in it. Heydari et al. [2] have attempted to
detect spam review by analyzing the linguistic features of the review. Ott et al.
[3] used three techniques to perform classification. These three techniques are-
genre identification, detection of psycholinguistic deception and text
categorization.
 Behavior feature based study focuses on the reviewer that includes
characteristics of the person who is giving the review. Lim et al. [7] addressed
the problem of review spammer detection, or finding users who are the source
of spam reviews. People who post intentional fake reviews have significantly
different behavior than the normal user. They have identified the following
deceptive rating and review behaviors.
 Deceptive online review detection is generally considered as a classification
problem and one popular approach is to use supervised text classification
techniques [5]. These techniques are robust if the training is performed using
large datasets of labeled instances from both classes, deceptive opinions
(positive instances) and truthful opinions (negative examples) [8]. Some
researchers also used semi-supervised classification techniques.
Disadvantages
 In the existing work, the system uses only to semi-supervised learning.
 Only Text Classification as sentiment text and it never finds fake review.

PROPOSED SYSTEM

 In the proposed system, each review goes through tokenization process first.
Then, unnecessary words are removed and candidate feature words are
generated.
 Each candidate feature words are checked against the dictionary and if its entry
is available in the dictionary then its frequency is counted and added to the
column in the feature vector that corresponds the numeric map of the word.
Alongside with counting frequency, the length of the review is measured and
added to the feature vector.
 Finally, sentiment score which is available in the data set is added in the
feature vector. We have assigned negative sentiment as zero valued and
positive sentiment as some positive valued in the feature vector.

Advantages

 The system is very fast and effective due to semi-supervised and supervised
learning.
 Focused on the content of the review based approaches. As feature we have
used word frequency count, sentiment polarity and length of review.

SYSTEM REQUIREMENTS
➢ H/W System Configuration:-

➢ Processor - Pentium –IV

➢ RAM - 4 GB (min)
➢ Hard Disk - 20 GB
➢ Key Board - Standard Windows Keyboard
➢ Mouse - Two or Three Button Mouse
➢ Monitor - SVGA

Software Requirements:
 Operating System - Windows XP
 Coding Language - Java/J2EE(JSP,Servlet)
 Front End - J2EE
 Back End - MySQL

Mechanistic Interpretability For AI Safety A Review: Leonard Bereska Efstratios Gavves
No ratings yet
Mechanistic Interpretability For AI Safety A Review: Leonard Bereska Efstratios Gavves
41 pages
Analysis of An Interview Based On Emotion Detection Using Convolutional Neural Networks
No ratings yet
Analysis of An Interview Based On Emotion Detection Using Convolutional Neural Networks
25 pages
IBA-Report Sales Superstore
No ratings yet
IBA-Report Sales Superstore
9 pages
Predicting Stock Price Direction Using Support Vector Machines
No ratings yet
Predicting Stock Price Direction Using Support Vector Machines
14 pages
Final Report
100% (1)
Final Report
20 pages
Self Learning and Efficient Health Status Analysis For A Core Router System
No ratings yet
Self Learning and Efficient Health Status Analysis For A Core Router System
35 pages
Major 2 Report
No ratings yet
Major 2 Report
41 pages
Diabetic Retinopathy Thesis Report Edited Nge Weng Sheng
No ratings yet
Diabetic Retinopathy Thesis Report Edited Nge Weng Sheng
55 pages
Litreature On Automatic Dipper Circuit For Vehicle-2
No ratings yet
Litreature On Automatic Dipper Circuit For Vehicle-2
10 pages
190A1E0336 - Project
No ratings yet
190A1E0336 - Project
67 pages
Currency Counting Machine With Fake Note Detection
100% (6)
Currency Counting Machine With Fake Note Detection
19 pages
Any Time Medicine Report
100% (1)
Any Time Medicine Report
34 pages
Deep Representation Based Feature Extraction and Recovering For Finger-Vein Veri Cation
100% (1)
Deep Representation Based Feature Extraction and Recovering For Finger-Vein Veri Cation
15 pages
A Report On "Material Storage Layout and Inventory Management"
No ratings yet
A Report On "Material Storage Layout and Inventory Management"
37 pages
Report On PC To PC Laser Communication
No ratings yet
Report On PC To PC Laser Communication
20 pages
KanOCR - TRANSLATION OF KANNADA TEXT IMAGE TO ENGLISH TEXT BY PROCESSING OF IMAGE USING OCR
No ratings yet
KanOCR - TRANSLATION OF KANNADA TEXT IMAGE TO ENGLISH TEXT BY PROCESSING OF IMAGE USING OCR
65 pages
Music Recommendation Using Facial Emotion Recognition
No ratings yet
Music Recommendation Using Facial Emotion Recognition
4 pages
Online Crime Management System: Sri Ramakrishna College of Arts and Science
No ratings yet
Online Crime Management System: Sri Ramakrishna College of Arts and Science
34 pages
Project Photo Share)
No ratings yet
Project Photo Share)
58 pages
Name of The Project: Seminar Report ON
No ratings yet
Name of The Project: Seminar Report ON
52 pages
Big Data
No ratings yet
Big Data
30 pages
Detection and Mitigation of DDoS Attack in Cloud
No ratings yet
Detection and Mitigation of DDoS Attack in Cloud
9 pages
Functional and Non Functional Requirements
No ratings yet
Functional and Non Functional Requirements
10 pages
Internship Report Anthony and Joshil PDF
No ratings yet
Internship Report Anthony and Joshil PDF
20 pages
Report On VAS Feasibility in Rural India
No ratings yet
Report On VAS Feasibility in Rural India
61 pages
Artificial Intelligence Dietitian Synopsis
No ratings yet
Artificial Intelligence Dietitian Synopsis
7 pages
Crime Management Report
No ratings yet
Crime Management Report
71 pages
Btech Final Year Project
No ratings yet
Btech Final Year Project
47 pages
Group-Project Final Documentation2
No ratings yet
Group-Project Final Documentation2
59 pages
Exam Cell Automation Project
0% (1)
Exam Cell Automation Project
17 pages
Kidney Stone Detection Using Ultrasound
No ratings yet
Kidney Stone Detection Using Ultrasound
26 pages
Tech Seminar Report
No ratings yet
Tech Seminar Report
5 pages
EVS Marine Pollution Final Report PDF
50% (6)
EVS Marine Pollution Final Report PDF
49 pages
Ration Card Management System
No ratings yet
Ration Card Management System
3 pages
MDAZMATHULLA 4JN16MCA24 Modified1 PDF
No ratings yet
MDAZMATHULLA 4JN16MCA24 Modified1 PDF
42 pages
A Synopsis On Mini Project: "Criminal Face Identification System"
No ratings yet
A Synopsis On Mini Project: "Criminal Face Identification System"
5 pages
Skenit - QR Code Ordering System: A Project Report
No ratings yet
Skenit - QR Code Ordering System: A Project Report
31 pages
Org Chart BHEL
0% (2)
Org Chart BHEL
1 page
Internship Project Report - PGDMHCM
No ratings yet
Internship Project Report - PGDMHCM
48 pages
Finger Vein Report
100% (1)
Finger Vein Report
31 pages
Chronic Kidney Disease Prediction Using Machine Learning Techniques (Documentation)
No ratings yet
Chronic Kidney Disease Prediction Using Machine Learning Techniques (Documentation)
48 pages
Industry Profile Port
100% (1)
Industry Profile Port
14 pages
Mini Project: Diploma in Computer Engineering
No ratings yet
Mini Project: Diploma in Computer Engineering
30 pages
Emotion Based Music Player (Manchester Univ)
No ratings yet
Emotion Based Music Player (Manchester Univ)
43 pages
Heart Disease Prediction: Submitted For Partial Fulfillment of The Degree
No ratings yet
Heart Disease Prediction: Submitted For Partial Fulfillment of The Degree
38 pages
Forest Fire Detection
No ratings yet
Forest Fire Detection
8 pages
Efficient Priority Based Load Balancing in Cloud Computing Environment
No ratings yet
Efficient Priority Based Load Balancing in Cloud Computing Environment
62 pages
Leaf Health Detection Using Python and Open Computer Vision
No ratings yet
Leaf Health Detection Using Python and Open Computer Vision
1 page
Synopsis, PDF
100% (1)
Synopsis, PDF
11 pages
Robo Revolution Seminar Report
No ratings yet
Robo Revolution Seminar Report
20 pages
Medical Shop Management A Project Report: Submitted by
No ratings yet
Medical Shop Management A Project Report: Submitted by
42 pages
Mca Project
100% (1)
Mca Project
35 pages
Project Report
No ratings yet
Project Report
42 pages
Final Year Report PDF
No ratings yet
Final Year Report PDF
56 pages
Automatic Gear Changer in Two Wheelers - DC Gun Model
50% (6)
Automatic Gear Changer in Two Wheelers - DC Gun Model
66 pages
Virtual Mirror - A Hassle Free Approach To The Use of Trial Room
No ratings yet
Virtual Mirror - A Hassle Free Approach To The Use of Trial Room
38 pages
Touchpad Plus Ver. 1.1 Class 7
From Everand
Touchpad Plus Ver. 1.1 Class 7
Nisha Batra
No ratings yet
20bf1f0033 - 2nd
No ratings yet
20bf1f0033 - 2nd
29 pages
Department of Masters of Comp. Applications
No ratings yet
Department of Masters of Comp. Applications
16 pages
Fake Product Review Monitoring and Removal For Genuine Product Using Opinion Mining
No ratings yet
Fake Product Review Monitoring and Removal For Genuine Product Using Opinion Mining
23 pages
Department of Masters of Comp. Applications
No ratings yet
Department of Masters of Comp. Applications
12 pages
Consumer Buying Behavior Tata Motors Concord Motor Dealer
No ratings yet
Consumer Buying Behavior Tata Motors Concord Motor Dealer
61 pages
Customer Satisfaction Asianpaints
No ratings yet
Customer Satisfaction Asianpaints
47 pages
Financial Statement Analysis-Axis
No ratings yet
Financial Statement Analysis-Axis
8 pages
Consumer Buying Pattern Towards Maruti Suzuki
No ratings yet
Consumer Buying Pattern Towards Maruti Suzuki
57 pages
Online Marketing in India - Amazon India
100% (1)
Online Marketing in India - Amazon India
87 pages
HR Payroll Management - Cogzinant
No ratings yet
HR Payroll Management - Cogzinant
80 pages
FAKEDETECTOR Effective Fake News Detection With Deep Diffusive Neural Network
No ratings yet
FAKEDETECTOR Effective Fake News Detection With Deep Diffusive Neural Network
2 pages
Static and Dynamic Analysis of Al-7075
No ratings yet
Static and Dynamic Analysis of Al-7075
71 pages
Tori Spherical
0% (1)
Tori Spherical
62 pages
Self Healing Composite Materials
No ratings yet
Self Healing Composite Materials
55 pages
Traffic Sign Board Recognition and Voice Alert System Using Convolutional Neural Network
No ratings yet
Traffic Sign Board Recognition and Voice Alert System Using Convolutional Neural Network
1 page
Radome Full Project
No ratings yet
Radome Full Project
54 pages
Modelling and Fabrication of Multipurpose Agricultural Equipment
No ratings yet
Modelling and Fabrication of Multipurpose Agricultural Equipment
1 page
Piston and Connecting Rod
No ratings yet
Piston and Connecting Rod
54 pages
Heat Transfer Computer Design
No ratings yet
Heat Transfer Computer Design
62 pages
Modeling and Analysis of Solid Vessel and Multilayered Composite Pressure Vessels
No ratings yet
Modeling and Analysis of Solid Vessel and Multilayered Composite Pressure Vessels
62 pages
Wa0000
No ratings yet
Wa0000
2 pages
Design and Analysis of Rocker Arm: Tools Were Used
No ratings yet
Design and Analysis of Rocker Arm: Tools Were Used
52 pages
Crank Shaft
No ratings yet
Crank Shaft
65 pages
Https/ev - Turnitin.com/student/paper/1744121734/queue PDF/sas23e6e8
No ratings yet
Https/ev - Turnitin.com/student/paper/1744121734/queue PDF/sas23e6e8
80 pages
A Novel Data Embedding Method Using Adaptive Pixel Pair Matching
No ratings yet
A Novel Data Embedding Method Using Adaptive Pixel Pair Matching
4 pages
Car Bumper
No ratings yet
Car Bumper
41 pages
Financial Analysis of Reliance Industry Limited
100% (1)
Financial Analysis of Reliance Industry Limited
69 pages
19wj1e0022 Financial Statment Analysis Reliance
No ratings yet
19wj1e0022 Financial Statment Analysis Reliance
67 pages
A Machine Learning Methodology For Diagnosing Chronic Kidney Disease
No ratings yet
A Machine Learning Methodology For Diagnosing Chronic Kidney Disease
2 pages
Empoyee's Retention at Balaji Formulation PVT - LTD
No ratings yet
Empoyee's Retention at Balaji Formulation PVT - LTD
53 pages
Simple Appointment Letter Format 1 1
100% (1)
Simple Appointment Letter Format 1 1
1 page
Python Ieee Projects 2021 - 22 JP
No ratings yet
Python Ieee Projects 2021 - 22 JP
3 pages
Customer Satisfaction Royal-Enfield
100% (3)
Customer Satisfaction Royal-Enfield
74 pages
Employee Training and Development - Websoft
No ratings yet
Employee Training and Development - Websoft
80 pages
DevOps Bootcamp Course Resource (1)-1-99
No ratings yet
DevOps Bootcamp Course Resource (1)-1-99
99 pages
CHEM 1101 End of 2020 1st Sem Exam
No ratings yet
CHEM 1101 End of 2020 1st Sem Exam
5 pages
Short Path Pattren in Excel
No ratings yet
Short Path Pattren in Excel
10 pages
Calculation of U Value Simple Construction
No ratings yet
Calculation of U Value Simple Construction
5 pages
Simpletron Java
No ratings yet
Simpletron Java
3 pages
Fenotipo TDAH
No ratings yet
Fenotipo TDAH
14 pages
Chinese Physics Olympiad 2017 Finals Theoretical Exam: Translated By: Wai Ching Choi Edited By: Kushal Thaman
100% (1)
Chinese Physics Olympiad 2017 Finals Theoretical Exam: Translated By: Wai Ching Choi Edited By: Kushal Thaman
7 pages
Unit 2 Materials Technology
No ratings yet
Unit 2 Materials Technology
78 pages
UN Numbers of Chemicals
No ratings yet
UN Numbers of Chemicals
62 pages
Senior Simulation Engineer
No ratings yet
Senior Simulation Engineer
3 pages
Market Making
100% (1)
Market Making
7 pages
E Book Image File Types Explained
No ratings yet
E Book Image File Types Explained
10 pages
Sea Math Skills Checklist
No ratings yet
Sea Math Skills Checklist
8 pages
s Tr Civil Work (Rev.0 2015) Piling Work
No ratings yet
s Tr Civil Work (Rev.0 2015) Piling Work
9 pages
DLL - Science 5 - Q3 - W2
No ratings yet
DLL - Science 5 - Q3 - W2
10 pages
4V400 Series: Solenoid Valve, Air Piloted Valve
No ratings yet
4V400 Series: Solenoid Valve, Air Piloted Valve
3 pages
B737 Autothrottle
100% (2)
B737 Autothrottle
103 pages
Spectrum Report: 1 Cover Page
No ratings yet
Spectrum Report: 1 Cover Page
15 pages
OM_Lesson 7_ Seven Basic Quality tools
No ratings yet
OM_Lesson 7_ Seven Basic Quality tools
33 pages
INTENSIVE CARE SCHOOLS WANDI P.4 MATHEMATICS END OF TERM I EXAMS 2024 BY Tr. BABEL.
No ratings yet
INTENSIVE CARE SCHOOLS WANDI P.4 MATHEMATICS END OF TERM I EXAMS 2024 BY Tr. BABEL.
11 pages
Name of Candidate (Block Letters)
No ratings yet
Name of Candidate (Block Letters)
1 page
Questions of Maximality: A. Lastname
No ratings yet
Questions of Maximality: A. Lastname
12 pages
BRAHMA SR3V Control de Llama
No ratings yet
BRAHMA SR3V Control de Llama
4 pages
Khayyam & His Solutions of The Cubic
No ratings yet
Khayyam & His Solutions of The Cubic
4 pages
Standards For Radiation Thermometry: 2012 NCSL International Workshop and Symposium
No ratings yet
Standards For Radiation Thermometry: 2012 NCSL International Workshop and Symposium
10 pages
8326 17 3 09 Flyer 98700-564E L22SR
No ratings yet
8326 17 3 09 Flyer 98700-564E L22SR
2 pages
How To Properly Complete An IIAR 6 System Safety Inspection Checklist Form?
No ratings yet
How To Properly Complete An IIAR 6 System Safety Inspection Checklist Form?
4 pages
Á1229Ñ Sterilization of Compendial Articles: Accessed From 10.6.1.1 by mvpstn3kts On Wed Apr 05 03:53:30 EDT 2017
No ratings yet
Á1229Ñ Sterilization of Compendial Articles: Accessed From 10.6.1.1 by mvpstn3kts On Wed Apr 05 03:53:30 EDT 2017
6 pages
Template Gfa24biz29 Fa24biz29 Report
No ratings yet
Template Gfa24biz29 Fa24biz29 Report
19 pages

Detection of Fake Online Reviews Using Semi-Supervised and Supervised Learning

Uploaded by

Detection of Fake Online Reviews Using Semi-Supervised and Supervised Learning

Uploaded by

Detection of fake online reviews using semi-

supervised and supervised learning

➢ Processor - Pentium –IV

You might also like