0% found this document useful (0 votes)

64 views

To Find Out The Quality and Popularity of A Product by Using User Comments

This document proposes a project to develop a model that can analyze user comments on products to determine the quality and popularity of a product. It will use text mining and sentiment analysis techniques to classify reviews on smartphones from sources like YouTube. The objectives are to build a dataset from user comments and develop a model that can identify quality and popularity. The outcome will be software that analyzes reviews to help organizations understand customer satisfaction and preferences. It will benefit organizations by helping them evaluate customer wants and enhance products. The methodology involves feature extraction from reviews, weighting features, machine learning classification, and model validation. The project will be completed by December 2020.

Uploaded by

Waleed Amir

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views

To Find Out The Quality and Popularity of A Product by Using User Comments

Uploaded by

Waleed Amir

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

To Find Out the Quality and Popularity of a Product by Using User

Comments

AYESHA SAJJAD
ADNAN AHMED
WALEED AMIR

A project proposal submitted for

Final Year Project

Department of Computer Science

Bahria University, Karachi Campus

January 31, 2020

1
INTRODUCTION
User reviews on social platforms have a great influence on products reputation, they are viewed
by other customers before making a decision to purchase and organisations can also take benefit
from user reviews by identifying which parameters are satisfying customers and which are not.
Due to huge amount of user reviews on different platforms, it is a challenging task for
organisations to identify which parameter is satisfying their customers.[1]

Text-Mining is the process of examining large number of unstructured data (i.e. user
reviews) and converting them into structured data to observe the emotions and behaviours of
reviewers from unstructured text data.

Therefore, in this project we will design a model which will be capable of identifying
which feature of a product was good, bad or neutral to customer and popularity of a product by
using Text-Mining and Sentiment Analysis to classify reviews.

We will take Smartphone as a product in this project.

OBJECTIVES

 To prepare dataset based on user comments against uploaded videos of a

product(smartphone) on YouTube etc.
 To build model that should be capable to identify the quality and popularity of a product.

OUTCOME

 Software component to be produced which will include all the given functionality.
 Evaluation of the build model against developed dataset based on user comments.
 A complete software to be developed that will include this component and provide all
results in visual representation.

Final Deliverable of the Project:

 Software System

2
Benefit of the project:
Organizations can evaluate what customers want, which product was successful and which
products needs more enhancement. They also can determine market strategies to target
maximum customers.
By assessing user reviews, organizations will be able to identify which feature is lacking in a
product and can work on such features in future to satisfy customers.
BACKGROUND/LITERATURE REVIEW

In Zhang and Hua’s [1] work, they compared two methods i.e. Naïve Bayes and Support Vector
Machine to analyse user reviews to find out which method has more accuracy in predicting
user’s behaviour through reviews. In this research they concluded that Naïve Bayes algorithm is
more effective than SVM. Further, they also evaluated that the average shortest reviews have 17
words, shortest review had only 1 word, largest review had maximum 6000 words. Moreover,
they stated that text length of reviews satisfies Power-Law distribution i.e. the accuracy of
sentiment polarity classification rises as the word count decreases.

[2]
Chrystal and Joseph , in their research they worked on Structured Support Vector Machine to
perform text mining on electronic gadgets reviews. They developed a model to analyse the
performance and flexibility of structured support vector machine by creating a confusion matrix
to measure the degree of prediction and classification of text documents. This model had four
modules i.e. pre-processing, learning, classification and evaluation. Their system result in an
overall accuracy of 80.4%.

[3]
Jack and Tsai worked only on Amazon reviews. They found that high quality reviews are
those that subjectively comment on several product features. This paper reviews a method of
applying text mining techniques to compare and highlight top customer opinions of a product.
The research was primarily focused on understanding what was really important to users, what
positively or negatively affected product reviews, and what specifically users choose as
highlights or pain points when reviewing laptop and tablets. Their model was to apply text
mining to understand consumer feedback about purchased products. Further, they concluded that
using crowdsourced data in the form of online reviews can inform a company on how customers
think about and react to products and what is most important to them and urgent to fix, it is a
method of feedback to manufacturers

3
According to Wahyudi and Kristiyanti [4], Support Vector Machine lacks in electing appropriate
parameters or features. In this research, they used the merger method election features, namely
Particle Swarm Optimization in order to increase the classifications accuracy Support Vector
Machine. Their data set was based on 100 positive and 100 negative smartphone reviews and 4
words related to the sentiment of products, namely bad, fail, good and premium. The data set was
pre-processed using 3 steps i.e. Tokenization, Stop Words-Removal and Stemming. The
accuracy of sentiment analysis using SVM was 82.00% and with addition of Particle Swarm
Optimization (PSO), it obtained 94.50% of accuracy rate.
PROJECT METHODOLOGY
Feature Selection & Extraction
It is further divided into
1) Tokenization: Text document is collection of statements. This step divides the whole
statements into words by removing blank spaces, commas etc.
2) Stop word removal: This step involves removing of stop words such as ’a’, ’is’, ’of’, ’an’ and
so on. According to these words stop word removal process removes words from documents.
3) Stemming (stem word removal): Stemming is the process to identify the root of certain words
such as presented, presenting, presentation gets convert into original word present. The most
commonly used algorithm is porters’ algorithm for stemming. [5]
Feature Weighting
Feature weighting will be done using techniques like Term Frequency (TF) and Term Frequency
and Inverse Document Frequency (TF-IDF).
Machine Learning & Classification
Text classification is the task of sorting a set of documents into categories from a predefined set
of documents. It assigns labels to each document. It is based on supervised learning.
Classification techniques like Nearest Neighbor classifier, Naïve Bayesian classifier, Decision
Tree, and Support Vector Machines can be used to categorize text.[5]
Validation & Evaluation
Validation and evaluation would be completed, in which it will be identified that the
review/opinion lies in which classifier.

4
Figure 1- Block Diagram

5
PROJECT SCHEDULE
KEY MILESTONES
Key Milestones of the Project with dates
S. No Elapsed time since start of the project Milestone Deliverable
Preparation &
th
1 20 January 2020 submission of 24th Jan 2020
proposal
Literature survey &
th
2 8 Feb 2020 Study/Understandin 20th Feb 2020
g of project
Prototype and
3 21st Feb 2020 25th April 2020
design
4 26th April 2020 Experiment December 2020
GANTT CHART
Weeks (Spring 2020)
Activity
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18
Midterm Exam Week

Title
Submission
2-1-2020
Preparation &
Submission of
Study Week

Exam Week

Exam Week
Proposal
24-1-2020
Literature
Survey &
Study/
Understandin
g Project
20-2-2017
Experiment
Till final exam
Prototype and
Design
25-3-2020

Mid Viva /

6
Progress

Weeks (Fall2 020)

Activity 1 2 3 4 5 6 7 8 9 1 1 1 1 1 1 16 17 18
0 1 2 3 4 5

Midterm Exam Week

Experiment

Study Week

Exam Week

Exam Week
Analysis
Writing
Report
Viva

7
REFERENCES

[1] Lin Zhang, Kun Hua, Honggang Wang, Guanqun Qian, Li Zhang “Sentiment Analysis on
Reviews of Mobile Users”, The 11th International Conference on Mobile Systems and
Pervasive Computing, 2014. Online, Available at:
https://www.sciencedirect.com/science/article/pii/S1877050914008680

[2] Jincy B. Chrystal and Stephy Joseph “Text Mining and Classification of
Product Reviews
Using Structured Support Vector Machine”, 2015. Online Available at:
https://www.researchgate.net/publication/300665247_Text_Mining_and_Classific
ation_of_Product_Reviews_Using_Structured_Support_Vector_Machine

[3] L. Jack and Y.D. Tsai “Using Text Mining of Amazon Reviews to Explore
User-Defined Product Highlights and Issues”, November 20, 2015. Online
Available at:
https://www.researchgate.net/publication/284188657_Using_Text_Mining_of_Am
azon_Reviews_to_Explore_User-Defined_Product_Highlights_and_Issues

[4] Mochamad Wahyudi, Dinar Ajeng Kristiyanti “Sentiment Analysis of Smartphone Product
Review Using Support Vector Machine Algorithm-Based Particle Swarm Optimization”,
Journal of Theoretical and Applied Information Technology Vol.91 No.1, 2016. Online
Available at:
https://www.academia.edu/33152344/SENTIMENT_ANALYSIS_OF_SMARTPHONE_PRODUCT
_REVIEW_USING_SUPPORT_VECTOR_MACHINE_ALGORITHM-
BASED_PARTICLE_SWARM_OPTIMIZATION

[5] Yugandhara Bapurao Dasri, Bhagyashree Vyankatrao Barde, Nalwade Prakash Shivajirao,
Anant Madhavrao Bainwad “Text Mining Framework, Methods and Techniques”,
IOSR Journal of Computer Engineering (IOSR-JCE) Vol. 19 ver. II, 2017. Online Available at:
https://www.semanticscholar.org/paper/Mining-Framework-%2C-Methods-and-Techniques-Dasri-
Barde/22496c8251735204fcf66cceb0feedb946a68e25

CAT PAT Task Guidelines 2024
No ratings yet
CAT PAT Task Guidelines 2024
14 pages
Pull&Bear
100% (1)
Pull&Bear
37 pages
B. (DR - Robert - Mellor) - Entrepreneurship - For - Everyone PDF
100% (1)
B. (DR - Robert - Mellor) - Entrepreneurship - For - Everyone PDF
257 pages
Ste Swa
No ratings yet
Ste Swa
14 pages
Student Marks Management System
No ratings yet
Student Marks Management System
16 pages
Project Diary For Mba-1
0% (1)
Project Diary For Mba-1
32 pages
ENTERPRENEURSHIP DEVELOPMENT
No ratings yet
ENTERPRENEURSHIP DEVELOPMENT
12 pages
CMU-CS 462 - Software Meassurement and Analysis - 2024S - Lecture Slides - 00
No ratings yet
CMU-CS 462 - Software Meassurement and Analysis - 2024S - Lecture Slides - 00
6 pages
Akshay IWT Project
No ratings yet
Akshay IWT Project
30 pages
Final Microproject Report DTM
No ratings yet
Final Microproject Report DTM
11 pages
Java Eva
No ratings yet
Java Eva
1 page
rushikesh ede
No ratings yet
rushikesh ede
19 pages
Amrutvaini Polytechnic, Sangamner: Department of Computer Technology
No ratings yet
Amrutvaini Polytechnic, Sangamner: Department of Computer Technology
30 pages
Sanket ste report
No ratings yet
Sanket ste report
15 pages
Proposal Software Engineering 1
No ratings yet
Proposal Software Engineering 1
3 pages
Pengembangan Soal Hots Berbasis Ispring Suite Berbentuk Aplikasi Android Materi Dinamika Gerak Lurus Untuk Mengukur Keterampilan Penalaran Ilmiah Siswa 2 12 2021
No ratings yet
Pengembangan Soal Hots Berbasis Ispring Suite Berbentuk Aplikasi Android Materi Dinamika Gerak Lurus Untuk Mengukur Keterampilan Penalaran Ilmiah Siswa 2 12 2021
11 pages
Guidelines of PBL - MBA (22-24) Sem II
No ratings yet
Guidelines of PBL - MBA (22-24) Sem II
8 pages
MP Intro
No ratings yet
MP Intro
10 pages
SentimentScanner Report (1).PDF 157
No ratings yet
SentimentScanner Report (1).PDF 157
20 pages
samruddhi cpp file
No ratings yet
samruddhi cpp file
13 pages
Format
No ratings yet
Format
4 pages
1,2 3 - Intro To Multimedia Instructional Design
No ratings yet
1,2 3 - Intro To Multimedia Instructional Design
22 pages
Semester Project Description
No ratings yet
Semester Project Description
3 pages
La Salle College 2024-25 F.2 ICT Programme Outline
No ratings yet
La Salle College 2024-25 F.2 ICT Programme Outline
2 pages
Cis4000 Assignment Answer Cardiff St20212772outlook
No ratings yet
Cis4000 Assignment Answer Cardiff St20212772outlook
31 pages
Value Analysis & Value Engineering: DR - Sanjay Rajurkar
No ratings yet
Value Analysis & Value Engineering: DR - Sanjay Rajurkar
36 pages
Kundan Ste Project
No ratings yet
Kundan Ste Project
26 pages
Cover Page
No ratings yet
Cover Page
8 pages
Ede
No ratings yet
Ede
16 pages
Assignment-Sgdp4043
No ratings yet
Assignment-Sgdp4043
3 pages
STG FINAL MICROPROJECT
No ratings yet
STG FINAL MICROPROJECT
19 pages
ETI_Project[55]
No ratings yet
ETI_Project[55]
16 pages
Deepanshu - 21BCS5066 Summer Institutional Training Report
No ratings yet
Deepanshu - 21BCS5066 Summer Institutional Training Report
37 pages
ste^
No ratings yet
ste^
19 pages
Project Report: Format of
No ratings yet
Project Report: Format of
9 pages
Sample Proposal of Innovation-1
No ratings yet
Sample Proposal of Innovation-1
13 pages
Ste Micro Project
No ratings yet
Ste Micro Project
14 pages
Shivani Ste Mp
No ratings yet
Shivani Ste Mp
14 pages
STG Final Microproject
No ratings yet
STG Final Microproject
20 pages
Project Code: SIP Project Title: Measuring The Performance of Commercial Papers Ratings
No ratings yet
Project Code: SIP Project Title: Measuring The Performance of Commercial Papers Ratings
6 pages
internship report final
No ratings yet
internship report final
31 pages
MAD Microproject Report
No ratings yet
MAD Microproject Report
23 pages
Unit 13 Computing Research Project - New 2023
No ratings yet
Unit 13 Computing Research Project - New 2023
9 pages
ISTQB Foundation Sample Paper A
No ratings yet
ISTQB Foundation Sample Paper A
40 pages
Ajp MP
No ratings yet
Ajp MP
10 pages
72-ComputerVision LabManual
No ratings yet
72-ComputerVision LabManual
63 pages
Enterpreneurship Development
No ratings yet
Enterpreneurship Development
20 pages
STE Final Microproject
No ratings yet
STE Final Microproject
23 pages
MAD_55_micro
No ratings yet
MAD_55_micro
15 pages
Reportdemo
No ratings yet
Reportdemo
11 pages
EDE microproject
No ratings yet
EDE microproject
14 pages
Jignesh Atmakur - Criteria A Slides 1 13-1 14
0% (1)
Jignesh Atmakur - Criteria A Slides 1 13-1 14
11 pages
Course Name - Applied Mechanics Course Code - 22203 Name of Topic
No ratings yet
Course Name - Applied Mechanics Course Code - 22203 Name of Topic
8 pages
MP 816to820 Ame
No ratings yet
MP 816to820 Ame
18 pages
Sample
No ratings yet
Sample
33 pages
Osy Microproject Report
No ratings yet
Osy Microproject Report
13 pages
Online AI-portal System With Facial Recognition
No ratings yet
Online AI-portal System With Facial Recognition
9 pages
Course Curriculum
No ratings yet
Course Curriculum
3 pages
Css Microproject
No ratings yet
Css Microproject
16 pages
Development of Mobile-Based Formative Faculty Performance Evaluation of Occidental Mindoro State College
No ratings yet
Development of Mobile-Based Formative Faculty Performance Evaluation of Occidental Mindoro State College
5 pages
Sample PDF of Std 10 Em Science Board Solved Papers Sample Content 4763
No ratings yet
Sample PDF of Std 10 Em Science Board Solved Papers Sample Content 4763
21 pages
ISTQB Certified Tester Advanced Level Test Manager (CTAL-TM): Practice Questions Syllabus 2012
From Everand
ISTQB Certified Tester Advanced Level Test Manager (CTAL-TM): Practice Questions Syllabus 2012
Gabriel Awoyemi
No ratings yet
Waleed Amir AI Assign 1
No ratings yet
Waleed Amir AI Assign 1
10 pages
HCL Assign
No ratings yet
HCL Assign
1 page
AI Assignment 03
No ratings yet
AI Assignment 03
2 pages
Bahria Model School Hanif Sre Lesson Plan (Format) : Objective
No ratings yet
Bahria Model School Hanif Sre Lesson Plan (Format) : Objective
1 page
Bahria Model School Hanif Sre Lesson Plan (Format) : Objective
No ratings yet
Bahria Model School Hanif Sre Lesson Plan (Format) : Objective
1 page
Bahria University (Karachi Campus) : Midterm Examination - Spring Semester - 2020
No ratings yet
Bahria University (Karachi Campus) : Midterm Examination - Spring Semester - 2020
2 pages
Sqa Assign
No ratings yet
Sqa Assign
1 page
Bahria University (Karachi Campus) : Midterm Examination - Spring Semester - 2020
No ratings yet
Bahria University (Karachi Campus) : Midterm Examination - Spring Semester - 2020
4 pages
HCL Assign
No ratings yet
HCL Assign
1 page
DLD Project Report
No ratings yet
DLD Project Report
2 pages
Parallel Programming Project Report
No ratings yet
Parallel Programming Project Report
10 pages
VLE Assignment Session 10
No ratings yet
VLE Assignment Session 10
3 pages
CH 03 - OS8e
No ratings yet
CH 03 - OS8e
61 pages
Mobile Application Development: Lecturer M Talha Alam
No ratings yet
Mobile Application Development: Lecturer M Talha Alam
22 pages
FUZZY Labview PDF
100% (1)
FUZZY Labview PDF
247 pages
4. Sensitivity
No ratings yet
4. Sensitivity
6 pages
HW 06 Markov Chains Solutions
No ratings yet
HW 06 Markov Chains Solutions
4 pages
Lecture 05-Fortran Control Structure
No ratings yet
Lecture 05-Fortran Control Structure
32 pages
Decision Tree - ID3
No ratings yet
Decision Tree - ID3
11 pages
Modeling and Simulation of Mechatronic Systems (MH 504)
No ratings yet
Modeling and Simulation of Mechatronic Systems (MH 504)
15 pages
ch03 Forecasting
No ratings yet
ch03 Forecasting
43 pages
Lecture 1. Part 1-Regression Analysis. Correlation and SLRM
No ratings yet
Lecture 1. Part 1-Regression Analysis. Correlation and SLRM
44 pages
Testing Via Sequential Experiments Best Practice
No ratings yet
Testing Via Sequential Experiments Best Practice
35 pages
Written Task # 1 (Third Quarter) Prob. and Stat Name: Section
No ratings yet
Written Task # 1 (Third Quarter) Prob. and Stat Name: Section
2 pages
20 Machine Learning Projects For Beginners
No ratings yet
20 Machine Learning Projects For Beginners
22 pages
Solve Linear Programming Problems - MATLAB Linprog - MathWorks India
No ratings yet
Solve Linear Programming Problems - MATLAB Linprog - MathWorks India
24 pages
2 - Chapter 2
No ratings yet
2 - Chapter 2
55 pages
Exam2 Practice FEA
No ratings yet
Exam2 Practice FEA
5 pages
Steps in Stepping Stone Method
No ratings yet
Steps in Stepping Stone Method
1 page
AIML
No ratings yet
AIML
24 pages
Answers To Probability Worksheet
No ratings yet
Answers To Probability Worksheet
2 pages
TN2953 P The Duffing Oscillator
No ratings yet
TN2953 P The Duffing Oscillator
18 pages
Pasquinelli Matteo Joler Vladan 2020 The Nooscope Manifested AI As Instrument of Knowledge Extractivism PDF
No ratings yet
Pasquinelli Matteo Joler Vladan 2020 The Nooscope Manifested AI As Instrument of Knowledge Extractivism PDF
23 pages
Dsp-Lab-5 Soliution (Waheed 3797)
No ratings yet
Dsp-Lab-5 Soliution (Waheed 3797)
13 pages
Artificial and Computational Intelligence - HO
No ratings yet
Artificial and Computational Intelligence - HO
7 pages
Revision Papers1
No ratings yet
Revision Papers1
30 pages
Activity 5 MMW
No ratings yet
Activity 5 MMW
3 pages
Chapter 09 - Estimation and Confidence Intervals
No ratings yet
Chapter 09 - Estimation and Confidence Intervals
7 pages
AI Associate Glossary
No ratings yet
AI Associate Glossary
5 pages
Matlab Dynamic Optimization of Batch Fermentation Processes
No ratings yet
Matlab Dynamic Optimization of Batch Fermentation Processes
7 pages
Non-Negative Matrix Factorization
No ratings yet
Non-Negative Matrix Factorization
18 pages
Data Mining Lab File
No ratings yet
Data Mining Lab File
20 pages
Algorithm Design Methods: Greedy Method. Divide and Conquer. Dynamic Programming. Backtracking. Branch and Bound
No ratings yet
Algorithm Design Methods: Greedy Method. Divide and Conquer. Dynamic Programming. Backtracking. Branch and Bound
24 pages
A Deep Learning Approach For The Estimation of Middleton Class-A Impulsive Noise Parameters
No ratings yet
A Deep Learning Approach For The Estimation of Middleton Class-A Impulsive Noise Parameters
6 pages