Department of Computer Science and Engineering Coding Assignment For Deep Learning CSE754
Department of Computer Science and Engineering Coding Assignment For Deep Learning CSE754
Engineering
Introduction:
Analysing the customer reviews to judge the quality of the product by it often
helps in making the better decisions on choosing the right product. But, the
problem arises when there are a large number of reviews. There are 2
approaches to resolve the problem. The first one, is using the number associated
with the review(called rating), which expresses the amount of satisfaction. The
latter one is by using the text classification technique, which classifies the
review into 2 classes namely- positive and negative respectively. In this work,
we have developed a text classification model which is built using
CNN(Convolutional Neural Network).
Problem Statement:
The task is to build a text classification model using CNN and classify the text
using the model. Finally we are required to evaluate the accuracy of the text
classification against the sample data.
Structure Chart:
Input:
The two seperate of input files each containing negative and positive customer
reviews respectively.
Output:
The output is the model generated by the training part of the data. It’s accuracy
is then evaluated against the remaining test part of the data.
1. Step
2. Loss
3. Accuracy
Algorithm:
Steps:
1. Labelling
The datasets are labelled to identify the raw data to add more meaning and
context to the model so that it can help efficiently in supervised learning.
The data was split into 90% and 10% fractions. Assigning data points to the
former and remaining to the latter.
3. Training
The datasets are trained to help and understand how sophisticated results are
produced in order to make predictions and fulfill the desired task.
The datasets are compared based on the algorithm’s flow of work. The
comparison is both qualitative and quantitative and investigates object detection
labels with respect to size, location, and contextual information.
Results:
The tests are carried out through these metrics:
Metric scores:
1.accuracy obtained by the author = 0.798
2.accuracy obtained by us = 0.7824
The results were mixed as the work on regularizing the fine-tuning process is
expected.
Metric scores:
1. Accuracy obtained by the author = 0.761
2. Accuracy obtained by us = 0.756
Epilogue:
The most troublesome parts during the implementation of the project was the
version incompatibilities between tensorflow 1 and tensorflow 2 and some other
python libraries as well. All in all we learnt to use version specific tensorflow
containers for addressing this problem and it will be useful when we try to
execute older tensorflow repositories.
Attachments:
Github link:
https://github.com/tharunedara/cnn-sentence-classification