0% found this document useful (0 votes)
19 views

Mini Project

Uploaded by

Puthin Dungala
Copyright
© © All Rights Reserved
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
19 views

Mini Project

Uploaded by

Puthin Dungala
Copyright
© © All Rights Reserved
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 16

Mini Project Review-1

Batch No: 02
Project Title: Sentimental Analysis using NLP
MINI PROJECT REVIEW I-CONTENTS
ABSTRACT

Sentiment analysis is type of analysis techniques which analysis text that automatically detect polarity of text.
Sentiment analysis also called as opinion mining which is one of the major tasks of NLP (Natural Language
Processing) Sentiment analysis has much popular in recent years. People are intended to develop a system that
can identify and classify opinion or sentiment as represented in an electronic text. Consumers regularly face the
trade-off in purchase decisions so nowadays if one wants to buy a consumer product one prefers user reviews
and discussion in public forums on web about the product. Many consumers use reviews posted by other
consumers before making their decisions. People have a tendency to express their opinion on various entities.
As a result, opinion mining has gained importance. Sentiment Analysis deals with evaluating whether this
expressed opinion about the entity has a positive or a negative orientation. Consumers need to decide what
subset of available information to use. The process of identifying and extracting subjective information from
raw data is known as sentiment analysis. An accurate method for predicting sentiments could enable us, to
extract opinions from the internet and predict online customer's preferences, which could prove valuable for
economic or marketing research. Till now, there are few different problema predominating in this research
community, namely, sentiment classification, feature based classification and handling negations. This paper
presents a survey covering the techniques and methods in sentiment analysis and challenges appear in the field.
INTRODUCTION

● Sentiment analysis is a natural language processing (NLP) technique used to automatically identify and
extract subjective information from text data. It involves analyzing the text to determine the author's attitude,
emotions, opinions, and sentiments towards a particular topic or product. Sentiment analysis has become
increasingly important in recent years due to the explosion of user-generated content on social media
platforms and the need for businesses to understand the sentiments of their customers towards their products
and services. This abstract provides an overview of sentiment analysis using NLP, including the different
approaches used, the challenges involved, and the applications of sentiment analysis in various industries.
Literature Survey
Sno. Author Published Year Title Technique

1
Sentiment Analysis: Capturing Natural Language
T. Nasukawa, 2003 Favorability Using Natural Processing
Language Processing Definition
of Sentiment Expressions
2
Y. Fang, H. Tan, "Multi-Strategy Sentiment Semantic Fuzziness,
and J. Zhan 2018 Analysis of Consumer Reviews Sentiment Analysis
Based on Semantic Fuzziness" Strategies
3 "Complete Tweet Sentiment Ordinal Regression,
Saad and Yang 2019 Analysis on the Basis of Ordinal Machine Learning (SVR,
Regression with Machine RF, Multinomial logistic
Learning Algorithms" regression, DTs), Pre-
processing
Problem Statement in Existing System

● The problem addressed by sentiment analysis using NLP is the need to automatically identify and
extract subjective information from large volumes of text data. With the growth of social media and
online reviews, there is a vast amount of user-generated content that contains valuable information
about people's attitudes, emotions, opinions, and sentiments towards products, services, or events.
Sentiment analysis aims to analyze this unstructured text data and classify it into positive, negative,
or neutral sentiments, which can provide valuable insights to businesses, policymakers, and
researchers. However, sentiment analysis using NLP poses several challenges, including handling
sarcasm, irony, and ambiguity in text, dealing with language and cultural differences, and ensuring
accuracy and consistency in classification. Addressing these challenges is essential for developing
effective sentiment analysis models that can provide reliable and actionable insights.
Proposed System

⮚ In pre-work, the dataset obtained from the open source are first pre-processed to remove the duplicate values
and features.

⮚ We first download the NLTK stopwords corpus and define a set of stopwords and punctuation marks to remove
from the text data. We then define a preprocess_text function that takes a text string as input, converts it to
lowercase, removes punctuation and stopwords using list comprehension and returns the preprocessed text

⮚ The dataset will be divided into training and testing sets.

.
Proposed System

⮚ we first define the training set as two lists X_train and y_train, where X_train contains the text data and
y_train contains the corresponding sentiment labels. We then initialize a TfidfVectorizer object with English
stopwords and fit and transform the training set using the vectorizer. This converts the text data into a sparse
matrix of numerical features using the TF-IDF algorithm. We then initialize a Logistic Regression object and
train the model on the training set using the fit method.
Objectives

-> To study and analyzes people’s opinions, sentiments, appraisals, attitudes, and emotions toward entities
and their attributes expressed in written text.

-> Companies can use sentiment analysis to check the social media sentiments around their brand from their
audience.
Project domain

• Machine Learning:
• A Machine Learning system learns from historical data, builds the prediction models, and whenever it
receives new data, predicts the output for it. The accuracy of predicted output depends upon the amount of
data, as the huge amount of data helps to build a better model which predicts the output more accurately.
• Machine learning is a method of data analysis that automates analytical model building. It is a branch of
artificial intelligence based on the idea that systems can learn from data, identify patterns and make decisions
with minimal human intervention.

• Natural Language Processing:


• Natural Language Processing (NLP) is a subfield of computer science and artificial intelligence that deals
with the interaction between computers and human languages. The primary goal of NLP is to enable
computers to understand, interpret, and generate natural language, the way humans do .
Requirement analysis

Functional Requirements:
The main requirement of our project is to take an text review of the product as an input and say whether
the review of the product is positive,negative or neutral.
Non-Functional Requirements:
Accuracy: The model must give the opinion of the product.
System Requirements:

Software Requirements: Operating System: Windows Coding Language: Python and various modules.

Hardware Requirements:

Processor- i5 or above Speed – 2.4GHz RAM - 4GB/8GB


Hard Disk - 20GB
Keyboard – Standard Keyboard
System Architecture
Methodology

• Input:
• Any text as input . The data was collected from Kaggle and different social
media platform.

. Numpy:
. Arrays in Numpy can be created by multiple ways, with various number of Ranks, defining the size of the
arrays . Arrays can also be created with the use of various data types such as lists, tuples, etc.

. Pandas:
. Pandas is an open-source library in Python that is made mainly for working with relational or labeled data

both easily and intuitively. It provides various data structures and operations for manipulating numerical data

and time series. This library is built on top of the NumPy library of Python.
Methodology

• Seaborn:
• Python provides a numerous number of libraries for data visualization, we have already seen the Matplotlib
library in this article we will know about Seaborn Library. Seaborn is an amazing visualization library for
statistical graphics plotting in Python.
• Matplotlib:
• Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations in Python.
Matplotlib makes easy things easy and hard things possible. Create publication quality plots . Make
interactive figures that can zoom, pan, update. Customize visual style and layout.
NLTK:
Natural language processing (NLP) is a field that focuses on making natural human language usable by
computer programs. NLTK, or Natural Language Toolkit,is a Python package that you can use for NLP.
Project plan

TASK NAME DURATION(IN WEEKS)

LITERATURE SURVEY 1 Week

METHODOLOGY AND DESIGN 2 Weeks

CODING 4 Weeks

IMPLEMENTATION 2 Week

TESTING 2 Weeks

OBSERVATION 1 Week

You might also like