SlideShare a Scribd company logo
Fast and Accurate Sentiment Classification
Using NLTK And Naive Bayes Model
Presented By -
Abhisek Sahoo (ID - B516001)
Computer Engineering(2016-20)
Under the guidance of
Prof. Sabyasachi Patra
oj
Department of Computer Science And Engineering
International Institute of Information Technology, Bhubaneswar
Sentiment Analysis ?
Aim to determine the attitude of a
speaker/writer with respect to some text
Positive or Negative ?
This is a good book ! Positive
This is a good book ! I like it ! Positive (more)
This is a bad book! Negative
The first chapter is good,
But the rest is terrible. Negative
Features of Sentiment Analysis
➢ Identifying trends of public opinion in social media
➢ Marketing and consumer research
➢ Customer feedback on new product launches, political
campaigns
➢ To facilitate smarter business decisions
➢ Enhanced product recommendation
How to
compute it ?
Proposed System Workflow
Natural
Language
Processing
➢ Field of computer science, artificial intelligence,
and computational linguistics concerned with the
interactions between computers and human (natural)
languages.
➢ Ability to draw insights from data contained in
emails, videos, and other unstructured material.
➢ The various aspects of NLP include Parsing,
Machine Translation, Language Modelling,
Machine Learning, Semantic Analysis etc.
Natural
Language
Toolkit
➢ NLTK is a leading platform for building Python
programs to work with human language data.
➢ It provides easy-to-use interfaces to over 50 corpora
and lexical resources such as WordNet.
➢ It provides a suite of text processing libraries for
classification, tokenization, stemming, tagging,
parsing, and semantic reasoning.
Dataset :
NLTK
Corpora
➢ A publicly available dataset of tweets from the
Natural Language Toolkit Corpus Library is used.
➢ Tweets dataset comprised of a ample collection of
individual emotions and captures most of the
adjectives important to sentiment classification.
➢ It consists of 30,000 extremely polar tweets for
training dataset and 10,000 for testing dataset.
➢ Both the training and test sets comprises of an
identical number of negative and positive tweets.
Data
Preprocessing
Tokenization
The process of breaking a stream of text up into words, phrases, symbols, or other
meaningful elements called tokens.
Normalization
➢ Normalization in NLP is the process of converting a word to its canonical form.
➢ Stemming is a process of removing affixes from a word.
➢ Lemmatization is a process in which a word normalizes with the context of vocabulary and
morphological analysis of words in text.
➢ Wordnet is a lexical database available in NLTK for the English language that helps the script
determine the base word.
Removal of Noise/Stop Words
➢ Noise is any part of the text that does not add any meaning to data.
➢ It is necessary to remove all hyperlinks, @ symbol of tweeter handles ,punctuation and
special characters.
➢ Also required to remove the stop words like “is”, “a” and “the” from the sentence.
Determine Word Density
➢ The most basic form of analysis on textual data is to take out the word frequency.
➢ After compiling all words in the sample of tweets, the most common words can be found out
using the FreqDist class of NLTK.
Separate The Data
Training data to train the model
Testing data to compute the accuracy
Training Data
(With Labels)
Model
Real Data Trained Model Prediction
Train The Data
Compute The Accuracy
Accuracy = No. of correct predictions / Total no. of statements
Result
Future Work
A web based application can
implemented for better access
Web scraping can be
implemented for direct retrieval
of tweets or reviews from
Twitter or any other platform
More classification categories
can be added to determine the
sentiment more specifically
Supervised with multiple
languages to make it more local
Conclusion
We conclude that by using various NLTK modules for
preprocessing and NLTK Naive Bayes classifier it is easier to
classify the tweets and get better accuracy.
References
➢ https://www.researchgate.net/publication/220482883_NLTK_the_Natural_Language_
Toolkit
➢ Basic Sentiment Analysis using NLTK - Towards Data Science
➢ https://www.analyticsvidhya.com/blog/2017/09/naive-bayes-explained/
➢ https://www.kaggle.com/lakshmi25npathi/sentiment-analysis-of-imdb-movie-reviews
➢ http://ijcsit.com/docs/Volume%206/vol6issue06/ijcsit20150606134.pdf
➢ https://pdfs.semanticscholar.org/c151/dfad8c1bf88b0afc716758c77d533ded7dd0.pdf
Thank You !
Ad

More Related Content

What's hot (20)

Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
VeenaSKumar2
 
How Does Customer Feedback Sentiment Analysis Work in Search Marketing?
How Does Customer Feedback Sentiment Analysis Work in Search Marketing?How Does Customer Feedback Sentiment Analysis Work in Search Marketing?
How Does Customer Feedback Sentiment Analysis Work in Search Marketing?
Countants
 
An overview of text mining and sentiment analysis for Decision Support System
An overview of text mining and sentiment analysis for Decision Support SystemAn overview of text mining and sentiment analysis for Decision Support System
An overview of text mining and sentiment analysis for Decision Support System
Gan Keng Hoon
 
Short story presentation
Short story presentationShort story presentation
Short story presentation
StutiAgarwal36
 
Natural Language Processing and Machine Learning
Natural Language Processing and Machine LearningNatural Language Processing and Machine Learning
Natural Language Processing and Machine Learning
Karthik Sankar
 
UNDERSTAND SHORTTEXTS BY HARVESTING & ANALYZING SEMANTIKNOWLEDGE
UNDERSTAND SHORTTEXTS BY HARVESTING & ANALYZING SEMANTIKNOWLEDGEUNDERSTAND SHORTTEXTS BY HARVESTING & ANALYZING SEMANTIKNOWLEDGE
UNDERSTAND SHORTTEXTS BY HARVESTING & ANALYZING SEMANTIKNOWLEDGE
Prasadu Peddi
 
Opinion Mining
Opinion MiningOpinion Mining
Opinion Mining
Ali Habeeb
 
Tutorial - Speech Synthesis System
Tutorial - Speech Synthesis SystemTutorial - Speech Synthesis System
Tutorial - Speech Synthesis System
IJERA Editor
 
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
PhD Assistance
 
Sentiment Analysis of Feedback Data
Sentiment Analysis of Feedback DataSentiment Analysis of Feedback Data
Sentiment Analysis of Feedback Data
ijtsrd
 
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
PhD Assistance
 
Presentation1
Presentation1Presentation1
Presentation1
Ritikesh Bhaskarwar
 
A scalable, lexicon based technique for sentiment analysis
A scalable, lexicon based technique for sentiment analysisA scalable, lexicon based technique for sentiment analysis
A scalable, lexicon based technique for sentiment analysis
ijfcstjournal
 
IRJET- Vernacular Language Spell Checker & Autocorrection
IRJET- Vernacular Language Spell Checker & AutocorrectionIRJET- Vernacular Language Spell Checker & Autocorrection
IRJET- Vernacular Language Spell Checker & Autocorrection
IRJET Journal
 
IRJET - Text Optimization/Summarizer using Natural Language Processing
IRJET - Text Optimization/Summarizer using Natural Language Processing IRJET - Text Optimization/Summarizer using Natural Language Processing
IRJET - Text Optimization/Summarizer using Natural Language Processing
IRJET Journal
 
Lexical Analysis to Effectively Detect User's Opinion
Lexical Analysis to Effectively Detect User's Opinion   Lexical Analysis to Effectively Detect User's Opinion
Lexical Analysis to Effectively Detect User's Opinion
dannyijwest
 
295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysis295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysis
Zahid Azam
 
Amazon sentimental analysis
Amazon sentimental analysisAmazon sentimental analysis
Amazon sentimental analysis
Akhila
 
Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...
Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...
Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...
PhD Assistance
 
Report for Speech Emotion Recognition
Report for Speech Emotion RecognitionReport for Speech Emotion Recognition
Report for Speech Emotion Recognition
Dongang (Sean) Wang
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
VeenaSKumar2
 
How Does Customer Feedback Sentiment Analysis Work in Search Marketing?
How Does Customer Feedback Sentiment Analysis Work in Search Marketing?How Does Customer Feedback Sentiment Analysis Work in Search Marketing?
How Does Customer Feedback Sentiment Analysis Work in Search Marketing?
Countants
 
An overview of text mining and sentiment analysis for Decision Support System
An overview of text mining and sentiment analysis for Decision Support SystemAn overview of text mining and sentiment analysis for Decision Support System
An overview of text mining and sentiment analysis for Decision Support System
Gan Keng Hoon
 
Short story presentation
Short story presentationShort story presentation
Short story presentation
StutiAgarwal36
 
Natural Language Processing and Machine Learning
Natural Language Processing and Machine LearningNatural Language Processing and Machine Learning
Natural Language Processing and Machine Learning
Karthik Sankar
 
UNDERSTAND SHORTTEXTS BY HARVESTING & ANALYZING SEMANTIKNOWLEDGE
UNDERSTAND SHORTTEXTS BY HARVESTING & ANALYZING SEMANTIKNOWLEDGEUNDERSTAND SHORTTEXTS BY HARVESTING & ANALYZING SEMANTIKNOWLEDGE
UNDERSTAND SHORTTEXTS BY HARVESTING & ANALYZING SEMANTIKNOWLEDGE
Prasadu Peddi
 
Opinion Mining
Opinion MiningOpinion Mining
Opinion Mining
Ali Habeeb
 
Tutorial - Speech Synthesis System
Tutorial - Speech Synthesis SystemTutorial - Speech Synthesis System
Tutorial - Speech Synthesis System
IJERA Editor
 
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
PhD Assistance
 
Sentiment Analysis of Feedback Data
Sentiment Analysis of Feedback DataSentiment Analysis of Feedback Data
Sentiment Analysis of Feedback Data
ijtsrd
 
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
PhD Assistance
 
A scalable, lexicon based technique for sentiment analysis
A scalable, lexicon based technique for sentiment analysisA scalable, lexicon based technique for sentiment analysis
A scalable, lexicon based technique for sentiment analysis
ijfcstjournal
 
IRJET- Vernacular Language Spell Checker & Autocorrection
IRJET- Vernacular Language Spell Checker & AutocorrectionIRJET- Vernacular Language Spell Checker & Autocorrection
IRJET- Vernacular Language Spell Checker & Autocorrection
IRJET Journal
 
IRJET - Text Optimization/Summarizer using Natural Language Processing
IRJET - Text Optimization/Summarizer using Natural Language Processing IRJET - Text Optimization/Summarizer using Natural Language Processing
IRJET - Text Optimization/Summarizer using Natural Language Processing
IRJET Journal
 
Lexical Analysis to Effectively Detect User's Opinion
Lexical Analysis to Effectively Detect User's Opinion   Lexical Analysis to Effectively Detect User's Opinion
Lexical Analysis to Effectively Detect User's Opinion
dannyijwest
 
295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysis295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysis
Zahid Azam
 
Amazon sentimental analysis
Amazon sentimental analysisAmazon sentimental analysis
Amazon sentimental analysis
Akhila
 
Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...
Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...
Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...
PhD Assistance
 
Report for Speech Emotion Recognition
Report for Speech Emotion RecognitionReport for Speech Emotion Recognition
Report for Speech Emotion Recognition
Dongang (Sean) Wang
 

Similar to Fast and accurate sentiment classification us and naive bayes model b516001 (20)

INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processing
socarem879
 
An Overview of Natural Language Processing.pptx
An Overview of Natural Language Processing.pptxAn Overview of Natural Language Processing.pptx
An Overview of Natural Language Processing.pptx
Softxai
 
NATURAL LANGUAGE PROCESSING.pptx
NATURAL LANGUAGE PROCESSING.pptxNATURAL LANGUAGE PROCESSING.pptx
NATURAL LANGUAGE PROCESSING.pptx
saivinay93
 
Top 10 Must-Know NLP Techniques for Data Scientists
Top 10 Must-Know NLP Techniques for Data ScientistsTop 10 Must-Know NLP Techniques for Data Scientists
Top 10 Must-Know NLP Techniques for Data Scientists
Xavor Corporation - Redefining Health Technology
 
Natural Language Processing .pdf
Natural Language Processing .pdfNatural Language Processing .pdf
Natural Language Processing .pdf
Anime196637
 
DataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
DataFest 2017. Introduction to Natural Language Processing by Rudolf EremyanDataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
DataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
rudolf eremyan
 
NLP Msc Computer science S2 Kerala University
NLP Msc Computer science S2 Kerala UniversityNLP Msc Computer science S2 Kerala University
NLP Msc Computer science S2 Kerala University
vineethpradeep50
 
Natural Language Processing (NLP).pdf
Natural Language Processing (NLP).pdfNatural Language Processing (NLP).pdf
Natural Language Processing (NLP).pdf
Moar Digital 360
 
Mining Opinion Features in Customer Reviews
Mining Opinion Features in Customer ReviewsMining Opinion Features in Customer Reviews
Mining Opinion Features in Customer Reviews
IJCERT JOURNAL
 
Technical Development Workshop - Text Analytics with Python
Technical Development Workshop - Text Analytics with PythonTechnical Development Workshop - Text Analytics with Python
Technical Development Workshop - Text Analytics with Python
Michelle Purnama
 
Natural Language Processing_in semantic web.pptx
Natural Language Processing_in semantic web.pptxNatural Language Processing_in semantic web.pptx
Natural Language Processing_in semantic web.pptx
AlyaaMachi
 
Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...
Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...
Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...
QuantInsti
 
AIS Technical Development Workshop 2: Text Analytics with Python
AIS Technical Development Workshop 2: Text Analytics with PythonAIS Technical Development Workshop 2: Text Analytics with Python
AIS Technical Development Workshop 2: Text Analytics with Python
Nhi Nguyen
 
Networking lesson 4 chaoter 1 Module 4-1.pptx
Networking lesson 4 chaoter 1 Module 4-1.pptxNetworking lesson 4 chaoter 1 Module 4-1.pptx
Networking lesson 4 chaoter 1 Module 4-1.pptx
MAHERMOHAMED27
 
Natural Language Processing: A comprehensive overview
Natural Language Processing: A comprehensive overviewNatural Language Processing: A comprehensive overview
Natural Language Processing: A comprehensive overview
Benjaminlapid1
 
IRJET - Twitter Sentiment Analysis using Machine Learning
IRJET -  	  Twitter Sentiment Analysis using Machine LearningIRJET -  	  Twitter Sentiment Analysis using Machine Learning
IRJET - Twitter Sentiment Analysis using Machine Learning
IRJET Journal
 
Text-to-Speech-presentation2(punjabi).pptx
Text-to-Speech-presentation2(punjabi).pptxText-to-Speech-presentation2(punjabi).pptx
Text-to-Speech-presentation2(punjabi).pptx
MuskanSaini50
 
Cognitive computing
Cognitive computing Cognitive computing
Cognitive computing
Pratap Dangeti
 
Sentiment Analysis using Machine Learning.pdf
Sentiment Analysis using Machine Learning.pdfSentiment Analysis using Machine Learning.pdf
Sentiment Analysis using Machine Learning.pdf
OmSatpathy
 
Machine Learning for Natural Language Processing| ashokveda . pdf
Machine Learning for Natural Language Processing| ashokveda . pdfMachine Learning for Natural Language Processing| ashokveda . pdf
Machine Learning for Natural Language Processing| ashokveda . pdf
df2608021
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processing
socarem879
 
An Overview of Natural Language Processing.pptx
An Overview of Natural Language Processing.pptxAn Overview of Natural Language Processing.pptx
An Overview of Natural Language Processing.pptx
Softxai
 
NATURAL LANGUAGE PROCESSING.pptx
NATURAL LANGUAGE PROCESSING.pptxNATURAL LANGUAGE PROCESSING.pptx
NATURAL LANGUAGE PROCESSING.pptx
saivinay93
 
Natural Language Processing .pdf
Natural Language Processing .pdfNatural Language Processing .pdf
Natural Language Processing .pdf
Anime196637
 
DataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
DataFest 2017. Introduction to Natural Language Processing by Rudolf EremyanDataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
DataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
rudolf eremyan
 
NLP Msc Computer science S2 Kerala University
NLP Msc Computer science S2 Kerala UniversityNLP Msc Computer science S2 Kerala University
NLP Msc Computer science S2 Kerala University
vineethpradeep50
 
Natural Language Processing (NLP).pdf
Natural Language Processing (NLP).pdfNatural Language Processing (NLP).pdf
Natural Language Processing (NLP).pdf
Moar Digital 360
 
Mining Opinion Features in Customer Reviews
Mining Opinion Features in Customer ReviewsMining Opinion Features in Customer Reviews
Mining Opinion Features in Customer Reviews
IJCERT JOURNAL
 
Technical Development Workshop - Text Analytics with Python
Technical Development Workshop - Text Analytics with PythonTechnical Development Workshop - Text Analytics with Python
Technical Development Workshop - Text Analytics with Python
Michelle Purnama
 
Natural Language Processing_in semantic web.pptx
Natural Language Processing_in semantic web.pptxNatural Language Processing_in semantic web.pptx
Natural Language Processing_in semantic web.pptx
AlyaaMachi
 
Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...
Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...
Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...
QuantInsti
 
AIS Technical Development Workshop 2: Text Analytics with Python
AIS Technical Development Workshop 2: Text Analytics with PythonAIS Technical Development Workshop 2: Text Analytics with Python
AIS Technical Development Workshop 2: Text Analytics with Python
Nhi Nguyen
 
Networking lesson 4 chaoter 1 Module 4-1.pptx
Networking lesson 4 chaoter 1 Module 4-1.pptxNetworking lesson 4 chaoter 1 Module 4-1.pptx
Networking lesson 4 chaoter 1 Module 4-1.pptx
MAHERMOHAMED27
 
Natural Language Processing: A comprehensive overview
Natural Language Processing: A comprehensive overviewNatural Language Processing: A comprehensive overview
Natural Language Processing: A comprehensive overview
Benjaminlapid1
 
IRJET - Twitter Sentiment Analysis using Machine Learning
IRJET -  	  Twitter Sentiment Analysis using Machine LearningIRJET -  	  Twitter Sentiment Analysis using Machine Learning
IRJET - Twitter Sentiment Analysis using Machine Learning
IRJET Journal
 
Text-to-Speech-presentation2(punjabi).pptx
Text-to-Speech-presentation2(punjabi).pptxText-to-Speech-presentation2(punjabi).pptx
Text-to-Speech-presentation2(punjabi).pptx
MuskanSaini50
 
Sentiment Analysis using Machine Learning.pdf
Sentiment Analysis using Machine Learning.pdfSentiment Analysis using Machine Learning.pdf
Sentiment Analysis using Machine Learning.pdf
OmSatpathy
 
Machine Learning for Natural Language Processing| ashokveda . pdf
Machine Learning for Natural Language Processing| ashokveda . pdfMachine Learning for Natural Language Processing| ashokveda . pdf
Machine Learning for Natural Language Processing| ashokveda . pdf
df2608021
 
Ad

Recently uploaded (20)

What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
SOFTTECHHUB
 
tecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdftecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdf
fjgm517
 
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
How analogue intelligence complements AI
How analogue intelligence complements AIHow analogue intelligence complements AI
How analogue intelligence complements AI
Paul Rowe
 
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdfSAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
Precisely
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxSpecial Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
shyamraj55
 
AI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global TrendsAI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global Trends
InData Labs
 
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
BookNet Canada
 
Drupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy ConsumptionDrupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy Consumption
Exove
 
Build Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For DevsBuild Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For Devs
Brian McKeiver
 
Technology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data AnalyticsTechnology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data Analytics
InData Labs
 
Electronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploitElectronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploit
niftliyevhuseyn
 
Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)
Ortus Solutions, Corp
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
SOFTTECHHUB
 
tecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdftecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdf
fjgm517
 
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
How analogue intelligence complements AI
How analogue intelligence complements AIHow analogue intelligence complements AI
How analogue intelligence complements AI
Paul Rowe
 
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdfSAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
Precisely
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxSpecial Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
shyamraj55
 
AI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global TrendsAI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global Trends
InData Labs
 
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
BookNet Canada
 
Drupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy ConsumptionDrupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy Consumption
Exove
 
Build Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For DevsBuild Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For Devs
Brian McKeiver
 
Technology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data AnalyticsTechnology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data Analytics
InData Labs
 
Electronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploitElectronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploit
niftliyevhuseyn
 
Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)
Ortus Solutions, Corp
 
Ad

Fast and accurate sentiment classification us and naive bayes model b516001

  • 1. Fast and Accurate Sentiment Classification Using NLTK And Naive Bayes Model Presented By - Abhisek Sahoo (ID - B516001) Computer Engineering(2016-20) Under the guidance of Prof. Sabyasachi Patra oj Department of Computer Science And Engineering International Institute of Information Technology, Bhubaneswar
  • 2. Sentiment Analysis ? Aim to determine the attitude of a speaker/writer with respect to some text
  • 3. Positive or Negative ? This is a good book ! Positive This is a good book ! I like it ! Positive (more) This is a bad book! Negative The first chapter is good, But the rest is terrible. Negative
  • 4. Features of Sentiment Analysis ➢ Identifying trends of public opinion in social media ➢ Marketing and consumer research ➢ Customer feedback on new product launches, political campaigns ➢ To facilitate smarter business decisions ➢ Enhanced product recommendation
  • 7. Natural Language Processing ➢ Field of computer science, artificial intelligence, and computational linguistics concerned with the interactions between computers and human (natural) languages. ➢ Ability to draw insights from data contained in emails, videos, and other unstructured material. ➢ The various aspects of NLP include Parsing, Machine Translation, Language Modelling, Machine Learning, Semantic Analysis etc.
  • 8. Natural Language Toolkit ➢ NLTK is a leading platform for building Python programs to work with human language data. ➢ It provides easy-to-use interfaces to over 50 corpora and lexical resources such as WordNet. ➢ It provides a suite of text processing libraries for classification, tokenization, stemming, tagging, parsing, and semantic reasoning.
  • 9. Dataset : NLTK Corpora ➢ A publicly available dataset of tweets from the Natural Language Toolkit Corpus Library is used. ➢ Tweets dataset comprised of a ample collection of individual emotions and captures most of the adjectives important to sentiment classification. ➢ It consists of 30,000 extremely polar tweets for training dataset and 10,000 for testing dataset. ➢ Both the training and test sets comprises of an identical number of negative and positive tweets.
  • 11. Tokenization The process of breaking a stream of text up into words, phrases, symbols, or other meaningful elements called tokens.
  • 12. Normalization ➢ Normalization in NLP is the process of converting a word to its canonical form. ➢ Stemming is a process of removing affixes from a word. ➢ Lemmatization is a process in which a word normalizes with the context of vocabulary and morphological analysis of words in text. ➢ Wordnet is a lexical database available in NLTK for the English language that helps the script determine the base word.
  • 13. Removal of Noise/Stop Words ➢ Noise is any part of the text that does not add any meaning to data. ➢ It is necessary to remove all hyperlinks, @ symbol of tweeter handles ,punctuation and special characters. ➢ Also required to remove the stop words like “is”, “a” and “the” from the sentence.
  • 14. Determine Word Density ➢ The most basic form of analysis on textual data is to take out the word frequency. ➢ After compiling all words in the sample of tweets, the most common words can be found out using the FreqDist class of NLTK.
  • 15. Separate The Data Training data to train the model Testing data to compute the accuracy
  • 16. Training Data (With Labels) Model Real Data Trained Model Prediction
  • 18. Compute The Accuracy Accuracy = No. of correct predictions / Total no. of statements
  • 20. Future Work A web based application can implemented for better access Web scraping can be implemented for direct retrieval of tweets or reviews from Twitter or any other platform More classification categories can be added to determine the sentiment more specifically Supervised with multiple languages to make it more local
  • 21. Conclusion We conclude that by using various NLTK modules for preprocessing and NLTK Naive Bayes classifier it is easier to classify the tweets and get better accuracy.
  • 22. References ➢ https://www.researchgate.net/publication/220482883_NLTK_the_Natural_Language_ Toolkit ➢ Basic Sentiment Analysis using NLTK - Towards Data Science ➢ https://www.analyticsvidhya.com/blog/2017/09/naive-bayes-explained/ ➢ https://www.kaggle.com/lakshmi25npathi/sentiment-analysis-of-imdb-movie-reviews ➢ http://ijcsit.com/docs/Volume%206/vol6issue06/ijcsit20150606134.pdf ➢ https://pdfs.semanticscholar.org/c151/dfad8c1bf88b0afc716758c77d533ded7dd0.pdf

Editor's Notes

  • #9: NLTK is intended to support research and teaching in NLP or closely related areas, including empirical linguistics, cognitive science, artificial intelligence, information retrieval, and machine learning.
  • #10: I choose tweets as my dataset since it comprise of a ample collection of individual emotions and captures most of the adjectives important to sentiment classification.