Craft Presentations

The document presents a project on sentiment analysis using machine learning and deep learning techniques, focusing on analyzing text data for sentiment understanding. It outlines the problem of unstructured data growth and the objectives of fine-tuning large language models. The findings indicate that DistilBERT outperforms traditional models like SVM, Naive Bayes, and Logistic Regression in accuracy and F1-score.

Uploaded by

sakshamdura

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Craft Presentations

Uploaded by

sakshamdura

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

SENTIMENT ANALYSIS

USING MACHINE LEARNING

SUBMITTED BY:
SAKSHAM DURA: 27444/077
SANDESH REGMI: 27447/077
RENJIL SUNAR: 27436/077
INTRODUCTION:
Sentiment analysis has become an important task in natural
language processing (NLP), finding its way into various areas like
business intelligence, social media monitoring, and customer
feedback reviews. This project aims to put machine learning and
deep learning techniques to work for understanding sentiments
in text.
PROBLEM STATEMENT
Exponential growth of unstructured data
Domain Specific complexities for analyzing text data

OBJECTIVES
Analyze text data based on sentiment
FineTune large language models
SYSTEM OVERVIEW
Data Collection
DataSet
link:https://www.kaggle.com/datasets/abhi8923shriv/senti
ment-analysis-dataset/data

Exploratory Data Analysis (EDA)

value counts
data samples
descriptive summary
statistical summary
generating wordclouds
Figure: WordCloud for positive text
Figure: WordCloud for negative text
Figure: WordCloud for neutral text
Preprocessing
normalizing casing
removing stopwords
stemming / lemmatization
tokenizer / vectorizer
drop null values
How CountVectorizer works

Figure: CountVectorizer [1].

How DistilBERT Tokenizer works

Figure: DistilBERT Tokenizer [2].

Statistical Machine Learning Models
Logistic Regression
Support Vector Machines
Multinomial Naive Bayes

Transformer Based Models

BERT
DistilBERT
Figure: Transformer vs BERT [3].
Figure: Transformer Model [4].
Figure: BERT Model [4].
Evaluation Metrics

Figure: SVM metrics Figure: Multinomial Naive Bayes

metrics
Evaluation Metrics

Figure: Logistic Regression metrics Figure: DistilBERT Model

Findings:
DistilBERT consistently outperformed traditional models like SVM,
Naive Bayes, and Logistic Regression in terms of accuracy and F1-score.
Refernces:
1. https://www.ronaldjamesgroup.com/article/grab-your-wine-its-time-to-
demystify-ml-and-nlp
2. https://www.cnblogs.com/emanlee/p/17521830.html
3. https://www.linkedin.com/pulse/overview-transformer-bert-sanjay-
kumar-mba-ms-phd/
4. https://arize.com/blog-course/unleashing-bert-transformer-model-nlp/
THANK
YOU VERY
MUCH!

BPMN: the Business Process Modeling Notation Pocket Handbook
From Everand
BPMN: the Business Process Modeling Notation Pocket Handbook
Patrice Briol
No ratings yet
The Applied SQL Data Analytics Workshop - Second Edition: Develop your practical skills and prepare to become a professional data analyst, 2nd Edition
From Everand
The Applied SQL Data Analytics Workshop - Second Edition: Develop your practical skills and prepare to become a professional data analyst, 2nd Edition
Matt Goldwasser
No ratings yet
Learn SAP BI in 24 Hours
From Everand
Learn SAP BI in 24 Hours
Alex Nordeen
3/5 (1)
Big Data Visualization
From Everand
Big Data Visualization
James D. Miller
No ratings yet
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
DATA ANALYSIS AND DATA SCIENCE: Unlock Insights and Drive Innovation with Advanced Analytical Techniques (2024 Guide)
From Everand
DATA ANALYSIS AND DATA SCIENCE: Unlock Insights and Drive Innovation with Advanced Analytical Techniques (2024 Guide)
WINTON CLEM
No ratings yet
Microsoft Power BI Performance Best Practices: Learn practical techniques for building high-speed Power BI solutions
From Everand
Microsoft Power BI Performance Best Practices: Learn practical techniques for building high-speed Power BI solutions
Thomas LeBlanc
No ratings yet
Spreadsheets To Cubes (Advanced Data Analytics for Small Medium Business): Data Science
From Everand
Spreadsheets To Cubes (Advanced Data Analytics for Small Medium Business): Data Science
alasdair gilchrist
No ratings yet
15 Math Concepts Every Data Scientist Should Know: Understand and learn how to apply the math behind data science algorithms
From Everand
15 Math Concepts Every Data Scientist Should Know: Understand and learn how to apply the math behind data science algorithms
David Hoyle
No ratings yet
Knight's Microsoft Business Intelligence 24-Hour Trainer: Leveraging Microsoft SQL Server Integration, Analysis, and Reporting Services with Excel and SharePoint
From Everand
Knight's Microsoft Business Intelligence 24-Hour Trainer: Leveraging Microsoft SQL Server Integration, Analysis, and Reporting Services with Excel and SharePoint
Brian Knight
3/5 (1)
A Natural Language Processing For Sentiment Analysis From Text Using Deep Learning Algorithm
No ratings yet
A Natural Language Processing For Sentiment Analysis From Text Using Deep Learning Algorithm
7 pages
Learning Dynamics NAV Patterns: Create solutions that are easy to maintain, are quick to upgrade, and follow proven concepts and design
From Everand
Learning Dynamics NAV Patterns: Create solutions that are easy to maintain, are quick to upgrade, and follow proven concepts and design
Marije Brummel
No ratings yet
Book Series Increasing Productivity of Software Development, Part 2: Management Model, Cost Estimation and KPI Improvement
From Everand
Book Series Increasing Productivity of Software Development, Part 2: Management Model, Cost Estimation and KPI Improvement
Stefan Luckhaus
No ratings yet
Data Analytics for Marketing: A practical guide to analyzing marketing data using Python
From Everand
Data Analytics for Marketing: A practical guide to analyzing marketing data using Python
Guilherme Diaz-Bérrio
No ratings yet
Business Dashboards: A Visual Catalog for Design and Deployment
From Everand
Business Dashboards: A Visual Catalog for Design and Deployment
Nils H. Rasmussen
4/5 (1)
Analyzing The Performance of Sentiment Analysis Using BERT DistilBERT and RoBERTa
No ratings yet
Analyzing The Performance of Sentiment Analysis Using BERT DistilBERT and RoBERTa
6 pages
Practical Full Stack Machine Learning: A Guide to Build Reliable, Reusable, and Production-Ready Full Stack ML Solutions
From Everand
Practical Full Stack Machine Learning: A Guide to Build Reliable, Reusable, and Production-Ready Full Stack ML Solutions
Alok Kumar
No ratings yet
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
Pragmatic Machine Learning with Python: Learn How to Deploy Machine Learning Models in Production
From Everand
Pragmatic Machine Learning with Python: Learn How to Deploy Machine Learning Models in Production
Avishek Nag
No ratings yet
Mastering Lead Generation with DeepSeek AI: Unlocking the Future of Customer Acquisition
From Everand
Mastering Lead Generation with DeepSeek AI: Unlocking the Future of Customer Acquisition
Robert Cullen
No ratings yet
The Kimball Group Reader: Relentlessly Practical Tools for Data Warehousing and Business Intelligence Remastered Collection
From Everand
The Kimball Group Reader: Relentlessly Practical Tools for Data Warehousing and Business Intelligence Remastered Collection
Ralph Kimball
No ratings yet
The MSP’s Guide to the Ultimate Client Experience: Optimizing service efficiency, account management productivity, and client engagement with a modern digital-first approach.
From Everand
The MSP’s Guide to the Ultimate Client Experience: Optimizing service efficiency, account management productivity, and client engagement with a modern digital-first approach.
Jeff Farris
No ratings yet
ASP.NET 3.5 Application Architecture and Design
From Everand
ASP.NET 3.5 Application Architecture and Design
Vivek Thakur
No ratings yet
Manufacturing: Engineering, Management and Marketing
From Everand
Manufacturing: Engineering, Management and Marketing
S.O.T Ogaji
No ratings yet
Data Entry Operator: Skills, Software, Career Tips, and Interview Q&A
From Everand
Data Entry Operator: Skills, Software, Career Tips, and Interview Q&A
Sumitra Kumari
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Data Analytics with SAS: Explore your data and get actionable insights with the power of SAS (English Edition)
From Everand
Data Analytics with SAS: Explore your data and get actionable insights with the power of SAS (English Edition)
Nishant Sidana
No ratings yet
Oracle CRM On Demand Administration Essentials
From Everand
Oracle CRM On Demand Administration Essentials
Padmanabha Rao
No ratings yet
Creating your MySQL Database: Practical Design Tips and Techniques
From Everand
Creating your MySQL Database: Practical Design Tips and Techniques
Marc Delisle
3/5 (1)
The Analytics Lifecycle Toolkit: A Practical Guide for an Effective Analytics Capability
From Everand
The Analytics Lifecycle Toolkit: A Practical Guide for an Effective Analytics Capability
Gregory S. Nelson
No ratings yet
Getting Started with SQL Server 2012 Cube Development
From Everand
Getting Started with SQL Server 2012 Cube Development
Simon Lidberg
No ratings yet
Optimizing Salesforce Industries Solutions on the Vlocity OmniStudio Platform: Implementing OmniStudio best practices for achieving maximum performance
From Everand
Optimizing Salesforce Industries Solutions on the Vlocity OmniStudio Platform: Implementing OmniStudio best practices for achieving maximum performance
Dmitri Khanine
No ratings yet
Machine Learning with Python: Foundations and Applications: ML, #1
From Everand
Machine Learning with Python: Foundations and Applications: ML, #1
Mohammed Nurudeen
No ratings yet
Data Science with R: Beginner to Expert
From Everand
Data Science with R: Beginner to Expert
Narayana Nemani
No ratings yet
Smarter Data Science: Succeeding with Enterprise-Grade Data and AI Projects
From Everand
Smarter Data Science: Succeeding with Enterprise-Grade Data and AI Projects
Neal Fishman
No ratings yet
Be Data Curious!: Be Data Curious!, #1
From Everand
Be Data Curious!: Be Data Curious!, #1
Nick Jewell
No ratings yet
Effective Analytics for Marketing
From Everand
Effective Analytics for Marketing
Sucheta Kakkar
No ratings yet
Sentiment_Analysis_Using_Bert_Model
No ratings yet
Sentiment_Analysis_Using_Bert_Model
8 pages
Business Intelligence and Data Mining Techniques
From Everand
Business Intelligence and Data Mining Techniques
Dwaipayan Sethi
No ratings yet
Data Mining with Microsoft SQL Server 2008
From Everand
Data Mining with Microsoft SQL Server 2008
Jamie MacLennan
4/5 (1)
Microsoft Dynamics NAV Administration
From Everand
Microsoft Dynamics NAV Administration
Amit Sachdev
No ratings yet
Data Cleaning with Power BI: The definitive guide to transforming dirty data into actionable insights
From Everand
Data Cleaning with Power BI: The definitive guide to transforming dirty data into actionable insights
Gus Frazer
No ratings yet
Microsoft Dynamics AX 2012 Reporting Cookbook
From Everand
Microsoft Dynamics AX 2012 Reporting Cookbook
Kamalakannan Elangovan
No ratings yet
Mastering Classification Algorithms for Machine Learning: Learn how to apply Classification algorithms for effective Machine Learning solutions (English Edition)
From Everand
Mastering Classification Algorithms for Machine Learning: Learn how to apply Classification algorithms for effective Machine Learning solutions (English Edition)
PARTHA MAJUMDAR
No ratings yet
Enterprise Process Orchestration: A Hands-on Guide to Strategy, People, and Technology That Will Transform Your Business
From Everand
Enterprise Process Orchestration: A Hands-on Guide to Strategy, People, and Technology That Will Transform Your Business
Bernd Ruecker
No ratings yet
IT Interview Guide for Freshers: Crack your IT interview with confidence
From Everand
IT Interview Guide for Freshers: Crack your IT interview with confidence
Sameer S Paradkar
No ratings yet
Sentiment__Analysis
No ratings yet
Sentiment__Analysis
12 pages
DeepSeek for Data Analysis: The Future of Data Analysis for Business Professionals
From Everand
DeepSeek for Data Analysis: The Future of Data Analysis for Business Professionals
Mohammod Shaharuzzaman
No ratings yet
Book Series: Increasing Productivity of Software Development, Part 1: Productivity and Performance Measurement - Measurability and Methods
From Everand
Book Series: Increasing Productivity of Software Development, Part 1: Productivity and Performance Measurement - Measurability and Methods
Stefan Luckhaus
No ratings yet
AWS Certified Machine Learning - Specialty (MLS-C01) Certification Guide: The ultimate guide to passing the MLS-C01 exam on your first attempt
From Everand
AWS Certified Machine Learning - Specialty (MLS-C01) Certification Guide: The ultimate guide to passing the MLS-C01 exam on your first attempt
Somanath Nanda
No ratings yet
Microsoft Dynamics NAV 2009: Professional Reporting
From Everand
Microsoft Dynamics NAV 2009: Professional Reporting
Steven Renders
No ratings yet
Introduction to Business Analytics
From Everand
Introduction to Business Analytics
Dwaipayan Sethi
No ratings yet
Expert Cube Development with SSAS Multidimensional Models
From Everand
Expert Cube Development with SSAS Multidimensional Models
Marco Russo
No ratings yet
Mastering Lead Generation with DeepSeek AI/ A Comprehensive Guide to Transforming Your Sales Strategy
From Everand
Mastering Lead Generation with DeepSeek AI/ A Comprehensive Guide to Transforming Your Sales Strategy
Robert Cullen
No ratings yet
Learn Professional Programming in .Net Using C#, Visual Basic, and Asp.Net
From Everand
Learn Professional Programming in .Net Using C#, Visual Basic, and Asp.Net
Adalat Khan
No ratings yet
Professional Microsoft SQL Server 2012 Reporting Services
From Everand
Professional Microsoft SQL Server 2012 Reporting Services
Paul Turley
1/5 (1)
Business Analytics with SAS Studio: Deliver Business Intelligence by Combining SQL Processing, Insightful Visualizations, and Various Data Mining Techniques
From Everand
Business Analytics with SAS Studio: Deliver Business Intelligence by Combining SQL Processing, Insightful Visualizations, and Various Data Mining Techniques
Rajinder Kr. Chitoria
No ratings yet
Manufacturing Secret : Product Development and Intelligent Manufacturing For Flexible Automation With Odoo 17: odoo consultations, #1.1
From Everand
Manufacturing Secret : Product Development and Intelligent Manufacturing For Flexible Automation With Odoo 17: odoo consultations, #1.1
DR.Abdelghany.fouad
No ratings yet
Learning Hunk: A quick, practical guide to rapidly visualizing and analyzing your Hadoop data using Hunk
From Everand
Learning Hunk: A quick, practical guide to rapidly visualizing and analyzing your Hadoop data using Hunk
Dmitry Anoshin
No ratings yet

Craft Presentations

Uploaded by

Craft Presentations

Uploaded by

SENTIMENT ANALYSIS

USING MACHINE LEARNING

Exploratory Data Analysis (EDA)

Figure: CountVectorizer [1].

Figure: DistilBERT Tokenizer [2].

Transformer Based Models

Figure: SVM metrics Figure: Multinomial Naive Bayes

Figure: Logistic Regression metrics Figure: DistilBERT Model

You might also like