0% found this document useful (0 votes)
29 views

Python Portfolio Project For Data Analyst

Uploaded by

Richard Smith
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
29 views

Python Portfolio Project For Data Analyst

Uploaded by

Richard Smith
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 13

Python

Project
FOR DATA ANALYST

Sentiment
Analysis
linkedin.com/in/ileonjose
Overview: Analyze Twitter sentiment
regarding climate change to gauge public
perception. This project combines data
collection, text preprocessing, advanced
sentiment analysis, and data visualization.

Dataset:
kaggle.com/datasets/kazanova/sentiment140

linkedin.com/in/ileonjose
Step 1: Gather Data

Utilize the Tweepy library to collect tweets


related to climate change over a specific
period.

1. Set Up Twitter API Access:

Create a Twitter Developer account


and obtain API keys.

2. Fetch Tweets:

Collect tweets using specific keywords


or hashtags like #ClimateChange or
#GlobalWarming.

linkedin.com/in/ileonjose
linkedin.com/in/ileonjose
Step 2: Preprocess Text Data

Clean and prepare the text for analysis,


focusing on meaningful content.

1. Text Cleaning Functions:

Remove URLs, special characters, and


stopwords.

2. Tokenization:

Break the text into words for further


analysis.

linkedin.com/in/ileonjose
linkedin.com/in/ileonjose
Step 3: Advanced Sentiment
Analysis

Use VADER (Valence Aware Dictionary and


sentiment Reasoner) for nuanced
sentiment analysis, particularly suitable
for social media text.

1. Apply VADER:

Get sentiment scores, which include


positive, negative, and neutral values.

linkedin.com/in/ileonjose
linkedin.com/in/ileonjose
Step 4: Aggregate and
Visualize Results

Create visualizations to present sentiment


distributions and trends over time.

1. Aggregate Scores:

Classify tweets based on their


compound scores into positive,
negative, and neutral categories.

2. Bar and Line Graphs:

Use Matplotlib to visualize the overall


sentiment and temporal trends.

linkedin.com/in/ileonjose
linkedin.com/in/ileonjose
Step 5: Provide Insights

Write a comprehensive report detailing your


findings:

Overall Sentiment: Discuss the balance


of positive, negative, and neutral tweets.

Trends Over Time: If data spans several


days, analyze how sentiment fluctuated,
especially during significant climate
events.

Public Opinion: Reflect on what the


sentiment suggests about public
awareness and concern regarding
climate change.
Recruiters POV:

This project highlights your ability to


collect and analyze social media data,
apply NLP techniques, and present your
findings visually.

It demonstrates critical thinking and data


storytelling, valuable skills for any data
analyst role.
Found this helpful? Repost!

linkedin.com/in/ileonjose

You might also like