0% found this document useful (0 votes)
33 views

Sahil Int577 Ca1

The document summarizes a student's analysis of social media data from Twitter and YouTube. 1) An analysis of COVID-related Twitter data found the tweet with the highest retweets and that positive emotion like trust was most common. 2) An analysis of comments on a YouTube video found over 125,000 likes and over 10,000 positive comments, indicating overall positive user feedback. 3) Word cloud visualizations of Twitter and YouTube data showed the most frequent words used were related to COVID, political leaders, and medical issues like oxygen.

Uploaded by

hemanth kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
33 views

Sahil Int577 Ca1

The document summarizes a student's analysis of social media data from Twitter and YouTube. 1) An analysis of COVID-related Twitter data found the tweet with the highest retweets and that positive emotion like trust was most common. 2) An analysis of comments on a YouTube video found over 125,000 likes and over 10,000 positive comments, indicating overall positive user feedback. 3) Word cloud visualizations of Twitter and YouTube data showed the most frequent words used were related to COVID, political leaders, and medical issues like oxygen.

Uploaded by

hemanth kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 12

Academic Task 1

INT577

WEB AND SOCIAL MEDIA ANALYTICS

Section: Q2041
Student Name: Sahil Madaan
Registration Number: 12010685
Roll Number: RQ8E26A48
Signature: Sahil

Submitted to: Mrinalini Rana (22138)


Section: Q8E26
Reg No: 12010685
Q1. Collect the data from twitter using “COVID” word, perform the pre-processing
and analyze the data.
• Find the tweet with highest retweet number.

Output:
• Find which emotions represent max sum.

Output:
• Display top words used using WORDCLOUD 2 only.

Output:
• Top words displayed using WORDCLOUD2 Star shape.

Output:
• Analyze the data and mention 3-4 points to conclude the complete analysis.

➢ In the first answer we have specified the highest number of tweets that have been retweeted
using the arrange function (), where we get retweet count as 61644 in the covid analysis.

➢ In the second point we have performed a sentimental analysis to find the emotion with the
max sum which tends to come out positive with emotion of trust being the topmost, through
the help of bar chart.

➢ Eventually in the third point we have used the wordcloud2 package. With the help of this
function wordcloud2() we have created a cloud of words representing the most used words
such as Covid, Modi, People, Vaccine, pandemic, patient, media etc.

Q2. Collect the data from YouTube using “OpO_mVvtkP0” video id


Output:
• Perform the pre-processing.

• Convert text to lowercase

• Punctuation is removed
• How many numbers of likes and dislike.

Output:

No of Likes = 125082
No of Dislikes = 11241
• What is the user feedback positive or negative.

Output:

As we can see that no of positive comments are more,10217 which are more as compared to the
negative comments,6621 so the result or feedback from the audience is positive
• Display using bar diagram count of top 10 words used in the comments.
Output:

From the above bar graph, we can see that most used words in the comments are “hai”, “Kejriwal”,
“Oxygen”, “delhi”, “modi” etc.

You might also like