0% found this document useful (0 votes)

13 views4 pages

Assignment 02

Uploaded by

DHRUV TILLU

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views4 pages

Assignment 02

Uploaded by

DHRUV TILLU

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Name: Dhruv Jayant Tillu Roll No.

: 6107
Subject: 510302 - BDS

ASSIGNMENT: 02
Aim: Take a sample dataset (The lab teacher may provide it). Plot the data using appropriate graphs (e.g.
scatter diagram). Perform normality and symmetry tests on it using at least one graph method and at least
one statistical test. Analyse the results. Then evaluate Spearman’s Rank Correlation for this data.

Requirements:
• Software: PyCharm Professional
• Libraries: Pandas, Scikit-Learn, Seaborn, Matplotlib, and NumPy
• Dataset: studentperformance.csv from Kaggle

Theory: This program analyzes the relationship between two variables by plotting a scatter plot, testing for
normality using z-scores, and assessing symmetry with skewness. Histograms are used to visually examine
data distribution, while Spearman’s Rank Correlation measures the strength and direction of the monotonic
relationship between the variables.

Code:
import pandas as pd

data = pd.read_csv('StudentsPerformance.csv')
data.head()

gender race/ethnicity parental level of education lunch \

0 female group B bachelor's degree standard
1 female group C some college standard
2 female group B master's degree standard
3 male group A associate's degree free/reduced
4 male group C some college standard

test preparation course math score reading score writing score

0 none 72 72 74
1 completed 69 90 88
2 none 90 95 93
3 none 47 57 44
4 none 76 78 75

import matplotlib.pyplot as plt

import seaborn as sns

plt.figure(figsize=(8, 6))
sns.scatterplot(x='math score', y='reading score', data=data)
plt.title('Scatter Plot of Math Score vs Reading Score')
plt.xlabel('Math Score')
plt.ylabel('Reading Score')
plt.grid(True)
plt.show()
Name: Dhruv Jayant Tillu Roll No.: 6107
Subject: 510302 - BDS

corr = data[['math score', 'reading score', 'writing score']].corr()

sns.heatmap(corr, annot=True, cmap='seismic')
plt.title('Correlation Heatmap')

Text(0.5, 1.0, 'Correlation Heatmap')

import numpy as np
math_scores = data['math score'].values
reading_scores = data['reading score'].values
writing_scores = data['writing score'].values

z_scores_math = (math_scores - np.mean(math_scores)) / np.std(math_scores)

z_scores_reading = (reading_scores - np.mean(reading_scores)) / np.std(reading_scores)
z_scores_writing = (writing_scores - np.mean(writing_scores)) / np.std(writing_scores)

within_one_std_math = np.mean(np.abs(z_scores_math) <= 1)

within_two_std_math = np.mean(np.abs(z_scores_math) <= 2)

within_one_std_reading = np.mean(np.abs(z_scores_reading) <= 1)

within_two_std_reading = np.mean(np.abs(z_scores_reading) <= 2)

within_one_std_writing = np.mean(np.abs(z_scores_writing) <= 1)

within_two_std_writing = np.mean(np.abs(z_scores_writing) <= 2)

print(f'Percentage of Math Scores within 1 std: {within_one_std_math * 100}%')

print(f'Percentage of Math Scores within 2 std: {within_two_std_math * 100}%')

print(f'Percentage of Reading Scores within 1 std: {within_one_std_reading * 100}%')

Name: Dhruv Jayant Tillu Roll No.: 6107
Subject: 510302 - BDS

print(f'Percentage of Reading Scores within 2 std: {within_two_std_reading * 100}%')

print(f'Percentage of Writing Scores within 1 std: {within_one_std_writing * 100}%')

print(f'Percentage of Writing Scores within 2 std: {within_two_std_writing * 100}%')

Percentage of Math Scores within 1 std: 69.6%

Percentage of Math Scores within 2 std: 95.39999999999999%
Percentage of Reading Scores within 1 std: 66.4%
Percentage of Reading Scores within 2 std: 95.39999999999999%
Percentage of Writing Scores within 1 std: 68.8%
Percentage of Writing Scores within 2 std: 95.8%

def calculate_skewness(data):
n = len(data)
mean = np.mean(data)
median = np.median(data)
std_dev = np.std(data)

# Pearson's second coefficient of skewness

skewness = 3 * (mean - median) / std_dev
return skewness

skew_math = calculate_skewness(math_scores)
skew_reading = calculate_skewness(reading_scores)
skew_writing = calculate_skewness(writing_scores)

print(f'Skewness for Math Score: {skew_math}')

print(f'Skewness for Reading Score: {skew_reading}')
print(f'Skewness for Writing Score: {skew_writing}')

Skewness for Math Score: 0.01761737051555966

Skewness for Reading Score: -0.1708366195714668
Skewness for Writing Score: -0.18685734108808663

plt.figure(figsize=(12, 6))

plt.subplot(1, 3, 1)
sns.histplot(math_scores, kde=True, bins=20)
plt.title('Histogram of Math Score')

plt.subplot(1, 3, 2)
sns.histplot(reading_scores, kde=True, bins=20)
plt.title('Histogram of Reading Score')

plt.subplot(1, 3, 3)
sns.histplot(writing_scores, kde=True, bins=20)
plt.title('Histogram of Writing Score')

plt.show()
Name: Dhruv Jayant Tillu Roll No.: 6107
Subject: 510302 - BDS

from scipy.stats import spearmanr

spearman_corr, p_value = spearmanr(math_scores, reading_scores)

print(f"Spearman's Rank Correlation: {spearman_corr}")

print(f"P-value: {p_value}")

Spearman's Rank Correlation: 0.8040638885551747

P-value: 1.3538514946746025e-227

spearman_corr, p_value = spearmanr(reading_scores, writing_scores)

print(f"Spearman's Rank Correlation: {spearman_corr}")

print(f"P-value: {p_value}")

Spearman's Rank Correlation: 0.9489525187100921

P-value: 0.0

Conclusion: In conclusion, the scatter plot provides a visual insight into the relationship between the two
variables, while the normality test suggests whether the data follows a normal distribution. The skewness
measure indicates any asymmetry in the data, and the histograms offer a clear view of the distribution's
shape. Finally, Spearman’s Rank Correlation helps determine the strength and direction of the relationship
between the variables, offering a comprehensive understanding of their interdependence.

Demo Lesson Plan For in Math 11 - Pearson Product Moment Correlation Coefficient 1
100% (4)
Demo Lesson Plan For in Math 11 - Pearson Product Moment Correlation Coefficient 1
10 pages
Validity and Reliability of Research Instrument
100% (5)
Validity and Reliability of Research Instrument
47 pages
PMA_Experiment_1
No ratings yet
PMA_Experiment_1
9 pages
student analysis
No ratings yet
student analysis
16 pages
Lab 2 - Basic Statistical Analysis
No ratings yet
Lab 2 - Basic Statistical Analysis
7 pages
Samarth Raghav
No ratings yet
Samarth Raghav
15 pages
vertopal.com_Jamboree
No ratings yet
vertopal.com_Jamboree
10 pages
Prep - SIA Assignment #1 - Jupyter Notebook
No ratings yet
Prep - SIA Assignment #1 - Jupyter Notebook
10 pages
DAV Prac BHR
No ratings yet
DAV Prac BHR
22 pages
Data Manipulation With Python Pandas 1700003764
No ratings yet
Data Manipulation With Python Pandas 1700003764
10 pages
First 4
No ratings yet
First 4
11 pages
CLASS Analysis
No ratings yet
CLASS Analysis
14 pages
Sujin
No ratings yet
Sujin
23 pages
Simple Linear Regression and Measures of Correlation
No ratings yet
Simple Linear Regression and Measures of Correlation
33 pages
The-Statistical-Tool-Linear-Regression
No ratings yet
The-Statistical-Tool-Linear-Regression
56 pages
8-MC 107-Elementary Stat and Probability-Finals
No ratings yet
8-MC 107-Elementary Stat and Probability-Finals
84 pages
Complex Problem AI
No ratings yet
Complex Problem AI
13 pages
Basic Statistics in Assessment: Mean, Variability, Correlation
No ratings yet
Basic Statistics in Assessment: Mean, Variability, Correlation
18 pages
DALab Part-B BCU&BU
No ratings yet
DALab Part-B BCU&BU
12 pages
DSBDA_prac2
No ratings yet
DSBDA_prac2
2 pages
Assignment 4
No ratings yet
Assignment 4
5 pages
ca school summary statistics deepseek
No ratings yet
ca school summary statistics deepseek
8 pages
DA Manual - Part B
No ratings yet
DA Manual - Part B
13 pages
Correlation Analysis in python
100% (1)
Correlation Analysis in python
6 pages
Numpy and Pandas
No ratings yet
Numpy and Pandas
11 pages
Statistics
No ratings yet
Statistics
13 pages
1b3 - Stat123Lab2
No ratings yet
1b3 - Stat123Lab2
2 pages
Business Analytics Report
No ratings yet
Business Analytics Report
4 pages
Students Performance Analysis
No ratings yet
Students Performance Analysis
12 pages
Core_11_Statistics-and-Probability_q4_CLAS8-Correlation-JOSEPH-AURELLO
No ratings yet
Core_11_Statistics-and-Probability_q4_CLAS8-Correlation-JOSEPH-AURELLO
21 pages
OIL(01)_256_EEE4416
No ratings yet
OIL(01)_256_EEE4416
10 pages
Basic Data Analysis in Action Research With Computer
No ratings yet
Basic Data Analysis in Action Research With Computer
43 pages
Lesson Word
No ratings yet
Lesson Word
6 pages
DAV ALL PRACTICALS
No ratings yet
DAV ALL PRACTICALS
35 pages
Pearson and Spearman Correlation
No ratings yet
Pearson and Spearman Correlation
50 pages
Lesson 8
No ratings yet
Lesson 8
11 pages
Assignment (1)
No ratings yet
Assignment (1)
27 pages
L7 Correlation
No ratings yet
L7 Correlation
40 pages
Python Case Study
No ratings yet
Python Case Study
7 pages
Name: Badigi Shivakumar Reg - No: 20MIS0173 Lab - Slot: L9+L10 Date: 02-09-2021
No ratings yet
Name: Badigi Shivakumar Reg - No: 20MIS0173 Lab - Slot: L9+L10 Date: 02-09-2021
10 pages
Correlation
No ratings yet
Correlation
19 pages
1595579871SMS_202_ODL
No ratings yet
1595579871SMS_202_ODL
65 pages
Business Statistics
No ratings yet
Business Statistics
22 pages
Pearson
0% (2)
Pearson
7 pages
Learning Activity Sheet
No ratings yet
Learning Activity Sheet
6 pages
QM PDF
No ratings yet
QM PDF
62 pages
Introduction To Correlation Analysis GB6023 2012
No ratings yet
Introduction To Correlation Analysis GB6023 2012
34 pages
Descriptive Statistics CH11
No ratings yet
Descriptive Statistics CH11
39 pages
Section 2 Lesson 1 (2)
No ratings yet
Section 2 Lesson 1 (2)
32 pages
Statistics & Probability Q4 - Week 7-8
No ratings yet
Statistics & Probability Q4 - Week 7-8
15 pages
Statics Imp Answer
No ratings yet
Statics Imp Answer
14 pages
STAT-ASSIGN
No ratings yet
STAT-ASSIGN
9 pages
Stat and Probability Finals
No ratings yet
Stat and Probability Finals
7 pages
Cambridge Standard 12 Chapter 6
No ratings yet
Cambridge Standard 12 Chapter 6
11 pages
Similarity Computation of Categrical and Ordinal Data
No ratings yet
Similarity Computation of Categrical and Ordinal Data
11 pages
Stats
No ratings yet
Stats
16 pages
Jamboree Linear Regression Version 2 Jupyter Notebook
No ratings yet
Jamboree Linear Regression Version 2 Jupyter Notebook
12 pages
CHAPTER 2 Norms and Basic Statistics For Testing
No ratings yet
CHAPTER 2 Norms and Basic Statistics For Testing
22 pages
Lab 13
No ratings yet
Lab 13
5 pages
AP Statistics Flashcards, Fifth Edition: Up-to-Date Practice
From Everand
AP Statistics Flashcards, Fifth Edition: Up-to-Date Practice
Barron's Educational Series
No ratings yet
Neo4j Graph Data Science Certified - Exam Practice Tests
From Everand
Neo4j Graph Data Science Certified - Exam Practice Tests
Cristian Scutaru
No ratings yet
Midas Biorxiv 2023
No ratings yet
Midas Biorxiv 2023
27 pages
M.Tech Statistics
No ratings yet
M.Tech Statistics
2 pages
Correlation-Analysis-in-Excel
No ratings yet
Correlation-Analysis-in-Excel
7 pages
Correlation Pearson WPS Office
No ratings yet
Correlation Pearson WPS Office
24 pages
The Working Memory Questionnaire A scale to assess everyday life problems related to deficits of working memory in brain injured patients
No ratings yet
The Working Memory Questionnaire A scale to assess everyday life problems related to deficits of working memory in brain injured patients
17 pages
CH 4 Quiz Bank Testing and Assessment
33% (3)
CH 4 Quiz Bank Testing and Assessment
70 pages
All The Statistical Concept You Required For Data Science
No ratings yet
All The Statistical Concept You Required For Data Science
26 pages
Spearman Rho Rank
No ratings yet
Spearman Rho Rank
3 pages
Edu 7102 Quantitative Research Methods
No ratings yet
Edu 7102 Quantitative Research Methods
3 pages
Effect of Nepotism On Employee Emotional Engagement Interplay of Organisational Politics
No ratings yet
Effect of Nepotism On Employee Emotional Engagement Interplay of Organisational Politics
11 pages
Exercise 8
No ratings yet
Exercise 8
2 pages
Behavior of Senior High School Students Affecting Classroom Learnings
No ratings yet
Behavior of Senior High School Students Affecting Classroom Learnings
49 pages
Math AI SL IA 2
No ratings yet
Math AI SL IA 2
17 pages
CORRELATION STUDY SCRIPT
No ratings yet
CORRELATION STUDY SCRIPT
5 pages
Final Research Proposal_Apostol, Mercado & Peregrino(A1).
No ratings yet
Final Research Proposal_Apostol, Mercado & Peregrino(A1).
30 pages
3Y1S Biostatistics and Epidemiology PDF
No ratings yet
3Y1S Biostatistics and Epidemiology PDF
16 pages
The Relationship Between The Coffee Intake and The Work Performanceof Coffee-Drinking Teachers: A Correlational Study
No ratings yet
The Relationship Between The Coffee Intake and The Work Performanceof Coffee-Drinking Teachers: A Correlational Study
11 pages
Causes of Delay in Public Building Construction Projects: A Case of Addis Abeba Administration Abdurezak Mohammed Kuhil, Neway Seifu
No ratings yet
Causes of Delay in Public Building Construction Projects: A Case of Addis Abeba Administration Abdurezak Mohammed Kuhil, Neway Seifu
10 pages
STA114 (9) - Correlation and Regression Analysis
No ratings yet
STA114 (9) - Correlation and Regression Analysis
24 pages
Predicting Academic Performance in Mathematics Through Online Learning Constructs and Content Proficiencies
No ratings yet
Predicting Academic Performance in Mathematics Through Online Learning Constructs and Content Proficiencies
16 pages
Statistics for Business and Economics-Paul Newbold
No ratings yet
Statistics for Business and Economics-Paul Newbold
6 pages
Correlation Coefficients: Appropriate Use and Interpretation: Anesthesia & Analgesia February 2018
No ratings yet
Correlation Coefficients: Appropriate Use and Interpretation: Anesthesia & Analgesia February 2018
7 pages
Basic Statistical Techniques in Data Analysis
No ratings yet
Basic Statistical Techniques in Data Analysis
23 pages
Module No. 12 Title: Pearson R and Spearman Rho: 1. The Coefficient of Correlation 2. Rank Correlation
100% (1)
Module No. 12 Title: Pearson R and Spearman Rho: 1. The Coefficient of Correlation 2. Rank Correlation
14 pages
BBA Semester 1
No ratings yet
BBA Semester 1
15 pages
Examining Relationships in Quantitative Research
No ratings yet
Examining Relationships in Quantitative Research
9 pages
Lavanyapaperpublished
No ratings yet
Lavanyapaperpublished
15 pages
Oup 6
No ratings yet
Oup 6
48 pages
Predicting Inflation Through Online Prices
No ratings yet
Predicting Inflation Through Online Prices
20 pages