0% found this document useful (0 votes)
13 views3 pages

Practice for Probability theory (2)

The document discusses various statistical measures of dispersion, including variance, standard deviation, Mean Absolute Deviation (MAD), skewness, kurtosis, covariance, and correlation. It includes calculations and interpretations of these measures for given datasets, as well as explanations of their significance in statistical modeling. The document also addresses the differences between MAD and standard deviation, the implications of skewness and kurtosis on data distribution, and the relationships between different variables through correlation analysis.

Uploaded by

h0ver251206
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views3 pages

Practice for Probability theory (2)

The document discusses various statistical measures of dispersion, including variance, standard deviation, Mean Absolute Deviation (MAD), skewness, kurtosis, covariance, and correlation. It includes calculations and interpretations of these measures for given datasets, as well as explanations of their significance in statistical modeling. The document also addresses the differences between MAD and standard deviation, the implications of skewness and kurtosis on data distribution, and the relationships between different variables through correlation analysis.

Uploaded by

h0ver251206
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Measures of Dispersion (MAD,Variance, Standard Deviation)

Dataset: {5, 10, 15, 18, 22, 25, 30, 35, 40, 45, 50}

1.​ Calculate the variance and standard deviation.


2.​ Why is MAD not commonly used in statistical modeling despite being easy to interpret?
3.​ What are the challenges associated with using variance as a measure of dispersion?
4.​ Explain the key differences between Mean Absolute Deviation (MAD) and Standard
Deviation (SD) in terms of their calculation and interpretation.Discuss the pros and cons
of using MAD over SD, and vice versa, in various statistical applications.
5.​ Under what circumstances would you prefer MAD over SD, and why?
6.​ Two datasets have the same mean (75), but one has a higher standard deviation. What
does this imply ?
7.​ A dataset has a variance of 9. If each value in the dataset is multiplied by 2, what
happens to the variance?
4.​ The exam scores of 12 students are {72, 85, 90, 88, 76, 95, 89, 84, 91, 78, 82, 94}.
Compute the variance and standard deviation, and interpret the spread of the scores.
5.​ A school has two classes with similar average test scores (75%), but one class has a
higher standard deviation.
○​ What does this suggest about student performance?


Skewness & Kurtosis
1.​ What do you understand by Skewness. How can you classify it further ?
2.​ Define kurtosis and how it helps analyze data distribution. Discuss various types of
kurtosis and their criteria of classification .
3.​ If a dataset has mean > median > mode, is it positively or negatively skewed?
4.​ What does it mean if a dataset has positive skewness?
5.​ A dataset has the following mean, median, and mode:

Mean = 80, Median = 75, Mode = 70 . Describe its skewness

6.​ A company tracks daily sales. Most days, sales are consistent, but a few days see
huge spikes (e.g., Black Friday). How would kurtosis help analyze this trend ?
7.​ The ages of 10 students in a class are as follows:​
40, 5, 12, 150 , 20, 12, 90, 8, 15, 12
8.​ Determine whether the distribution of ages is: Positively skewed , Negatively skewed, or
Symmetric.Justify your conclusion.

Covariance & Correlation


1.​ What is correlation? How does it differ from covariance?
2.​ Compare Pearson correlation (rrr) and Spearman correlation (ρ)—when should each
be used?
3.​ A data analyst is studying the relationship between hours studied (X) and exam scores
(Y) for 6 students. The dataset is as follows:

Hours Studied (X): {2, 3, 5, 7, 8, 10}

Exam Scores (Y): {50, 55, 65, 70, 85, 95}

●​ Compute the covariance between Hours Studied (X) and Exam Scores (Y).
●​ Interpret the result—does a positive covariance indicate a strong relationship between
study hours and exam scores?

4.​ A researcher is analyzing the relationship between study hours per week (X) and final
exam scores (Y) among a group of students. The dataset is as follows:

Study Hours (X): {5, 10, 15, 20, 25, 30}

Final Exam Scores (Y): {50, 60, 70, 75, 85, 90}

Determine the correlation(r) between study hours and final exam scores.

5.​ A business analyst is studying the relationship between monthly ice cream sales (X) and
average temperature (Y)°C in a beach town. The dataset for the entire year is given
below:

Calculate the correlation coefficient between Ice Cream Sales (X) and Average Temperature (Y) and
interpret the result.

6.​ A sports analyst studies player training hours and their performance rating:
●​ Training Hours (X): {10, 8, 6, 4, 2}
●​ Performance Rating (Y): {95, 85, 80, 70, 60}

Compute Spearman’s rank correlation coefficient (ρ) and interpret the strength of the relationship.

You might also like