Descriptive Analysis
Descriptive Analysis
Munmun Biswas
July 9, 2019
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 1 / 37
Introduction
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 2 / 37
Application of Statistics in Real World Problem
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 3 / 37
Outline:
Descritive Statistics
Data Type
Data Representation
Data Summery
Data Shape
Bivariate Data
Probability Distributions
Probability
Random Variable
Probability Distribution Function
Expectation and Variance
Moments
Binomial, Poisson
Normal Distribution
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 4 / 37
Descriptive Statistics
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 5 / 37
Descritive Statistics: Data Type
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 6 / 37
Quantitave data has numerical values. This can also be called
variables
Examples: Height, Weight, Number of defectives in a particular lot of
products, Number of accidents in a traffic signal per week
Variables can be discrete when it takes only discrete values in a
certain interval
Variables can be continuous when it may take all the values of an
interval
Qualitative (or Categorical) data do not have numerical values. It can
also be called attribute.
Examples: Blood groups, gender, exam grades, income groups
Categories which can be ordered can be classified as ordinal data, if
not they are called as nominal data.
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 7 / 37
Questions
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 8 / 37
Graphical Representation
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 9 / 37
Graphical Representation
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 9 / 37
Graphical Representation
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 9 / 37
Bar Diagrams
Bar Diagram: To represent Discrete variable, Categorical data
Examples:
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 10 / 37
Bar Diagrams
Bar Diagram: To represent Discrete variable, Categorical data
Examples:
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 10 / 37
Bar Diagrams
Bar Diagram: To represent Discrete variable, Categorical data
Examples:
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 10 / 37
Bar Diagrams
Bar Diagram: To represent Discrete variable, Categorical data
Examples:
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 10 / 37
Continuous data: Histogram
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 11 / 37
Continuous data: Histogram
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 11 / 37
Histogram
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 12 / 37
Frequency Distribution
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 13 / 37
Data Summary
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 14 / 37
Measures of Location
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 15 / 37
Spread Measures
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 17 / 37
Inter Quartile Range
The lower quartile Q1 is the value such that 1/4th of the observations
fall below it and 3/4ths fall above it.
The middle quartile Q2 is the median
The third quartile Q3 is the value such that 3/4ths of the
observations fall below it and 1/4th above it.
The Inter Quartile Range IQR is the difference between the third
quartile and he first quartile.
Thus IQR = Q3 − Q1
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 18 / 37
Box Plot
Box Plots are simple means of providing a useful picture of how the
data are distributed.
To draw Box Plot
Determine Q1 , Q3 and IQR
A line is drawn at the median to divide the box
Two lines, known as Whiskers are drawn outward from the box.
One line extends the top edge of the box at Q3 to either xmax or
Q3 + 1.5 × IQR whichever is lower. Another line from the bottom
edge of the box at Q1 extends downward to a value that is either the
xmin or Q1 –1.5 × IQR whichever is greater.
The end points of the whiskers are known as upper and lower adjacent
values
Values that fall outside the adjacent values are candidates for
consideration as outliers. They are plotted as bullets (◦).
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 19 / 37
Box Plot
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 20 / 37
Probability Distribution
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 21 / 37
Probability
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 22 / 37
Definition of Probability
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 23 / 37
Few Properties of Probability
P(φ) = 0
A ⊂ B ⇒ P(A) ≤ P(B)
0 ≤ P(A) ≤ 1
P(Ac ) = 1 − P(A)
A ∩ B = φ ⇒ P(A ∪ B) = P(A) + P(B)
For any two events A and B, P(A ∪ B) = P(A) + P(B) − P(A ∩ B)
|A|
If Ω is finite and if each outcome is equally likely then P(A) = |Ω| ,
where |.| denotes cardinality of a set
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 24 / 37
Independent Events, Conditional Probabity
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 25 / 37
Random Variables
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 26 / 37
Probability Distribution of a RV
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 27 / 37
Probability Distribution of a RV
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 27 / 37
Probability Distribution of a RV
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 27 / 37
Probability Distribution of a RV
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 27 / 37
Probability Distribution of a RV
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 27 / 37
Expectation and Variance
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 28 / 37
Expectation and Variance
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 28 / 37
Expectation and Variance
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 28 / 37
Expectation and Variance
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 28 / 37
Binomial Distribution: Example
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 29 / 37
Binomial Distribution: Example
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 29 / 37
Binomial Distribution: Example
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 29 / 37
Binomial Distribution: Example
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 29 / 37
Binomial Distribution: Example
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 29 / 37
Binomial Distribution: Example
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 29 / 37
Binomial Distribution: Example
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 29 / 37
Binomial Distribution
X ∼ Binomial(n,
p), where n is an integer and 0 < p < 1 if
P(X = x) = xn p x (1 − p)n−x for x ∈ {0, 1, . . . , n}
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 30 / 37
Binomial Distribution
X ∼ Binomial(n,
p), where n is an integer and 0 < p < 1 if
P(X = x) = xn p x (1 − p)n−x for x ∈ {0, 1, . . . , n}
Check E(X ) = ni=0 x xn p x (1 − p)n−x = np
P
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 30 / 37
Binomial Distribution
X ∼ Binomial(n,
p), where n is an integer and 0 < p < 1 if
P(X = x) = xn p x (1 − p)n−x for x ∈ {0, 1, . . . , n}
Check E(X ) = ni=0 x xn p x (1 − p)n−x = np
P
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 30 / 37
Binomial Distribution
X ∼ Binomial(n,
p), where n is an integer and 0 < p < 1 if
P(X = x) = xn p x (1 − p)n−x for x ∈ {0, 1, . . . , n}
Check E(X ) = ni=0 x xn p x (1 − p)n−x = np
P
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 30 / 37
Binomial Distribution
X ∼ Binomial(n,
p), where n is an integer and 0 < p < 1 if
P(X = x) = xn p x (1 − p)n−x for x ∈ {0, 1, . . . , n}
Check E(X ) = ni=0 x xn p x (1 − p)n−x = np
P
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 30 / 37
Binomial Distribution
X ∼ Binomial(n,
p), where n is an integer and 0 < p < 1 if
P(X = x) = xn p x (1 − p)n−x for x ∈ {0, 1, . . . , n}
Check E(X ) = ni=0 x xn p x (1 − p)n−x = np
P
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 30 / 37
Poisson Distribution
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 31 / 37
Poisson Distribution
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 31 / 37
Poisson Distribution
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 31 / 37
Poisson Distribution
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 31 / 37
−λ λx
P∞
Check E(X ) = x=0 xe x! =λ
V(X ) = E(X − λ)2 = λ
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 32 / 37
Normal Distribution
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 33 / 37
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 34 / 37
Standard Normal Distribution
X −µ
If X ∼ N(µ, σ 2 ), then Z = σ ∼ N(0, 1). Z is then called the Standard
Normal Variable
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 35 / 37
Central Limit Theorem
√
X̄n − µ n(X̄n − µ)
Zn = p = Z ∼ N(0, 1)
V(X̄n ) σ
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 36 / 37
Thank You
M.Biswas (BKC College) Descriptive Statistics and Probability Distributions July 9, 2019 37 / 37