2.1 Descriptive Statistics Contd
2.1 Descriptive Statistics Contd
2.2 Probability
7 Hypothesis Testing
8.1 Hypothesis Testing Contd..
11 Forecasting + ANOVA
12 Multivariate Analytics
13 PCA
Branches of statistics
Descriptive statistics
Inferential statistics
Terminologies
Variable
Types of variable
Data Sets
Data Measurement
Data Visualization
Pie charts and bar chart for a categorical data
Histograms for quantitative data
Shape of histograms
Center
Spread
Stem and Leaf Plot
Dotplot
Measures of Location/Central Tendency: Ungrouped Data
Mean
Median
Mode
Comparing the Mean and the Median
Quartiles
Percentile
Measures of variability: Ungrouped Data
Range
Inter-Quartile Range (IQR)
Using the IQR to Detect Outliers
Boxplot
Side by side Box plot
Variance
Standard Deviation
The Empirical Rule
Coefficient of variation
Measures of central tendency: Grouped Data
Mean
Median
Mode
Measures of variability: Grouped Data
Population Variance and Standard deviation
Sample Variance and Standard deviation
Relation between mean, median and mode
Probability
Probability as a Numerical Measure of the Likelihood of Occurrence
Sample Space
Event
Complement of an Event
Union of Two Events
Intersection of Two Events
Set Theory Example
Disjoint events
Non-disjoint events
Independent events
Dependent events
Basic properties of probability
Determining Probability
Classical method
Limitation of Classical definition
Random experiment
Relative frequency
Limitations of relative frequency
Subjective Probability
P(A or B)
P(A and B)
The mn Rule
Sampling from a Population with Replacement
Permutation
Combination: Sampling from a Population Without Replacement
Combination
Probability table
Conditional probability
Exercise: Marginal Probabilities
Marginal, Union, Joint, and Conditional Probabilities
Rule of total Probability
Bayes’ Theorem
Random variable
Types of random variable
Probability distribution
Probability distribution histogram
Valid Probability Model
Cumulative distribution function
Expected Value of X
Expected Value of a Function
Rules of Expected Value
Normal distribution
Standard Deviation Rule for Normal Random Variables
The Normal Distribution
Standard Normal Distribution
Non-standard Normal Distributions
Probabilities using z value
z - score
Approximate Binomial Distribution Problems
Continuity correction factor
Sampling
Reasons for Sampling
Random Versus Non-random Sampling
Random Sampling
Simple random sampling
Stratified Random Sampling
Systematic Random Sampling
Non-random Sampling
Convenience Sampling
Quota Sampling
Snowball Sampling
Sampling Error
Sampling Variation
Sampling Variation
Sampling Variability
Sampling Distribution
Sampling Distribution of x bar
Conclusions
Shapes of the Distributions of Sample Means
Central Limit Theorem
Z score for sample means
Linear Regression
Covariance
Variance Vs Covariance
Sampled variance
Problem with Covariance
Example of how covariance value relies on variance
Correlation
Questions a Pearson correlation answers
Assumptions
Pearson product-moment correlation coefficient
Correlation does not have units but Covariance always has units
Advantages of the Correlation Coefficient
Correlation Vs Covariance
Regression
Regression Analysis
Dependent and independent variable
Scatter Plot
Equation of regression line
Forecasting + ANOVA
Python example: Minimum Daily Temperatures Dataset
Load the dataset as a Pandas Series
Quick Check for Autocorrelation
Pearson correlation coefficient
Autocorrelation Plots
Autoregression Model
Predictions from fixed AR model
Moving Averages model
Theoretical Properties of a Time Series with an MA(1) Model
Theoretical Properties of a Time Series with an MA(2) Model
ACF for General MA(q) Models
Important conclusion for MA(q) Process
Box-Jenkins models
Autoregressive Moving Average (ARMA)
Identifying the Model
Estimating & diagnosing a possible model
What if more than one model looks OK !!
So how do we decide the values of p and q ?
Autoregressive Integrated Moving Average (ARIMA)
ANOVA
Design Of Experiments
Terms
THE COMPLETELY RANDOMIZED DESIGN
Need for ANOVA
One Way ANOVA
Hypothesis in ANOVA
Testing hypotheses
Total Sum of Squares of Variation
Assumptions
Comparison of F and t Values
Multivariate Analytics
Introduction
Joint Distribution
Joint PMF
Joint probability table
Properties of joint probability mass function
Joint PDF
Properties of joint probability distribution function
Joint cumulative distribution function
Properties of the joint cdf
Independence
Expected Value
Covariance
Correlation
Multivariate normal distribution
Univariate Normal Distribution
Multivariate normal distribution
PDF of the Bivariate Normal Distribution
Probability for Bivariate
Conditional distribution
Statistical Independence
The standard multivariate normal distribution
Expected value
Covariance matrix
The multivariate normal distribution in general
Multivariate Normal probabilities
Multivariate conditional distribution
Conditional Normal Multivariate
Statistical Independence
0.95
0.01
0.99 0.0095 0.0198 0.0293 0.32423208
0.02
1.56115108