0% found this document useful (0 votes)

0 views

Unit I

The document provides an overview of statistical inference, detailing the differences between parametric and non-parametric tests, as well as the concepts of random sampling and the central limit theorem. It explains hypothesis testing, including the null and alternative hypotheses, significance levels, and the use of z-tests and t-tests. Additionally, it discusses the assumptions and applications of these tests, along with the relationship between levels of significance and p-values.

Uploaded by

panwarsakshi2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

0 views

Unit I

Uploaded by

panwarsakshi2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Unit I

Basic aim of statistical inference is to form a conclusion about a population

parameter from a sample (statistic) taken from that population.
A population is the complete set of observations about which an investigator
wishes to draw conclusions. A sample is a part of that population.
A population is defined in terms of observations rather than people.
A population is defined by the interest of the investigator.

Parametric Tests
A statistical test, in which specific assumptions are made about the
population parameter
Parametric tests assumes that the data follow a specific distribution, usually
a normal distribution; homogeneity of variances (equal variances across
groups); and that the data are measured on an interval or ratio scale.
The observation must be independent. The inclusion or exclusion of any
case in the sample should not unduly affect the results of study
The meaningfulness of the results of a parametric test depends on the
validity of the assumption.
Parametric tests are sensitive to outliers and non-normality, which can affect
the validity of the results.
Parametric tests are useful as these tests are most powerful for testing the
significance of the computed sample statistics.
Examples: t-test (compares means between two groups), ANOVA (compares
means across multiple groups)

Non-Parametric Tests
They are distribution-free tests. Statistical tests which are not based on a
normal distribution of data or on any other assumption.
The test statistic is arbitrary and can be applied to data measured on ordinal
or nominal scales, as well as interval and ratio scales.
Results are interpreted based on ranks or medians.
Non-parametric test are less sensitive to outliers and can be used for
skewed distributions and small sample sizes.
The use of non-parametric tests is recommended in the following situations:
Sample size is quite small, as small as N=5 or N=6
Assumption like normality of the distribution of scores in the population
are doubtful
When the measurement of data is available either in the form of ordinal
or nominal scales or when the data can be expressed in the form of
ranks
Non-parametric tests typically make fewer assumptions about the data and
may be relevant to a particular situation
Examples: Spearman's rank correlation (measures the strength and direction
of the association between two ranked variables), chi-square test (tests the
association between categorical variables)

Z-test

Random Sampling
A random sample of a given population is a sample so drawn that each
possible sample of that size has an equal probability of being selected from
the population.
The method of selection, not the particular sample outcome, defines a
random sample.
There are two sampling plans that yield a random sample:
sampling with replacement- in which an element may appear more
than once in a sample
sampling without replacement- in which no element may appear more
than once
A random sampling distribution of the mean is the relative frequency
distribution of mean (X-bar) obtained from all possible random samples of a
given size that could be drawn from a given population.
Characteristics of the Random Sampling Distribution of the
Mean
Expected Value of the sample mean, is the same as the mean of the
population of scores from which the samples were drawn.

The standard deviation of the random sampling distribution of the mean,

called the standard error of the mean, depends on the standard deviation of
the population, 𝜎X, and the sample size, n.

If the population of scores is normally distributed, the sampling distribution

of the mean will also be normally distributed, regardless of sample size.

Central Limit Theorem

States that~ the random sampling distribution of the mean tends toward a
normal distribution irrespective of the shape of the population of
observations sampled; the approximation to the normal distribution improves
as sample size increases

The central limit theorem also states that the sampling distribution will have
the following properties:
The mean of the sampling distribution will be equal to the mean of the

population distribution:
The variance of the sampling distribution will be equal to the variance of

the population distribution divided by the sample size:

The central limit theorem states that the sampling distribution of the mean
will always follow a normal distribution under the following conditions:
The sample size is sufficiently large. This condition is usually met if the
sample size is n ≥ 30.
The samples are independent and identically distributed random
variables. This condition is usually met if the sampling is random.
The population’s distribution has finite variance. Central limit theorem
doesn’t apply to distributions with infinite variance, such as the Cauchy
distribution.
Practical applications of the central limit theorem:
Quality control: Monitoring manufacturing processes.
Economics: Analyzing average income, expenditure, etc.
Finance: Modeling stock returns and risks.

Testing Hypothesis
Hypothesis is a statement about a population parameter to be subjected to
test and, on the outcome of the test, to be retained or rejected.
The key to any problem in statistical inference is to discover what sample
values will occur by chance in repeated sampling and with what probability.
Sampling Distribution: a theoretical relative frequency distribution of the
values of a statistic that would be obtained by chance from an infinite
number of samples of a particular size drawn from a given population.
Probability Samples: samples for which the probability of inclusion in the
sample of each element in the population is known.
The goal of hypothesis testing is to make inferences about a population
based on a sample.

Null Hypothesis
The hypothesis that a researcher tests is called the null hypothesis
Symbolized Ho. It is the hypothesis that he or she will decide to retain or
reject.
The null hypothesis is simply whatever hypothesis we choose to test.
Null hypothesis implies a statement that expects no difference or effect.
Level of Significance

In Statistics, “significance” means “not by chance” or “probably true”.

The probability value that is used as a criterion to decide that an obtained
sample statistic (X-bar) has a low probability of occurring by chance if the
null hypothesis is true (resulting in rejection of the null hypothesis)
It defines whether the null hypothesis is assumed to be accepted or
rejected.
𝜶 (alpha) symbol for the level of significance
The level of significance is generally chosen as 0.01 or 0.05

Conclusion

Region of Rejection the portion of the sampling distribution of Mean

(consisting of values of mean that are unlikely to have occurred by chance if
Ho is true) that leads to rejection of Ho.
Region of Retention the portion of the sampling distribution of Mean that
leads to retention of Ho.
Critical value(s) the value(s) that separates the region of rejection from the
region of retention
To determine the position of the obtained X-bar, it must be expressed as a z
score:

Rejecting the null hypothesis- the obtained sample statistic (X-bar) has a
low probability of occurring by chance if the value of the population
parameter stated in Ho is true
Retaining the null hypothesis- we do not have sufficient evidence to reject
Ho
Large samples increase the precision by reducing sampling variation.

Alternate Hypothesis
Alternative hypothesis- a hypothesis about a population parameter that
contradicts the null hypothesis
Ha symbol for the alternative hypothesis
Alternative hypothesis is one that expects some difference or effect.
The alternative hypothesis may be directional or non-directional
The time to decide on the nature of the alternative hypothesis is at the
beginning of the study, before the data are collected.

One-Tailed (Directional) Hypothesis

The alternative hypothesis states that the population parameter differs from
the value stated in Ho in one particular direction (and the critical region is
located in only one tail of the sampling distribution)
A directional alternative hypothesis is appropriate only when there is no
practical difference in meaning between retaining the null hypothesis and
concluding that a difference exists in a direction opposite to that stated in
the directional alternative hypothesis.

Two-Tailed (Non- Directional) Hypothesis

The alternative hypothesis states that the population parameter may be
either less than or greater than the value stated in Ho (and the critical region
is divided between both tails of the sampling distribution)

Assumptions of Z-test
A random sample has been drawn from the population. This ensures that
each member of the population has an equal chance of being included in the
sample, which helps in making the sample representative of the population.
The sample has been drawn by the with-replacement sampling plan.
The sampling distribution of Mean follows the normal curve. When the
scores in the population are not normally distributed, the central limit
theorem comes to the rescue when the sample size is 30 or larger
The standard deviation of the population of scores is known.

The observations in the sample must be independent of each other. This

means that the value of one observation should not influence the value of
another observation.
t-test for single mean (Deviational Formula)
Unbiased Estimator- the mean of the estimates made from all possible
samples of the same size equals the value of the parameter estimated (X-
bar is an unbiased estimator of 𝜇x; Sx is not an unbiased estimator of 𝜎x)

Estimated Standard Error of the Mean- an estimate of the standard

deviation of the random sampling distribution of means.
t-test (Raw Score Method)

Student's Distribution of t

Because of the presence of the variable in the denominator, this statistic

does not follow the normal distribution.
British mathematician William S. Gosset presented the proper distribution for
it which has been referred to as Student’s distribution of t.
Student’s distribution of t- a theoretical relative frequency distribution of all
the values of Means converted to t that would be obtained by chance from
an infinite number of samples of a particular size drawn from a given
population

Characteristics
Student’s distribution of t is not a single distribution, but rather a family of
distributions. They differ in their degree of approximation to the normal
curve
The mean of the t-distribution is zero.
Are symmetrical.
Are unimodal.
Is platykurtic compared to the normal distribution (i.e., it is narrower at the
peak and has a greater concentration in the tails than does a normal curve)
The shape of the t-distribution depends on the degrees of freedom, which
are typically related to the sample size, df = n - 1
As the degrees of freedom increase, the t-distribution approaches the
normal distribution. For df > 30, the t-distribution is very close to the normal
distribution.
Has a larger standard deviation (𝜎Z = 1)
As the degrees of freedom increase, the standard deviation approaches 1,
which is the standard deviation of the standard normal distribution.

Assumptions of t-test
The t-test is a statistical test used to compare the means of two groups

The data should be collected using a random sampling method. This

ensures that the sample is representative of the population.
The data should be approximately normally distributed. This assumption is
particularly important for small sample sizes (typically n < 30)
Independence: For independent two-sample t-tests, the samples should be
independent of each other.
The standard deviation of the population of scores is unknown.
Equal Variances: For independent two-sample t-tests, the variances of the
two populations should be equal (homogeneity of variances).
The scale of measurement applied to the data collected follows a
continuous or ordinal scale
Paired Samples: For paired-sample t-tests, the observations should be
paired and dependent.

Differences and Similarities between z and t

The z-test and t-test are both statistical methods used for hypothesis testing,
particularly for comparing means between two groups or for testing the
significance of a sample mean against a known or hypothesized population mean.
Similarities:

Both tests are used to assess whether there is a significant difference

between the means of two groups or between a sample mean and a
population mean.
Both tests are parametric, meaning they make assumptions about the
underlying distribution of the data to make inferences.
Both tests generate a test statistic that is used to determine the probability
(p-value) of observing the sample data if the null hypothesis is true.
All data points of both the test are independent.

Differences:

In t-test, standard deviation of the population is unknown whereas in z-test,

the standard deviation of the population is known.
The t-test is based on Student’s t-distribution. On the contrary, z-test relies
on the assumption that the distribution of sample means is normal.
z-test is used to when the sample size is large, i.e. n > 30, and t-test is
appropriate when the size of the sample is small, in the sense that n < 30.

Degree of Freedom
The degrees of freedom associated with s, the estimated standard deviation
of a population, corresponds to the number of observations that are
completely free to vary.
df= n − 1
The degrees of freedom of a test statistic determines the critical value of the
hypothesis test.

statistical conclusion- a conclusion about the numerical property of the

data (reject or retain Ho)
research conclusion- a conclusion about the subject matter
Levels of Significance versus p-values
Level of Significance
The level of significance, denoted by (alpha), is the threshold set by the
researcher to determine when to reject the null hypothesis.
It represents the probability of committing a Type I error, which is rejecting
the null hypothesis when it is actually true.
Typical values for (\alpha) are 0.05, 0.01, and 0.1.
For example, an (\alpha) of 0.05 indicates a 5% risk of concluding that a
difference exists when there is no actual difference.
Before conducting a test, the researcher decides on the level of significance.
If the p-value obtained from the test is less than or equal to (\alpha), the null
hypothesis is rejected.

P-value
the probability, when Ho is true, of observing a sample mean as deviant or
more deviant (in the direction specified in HA) than the obtained value of
mean
It measures the strength of evidence against the null hypothesis.
It is not established in advance and is not a statement of risk; it simply
describes the rarity of the sample outcome if Ho is true.
If the p-value is less than or equal to the level of significance, reject the null
hypothesis. This suggests that the observed effect is statistically significant.
If the p-value is greater than alpha, do not reject the null hypothesis. This
suggests that there is not enough evidence to conclude that the effect is
statistically significant.
If the p-value is smaller than your level of significance, you declare your
findings significant.

Inferential Statistics
100% (4)
Inferential Statistics
28 pages
Anexo 03 - VW - 10130
No ratings yet
Anexo 03 - VW - 10130
36 pages
2.as Stats Binomial Hypothesis Testing
No ratings yet
2.as Stats Binomial Hypothesis Testing
3 pages
Traits Revisited: Harvard University
No ratings yet
Traits Revisited: Harvard University
10 pages
Statistics - The Big Picture
No ratings yet
Statistics - The Big Picture
4 pages
Making Predictions
No ratings yet
Making Predictions
30 pages
Testing of Hypothesis
67% (3)
Testing of Hypothesis
37 pages
Hypothesis Testing II
No ratings yet
Hypothesis Testing II
98 pages
SMPR-II_Unit 1 and 2 Theoretical Questions
No ratings yet
SMPR-II_Unit 1 and 2 Theoretical Questions
13 pages
What Is a Probability Distribution
No ratings yet
What Is a Probability Distribution
11 pages
Chapter 2
No ratings yet
Chapter 2
16 pages
New Normal MPA Statistics Chapter 2
No ratings yet
New Normal MPA Statistics Chapter 2
15 pages
Introduction To Hypothesis and Its Concepts 87
No ratings yet
Introduction To Hypothesis and Its Concepts 87
5 pages
2statistics Prac New
No ratings yet
2statistics Prac New
13 pages
Statistics Assignment Ii
No ratings yet
Statistics Assignment Ii
7 pages
Procedure of Testing Hypothesis
100% (1)
Procedure of Testing Hypothesis
5 pages
What Is A Hypothesis
No ratings yet
What Is A Hypothesis
4 pages
Fs103_parametric Test of Significance Hypothesis Testing_ Tabones , Jermer u
No ratings yet
Fs103_parametric Test of Significance Hypothesis Testing_ Tabones , Jermer u
4 pages
AD3491 - Unit 3 - Inferential Statistics Important Questions 2 Marks With Answer --3-9 (1)
No ratings yet
AD3491 - Unit 3 - Inferential Statistics Important Questions 2 Marks With Answer --3-9 (1)
7 pages
T test
No ratings yet
T test
29 pages
Sampling Distribution
No ratings yet
Sampling Distribution
3 pages
Educ 707 Portfolio
No ratings yet
Educ 707 Portfolio
113 pages
3. Inferential Statistics
No ratings yet
3. Inferential Statistics
48 pages
Applied_Data_Science-MODULE-2-SEM8
No ratings yet
Applied_Data_Science-MODULE-2-SEM8
53 pages
Lecture 7.descriptive and Inferential Statistics
No ratings yet
Lecture 7.descriptive and Inferential Statistics
44 pages
Unit 6.
No ratings yet
Unit 6.
37 pages
PSM 201 Sampling Distributions and Hypothesis Testing
No ratings yet
PSM 201 Sampling Distributions and Hypothesis Testing
31 pages
Expe Finals
No ratings yet
Expe Finals
8 pages
Lecture Notes ON Parametric & Non-Parametric Tests FOR Social Scientists/ Participants OF Research Metodology Workshop Bbau, Lucknow
No ratings yet
Lecture Notes ON Parametric & Non-Parametric Tests FOR Social Scientists/ Participants OF Research Metodology Workshop Bbau, Lucknow
19 pages
Statistics Assignment
No ratings yet
Statistics Assignment
7 pages
90156hypothesis Testing
No ratings yet
90156hypothesis Testing
34 pages
Statistics Notes BS
No ratings yet
Statistics Notes BS
11 pages
Uts WPS Office
No ratings yet
Uts WPS Office
7 pages
statss-2
No ratings yet
statss-2
7 pages
Tests of Hypothesis
No ratings yet
Tests of Hypothesis
16 pages
Unit 3
No ratings yet
Unit 3
14 pages
The Central Limit Theorem and Hypothesis Testing Final
100% (1)
The Central Limit Theorem and Hypothesis Testing Final
29 pages
Inferential Statistics: Sampling, Probability, and Hypothesis Testing
No ratings yet
Inferential Statistics: Sampling, Probability, and Hypothesis Testing
26 pages
Inferential Statistics
No ratings yet
Inferential Statistics
6 pages
What Is Hypothesis Testing
100% (1)
What Is Hypothesis Testing
32 pages
Ed Inference1
No ratings yet
Ed Inference1
20 pages
Handout#3 - Statistical Inference, z and t Test
No ratings yet
Handout#3 - Statistical Inference, z and t Test
3 pages
Measure of Central Tendency
No ratings yet
Measure of Central Tendency
40 pages
MMW Midterms Notes
No ratings yet
MMW Midterms Notes
6 pages
Statistics basic concepts
No ratings yet
Statistics basic concepts
13 pages
Stat 115 - Basic Statistical Methods
No ratings yet
Stat 115 - Basic Statistical Methods
6 pages
Preliminary Concepts On Statistical Inference
100% (1)
Preliminary Concepts On Statistical Inference
39 pages
COM 201 - Inferential Statistics - 18032022-1
No ratings yet
COM 201 - Inferential Statistics - 18032022-1
58 pages
State 2205
No ratings yet
State 2205
19 pages
Final Exam
No ratings yet
Final Exam
5 pages
SPSS
No ratings yet
SPSS
24 pages
SBC_3305 (4)
No ratings yet
SBC_3305 (4)
11 pages
Psych Stats_2nd sem
No ratings yet
Psych Stats_2nd sem
4 pages
Tests of Hypotheses
No ratings yet
Tests of Hypotheses
24 pages
- Module 4-Sampling 2
No ratings yet
- Module 4-Sampling 2
56 pages
Wk. 13 Ppt. - Quantitative Techniques in Business
No ratings yet
Wk. 13 Ppt. - Quantitative Techniques in Business
24 pages
research methodology
No ratings yet
research methodology
12 pages
Stats
No ratings yet
Stats
52 pages
MMW - Midterms
No ratings yet
MMW - Midterms
7 pages
Inferential Statistics
No ratings yet
Inferential Statistics
40 pages
Module 02 - AIML Statisitcs
No ratings yet
Module 02 - AIML Statisitcs
103 pages
Sampling in Statistics
From Everand
Sampling in Statistics
Stephanie Glen
No ratings yet
Statistical Foundations for Psychology
From Everand
Statistical Foundations for Psychology
James C. Ware
No ratings yet
Failure Mode Effect Analysis FMEA
No ratings yet
Failure Mode Effect Analysis FMEA
42 pages
0_PWSFAC759331228202420832
No ratings yet
0_PWSFAC759331228202420832
5 pages
Crop Weed Competition
No ratings yet
Crop Weed Competition
5 pages
Sta 32101 Questions-Hypothesis Testing
No ratings yet
Sta 32101 Questions-Hypothesis Testing
9 pages
Operatinal
No ratings yet
Operatinal
4 pages
DBB2103 Unit-01
No ratings yet
DBB2103 Unit-01
26 pages
4.2.5 Journal - Theories and Laws (Journal)
No ratings yet
4.2.5 Journal - Theories and Laws (Journal)
5 pages
Stank Et Al-2017-Journal of Business Logistics
No ratings yet
Stank Et Al-2017-Journal of Business Logistics
12 pages
Positivist Research: Emergence and Origins of Positivism
No ratings yet
Positivist Research: Emergence and Origins of Positivism
3 pages
编号0027
No ratings yet
编号0027
42 pages
Soln123 Bio
No ratings yet
Soln123 Bio
2 pages
A. J. Ayer
100% (1)
A. J. Ayer
60 pages
Problem-Focused and Emotion-Focused Coping Options and Loneliness: How Are They Related?
No ratings yet
Problem-Focused and Emotion-Focused Coping Options and Loneliness: How Are They Related?
10 pages
Factors Influencing Corporate Working Capital Management: Evidence From An Emerging Economy
No ratings yet
Factors Influencing Corporate Working Capital Management: Evidence From An Emerging Economy
20 pages
Hypothesis Booklet PDF
No ratings yet
Hypothesis Booklet PDF
6 pages
(AMALEAKS - BLOGSPOT.COM) Statistics (STAT-112) - Grade 11 Week 1-10
No ratings yet
(AMALEAKS - BLOGSPOT.COM) Statistics (STAT-112) - Grade 11 Week 1-10
100 pages
Effect of Computer Game On Student
75% (4)
Effect of Computer Game On Student
18 pages
(1987) - Bilingualism and Cognitive Development Three Perspe
No ratings yet
(1987) - Bilingualism and Cognitive Development Three Perspe
18 pages
Patanjali Vs Baidyanath
No ratings yet
Patanjali Vs Baidyanath
6 pages
PDF Understanding Social Science Research 2nd Edition Dr Thomas R Black download
100% (3)
PDF Understanding Social Science Research 2nd Edition Dr Thomas R Black download
88 pages
Effect of Laboratory Practice On Academic Achievement of Senior Secondary School Chemistry Students in Jos South Local Government Area
No ratings yet
Effect of Laboratory Practice On Academic Achievement of Senior Secondary School Chemistry Students in Jos South Local Government Area
10 pages
GED Past Papers Series No3
100% (1)
GED Past Papers Series No3
19 pages
Statistics In Criminal Justice 4th Ed 2014 Weisburd Davidbritt download
100% (1)
Statistics In Criminal Justice 4th Ed 2014 Weisburd Davidbritt download
80 pages
Modelling An Air Traffic Control Environment Using BNs PDF
No ratings yet
Modelling An Air Traffic Control Environment Using BNs PDF
10 pages
Stop-Loss Orders and Price Cascades in Currency Markets: Osler, Carol Lee
No ratings yet
Stop-Loss Orders and Price Cascades in Currency Markets: Osler, Carol Lee
45 pages
G11 SLM3 Q3 PR2 Final
No ratings yet
G11 SLM3 Q3 PR2 Final
18 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
40 pages

Unit I

Uploaded by

Unit I

Uploaded by

Unit I

Basic aim of statistical inference is to form a conclusion about a population

The standard deviation of the random sampling distribution of the mean,

If the population of scores is normally distributed, the sampling distribution

Central Limit Theorem

the population distribution divided by the sample size:

In Statistics, “significance” means “not by chance” or “probably true”.

Region of Rejection the portion of the sampling distribution of Mean

One-Tailed (Directional) Hypothesis

Two-Tailed (Non- Directional) Hypothesis

The observations in the sample must be independent of each other. This

Estimated Standard Error of the Mean- an estimate of the standard

Because of the presence of the variable in the denominator, this statistic

The data should be collected using a random sampling method. This

Differences and Similarities between z and t

Both tests are used to assess whether there is a significant difference

In t-test, standard deviation of the population is unknown whereas in z-test,

statistical conclusion- a conclusion about the numerical property of the

You might also like