Statistical Analysis Data Treatment and Evaluation

The document discusses statistical analysis methods, focusing on confidence intervals, ANOVA, and detection of gross errors. It explains how to calculate confidence intervals for means using sample data, the principles of ANOVA for comparing population means, and the Q test for identifying outliers in datasets. Several examples illustrate the application of these statistical techniques in practical scenarios.

Uploaded by

leilashania loon

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Statistical Analysis Data Treatment and Evaluation

Uploaded by

leilashania loon

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Statistical Analysis Data Treatment and Evaluation

Confidence Interval

 In most quantitative chemical analyses, the true value of the mean, µ, cannot be
determined because a huge number of measurements (approaching infinity) would be
required.
 However, the interval surrounding the experimentally determined mean, x, can be
determined within which the population mean µ is expected to lie with a certain degree of
probability. This interval is known as the confidence interval. The limits of the interval
are called confidence limits.
 The probability that a result is outside the confidence interval is often called the
significance level.
 If we make a single measurement x from a distribution of known σ, we can say that the true
mean should lie in the interval x ± zσ with a probability dependent on z.
 However, we rarely estimate the true mean from a single measurement. Instead, we use
the experimental mean of N measurements as a better estimate of µ.

 If population standard deviation (σ) is unknown, the given is sample standard deviation (s)
and the sample (n) is more than 30. Use the following formula:

( )
Cl for μ= x̄ ± z
s
√n
Values of z at various confidence levels:

Example: (follow these solutions so we can have uniform answers, for the final answers use the unit of
the mean)
1. I randomly select 25 students’ Math SAT scores and find =600. I know that σ from this
population is 50. Find a 95% Confidence Interval and interpret.
1.96 ( 50 )
95 % Cl=600± =600 ±19.6
√ 25
Upper limit :
95 %Cl=600+ 19.6=619.6
Lower limit :
95 % Cl=600−19.6=580.4
580.4< μ <619.6 Math SAT scores
Interpretation:
Therefore, from the experimental mean, we conclude that there is a 95% chance that the μ lies
between the interval of 619.6 and 580.4 Math SAT scores.

2. You sample 12 bugs and find the sample mean is 2.40 cm. You are told that σ=0.2 cm. Find a
95% Confidence Interval and interpret.
1.96 ( 0.2 )
95 % Cl=2.40± =2.40 ±0.11 cm
√ 12
Upper limit :
95 %Cl=2.40+ 0.11=2.51cm
Lower limit :
95 % Cl=2.40−0.11=2.29 cm
2.29< μ< 2.51cm

Interpretation:
Therefore, from the experimental mean, we conclude that there is a 95% chance that the μ lies
between the interval of 2.29 and 2.51 cm.

3. The National Center for Education Statistics surveyed 4400 college graduates about the lengths
of time required to earn their bachelor’s degrees. The mean was 5.15 years and the standard
deviation was 1.68 years. Based on the above information, construct a 98% confidence interval for
the mean time required to earn a bachelor’s degree by all college students.
Given: n= 4400 x̄ = 5.15 s= 1.68 z value for 98% Cl= 2.33

( √sn )
Cl for μ= x̄ ± z

98 % Cl=5.15 ± 2.33
( √1.68
4400 )
=5.15 ± 0.06

upper limit :
¿ 5.15+0.06=5.21
lower limit :
¿ 5.15−0.06=5.09
5.09< μ<5.21 years
Interpretation:
Therefore, we conclude that from the survey mean, there is a 98% chance that the µ lies between
the interval of 5.09 and 5.21 years.
Analysis of Variance (ANOVA)

 Analysis of Variance (ANOVA) is a method for testing the hypothesis that there is no
difference between two or more population means.
 The ANOVA technique enables us to perform the simultaneous test and as such is
considered to be an important tool of analysis in the hands of a researcher.
 The significance of the difference pf the means of the two samples can be judged through
either z-test or t-test.
 Z-test is applied to find out the degree of reliability of a statistics in case of large sample.
 Z-test is based on the normal probability distribution and is used for judging the
significance of several statistical measures, particularly means.
 T-test is used to test the null hypothesis that the population means of to groups are the same
 t-test with two samples is commonly used with small sample sizes, testing the difference
between the samples when the variance of two norma distributions are not known.
 T-test is also used for judging the significance of the coefficients of simple and partial
correlations.
ANOVA Concepts
In ANOVA procedures, difference in several population means is obtained by comparing the
variances. For comparing I population means m 1, m2,…mI, the null hypothesis H0 is of the form
H0= μ1 = μ2 = μ3 = …. = μI
The alternative hypothesis Ha: at least two of the mi’s are different.

Assumptions in ANOVA
 The experimental errors of the data are normally distributed.
 Equal variances between treatments (i.e. Homogeneity of variances)
 Independence of sample (i.e. each sample is randomly selected and independent
ANOVA Techniques:
One-way ANOVA
 Is the simplest type of ANOVA, in which only one source of variation, or factor is
investigated.
 It is an extension to three or more samples of the t-test procedure for use with two
independent samples.
Two-way ANOVA
 Is used when the data are classified on the basis of two factors. For example, the
agricultural output may be classified on the basis of different varieties of seeds and also on
the basis of different varieties of fertilizers used.
 A statistical test used to determine the effect of two normal predictor variables on a
continuous outcome variable.
 Two-way ANOVA test analyzes the effect of the independent variables on the expected
outcome long with the relationship to the outcome itself.
Detection of gross errors
 An outlier is a result that is quite different from the others in the data set.
 It is important to develop a criterion to decide whether to retain or reject the outlying data
point.
 The choice of criterion for the rejection of a suspected result has its perils. If the standard is
too strict so that it is quite difficult to reject a questionable result, there is a risk of retaining a
spurious value that has an inordinate effect on the mean.
 If we set a lenient limit and make the rejection of a result easy, we are likely to discard a
value that rightfully belongs in the set, thus introducing bias to the data.
 While there is no universal rule to settle the question of retention or rejection, the Q
test is generally acknowledged to be an appropriate method for making the decision.

Q-test
 The Q test is a simple, widely used statistical test for deciding whether a suspected result
should be retained or rejected.
 In this test, the absolute value of the difference between the questionable result x q and its
nearest neighbor xn is divided by the spread w of the entire set to give the quantity Q:

Example:
The analysis of a city drinking water for arsenic yielded vales of 5.60, 5.64, 5.70, 5.69, and 5.81 ppm. The
last value appears anomalous, should it be rejected at the 95% confidence level?

First, arrange the given values: 5.60, 5.64, 5.69, 5.70, 5.81 ppm
To solve for Q, calculate the absolute value of the difference between the questionable result (xq)(which is the
last value) and its nearest neighbor (xn) divided by the spread (w) (w= maximum value – minimum value) of
the entire set to give the quantity Q:
5.60, 5.64, 5.69, 5.70, 5.81 ppm
Nearest neighbor Questionable value

Questionable value ( x q )−Nearest neighbor ( X n )

Q=
Spread ( w )=Maximum value−minimum value

x q− X n
Q=
w
5.81−5.70 0.11
¿ = =0.52
5.81−5.60 0.21

Interpretation:
For 5 measurements, Qcrit at the 95% confidence level is 0.710. Because 0.52 < 0.710, we must retain the
outlier at the 95% confidence level.

Note: If Q is less than the Qcrit, retain the outlier.

If Q is greater than the Qcrit, reject the outlier.

CH 5 HP Testing
100% (1)
CH 5 HP Testing
29 pages
Video Games Market Research Project
No ratings yet
Video Games Market Research Project
16 pages
STAT
No ratings yet
STAT
40 pages
Lecture 3 PDF
100% (2)
Lecture 3 PDF
77 pages
Basic Statistics and Data Handling
No ratings yet
Basic Statistics and Data Handling
53 pages
CH7 - Statistical Data Treatment and Evaluation
No ratings yet
CH7 - Statistical Data Treatment and Evaluation
56 pages
Statistical Analysis Data Treatment and Evaluation
No ratings yet
Statistical Analysis Data Treatment and Evaluation
55 pages
Ch7 0922 2023
No ratings yet
Ch7 0922 2023
103 pages
IE-525-Chapter 2
No ratings yet
IE-525-Chapter 2
48 pages
Ac 07 PDF
No ratings yet
Ac 07 PDF
3 pages
Unit 5 Mba 1ST
No ratings yet
Unit 5 Mba 1ST
197 pages
Some Statistical Methods in Anachem
No ratings yet
Some Statistical Methods in Anachem
39 pages
Chapter 2-1
No ratings yet
Chapter 2-1
55 pages
Techniques of Annova_20241103_232802_0000
No ratings yet
Techniques of Annova_20241103_232802_0000
32 pages
Chapter 7
No ratings yet
Chapter 7
31 pages
T (Ea) For Two
No ratings yet
T (Ea) For Two
31 pages
Lecture 3 2014 Statistical Data Treatment and Evaluation
No ratings yet
Lecture 3 2014 Statistical Data Treatment and Evaluation
44 pages
Probability and Hypothesis Testing
No ratings yet
Probability and Hypothesis Testing
28 pages
Annova
0% (1)
Annova
19 pages
Mor 7 FR Icd 05-00-1t Test 2z Test 3analysis of Variance Anova 4regression
No ratings yet
Mor 7 FR Icd 05-00-1t Test 2z Test 3analysis of Variance Anova 4regression
24 pages
CHEM-205 Analytical Chemistry-I: Anova
No ratings yet
CHEM-205 Analytical Chemistry-I: Anova
20 pages
Week 11
No ratings yet
Week 11
22 pages
Basic Statistics: There Are Three Types of Error
No ratings yet
Basic Statistics: There Are Three Types of Error
21 pages
ONEWAYANOVA
No ratings yet
ONEWAYANOVA
40 pages
Things To Know PDF
No ratings yet
Things To Know PDF
56 pages
APP601S Chapter 4- Data Handling in Anal Chem
No ratings yet
APP601S Chapter 4- Data Handling in Anal Chem
42 pages
APP601S Chapter 4 - Data Handling in Analytical Chem
No ratings yet
APP601S Chapter 4 - Data Handling in Analytical Chem
42 pages
SSCK 1203 Data Analysis 090214 Students 02
No ratings yet
SSCK 1203 Data Analysis 090214 Students 02
36 pages
One Way Analysis of Variance (ANOVA) : "Slide 43-45)
No ratings yet
One Way Analysis of Variance (ANOVA) : "Slide 43-45)
15 pages
Activity 10 + 11 - Anova
No ratings yet
Activity 10 + 11 - Anova
22 pages
Anova and Design of Experiments
No ratings yet
Anova and Design of Experiments
35 pages
Chapter 2 - Data Analysis II
No ratings yet
Chapter 2 - Data Analysis II
56 pages
Analyze
No ratings yet
Analyze
194 pages
SMuR Complete
No ratings yet
SMuR Complete
114 pages
4.1analysis of Varianc Eand Covariance
No ratings yet
4.1analysis of Varianc Eand Covariance
41 pages
Lab 4 .
No ratings yet
Lab 4 .
6 pages
Correlation Regression Hypo ANOVA
No ratings yet
Correlation Regression Hypo ANOVA
22 pages
Research Methods Course Work 2
No ratings yet
Research Methods Course Work 2
15 pages
Lecture7 Confidence
No ratings yet
Lecture7 Confidence
45 pages
C22 Inferential Statistics Dxb (3)
No ratings yet
C22 Inferential Statistics Dxb (3)
66 pages
Lecture 6-7 Significance Testing 30-08-2024
No ratings yet
Lecture 6-7 Significance Testing 30-08-2024
16 pages
Statistical Inferences
No ratings yet
Statistical Inferences
46 pages
Ie352l1 Labmanual
No ratings yet
Ie352l1 Labmanual
90 pages
5_Stat Lecture..
No ratings yet
5_Stat Lecture..
44 pages
Lecture 7
No ratings yet
Lecture 7
7 pages
Statistics For Business: Analysis of Variance
No ratings yet
Statistics For Business: Analysis of Variance
51 pages
AD3411 - 6 To11
No ratings yet
AD3411 - 6 To11
15 pages
Confidence Revised Online
No ratings yet
Confidence Revised Online
14 pages
Basic Concepts of One Way Analysis of Variance (ANOVA)
No ratings yet
Basic Concepts of One Way Analysis of Variance (ANOVA)
38 pages
Statistics (Autosaved)
No ratings yet
Statistics (Autosaved)
75 pages
Basic Concepts of One Way Analysis of Variance (ANOVA)
No ratings yet
Basic Concepts of One Way Analysis of Variance (ANOVA)
38 pages
Spss Tutorials: One-Way Anova
No ratings yet
Spss Tutorials: One-Way Anova
12 pages
Data Anlalysis
No ratings yet
Data Anlalysis
6 pages
Business Analytics & Machine Learning: Regression Analysis
No ratings yet
Business Analytics & Machine Learning: Regression Analysis
58 pages
Lecture 13 ANOVA
No ratings yet
Lecture 13 ANOVA
36 pages
Anova
No ratings yet
Anova
34 pages
One-Way ANOVA Two-Way ANOVA
No ratings yet
One-Way ANOVA Two-Way ANOVA
31 pages
Data Preparation & Analysis
No ratings yet
Data Preparation & Analysis
27 pages
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet
Foundations of Elementary Analysis
From Everand
Foundations of Elementary Analysis
Roshan Trivedi
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Linear Regression Analysis For Survey Data
No ratings yet
Linear Regression Analysis For Survey Data
28 pages
HIT-6 and EQ-5D-5L in Patients With Migraine Asses
No ratings yet
HIT-6 and EQ-5D-5L in Patients With Migraine Asses
12 pages
Lecture 5 - Interval Estimation
No ratings yet
Lecture 5 - Interval Estimation
76 pages
Unit No. 2- Research Design
No ratings yet
Unit No. 2- Research Design
15 pages
Group 3-Report Mgt555
No ratings yet
Group 3-Report Mgt555
19 pages
Measures of Positionsadas
No ratings yet
Measures of Positionsadas
11 pages
Perceptual Data: Average Score Each Brand Achieves On Each Attribute From Your Sample of Respondents
No ratings yet
Perceptual Data: Average Score Each Brand Achieves On Each Attribute From Your Sample of Respondents
34 pages
ECN3322 - Panel Data-1
No ratings yet
ECN3322 - Panel Data-1
56 pages
Exit Exam Stat and Proba 11
No ratings yet
Exit Exam Stat and Proba 11
2 pages
Stat Q4 Mod 3 Week3
No ratings yet
Stat Q4 Mod 3 Week3
25 pages
Simple-Linear-Regression-Model-3 24
No ratings yet
Simple-Linear-Regression-Model-3 24
87 pages
m1.9 Tutorial
No ratings yet
m1.9 Tutorial
12 pages
Unit 3 Ids Notes
No ratings yet
Unit 3 Ids Notes
31 pages
Quiz (Mean&var)
No ratings yet
Quiz (Mean&var)
2 pages
Pengaruh Rasio Keuangan Terhadap Harga Saham Pada Perusahaan PT. Indofood Sukses Makmur Yang Terdaftar Di Bursa Efek Indonesia
No ratings yet
Pengaruh Rasio Keuangan Terhadap Harga Saham Pada Perusahaan PT. Indofood Sukses Makmur Yang Terdaftar Di Bursa Efek Indonesia
8 pages
Stats Chapter 5
No ratings yet
Stats Chapter 5
10 pages
Random Forest Presentation
No ratings yet
Random Forest Presentation
37 pages
Unit 5.2 Testing Two Population Means
No ratings yet
Unit 5.2 Testing Two Population Means
24 pages
Chapter 8
No ratings yet
Chapter 8
30 pages
Lesson2 Shs
No ratings yet
Lesson2 Shs
4 pages
Data Management
No ratings yet
Data Management
7 pages
Mas202 - 2022
No ratings yet
Mas202 - 2022
53 pages
Hypothesis Testing 1
No ratings yet
Hypothesis Testing 1
61 pages
Last Minute Statistics Livestream
No ratings yet
Last Minute Statistics Livestream
10 pages
Week 5 Correlation Analysis
No ratings yet
Week 5 Correlation Analysis
14 pages
MTP 22 56 Questions 1716557591
No ratings yet
MTP 22 56 Questions 1716557591
19 pages
Metropolitan Research Inc. Case Study
No ratings yet
Metropolitan Research Inc. Case Study
6 pages
BUAN6359-Fall2024 HW3
No ratings yet
BUAN6359-Fall2024 HW3
3 pages

Statistical Analysis Data Treatment and Evaluation

Uploaded by

Statistical Analysis Data Treatment and Evaluation

Uploaded by

Statistical Analysis Data Treatment and Evaluation

Questionable value ( x q )−Nearest neighbor ( X n )

Note: If Q is less than the Qcrit, retain the outlier.

You might also like