0% found this document useful (0 votes)

42 views48 pages

Statistics - Lying Without Sinning?: - "Lies, Damned Lies, and Statistics"

The document discusses a statistic reported by an activist against drunk driving regarding the number of beer bottles and cans found along roadsides in South Dakota. It notes that according to the activist's estimates, each South Dakota resident would have to throw over 70 beer bottles or cans onto the road each year for the statistic to be accurate. However, the document questions the methodology and math used to arrive at this figure. It suggests the statistic may be an exaggeration and notes the large number of bottles and cans it implies each resident is littering annually seems unlikely.

Uploaded by

αγαπημένη του Χριστού

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views48 pages

Statistics - Lying Without Sinning?: - "Lies, Damned Lies, and Statistics"

Uploaded by

αγαπημένη του Χριστού

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 48

Statistics - Lying without sinning?

• "Lies, damned lies, and statistics"

1954
Statistics - Lying without sinning?
In North Dakota, 54 Million Beer Bottles by the side of the Road
April 01 2002

South Dakota's Pierre Capital Journal reports (Mar. 1) that "an average of 650
beer cans and bottles are tossed per mile of road annually." The statistic is
attributed to Dennis W. Brezina, an activist against drunk-driving.

But how did he come up with his data? According to the Journal, Brezina traveled
"highways across the nation to determine whether the problem he perceived was For more
widespread. He made two trips to South Dakota, one in 1998 and another in Check out
2000." He counted "cans and bottles in ditches in May of both years" and claimed
to have found an average of "one beer can or bottle every 16 feet when walking www.STATS.org
randomly selected stretches of ditch."

But the math appears a little blurry. The web site of the South Dakota Department
of Transportation claims that the state "has 83,472 miles of highways, roads and
streets." Assuming Brezina's estimate is correct, South Dakotans appear to be
world-class litterbugs, tossing aside approximately 54,256,800 bottles or cans
every year. According to the Census Bureau there are 754,844 people in South
Dakota. So, according to Brezina, the average resident throws at least 71 beer
bottles or cans on the side of the road every year.
Statistics for Quantitative Analysis

• Statistics: Set of mathematical tools used to describe

and make judgments about data
• Type of statistics we will talk about in this class has
important assumption associated with it:

Experimental variation in the population from which samples

are drawn has a normal (Gaussian, bell-shaped) distribution.
Normal distribution
• Infinite members of group:
population
• Characterize population by taking
samples
• The larger the number of samples,
the closer the distribution becomes to
normal
• Equation of normal distribution:
1  ( x   ) 2 / 2 2
y e
 2
Normal distribution

• Estimate of mean value

of population = 
• Estimate of mean value
of samples = x

x i
Mean = x  i

n
Normal distribution
• Degree of scatter (measure of central tendency)
of population is quantified by calculating the
standard deviation

• Std. dev. of population = 

• Std. dev. of sample = s

 ( xi  x ) 2
s i

n 1
• Characterize sample by calculating xs
Standard deviation and the
normal distribution
• Standard deviation defines
the shape of the normal
distribution (particularly
width)

• Larger std. dev. more

scatter about the mean,
worse precision.

• Smaller std. dev. means

less scatter about the
mean, better precision.
Standard deviation and the
normal distribution
• There is a well-defined relationship between the std. dev. of a population
and the normal distribution of the population.

• (May also consider these percentages of area under the curve)

Amount of Data

Standard deviations
68 %
95 %
99.7 %
Total % of the data covered by distribution
Example of mean and standard
deviation calculation

Consider Cu data: 5.23, 5.79, 6.21, 5.88, 6.02 nM

x = 5.826 nM  5.82 nM

s = 0.368 nM  0.36 nM

Answer: 5.82 ± 0.36 nM or 5.8 ± 0.4 nM

Learn how to use the statistical functions on your
calculator. Do this example by longhand calculation
once, and also by calculator to verify that you’ll get
exactly the same answer. Then use your calculator for
all future calculations.
Learn to use your calculator’s statistical functions to
calculate
mean and standard deviation. You’ll save yourself a lot of
work.

http://www.willamette.edu/~mjaneba/help/TI-85-stats.htm

http://www2.ohlone.edu/people2/joconnell/ti/
Relative standard deviation (rsd)
or coefficient of variation (CV)

s
rsd or CV =  100
x

From previous example,

rsd = (0.36 nM/5.82 nM) 100 = 6.1% or 6%

Standard error
• Tells us that standard deviation of set of samples should decrease if we take more
measurements

• Standard error =
s
sx 
•
n
Take twice as many measurements, s decreases by

• Take 4x as many measurements, s decreases by

2  1.4
• There are several quantitative ways to determine the sample size required to achieve a
4 2
desired precision for various statistical applications. Can consult statistics textbooks for further
information; e.g. J.H. Zar, Biostatistical Analysis
Variance

Used in many other statistical calculations and tests

Variance = s2

From previous example, s = 0.36

s2 = (0.36)2 = 0. 129 (not rounded because it is usually
used in further calculations)
Average deviation
• Another way to express
degree of scatter or
uncertainty in data. Not as ( x  x )
i
statistically meaningful as d i

standard deviation, but n

useful for small samples.

Using previous data:

5.23  5.82  5.79  5.82  6.21 5.82  5.88  5.82  6.02  5.82
d
5
d  0.25  0.25 or 0.2 nM

Answer : 5.82  0.25 nM or 5.8  0.2 nM

Relative average deviation (RAD)
d 
RAD   100 (as percentage )
x
d 
RAD   1000 (as parts per thousand , ppt )
x
Using previous data,

RAD = (0. 25/5.82) 100 = 4.2 or 4%

RAD = (0. 25/5.82) 1000 = 42 ppt

 4.2 x 101 or 4 x 101 ppt (0/00)
Some useful statistical tests

• To characterize or make judgments about data

• Tests that use the Student’s t distribution
– Confidence intervals
– Comparing a measured result with a “known” value
– Comparing replicate measurements (comparison of
means of two sets of data)
From D.C. Harris (2003) Quantitative Chemical Analysis, 6th Ed.
Confidence intervals

• Quantifies how far the true mean () lies from the
measured mean, x. Uses the mean and standard
deviation of the sample.

ts
x
n
where t is from the t-table and n = number of
measurements.
Degrees of freedom (df) = n - 1 for the CI.
Example of calculating a
confidence interval
Consider measurement of dissolved Ti
in a standard seawater (NASS-3):
Data: 1.34, 1.15, 1.28, 1.18, 1.33,
1.65, 1.48 nM
DF = n – 1 = 7 – 1 = 6
x = 1.34 nM or 1.3 nM ts
s = 0.17 or 0.2 nM x
95% confidence interval n
t(df=6,95%) = 2.447
CI95 = 1.3 ± 0.16 or 1.3 ± 0.2 nM
50% confidence interval
t(df=6,50%) = 0.718
CI50 = 1.3 ± 0.05 nM
Interpreting the confidence interval
• For a 95% CI, there is a 95% probability that the true
mean () lies between the range 1.3 ± 0.2 nM, or
between 1.1 and 1.5 nM

• For a 50% CI, there is a 50% probability that the true

mean lies between the range 1.3 ± 0.05 nM, or between
1.25 and 1.35 nM

• Note that CI will decrease as n is increased

• Useful for characterizing data that are regularly obtained;

e.g., quality assurance, quality control
Nitrate Concentrations (g/mL)
Trial 1 Trial 2 Trial 3 Trial 4 Trial 5 Trial 6 Trial 7 Trial 8 Trial 9 Trial 10
0.51 0.51 0.51 0.5 0.51 0.49 0.52 0.53 0.5 0.47
0.51 0.52 0.53 0.48 0.49 0.5 0.52 0.49 0.49 0.5
0.49 0.48 0.46 0.49 0.49 0.48 0.49 0.49 0.51 0.47
0.51 0.51 0.51 0.48 0.5 0.47 0.5 0.51 0.49 0.48
0.51 0.5 0.5 0.53 0.52 0.52 0.5 0.5 0.51 0.51
0.506 0.504 0.502 0.496 0.502 0.492 0.506 0.504 0.5 0.486 mean

average 0.4998
mg/mL frequency
stdev 0.01647
0.53 3
0.52 5
0.51 13
0.5 10
0.49 10 Let’s Graph the Data!
0.48 5
0.47 3
0.46 1
nitrate concentration

14
outlier
12
10
frequency

8
6 ± 1

4
± 2
2
0
0.44 0.46 0.48 0.5 0.52 0.54
 g/mL
Confidence Interval Exercise
s
x    t  sm    t 
n
Calculate the 95, 98 and 99 % confidence intervals

For the nitrate concentration data

95 % 0.500 ± 0.005
98 % 0.500 ± 0.006
99 % 0.500 ± 0.006
50 % 0.500 ± 0.002
0.500 ± 0.006 0.500± 0.006

0.500 ± 0.005 0.500 ± 0.002

Testing a Hypothesis (Significance Tests)

Carry out measurements on an accurately known standard.

Experimental value is different from the true value.

Is the difference due to a systematic error (bias) in the method - or simply to random error?

Assume that there is no bias

(NULL HYPOTHESIS),
and calculate the probability
that the experimental error
is due to random errors.

Figure shows (A) the curve for

the true value (A = t) and
(B) the experimental curve (B)
Comparing a measured result
with a “known” value

• “Known” value would typically be a certified value

from a standard reference material (SRM)
• Another application of the t statistic

known value  x
t calc  n
s
Will compare tcalc to tabulated value of t at appropriate
df and CL.

df = n -1 for this test

Comparing a measured result
with a “known” value--example
Dissolved Fe analysis verified using NASS-3 seawater SRM
Certified value = 5.85 nM
Experimental results: 5.76 ± 0.17 nM (n = 10)

known value  x 5.85  5.7 6

tcalc  n  10  1.674
s 0.17
(Keep 3 decimal places for comparison to table.)
Compare to ttable; df = 10 - 1 = 9, 95% CL
ttable(df=9,95% CL) = 2.262

If |tcalc| < ttable, results are not significantly different at the 95% CL.
If |tcalc|  ttable, results are significantly different at the 95% CL.
For this example, tcalc < ttest, so experimental results are not significantly
different at the 95% CL. THE NULL HYPOTHESIS IS MAINTAINED and no BIAS
at the 95 % confidence level.
Comparing replicate measurements or
comparing means of two sets of data
• Another application of the t statistic
• Example: Given the same sample analyzed by two
different methods, do the two methods give the “same”
result?
x1  x 2 n1 n2
t calc 
s pooled n1  n2

s12 (n1 1)  s 22 (n2 1)

s pooled 
n1  n2  2
Will compare tcalc to tabulated value of t at appropriate df
and CL.
df = n1 + n2 – 2 for this test
Comparing replicate measurements or
comparing means of two sets of data—
example
Ewww!

Determination of nickel in sewage sludge

using two different methods
Method 1: Atomic absorption Method 2: Spectrophotometry
spectroscopy
Data: 3.91, 4.02, 3.86, 3.99 mg/g Data: 3.52, 3.77, 3.49, 3.59 mg/g

x1 = 3.945 mg/g x2 = 3.59 mg/g

s1 = 0.073 mg/g = 0.12 mg/g

s2
n1 =4 n2 =4
Comparing replicate measurements or
comparing means of two sets of data—example

s12 (n1 1)  s22 (n2 1) (0.07 3 ) 2 (4 1)  (0.12 ) 2 (4 1)
s pooled    0.0993
n1  n2  2 442

x1  x2 n1 n2 3.945  3.59 (4)(4)

tcalc    5.056
s pooled n1  n2 0.0993 44

Note: Keep 3 decimal places to compare to ttable.

Compare to ttable at df = 4 + 4 – 2 = 6 and 95% CL.

ttable(df=6,95% CL) = 2.447

If |tcalc|  ttable, results are not significantly different at the 95%. CL.
If |tcalc|  ttable, results are significantly different at the 95% CL.

Since |tcalc| (5.056)  ttable (2.447), results from the two methods are
significantly different at the 95% CL.
Comparing replicate measurements or
comparing means of two sets of data

Wait a minute! There is an important assumption

associated with this t-test:

It is assumed that the standard deviations (i.e., the

precision) of the two sets of data being compared are not
significantly different.

• How do you test to see if the two std. devs. are

different?

• How do you compare two sets of data whose std. devs.

are significantly different?
t-tests and the Law

Clearly, the meanings of 1.083 ± 0.007 and 1.0 ± 0.4 are very different. As a
person who will either derive or use analytical results, you should be aware of
this warning published in a report entitled “Principles of Environmental Analysis”:

Analytical chemists must always emphasize to the public that the single most
important characteristic of any result obtained from one or more analytical
measurements is an adequate statement of its uncertainty interval. Lawyers
usually attempt to dispense with uncertainty and try to obtain unequivocal
statements: therefore, an uncertainty interval must be defined in cases
involving litigation and or enforcement proceedings. Otherwise, a value of
1.001 without a specified uncertainty, for example may be views as legally
exceeding a permissible level of 1.

L. K. Keith, W. Crummett, J. Deegan Jr., R. A. Libby, J. K. Taylor, and G. Wentler,
Analytical Chemistry, 55, 2210 (1983).
F-test to compare standard deviations

• Used to determine if std. devs. are significantly

different before application of t-test to compare
replicate measurements or compare means of two
sets of data

• Also used as a simple general test to compare the

precision (as measured by the std. devs.) of two sets
of data

• Uses F distribution
F-test to compare standard deviations

Will compute Fcalc and compare to Ftable.

s12
Fcalc  where s1  s2
s22

DF = n1 - 1 and n2 - 1 for this test.

Choose confidence level (95% is a typical CL).

From D.C. Harris (2003) Quantitative Chemical Analysis, 6th Ed .
F-test to compare standard deviations
From previous example:
Let s1 = 0.12 and s2 = 0.073

s12 (0.12 ) 2
Fcalc    2.70
s22 (0.07 3 ) 2
Note: Keep 2 or 3 decimal places to compare with F table.

Compare Fcalc to Ftable at df = (n1 -1, n2 -1) = 3,3 and 95% CL.
If Fcalc  Ftable, std. devs. are not significantly different at 95% CL.
If Fcalc  Ftable, std. devs. are significantly different at 95% CL.
Ftable(df=3,3;95% CL) = 9.28
Since Fcalc (2.70) < Ftable (9.28), std. devs. of the two sets of data are
not significantly different at the 95% CL. (Precisions are similar.)
Comparing replicate measurements or
comparing means of two sets of data-
revisited

The use of the t-test for comparing means was

justified for the previous example because we
showed that standard deviations of the two sets of
data were not significantly different.

If the F-test shows that std. devs. of two sets of data

are significantly different and you need to compare
the means, use a different version of the t-test 
Comparing replicate measurements or
comparing means from two sets of data when
std. devs. are significantly different

x1  x2
tcalc 
s12 / n1  s22 / n2

 
 
 ( s1 / n1  s2 / n2 )
2 2 2

DF   2   2

 1 1( s / n ) 2
( s 2
/ n ) 2

 2 2

  n1  1 n2  1  
Flowchart for comparing means of two
sets of data or replicate measurements
Use F-test to see if std.
devs. of the 2 sets of
data are significantly
different or not

Std. devs. are Std. devs. are not

significantly different significantly different

Use the 2nd version Use the 1st version of the

of the t-test () t-test (see previous, fully
worked-out example)
One last comment on the F-test

Note that the F-test can be used to simply test whether

or not two sets of data have statistically similar
precisions or not.

Can use to answer a question such as: Do method one

and method two provide similar precisions for the
analysis of the same analyte?
Statistics in the News
Outliers Disrupt the Mean
January 01 1999

In 1984, according to Larry Gonick and Woollcott Smith, the University of

Virginia announced that the mean starting salary of its graduates from the
Department of Rhetoric and Communications was a very hefty $55,000 per
year. But before you abandon your computer science training for speech
classes, you should know that the graduating class contained a significant
"outlier," or extreme data point not typical of the rest of the data set - Ralph
Sampson, future NBA All-Star, who majored in speech. It would have been
better to learn the median salary, the data point in the middle of the set.
Evaluating questionable data points
using the Q-test
• Need a way to test questionable data points (outliers) in an
unbiased way.
• Q-test is a common method to do this.
• Requires 4 or more data points to apply.

Calculate Qcalc and compare to Qtable

Qcalc = gap/range

Gap = (difference between questionable data pt. and its

nearest neighbor)

Range = (largest data point – smallest data point)

Evaluating questionable data points
using the Q-test--example
Consider set of data; Cu values in sewage sample:
9.52, 10.7, 13.1, 9.71, 10.3, 9.99 mg/L

Arrange data in increasing or decreasing order:

9.52, 9.71, 9.99, 10.3, 10.7, 13.1

The questionable data point (outlier) is 13.1

gap (13.1  10.7)
Calculate Qcalc    0.670
range (13.1  9.52)
Compare Qcalc to Qtable for n observations and desired CL (90% or
95% is typical). It is desirable to keep 2-3 decimal places in
Qcalc so judgment from table can be made.

Qtable (n=6,90% CL) = 0.56

From G.D. Christian (1994) Analytical Chemistry, 5th Ed.
Evaluating questionable data points
using the Q-test--example
If Qcalc < Qtable, do not reject questionable data point at stated CL.

If Qcalc  Qtable, reject questionable data point at stated CL.

From previous example,

Qcalc (0.670) > Qtable (0.56), so reject data point at 90% CL.

Subsequent calculations (e.g., mean and standard deviation)

should then exclude the rejected point.

Mean and std. dev. of remaining data: 10.04  0.47 mg/L

Q or G outlier test?
G (95 % confidence) Number of Observations
1.463 4
1.672 5

questionable _ value  x
1.822 6
1.938 7

G calc  2.032 8

s
2.11 9
2.176 10
2.234 11
2.285 12
2.409 15
2.557 20
reject if Gcalc > G table

Q (90 % confidence) Number of Observations

gap 0.76 4
Q calc  0.64 5

range 0.56
0.51
6
7
0.47 8
0.44 9
0.41 10

reject if Qcalc > Q table

No. of observations 90% 95% 99% confidencelevel

3 0.941 0.970 0.994

4 0.765 0.829 0.926
5 0.642 0.710 0.821
6 0.560 0.625 0.740
7 0.507 0.568 0.680
8 0.468 0.526 0.634
9 0.437 0.493 0.598
10 0.412 0.466 0.568
Rejection of outlier recommended if Qcalc> Qtable for the desired confidence level.

Note:1. The higher the confidence level, the less likely is

rejection to be recommended.
2. Rejection of outliers can have a marked effect on mean
and standard deviation, esp. when there are only a few
data points. Always try to obtain more data.
The following values were obtained for
Q Test for Rejection the concentration of nitrite ions in a sample
of Outliers of river water: 0.403, 0.410, 0.401, 0.380 mg/l.
Should the last reading be rejected?
Qcalc  0.380  0.401 (0.410  0.380)  0.7
But Qtable = 0.829 (at 95% level) for 4 values
Therefore, Qcalc < Qtable, and we cannot reject the suspect value.
Suppose 3 further measurements taken, giving total values of:
0.403, 0.410, 0.401, 0.380, 0.400, 0.413, 0.411 mg/l. Should
0.380 still be retained?

Qcalc  0.380  0.400 (0.413  0.380)  0.606

But Qtable = 0.568 (at 95% level) for 7 values
Therefore, Qcalc > Qtable, and rejection of 0.380 is recommended.

But note that 5 times in 100 it will be wrong to reject this suspect value!
Also note that if 0.380 is retained, s = 0.011 mg/l, but if it is rejected,
s = 0.0056 mg/l, i.e. precision appears to be twice as good, just by
rejecting one value.

Charpter 2
No ratings yet
Charpter 2
26 pages
QBM 101 Business Statistics: Department of Business Studies Faculty of Business, Economics & Accounting HE LP University
No ratings yet
QBM 101 Business Statistics: Department of Business Studies Faculty of Business, Economics & Accounting HE LP University
62 pages
Lecture 2.2 - Statistics - Desc Stat and Distrib
No ratings yet
Lecture 2.2 - Statistics - Desc Stat and Distrib
48 pages
Statistics
No ratings yet
Statistics
46 pages
APP601S Chapter 4 - Data Handling in Analytical Chem
No ratings yet
APP601S Chapter 4 - Data Handling in Analytical Chem
42 pages
APP601S Chapter 4- Data Handling in Anal Chem
No ratings yet
APP601S Chapter 4- Data Handling in Anal Chem
42 pages
Prob & Stats (Slides) PDF
No ratings yet
Prob & Stats (Slides) PDF
101 pages
Cmda2005 Review
No ratings yet
Cmda2005 Review
65 pages
Statistics For Business and Economics: Module 1:probability Theory and Statistical Inference Spring 2010
No ratings yet
Statistics For Business and Economics: Module 1:probability Theory and Statistical Inference Spring 2010
20 pages
Statistics 101
100% (1)
Statistics 101
20 pages
Essentials of Statistics
No ratings yet
Essentials of Statistics
272 pages
1 - 3 Descriptive Measures
No ratings yet
1 - 3 Descriptive Measures
33 pages
004 Statistics
No ratings yet
004 Statistics
4 pages
Key of Week1 - Lecture Notes
No ratings yet
Key of Week1 - Lecture Notes
10 pages
Statistical Analysis and Calibration
No ratings yet
Statistical Analysis and Calibration
54 pages
CHAPTERS
No ratings yet
CHAPTERS
17 pages
Ch04le 1
No ratings yet
Ch04le 1
59 pages
Basic Statistics
No ratings yet
Basic Statistics
105 pages
Lecture 3
No ratings yet
Lecture 3
14 pages
Biostatistics 140127003954 Phpapp02
No ratings yet
Biostatistics 140127003954 Phpapp02
47 pages
Statistics in Biology Part 1 Mean and Standard Deviation Web
No ratings yet
Statistics in Biology Part 1 Mean and Standard Deviation Web
24 pages
Lecture Notes Ma12003 PDF
100% (1)
Lecture Notes Ma12003 PDF
105 pages
D2 Basic Stat
No ratings yet
D2 Basic Stat
53 pages
Basic Statistical Techniques
No ratings yet
Basic Statistical Techniques
23 pages
OPMT 1005 - Week Three - Data and Statistics
No ratings yet
OPMT 1005 - Week Three - Data and Statistics
50 pages
Error and Uncertainty: General Statistical Principles
No ratings yet
Error and Uncertainty: General Statistical Principles
8 pages
Evaluating Analytical Data PDF
No ratings yet
Evaluating Analytical Data PDF
8 pages
Book IntroStatistics PDF
No ratings yet
Book IntroStatistics PDF
263 pages
Basic - Statistics 30 Sep 2013 PDF
100% (1)
Basic - Statistics 30 Sep 2013 PDF
20 pages
L4 Statistics
No ratings yet
L4 Statistics
38 pages
Introduction To Statistics: Measures of Central Tendency
No ratings yet
Introduction To Statistics: Measures of Central Tendency
35 pages
Chapter 6
No ratings yet
Chapter 6
37 pages
Inbound 588667172330667162
No ratings yet
Inbound 588667172330667162
30 pages
Random Errors in Chemical Analysis: CHM028 Analytical Chemistry For Teachers
No ratings yet
Random Errors in Chemical Analysis: CHM028 Analytical Chemistry For Teachers
38 pages
Classification of Data: Objectives: Understand How Data Are Classified. Recognize The Different Types of Data
No ratings yet
Classification of Data: Objectives: Understand How Data Are Classified. Recognize The Different Types of Data
39 pages
MT233 October 2019-1
No ratings yet
MT233 October 2019-1
39 pages
2NUBIONormalCurve2T24-25
No ratings yet
2NUBIONormalCurve2T24-25
50 pages
Che 411 L2
No ratings yet
Che 411 L2
22 pages
Numerical Measures HANDOUT With Answers
No ratings yet
Numerical Measures HANDOUT With Answers
8 pages
Basic Statistics Concepts: 1 Frequency Distribution
No ratings yet
Basic Statistics Concepts: 1 Frequency Distribution
7 pages
Basic Statistics
No ratings yet
Basic Statistics
23 pages
Math
No ratings yet
Math
6 pages
Unit II: Basic Data Analytic Methods
No ratings yet
Unit II: Basic Data Analytic Methods
38 pages
Week 01 Introduction
No ratings yet
Week 01 Introduction
33 pages
STAT6101 Coursenotes 1516 PDF
No ratings yet
STAT6101 Coursenotes 1516 PDF
73 pages
Chapter 4
No ratings yet
Chapter 4
8 pages
07. Notes_ Application of Statistical Tools
No ratings yet
07. Notes_ Application of Statistical Tools
9 pages
RP Notes Unit 4 - Distribution Fucntions
No ratings yet
RP Notes Unit 4 - Distribution Fucntions
5 pages
Stats 2 Notes
No ratings yet
Stats 2 Notes
17 pages
Chapter 4
No ratings yet
Chapter 4
48 pages
Counting Statistics: Presented by Vikas Lakhwani
No ratings yet
Counting Statistics: Presented by Vikas Lakhwani
21 pages
L2-More On Describing Data
No ratings yet
L2-More On Describing Data
154 pages
Introduction To Statistics2312
No ratings yet
Introduction To Statistics2312
34 pages
Basic Statistics: Populations and Samples
No ratings yet
Basic Statistics: Populations and Samples
10 pages
DSML
No ratings yet
DSML
510 pages
Statistics 1 (Final) / Orthodontic Courses by Indian Dental Academy
No ratings yet
Statistics 1 (Final) / Orthodontic Courses by Indian Dental Academy
15 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
High-Dimensional Covariance Estimation: With High-Dimensional Data
From Everand
High-Dimensional Covariance Estimation: With High-Dimensional Data
Mohsen Pourahmadi
No ratings yet
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
From Everand
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
Stuart A. Klugman
4/5 (1)
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
From Everand
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
S. Deviant
4.5/5 (6)
Pitch
No ratings yet
Pitch
17 pages
Pamalandong MC
No ratings yet
Pamalandong MC
12 pages
Lord I Stretch FULL
No ratings yet
Lord I Stretch FULL
8 pages
Come and See
No ratings yet
Come and See
3 pages
Date: August 17, 2022 (Youth Choir in Charge) Venue: Iligan City SDA Central Church Ay Program
No ratings yet
Date: August 17, 2022 (Youth Choir in Charge) Venue: Iligan City SDA Central Church Ay Program
1 page
Devotional
No ratings yet
Devotional
10 pages
Undergraduate Teaching Syllabus in Analytical Chemistry
No ratings yet
Undergraduate Teaching Syllabus in Analytical Chemistry
7 pages
Transcriber Guide
No ratings yet
Transcriber Guide
14 pages
Feel The Nails
No ratings yet
Feel The Nails
10 pages
Third Angel's Message Study Guide 1 of 3
No ratings yet
Third Angel's Message Study Guide 1 of 3
5 pages
A Coffee Enema Is The Injection of Coffee Into The Rectum and Colon Via The Anus
No ratings yet
A Coffee Enema Is The Injection of Coffee Into The Rectum and Colon Via The Anus
3 pages
2) Biodiesel Production From Jatropha Oil by Catalytic and Non-Catalytic Approaches: An Overview 2010
No ratings yet
2) Biodiesel Production From Jatropha Oil by Catalytic and Non-Catalytic Approaches: An Overview 2010
3 pages
Caspase-Glo 1 Inflammasome Assay: Technical Manual
No ratings yet
Caspase-Glo 1 Inflammasome Assay: Technical Manual
27 pages
Promega Timing Apoptosis Assay Publication
No ratings yet
Promega Timing Apoptosis Assay Publication
4 pages
Reactions in Aqueous Solution: Lecture Presentation
No ratings yet
Reactions in Aqueous Solution: Lecture Presentation
37 pages
CH01 Analy Chem
No ratings yet
CH01 Analy Chem
29 pages
HUAWEI Band 4e User Guide - (AW70,01, En-Us)
No ratings yet
HUAWEI Band 4e User Guide - (AW70,01, En-Us)
33 pages
ROS-Glo™ H O Assay: Technical Manual
No ratings yet
ROS-Glo™ H O Assay: Technical Manual
22 pages
Dissertation Proposal Sample Quantitative
67% (3)
Dissertation Proposal Sample Quantitative
8 pages
Example of Master Dissertation Proposal
100% (2)
Example of Master Dissertation Proposal
6 pages
Co DHB 3042 Pta 1 - Edisi 10 Mac 2021
No ratings yet
Co DHB 3042 Pta 1 - Edisi 10 Mac 2021
9 pages
Marketing-Research Test 02
No ratings yet
Marketing-Research Test 02
5 pages
Activity 5
No ratings yet
Activity 5
1 page
Veg Carving 2
No ratings yet
Veg Carving 2
9 pages
Eyes On The Truth: Assessing The Use of The Body-Worn Cameras in South Cotabato Provincial Police Office
No ratings yet
Eyes On The Truth: Assessing The Use of The Body-Worn Cameras in South Cotabato Provincial Police Office
29 pages
CritiqueofNursingResearchfinal Adjusted
No ratings yet
CritiqueofNursingResearchfinal Adjusted
47 pages
Science, Technology and Society: Dr. Cresente D. Delatado Professor
No ratings yet
Science, Technology and Society: Dr. Cresente D. Delatado Professor
24 pages
Passmedicine Statistics Note 2021: Prepared by DR - Abohaneen Mrcpase Telegram Group
No ratings yet
Passmedicine Statistics Note 2021: Prepared by DR - Abohaneen Mrcpase Telegram Group
25 pages
Cyclical Metamorphosis: The Accreditation Experience of Middle Managers in Local Colleges in CALABARZON, Philippines
No ratings yet
Cyclical Metamorphosis: The Accreditation Experience of Middle Managers in Local Colleges in CALABARZON, Philippines
9 pages
Introduction To Econometrics, 5 Edition
No ratings yet
Introduction To Econometrics, 5 Edition
33 pages
Quiz in Science 7 Components of A Scientific Investigation
100% (1)
Quiz in Science 7 Components of A Scientific Investigation
4 pages
Guidelines Qualitative
No ratings yet
Guidelines Qualitative
5 pages
Unit-5 Anova
No ratings yet
Unit-5 Anova
12 pages
Spencer Foundation Dissertation Fellowship Program
100% (1)
Spencer Foundation Dissertation Fellowship Program
7 pages
May, 2012 Jimma, Ethiopia
No ratings yet
May, 2012 Jimma, Ethiopia
5 pages
Research Report 1
No ratings yet
Research Report 1
13 pages
Observational Study Ntcc Noida PM
No ratings yet
Observational Study Ntcc Noida PM
28 pages
Educ 1 Module 4
No ratings yet
Educ 1 Module 4
29 pages
DLL, 1st Week, Research 2
No ratings yet
DLL, 1st Week, Research 2
4 pages
Stat Finals
No ratings yet
Stat Finals
2 pages
Ch 3 Research Proposal and its components.docx
No ratings yet
Ch 3 Research Proposal and its components.docx
1 page
Paper 16635
No ratings yet
Paper 16635
6 pages
Qualitative Research
No ratings yet
Qualitative Research
34 pages
Finalpr1 Defense 1
No ratings yet
Finalpr1 Defense 1
16 pages
Teachers
No ratings yet
Teachers
44 pages
(eBook PDF) Biological Science, Third Canadian Edition 3rd Edition pdf download
100% (1)
(eBook PDF) Biological Science, Third Canadian Edition 3rd Edition pdf download
55 pages
Statistics and Probability 2nd Quarter
100% (1)
Statistics and Probability 2nd Quarter
398 pages
Literature Review 2500 Words
100% (3)
Literature Review 2500 Words
6 pages

Statistics - Lying Without Sinning?: - "Lies, Damned Lies, and Statistics"

Uploaded by

Statistics - Lying Without Sinning?: - "Lies, Damned Lies, and Statistics"

Uploaded by

Statistics - Lying without sinning?

• "Lies, damned lies, and statistics"

• Statistics: Set of mathematical tools used to describe

Experimental variation in the population from which samples

• Estimate of mean value

• Std. dev. of population = 

• Std. dev. of sample = s

• Larger std. dev. more

• Smaller std. dev. means

• (May also consider these percentages of area under the curve)

Consider Cu data: 5.23, 5.79, 6.21, 5.88, 6.02 nM

Answer: 5.82 ± 0.36 nM or 5.8 ± 0.4 nM

From previous example,

rsd = (0.36 nM/5.82 nM) 100 = 6.1% or 6%

• Take 4x as many measurements, s decreases by

Used in many other statistical calculations and tests

From previous example, s = 0.36

standard deviation, but n

Using previous data:

Answer : 5.82  0.25 nM or 5.8  0.2 nM

RAD = (0. 25/5.82) 100 = 4.2 or 4%

RAD = (0. 25/5.82) 1000 = 42 ppt

• To characterize or make judgments about data

• For a 50% CI, there is a 50% probability that the true

• Note that CI will decrease as n is increased

• Useful for characterizing data that are regularly obtained;

For the nitrate concentration data

0.500 ± 0.005 0.500 ± 0.002

Carry out measurements on an accurately known standard.

Experimental value is different from the true value.

Assume that there is no bias

Figure shows (A) the curve for

• “Known” value would typically be a certified value

df = n -1 for this test

known value  x 5.85  5.7 6

s12 (n1 1)  s 22 (n2 1)

Determination of nickel in sewage sludge

x1 = 3.945 mg/g x2 = 3.59 mg/g

s1 = 0.073 mg/g = 0.12 mg/g

x1  x2 n1 n2 3.945  3.59 (4)(4)

Note: Keep 3 decimal places to compare to ttable.

Compare to ttable at df = 4 + 4 – 2 = 6 and 95% CL.

Wait a minute! There is an important assumption

It is assumed that the standard deviations (i.e., the

• How do you test to see if the two std. devs. are

• How do you compare two sets of data whose std. devs.

• Used to determine if std. devs. are significantly

• Also used as a simple general test to compare the

Will compute Fcalc and compare to Ftable.

DF = n1 - 1 and n2 - 1 for this test.

Choose confidence level (95% is a typical CL).

The use of the t-test for comparing means was

If the F-test shows that std. devs. of two sets of data

Std. devs. are Std. devs. are not

Use the 2nd version Use the 1st version of the

Note that the F-test can be used to simply test whether

Can use to answer a question such as: Do method one

In 1984, according to Larry Gonick and Woollcott Smith, the University of

Calculate Qcalc and compare to Qtable

Gap = (difference between questionable data pt. and its

Range = (largest data point – smallest data point)

Arrange data in increasing or decreasing order:

The questionable data point (outlier) is 13.1

Qtable (n=6,90% CL) = 0.56

If Qcalc  Qtable, reject questionable data point at stated CL.

From previous example,

Subsequent calculations (e.g., mean and standard deviation)

Mean and std. dev. of remaining data: 10.04  0.47 mg/L

Q (90 % confidence) Number of Observations

reject if Qcalc > Q table

3 0.941 0.970 0.994

Note:1. The higher the confidence level, the less likely is

Qcalc  0.380  0.400 (0.413  0.380)  0.606

You might also like