Centrall - Tendency - IV
Centrall - Tendency - IV
Central tendency
A statistical measure that identifies a single
score as representative for an entire
distribution. The goal of central tendency is
to find the single score that is most typical
or most representative of the entire group.
Measure of central tendency
Measure of central tendency
The mean
Population mean vs. sample mean
ΣX ΣX
μ= x=
N n
N=4: 3,7,4,6
ΣX 20
x= = =5
n 4
Measure of central tendency
The weighted mean
Group A: x = 6 n=12
Group B: x = 7 n=8
72 + 56
Weighted mean = ΣX 1 + ΣX 2 = 12 + 8
= 6.4
n1 + n2
Imagine that you received the following data on the vocabulary test mentioned earlier:
20 22 23 23 23
23 23 23 24 25
28 29 30 30 30
30 30 30 31 32
32 33 33 34 35
35 36 36 37 37
2. Compute the mean, mode, and median of the data and decide which of the three you
believe to be best for the central tendency of the data.
Measure of variability
Variability provides a quantitative
measure of the degree to which scores
in a distribution are spread out or
clustered together.
Measure of variability
Range
range=Xhighest – Xlowest
Quartile:
A statistical term describing a division of observations into four defined intervals
based upon the values of the data and how they compare to the entire set of
observations.
Each quartile contains 25% of the total observations. Generally, the data is
ordered from smallest to largest with those observations falling below 25% of all
the data analyzed allocated within the 1st quartile, observations falling between
25.1% and 50% and allocated in the 2nd quartile, then the observations falling
between 51% and 75% allocated in the 3rd quartile, and finally the remaining
observations allocated in the 4th quartile.
Interquartile: The interquartile range is a measure of spread or dispersion. It
is the difference between the 75th percentile (often called Q3) and the 25th
percentile (Q1). The formula for interquartile range is therefore: Q3-Q1.
Semi-interquartile: The semi-interquartile range is a measure of spread or
dispersion. It is computed as one half the difference between the 75th
percentile [often called (Q3)] and the 25th percentile (Q1). The formula for
semi-interquartile range is therefore: (Q3-Q1)/2.
TOEFL: (560-470)/2=45
Measure of variability
Measure of variability
Variance
Deviation: deviation of one score from the
mean
Variance: taking the distribution of all
scores into account.
Sum of
square (SS)
n=24
Measure of variability
Standard deviation
squar ed
scor e mean devi at i on* devi at i on
8 9. 67 - 1. 67 2. 79
25 9. 67 +15. 33 235. 01
7 9. 67 - 2. 67 7. 13
5 9. 67 - 4. 67 21. 81
8 9. 67 - 1. 67 2. 79
3 9. 67 - 6. 67 44. 49
10 9. 67 + . 33 . 11
12 9. 67 + 2. 33 5. 43
9 9. 67 - . 67 . 45
sum of squar ed dev= 320. 01
Type of instrument
Listening Watching
Mean SD Mean SD
Males 15.72 4.43 6.94 2.26
Females 3.47 1.12 2.61 0.98
Measure of variability
Standard deviation and normal distribution
Homework
1. Calculate the mean, median, mode, range and standard
deviation for the following sample:
Midterm Exam
X X
100 85
88 82
83 96
105 107
78 102
98 113
126 94
85 119
67 91
88 100
88 72
77 88
114 85
Homework
2. Suppose that the following scores were obtained on administering a
language proficiency test to ten aphasics who had undergone a course
of treatment, and ten otherwise similar aphasics who had not
undergone the treatment:
Experimental group Control group
15 31
28 34
62 47
17 41
31 28
58 54
45 36
11 38
76 45
43 32
Calculate the mean score and standard deviation for each group, and
comment on the results.
Locating scores and finding
scales in a distribution
Percentiles, quartiles, deciles
Mind work
Imagine that you conducted an in-service course for ESL teachers. To receive university credit for the
course, the teachers must take examinations--in this case, a midterm and a final. The midterm was a
multiple-choice test of 50 items and the final exam presented teachers with 10 problem situations to
solve. Sue, like most teachers, was a whiz at taking multiple-choice exams, but bombed out on the
problem-solving final exam. She received a 48 on the midterm and a 1 on the final. Becky didn't do so
well on the midterm. She kept thinking of exceptions to answers on the multiple-choice exam. Her score
was 39. However, she really did shine on the final, scoring a 10. Since you expect students to do well on
both exams, you reason that Becky has done a creditable job on each and Sue has not. Becky gets the
higher grade. Yet, if you add the points together, Sue has 49 and Becky has 49. The question is whether
the points are really equal.
Should Sue also do this bit of arithmetic, she might come to your office to complain of the injustice of it
all. How will you show her that the value of each point on the two tests is different?
Locating scores and finding
scales in a distribution
Standard score (z-scores) X −x
z=
s
Locating scores and finding
scales in a distribution
Mind work
Suppose that we have measured the times taken by a very large number of
people to utter a particular sentence, and have shown these times to be
normally distributed with a mean of 3.45 sec and a standard deviation of
0.84 sec. Armed with this information, we can answer various questions.
1. What proportion of the (potentially infinite) population of utterance
times would be expected to fall below 3 sec?
2. What proportion would lie between 3 and 4 sec?
3. What is the time below which only 1 per cent of the times would be
expected to fall?
Mind work
某外语学院在其研究生教学中规定,只要有一门课程的考试
成绩低于75分,即取消其撰写论文的资格。显然,这是不科
学的。因为这实质上也是把不同质的考试硬拉在一起进行比
较。同是75分,在不同考试中的意义是不一样的。在一个非
常容易的考试中,它可能是比较低的分数,而在一个难度较
大的考试中,它却可能是比较高的考分。如果凡是低于该分
数的都不让写论文,这是不科学的,也是不公平的。科学的
做法是把各科的考试分数换算成标准分,然后规定多少标准
分以下的没有资格写论文。同上例一样,有了标准分之后,
也可以把各科的成绩合成一个总分,或求平均分,排出名次,
再制定一个标准,以确定总分或平均分为多少的人才有资格
撰写论文
Locating scores and finding
scales in a distribution
Distributions with nominal data
Implicational scaling (Guttman scaling)
Coefficient of scalability
Homework
I. The following scores are obtained by 50 subjects on a language aptitude test:
42 62 44 32 47 42 52 76 36 43
55 27 46 55 47 28 53 44 15 61
18 59 58 57 49 55 88 49 50 62
61 82 66 80 64 50 40 53 28 63
63 25 58 71 82 52 73 67 58 77
.2−x
z=58 .2−x
58 51.7≤ x≤64.7
3.3 −1.96
≤ ≤1.9
3.3
Sample statistics and population
parameter: estimation
Confidence limits for proportions
Standard error = p(1 − p )
N
Confidence limits=proportion in sample
±(critical value x standard error)
Sample statistics and population
parameter: estimation
Suppose that we have taken a random sample of 500 finite verbs from a text,
and found that 150 of them have present tense form. How can we set
confidence limits for the proportion of present tense finite verbs in the whole
text, the population from which the sample is taken?
Calculate (i) the mean, (ii) the standard deviation, (iii) the standard error
of the mean, (iv) the 99 per cent confidence limits for the mean.
II. A random sample of 300 finite verbs is taken from a text, and it is found
that 63 of these are auxiliaries. Calculate the 95 per cent confidence
limits for the proportion of finite verbs which are auxiliaries in the text
as a whole.
III. Using the data in question II, calculate the size of the sample of finite
verbs which would. be required in order to estimate the proportion of
auxiliaries to within an accuracy of 1 per cent, with 95 per cent
confidence.
Probability and Hypothesis
Testing
Null hypothesis (H0)
The null hypothesis states that in the general
population there is no change, no difference, or no
relationship. In the context of an experiment, H0
predicts that the independent variable (treatment)
will have no effect on the dependent variable for
the population. H0: μA- μB=0 or μA= μB
Alternative hypothesis (H1)
The alternative hypothesis (H1) states that there is
a change, a difference, or a relationship for the
general population. H1: μA≠ μB
Probability and Hypothesis
Testing
Null hypothesis (H0)
When we reject the null hypothesis, we want the probability to be
very low that we are wrong. If, on the other hand, we must accept
the null hypothesis, we still want the probability to be very low that
we are wrong in doing so.
Type I error and Type II error
A type I error is made when the researcher rejected the null
hypothesis when it should not have been rejected.
A type II error is made when the null hypothesis is accepted when
it should have been rejected.
In research, we test our hypothesis by finding the probability of
our results. Probability is the proportion of times that any
particular outcome would happen if the research were repeated
an infinite number of times.
Probability and Hypothesis
Testing
Two-tailed and one-tailed hypothesis
When we specify no direction for the null hypothesis (i.e.,
whether our score will be higher or lower than more typical
scores), we must consider both tails of the distribution. This
is called two-tailed hypothesis.
If we have good reason to believe that we will find a
difference (e.g., previous studies or research findings
suggest this is so), then we will use a one-tailed hypothesis.
One-tailed tests specify the direction of the predicted
difference. We use previous findings to tell us which
direction to select.
.05 .01
1-tailed 1.64 2.33
2-tailed 1.96 2.57
Probability and Hypothesis
Testing
Steps in hypothesis testing
1. State the null hypothesis.
2. Decide whether to test it as a one- or two-tailed hypothesis. If there is
no research evidence on the issue, select a two-tailed hypothesis. This
will allow you to reject the null hypothesis in favor of an alternative
hypothesis. If there is research evidence on the issue, select a
one-tailed hypothesis. This will allow you to reject the null
hypothesis in favor of a directional hypothesis.
3. Set the probability level (α level). Justify your choice.
4. Select the appropriate statistical test(s) for the data.
5. Collect the data and apply the statistical test(s).
6. Report the test results and interpret them correctly.
Probability and Hypothesis
Testing
Parametric vs. nonparametric
Parametric procedures
Make strong assumptions about the distribution of the
data
Assume the data are NOT frequencies or ordinal scales
but interval data
Data are normally distributed
Nonparametric procedures
Do not make strong assumptions about the shape of the
distribution of the data
Work with frequencies and rank-ordered scales
Used when the sample size is small
Homework