2 Statistical Theory and Methods: Measures of Central Tendency (Averages)

This document discusses key statistical concepts for workplace learning and performance professionals. It defines and compares three measures of central tendency (mean, median, mode), and explains how each is calculated and when each is most appropriate to use. It also introduces frequency distributions and how they can be used to visualize data dispersion, clusters, skewness, outliers, and normal distribution. Key terms like confounding variable and continuous variable are also defined. The goal is to help professionals understand essential statistical concepts to correctly apply, interpret, and draw appropriate inferences from data.

Uploaded by

Tin

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views

2 Statistical Theory and Methods: Measures of Central Tendency (Averages)

Uploaded by

Tin

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Statistical Theory and Methods Statistical Theory and Methods

2 Measures of Central Tendency (Averages)

Besides using descriptive statistics for charts and graphs, trainers can perform several
Statistical Theory and Methods numeric calculations on them. The most common are called measures of central tendency,
or averages, of which there are three: mean, median, and mode. Each type of average
Statistics allow workplace learning and performance (WLP) professionals to quantitatively serves a unique purpose.
describe and draw inferences about people, things, or events. In other words, statistics allow
data to be organized and summarized and make it possible to draw generalizations and Mean
inferences. Statistics enable WLP professionals to document current levels of performance The mean score is considered the most robust, or least affected by the presence of extreme
(individual, group, or organizational), measure the impact of their programs, and offer well- values (outliers), of the three types of central tendency measures, because each number in
grounded feedback for change. the data set has an impact on its (mean) value.
For many WLP professionals, the use of statistics is an onerous task—but it shouldn’t be. The mean is represented by the following formula:
Several software applications can do the number crunching for trainers; however,
practitioners must understand how to use statistics. The selection and interpretation of Mean = Sum of all numbers divided by the number of values that make up the sum
statistics still rests in their hands. To use statistics properly, a statistics consumer needs to The mean is a good measure of central tendency for roughly symmetric distributions but can
understand some essential concepts and principles. Although this chapter is designed to be misleading in skewed, or nonsymmetric, distributions because it can be influenced a great
serve as a primer for statistics, occasionally calculations are used to help understanding. deal by extreme scores. Therefore, other statistics, such as the median, may be more
WLP professionals must have a broad understanding of how data falls into distributions (for informative and appropriate for distributions that are often quite skewed, such as reaction
example, variance and normal distribution) and how data relates to other data (for example, time or family income.
correlation and regression). In addition, from an inferential standpoint, WLP professionals Median
must understand concepts related to hypothesis testing, such as effect sizes and confidence
intervals. It’s easy to misuse or misinterpret statistics. Having a real understanding of The median is the middle of a distribution arranged by magnitude: Half the scores are
statistics means that people can apply them correctly, represent findings accurately, and draw above the median, and half are below the median. The median is less sensitive to extreme
appropriate inferences. scores than the mean, which makes it a better measure than the mean for highly skewed
distributions. The median income is usually more informative than the mean income, for
Learning Objectives: example.
; Define and illustrate the three measures of central tendency. The median in a distribution of odd- or even-numbered values, as noted in Table 1-1, is the
; Define and compare the various types of frequency distributions. calculated average of the two numbers of the high and low side of the nth number in the
; Express how measurement scales and statistical implications are used in the formula.
collection of measurement data. Which of these two measures—the sample mean or median—should trainers use? It
; Explain how the measures of variance are used in statistics. depends. Usually medians are a better measure for skewed distributions than means, but this
; Describe how distributions are used with standard scores. guideline must be tempered by common sense. In a dispute between the American Medical
; Identify the correct usage of correlation versus causation in data. Association and the American Bar Association about the rising costs of malpractice
; List the five steps in the hypothesis-testing process. insurance for doctors, the doctors used means to show a sharp rise in costs in the period
; Demonstrate knowledge related to effect sizes and confidence intervals. 1980 to 1984, and the lawyers used medians to show that there was no rise at all (Schwarz
; Recognize the appropriate use of statistical information. 1998).
Mode
The mode, the most frequently occurring score in a distribution, is also used as a measure of
central tendency. The advantage of the mode as a measure of central tendency is that its
meaning is obvious. Further, it’s the only measure of central tendency that can be used with
nominal data.
Statistical Theory and Methods Statistical Theory and Methods

The mode is greatly subject to sample fluctuations, so it’s not recommended for use as the Frequency Distributions
only measure of central tendency. A further disadvantage of the mode is that many
distributions have more than one mode; these distributions are called multimodal. A set of numbers may be summarized in two major ways: using pictures and using summary
numbers. Each method has advantages and disadvantages, and the use of one method need
The mean score is considered the most robust of the three types of central tendency because not exclude the use of the other. This section describes drawing pictures of data called
each number in the data set has an impact on the value of the mean. The median and the frequency distributions.
mode can be unaffected by individual numbers.
A frequency distribution can show the actual number of observations falling in each range or
Table 2-1 shows a sample set of data collected by a trainer. These scores represent the percentage of observations. With percentage of observations, the distribution is called a
pretest scores on a knowledge test a trainer administered before the start of a training relative frequency distribution.
program. This data is used to show the three types of measures of central tendency as well as
variance. Some conditions that a frequency distribution might illustrate include dispersion, clusters,
skewness, outliers, and normal distribution. The next section explores these concepts in
Table 2-1. Pretest Scores (Arranged From High to Low) more depth.
Scores (x) Mode (mo) Median (mdn)a Definition of Terms
29 The mode is the mdn=(n+1) ÷2 Confounding variable is an unknown or uncontrolled variable that produces an effect
35 most frequently in an experimental setting. A confounding variable is an “independent variable” that the
occurring number evaluator didn’t somehow recognize or control. It becomes a variable that confounds the
37 mdn=(12+1) ÷2
37 experiment.
mo=37
46 mdn=6.5th number Continuous variable is a variable whose quantification can be broken down into
52 extremely small units (for example, time, speed, distance).
56 mdn=54 Control group is a group of participants in an experiment that’s equal in all ways to the
59 experimental group except that it didn’t receive the experimental treatment.
61 Covariates are the multiple dependent variables in a study with multiple independent
73 variables.
77
Dependent variable is frequently thought of as the “outcome,” or treatment variable.
82
The dependent variable’s outcome depends on the independent variable and covariates.
Dichotomous variable is a variable that falls into one of two possible classifications
∑x=644
(for example, gender [male or female]). An artificially dichotomous variable is imposed
for classification purposes (for example, age classified as retired [>65] or not retired
The number of values, n, equals [<65]).
12
Discrete variable is a variable in which the units are in whole numbers, or “discrete”
The mean = ∑x÷n
units (for example, number of children, number of defects).
The mean = 644 ÷12=53.67
Experimental group is the treatment group; those participants who receive the
∑x = sum of values in the list “treatment,” for example, the training program.
n = number of values in the list
Independent variable is the variable that influences the dependent variable. Age,
a Note that the formula in the example applies only to a case with an even
number of values. For the odd number of values it is not necessary to average seniority, gender, shift, level of education, and so on may all be factors (independent
the two middle values to come up with the “virtual median.” In the case of an variables) that influence a person’s performance (the dependent variable).
odd number, the appropriate formula is (n+1÷2) ÷2.
Statistical Theory and Methods Statistical Theory and Methods

random error. The test can’t consider biases resulting from nonrandom error (for example, a Appropriate Use of Statistical Information and Data
badly selected sample).
As Mark Twain said, “Collecting data is like collecting garbage. Pretty soon, we have to do
These are some key concepts about statistical significance: something with it.” If not used properly, statistical information and evaluation data are
• In statistical terms, significant does not necessarily mean important. useless. Improper use of evaluation data can lead to four major problems:
• Probability values should be read in reverse. • Too many organizations don’t use evaluation data at all. In these situations, data is
collected, tabulated, catalogued, filed, and never used by any particular group other
• Too many significance tests turn up some falsely significant relationships.
than the person who initially collected the data.
• It’s important to check the sampling procedure to avoid bias.
• Data is not provided to the appropriate groups. Different groups need different
Effect Sizes types of data and often in very different formats. Analyzing target audiences and
determining the specific data needed for each group are important for
Effect size is a way of quantifying the difference between two groups. For example, if one communicating data.
group (the treatment group) has had an experimental treatment and the other (the control
group) has not, the effect size is a measure of the effectiveness between the two groups. • Data isn’t used to drive improvement. Most evaluation data uncovers process
Effect size uses standard deviation to contextualize the difference between the two groups. improvement opportunities and identifies features that could be adjusted or changes
that should be made to make the program more effective. If it’s not part of the
Confidence Intervals feedback cycle, evaluation falls short of what it’s intended to do.
The confidence interval is the range where something is expected to be. Saying “expected” • Data is used for the wrong reasons—to take action against a person or a group or to
leaves open the possibility of being wrong. The degree of confidence measures the withhold funds rather than improve processes. Sometimes the data is used in
probability of that expectation to be true. political ways to gain power or advantage over another person.
The degree of confidence is linked with the width of the confidence interval. It’s easy to be These problems represent dysfunctional activities that can destroy evaluation processes.
very confident that something will be within a very wide range, and vice versa. Also, the They must be addressed if evaluation is to add value.
amount of information (typically related to the sample size) has an influence on the degree
of confidence and the width of the confidence interval. With more information, there can be
more confidence that what’s being measured will be within a given interval. Also, with more
information and keeping a given degree of confidence, the interval can be narrowed.
For example, say a survey is conducted in Alexandria, Virginia. The question is “Do you
prefer Coca-Cola or Pepsi?” Of the responses, 60 percent answer Coca-Cola, and 40 percent
answer Pepsi. So the estimation is that, in this city, 60 percent prefer Coca-Cola. This doesn’t
mean that 60 percent of the population in this city prefers Coca-Cola—unless everyone in
the population answered the survey. However, there’s some “confidence” that the actual
proportion of people choosing Coca-Cola will be within some interval around the 60 percent
found in the sample. The amount of confidence depends on how wide the interval is. If the
survey is based on a sample of 100 people, there can be 90 percent confidence that the actual
proportion of those preferring Coca-Cola will be between 52 percent and 68 percent. Also,
there can be 99 percent confidence that the actual proportion will be between 48 percent
and 72 percent (for the same sample size, with more confidence and a wider interval). If the
survey had been on a sample of 1,000 people instead of 100, there could be 90 percent
confidence that the actual proportion is between 57.5 percent and 62.5 percent (compared
with 52 percent and 68 percent for the same confidence with a sample of 100). Keep in
mind that the larger the sample, the higher the degree of confidence.
Statistical Theory and Methods Statistical Theory and Methods

9 Chapter 2 Knowledge Check 6. Which of the following types of data include the feature of identifying an absolute
zero point?
1. Which of the following best describes a situation where the mode < median <
mean? a. Nominal
a. Negative skewness b. Ordinal
b. Positive skewness c. Interval
c. Outlier d. Ratio
d. Normal distribution 7. Variance is defined as how spread out a distribution of data points is, whereas the
standard deviation is the measure of how spread out the data points are when the
2. Which of the following best describes normal distribution? mean is used to calculate central tendency.
a. An observation in a data set that’s far removed in value from others in the data a. True
set
b. False
b. The symmetry in the distribution of the same data values
8. The reason that practitioners convert raw scores to standard scores includes which
c. The way in which observations tend to pile up around the mean, also known as of the following?
the bell-shaped curve
a. To indicate the number of correct answers to allow scores to be compared
d. Variation in values that could be widely scattered or tightly clustered
b. To reflect where they fall with respect to the mean to allow scores to be
3. Which of the following best describes dispersion? compared and interpreted
a. An observation in a data set that’s far removed in value from the others in the c. To understand the cause-and-effect connections between variables
data set
d. Because they are always expressed as a number between –1.00 and +1.00
b. The symmetry in the distribution of the same data values
9. An example of a relational study statistic that measures the relationship between two
c. The way in which observations tend to pile up around the mean, also known as or more variables includes
the bell-shaped curve
a. Correlation coefficient
d. Variation in values that could be widely scattered or tightly clustered
b. Cause-and-effect connection
4. Which of the following best describes an outlier?
c. Normal distribution
a. An observation in a data set that’s far removed in value from the others in the
data set d. Skewness
b. The symmetry in the distribution of the same data values 10. The primary goal of hypothesis testing is to test a hypothesis and then accept or
reject the hypothesis based on the findings.
c. The way in which observations tend to pile up around the mean, also known as
the bell-shaped curve a. True
d. Variation in values that could be widely scattered or tightly clustered b. False
5. Which of the following types of data make it possible to rank order items measured
in terms of which has less or more of the quality represented?
a. Nominal
b. Ordinal
c. Interval
d. Ratio

Eln402 Assess 1
0% (2)
Eln402 Assess 1
17 pages
Mean Median Mode
0% (1)
Mean Median Mode
10 pages
Define Statistics
No ratings yet
Define Statistics
89 pages
MBA_U3_Quantitative Techniques for Business Decisions
No ratings yet
MBA_U3_Quantitative Techniques for Business Decisions
18 pages
Merged Presentation Choladeck (3)
No ratings yet
Merged Presentation Choladeck (3)
14 pages
Stats 7th Sems
No ratings yet
Stats 7th Sems
3 pages
Define Statistics
No ratings yet
Define Statistics
89 pages
COURSE CODE 8614 Assignment 2
No ratings yet
COURSE CODE 8614 Assignment 2
9 pages
Measures of central tendency
No ratings yet
Measures of central tendency
6 pages
Chapter 10
No ratings yet
Chapter 10
9 pages
Introduction To Mean and Mode
No ratings yet
Introduction To Mean and Mode
8 pages
8614.02
No ratings yet
8614.02
39 pages
Runit 3
No ratings yet
Runit 3
39 pages
Vivian 2nd Assignment
No ratings yet
Vivian 2nd Assignment
9 pages
Module 3 Descriptive Statistics Final
100% (1)
Module 3 Descriptive Statistics Final
15 pages
Understanding-Central-Tendency
No ratings yet
Understanding-Central-Tendency
7 pages
Statistics, Statistical Modelling & Data Analytics
No ratings yet
Statistics, Statistical Modelling & Data Analytics
68 pages
PDFen (1)
No ratings yet
PDFen (1)
16 pages
Assignment#8614 2
No ratings yet
Assignment#8614 2
37 pages
Qm-Lesson 4
No ratings yet
Qm-Lesson 4
16 pages
1 (2)
No ratings yet
1 (2)
10 pages
8614 ASSIGNMENT NO 2
No ratings yet
8614 ASSIGNMENT NO 2
26 pages
Central Tendency
No ratings yet
Central Tendency
2 pages
Colegio de Dagupan: Master of Education
No ratings yet
Colegio de Dagupan: Master of Education
3 pages
8614 Saba 2nd
No ratings yet
8614 Saba 2nd
44 pages
Presentation 2
No ratings yet
Presentation 2
6 pages
Economics GR 11 Ist Term Project
No ratings yet
Economics GR 11 Ist Term Project
9 pages
Lecture 7-9 Measure of Central Tendency
No ratings yet
Lecture 7-9 Measure of Central Tendency
58 pages
Which Measure of Central Tendency To Use
No ratings yet
Which Measure of Central Tendency To Use
8 pages
Unit 1 - Business Statistics
No ratings yet
Unit 1 - Business Statistics
10 pages
Unit 3
No ratings yet
Unit 3
22 pages
BSQT PG II Sem II Notes Session (1 6)
No ratings yet
BSQT PG II Sem II Notes Session (1 6)
35 pages
Central Tendency, The Variability and Distribution of Your Dataset Is Important To Understand When Performing Descriptive Statistics.
No ratings yet
Central Tendency, The Variability and Distribution of Your Dataset Is Important To Understand When Performing Descriptive Statistics.
14 pages
Chapter 10 with answer
No ratings yet
Chapter 10 with answer
10 pages
PED-6 Joebert Acierto
No ratings yet
PED-6 Joebert Acierto
4 pages
Element of Stat - Docx 11111
No ratings yet
Element of Stat - Docx 11111
12 pages
Mean
No ratings yet
Mean
9 pages
seminar report
No ratings yet
seminar report
12 pages
Stat Theory (Previous+All)
No ratings yet
Stat Theory (Previous+All)
113 pages
Basic Concept Used
No ratings yet
Basic Concept Used
2 pages
Measurement of Central Tendency
No ratings yet
Measurement of Central Tendency
2 pages
chapter2-statistical analysis
No ratings yet
chapter2-statistical analysis
86 pages
Rushabh Dhole QM Online Assign 1
No ratings yet
Rushabh Dhole QM Online Assign 1
6 pages
Chapter 2 Measures of Central Tendency
No ratings yet
Chapter 2 Measures of Central Tendency
22 pages
Letter For Exemption
No ratings yet
Letter For Exemption
9 pages
RESEARCH
No ratings yet
RESEARCH
5 pages
14 - Chapter 7 PDF
No ratings yet
14 - Chapter 7 PDF
39 pages
Unit IV
No ratings yet
Unit IV
80 pages
Evaluating Analytical Chemistry
No ratings yet
Evaluating Analytical Chemistry
4 pages
LabModule - Exploratory Data Analysis - 2023ic
No ratings yet
LabModule - Exploratory Data Analysis - 2023ic
24 pages
Measures-of-Central-Tendency
No ratings yet
Measures-of-Central-Tendency
7 pages
Initial Data Analysis: Central Tendency
No ratings yet
Initial Data Analysis: Central Tendency
20 pages
DSBDL Asg 3 Write Up
No ratings yet
DSBDL Asg 3 Write Up
6 pages
8614(2)
No ratings yet
8614(2)
24 pages
Ilk Asl Week 16 Measures of Central Tendency
No ratings yet
Ilk Asl Week 16 Measures of Central Tendency
3 pages
Chapter 2 Descriptive Statistics
100% (2)
Chapter 2 Descriptive Statistics
15 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
18 pages
When To Use Mean Median Mode
No ratings yet
When To Use Mean Median Mode
2 pages
FROM DR Neerja Nigam
No ratings yet
FROM DR Neerja Nigam
75 pages
DATA ANA
No ratings yet
DATA ANA
2 pages
Overview Of Bayesian Approach To Statistical Methods: Software
From Everand
Overview Of Bayesian Approach To Statistical Methods: Software
Vinaitheerthan Renganathan
No ratings yet
CH 18 - Process Costing
No ratings yet
CH 18 - Process Costing
46 pages
Table of Specification 1 Semester/ 2 Quarterly Assessment Grade 12 Subject: Physical Science
No ratings yet
Table of Specification 1 Semester/ 2 Quarterly Assessment Grade 12 Subject: Physical Science
5 pages
Use of Test-Teach-Test Method in English
No ratings yet
Use of Test-Teach-Test Method in English
13 pages
Geographic Map Shapes For Microsoft Visio
No ratings yet
Geographic Map Shapes For Microsoft Visio
3 pages
The Design Thinking Approach To Projects
No ratings yet
The Design Thinking Approach To Projects
6 pages
Esi Quick Cast
100% (2)
Esi Quick Cast
50 pages
Eq-Trp & MRP
No ratings yet
Eq-Trp & MRP
9 pages
Pfaff General Cat 09 13
No ratings yet
Pfaff General Cat 09 13
90 pages
UPCAT Language Proficiency Tips and Tricks
No ratings yet
UPCAT Language Proficiency Tips and Tricks
11 pages
Signed and Unsigned Numbers
No ratings yet
Signed and Unsigned Numbers
10 pages
Coker Unit
No ratings yet
Coker Unit
15 pages
Rol Oon Product PDF
No ratings yet
Rol Oon Product PDF
7 pages
Hardware Manual Update SDCS-COM-8 DCS800 Drives (20 To 5200 A)
No ratings yet
Hardware Manual Update SDCS-COM-8 DCS800 Drives (20 To 5200 A)
8 pages
A Vision of Blindness Bladerunner and Mo
No ratings yet
A Vision of Blindness Bladerunner and Mo
39 pages
MasteringPhysics - Assignment 6 - Motion in 1-D
100% (1)
MasteringPhysics - Assignment 6 - Motion in 1-D
3 pages
CHE239 Liquid Flow Report
No ratings yet
CHE239 Liquid Flow Report
4 pages
Analysis History
No ratings yet
Analysis History
2 pages
Section1 Group3 Intuit India
No ratings yet
Section1 Group3 Intuit India
10 pages
Assignment II-Belisa Mulugeta
No ratings yet
Assignment II-Belisa Mulugeta
7 pages
Metassemble Manual
No ratings yet
Metassemble Manual
12 pages
Econ 443
No ratings yet
Econ 443
2 pages
Linux Questions & Answers - User Account Management
No ratings yet
Linux Questions & Answers - User Account Management
3 pages
Geolog6.6 Determin Tutorial
No ratings yet
Geolog6.6 Determin Tutorial
124 pages
Zestaw - Egzaminacyjny 3 PR
No ratings yet
Zestaw - Egzaminacyjny 3 PR
8 pages
IENG/Mane 332 Lecture Notes: Reference: PRODUCTION, Planning, Control, and Integration by SIPPER & Bulfin
No ratings yet
IENG/Mane 332 Lecture Notes: Reference: PRODUCTION, Planning, Control, and Integration by SIPPER & Bulfin
14 pages
Neck Pain Questionnaire Form
100% (1)
Neck Pain Questionnaire Form
1 page
Approved Revised Guidelines of TBI 2
No ratings yet
Approved Revised Guidelines of TBI 2
22 pages
BDC For 585 Infotype Using HR - Infotype - Operations
No ratings yet
BDC For 585 Infotype Using HR - Infotype - Operations
13 pages
downloadfile-2
No ratings yet
downloadfile-2
21 pages

2 Statistical Theory and Methods: Measures of Central Tendency (Averages)

Uploaded by

2 Statistical Theory and Methods: Measures of Central Tendency (Averages)

Uploaded by

Statistical Theory and Methods Statistical Theory and Methods

2 Measures of Central Tendency (Averages)

You might also like