statistical_method__1 (3)
statistical_method__1 (3)
By
Wilhemina Adoma Pels
FIRST LECTURE
1 / 46
Weekly Content
Week 1:
1 Course introduction, provision of course outline and
recommended textbooks
2 Introduction to Statistics.
3 Uses of Statistics.
4 Basic terms in Statistics
5 Variable and Data
6 Measurement scales
Week 2:
1 Stages of statistical investigation
2 Data Collection (Primary and Secondary data)
Week 3:
1 Questionnaire Design
2 Quiz 1
Week 4, 5, 6
1 Summarizing and describing data
2 Using numerical and graphical summaries to characterize
sample data
2 / 46
Week 7: Midsem Exams
Week 8
1 Introduction to Probability
2 Axioms, Sets, Sample space, Measure of probability of
events
3 Mutually exclusive
4 Independent events
5 Conditional probability, Bayes’ theorem
Week 9: Counting techniques: combination and
permutations
Week 10: Random variables and some discrete probability
distribution
Week 11: Some Continuous Probability Distributions
Week 12: Revision
3 / 46
Week 1 - 3
4 / 46
Course Introduction, Provision of Course Outline and
Recommended Textbooks
5 / 46
What is Statistics?
Statistics is the science concerned with developing and studying
methods for collecting, organizing, analyzing, interpreting and
presenting empirical data.
Statistics is the science of learning from data.
Types of Statistics
1 Descriptive statistics
6 / 46
Types of Inferential Statistics
Inductive: Generalization for the population based on
knowledge of the sample.
Deductive: Generalization for the sample based on
knowledge of the population.
7 / 46
USE OF STATISTICS
8 / 46
BASIC TERMS
9 / 46
10 / 46
What is a variable?
A variable is any attribute, characteristic, or measurable
property that can vary from one observation to another.
Example: weight, milk yield, sex, and eye color, number of
pregnancies, number of hospitalizations, Height, Weight,
Gender, Age and Temperature of pregnant patient
TYPES OF VARIABLES
1 Qualitative or Categorical Variables
11 / 46
Figure: Illustration of the relationship among qualitative, quantitative,
discrete, and continuous variables.
12 / 46
DATA VRS VARIABLE
The list of observed values for a variable is data.
Example; gender is a variable; the observations male or
female are data.
Qualitative data are observations corresponding to a
qualitative variable.
Quantitative data are observations corresponding to a
quantitative variable.
Discrete data are observations corresponding to a discrete
variable
Continuous data are observations corresponding to a
continuous variable
14 / 46
MEASUREMENT
What is measurement?
Measurement is the application of mathematics to things or
events.
A system of measurement is a crucial component of research.
Simple example: How tall is a patient? More complex example:
How obese is an animal?
15 / 46
Scales of measurement
Nominal Scale
Data that represent categories or names or labels. There is
no implied order to the categories of nominal data.
Observations are classified into mutually exclusive
categories
16 / 46
Scales of Measurement
Ordinal Scale:
This scale has a logical ordering of the categories.
designates an ordering (greater than, less than). It does
not assume that the intervals between numbers are equal.
17 / 46
18 / 46
MEASUREMENT
Scales of Measurement
Interval Scale:
An important point to make about interval scales is that
the zero point is simply another point on the scale; it does
not represent the starting point of the scale or the total
absence of the characteristic being measured.
Designates an equal-interval ordering. For example,
Temperature in Fahrenheit or Celsius is an interval scale
measurement. The difference in temperature between 20
degrees F and 25 degrees F is the same as the difference
between 76 degrees F and 81 degrees F.
Likert scale is another example of interval scale
measurement.
19 / 46
Example: Temperature
20 / 46
MEASUREMENT
Scales of Measurement
Ratio Scale: designates an equal-interval ordering with a true
zero point (i.e. the zero implies an absence of the thing being
measured).
21 / 46
22 / 46
MEASUREMENT
23 / 46
Summary
24 / 46
STAGES OF STATISTICAL INVESTIGATIONS
STEPS
1. Statement of problem and objectives: We must
identify the cause for concern and state explicitly what the
problem is, characteristics to be measured, collection,
processing and publishing methods
2. Target population and the use of sample or entire
population: Define in clear unambiguous terms the population
of interest, define the sample units to make them distinct,
non-overlapping and recognizable and select an appropriate
sampling design
25 / 46
STAGES OF STATISTICAL INVESTIGATIONS
26 / 46
STAGES OF STATISTICAL INVESTIGATIONS
27 / 46
DATA COLLECTION METHODS
28 / 46
DATA COLLECTION METHODS
Secondary Data
Secondary data is data someone else has collected
EXAMPLES OF SOURCES
Vital Statistics – birth, death certificates
Hospital, clinic, school nurse records
Private and foundation databases
City and regional governments
Surveillance data from state government programs
Federal agency statistics - Census, NHIS, etc
29 / 46
DATA COLLECTION METHODS
30 / 46
DATA COLLECTION METHODS
Secondary Data - LIMITATIONS
When was it collected? For how long? Maybe out of date
for what you want to analyze. May not have been collected
long enough for detecting trends.
Is the data set complete? There may be missing
information on some observations. Unless such missing
information is seen and corrected for, analysis will be
biased.
Are there confounding problems? Sample selection bias?
Source choice bias? In time series, did some observations
drop out over time?
Are the data consistent/reliable? Did variables drop out
over time? Did variables change in definition over time?
For example, number of years of education versus highest
degree obtained
31 / 46
DATA COLLECTION METHODS
32 / 46
DATA COLLECTION METHODS
33 / 46
DATA COLLECTION METHODS
34 / 46
DATA COLLECTION METHODS
PRIMARY DATA
Primary data is data you collect.
35 / 46
DATA COLLECTION METHODS
PRIMARY DATA - EXAMPLES
Surveys
Focus groups
Questionnaires
Personal interviews
Experiments and observational study
36 / 46
DATA COLLECTION METHODS
PRIMARY DATA - LIMITATIONS
Do you have the time and money for:
Designing your collection instrument?
Selecting your population or sample?
Pretesting/piloting the instrument to work out sources of
bias?
Administration of the instrument?
Entry/collation of data?
Uniqueness. May not be able to compare to other
populations
Researcher error (Sample bias, Other confounding factors)
38 / 46
DATA COLLECTION METHODS
Questionnaire design
A survey is only as good as the questions it asks
Key Criteria
Questionnaire relevancy: No unnecessary information is
collected and only information needed to solve the problem
is obtained. Be specific about your data needs; tie each
question to an objective
Questionnaire accuracy: Information is both reliable
and valid
Phrasing Questions
Open ended response versus fixed alternative questions
Decision criteria: type of research; time; method of
delivery; budget; concerns regarding researcher bias
40 / 46
DATA COLLECTION METHODS
AVOID
Leading questions
Overly complex questions
Use of jargon
Loaded questions (can use a counter-biasing statement)
Ambiguity
Double barreled questions
Making assumptions
DECISIONS
Ranking, sorting, rating or choice
How many categories or response positions
Balanced or unbalanced
Forced choice or non-forced choice
41 / 46
DATA COLLECTION METHODS
Types of questions
Types of fixed alternative questions
Single dichotomy or dichotomous-alternative questions
Example: Do any patient show sign of coughing?
Yes No
Respondent chooses one of two alternatives (yes/no;
male/female)
What scale would this data create?
42 / 46
DATA COLLECTION METHODS
43 / 46
DATA COLLECTION METHODS
Frequency determination
Asks for an answer about frequency of occurrence
Example: In a typical week, how often do you purchase
animal feed?
Never
Once
Two or more times
Group Assignment
In groups of ten (10), design a questionnaire on the topic,
”Content of Introductory Chemistry Courses for Different
Degree Programs at Knust”
45 / 46
46 / 46