0% found this document useful (0 votes)
23 views47 pages

statistical_method__1 (3)

The document outlines the course structure and content for a Statistics class (Math 153) taught by Wilhemina Adoma Pels at KNUST, covering topics such as data collection, measurement scales, and statistical methods. It includes a weekly breakdown of topics, definitions of key statistical terms, and the importance of both primary and secondary data in research. The document emphasizes the significance of careful planning in statistical investigations and the design of questionnaires for effective data collection.

Uploaded by

boafowaahr
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
23 views47 pages

statistical_method__1 (3)

The document outlines the course structure and content for a Statistics class (Math 153) taught by Wilhemina Adoma Pels at KNUST, covering topics such as data collection, measurement scales, and statistical methods. It includes a weekly breakdown of topics, definitions of key statistical terms, and the importance of both primary and secondary data in research. The document emphasizes the significance of careful planning in statistical investigations and the design of questionnaires for effective data collection.

Uploaded by

boafowaahr
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 47

Statistical Methods: Math 153

By
Wilhemina Adoma Pels

Department of Statistics and Actuarial Science


KNUST

FIRST LECTURE

January 25, 2023

1 / 46
Weekly Content
Week 1:
1 Course introduction, provision of course outline and
recommended textbooks
2 Introduction to Statistics.
3 Uses of Statistics.
4 Basic terms in Statistics
5 Variable and Data
6 Measurement scales
Week 2:
1 Stages of statistical investigation
2 Data Collection (Primary and Secondary data)
Week 3:
1 Questionnaire Design
2 Quiz 1
Week 4, 5, 6
1 Summarizing and describing data
2 Using numerical and graphical summaries to characterize
sample data
2 / 46
Week 7: Midsem Exams
Week 8
1 Introduction to Probability
2 Axioms, Sets, Sample space, Measure of probability of
events
3 Mutually exclusive
4 Independent events
5 Conditional probability, Bayes’ theorem
Week 9: Counting techniques: combination and
permutations
Week 10: Random variables and some discrete probability
distribution
Week 11: Some Continuous Probability Distributions
Week 12: Revision

3 / 46
Week 1 - 3

4 / 46
Course Introduction, Provision of Course Outline and
Recommended Textbooks

Schaum’s Outlines of Probability and Statistics


etc.

5 / 46
What is Statistics?
Statistics is the science concerned with developing and studying
methods for collecting, organizing, analyzing, interpreting and
presenting empirical data.
Statistics is the science of learning from data.

Types of Statistics
1 Descriptive statistics

Summarizing and describing the data


Uses numerical and graphical summaries to characterize
sample data
2 Inferential statistics
Uses sample data to make conclusions about a broader
range of individuals—–a population–—than just those who
are observed (a sample)

6 / 46
Types of Inferential Statistics
Inductive: Generalization for the population based on
knowledge of the sample.
Deductive: Generalization for the sample based on
knowledge of the population.

7 / 46
USE OF STATISTICS

1 To Present Facts in Definite Form


2 Comparisons
3 Policy Making
4 Forecasting
5 It Enlarges Knowledge

8 / 46
BASIC TERMS

Population and Sample


A population is the collection of all possible individual
units whose characteristics are to be studied
A sample is a subset of the population that is studied in
order to make inference about the population

9 / 46
10 / 46
What is a variable?
A variable is any attribute, characteristic, or measurable
property that can vary from one observation to another.
Example: weight, milk yield, sex, and eye color, number of
pregnancies, number of hospitalizations, Height, Weight,
Gender, Age and Temperature of pregnant patient

TYPES OF VARIABLES
1 Qualitative or Categorical Variables

Take on values that are names or labels


Allow for classification of individuals based on some
attribute or characteristic.
Examples : Gender, Eye color, whether or not a patient
or animal is ill, Name of pregnant patient
2 Quantitative Variable
Numeric
Represent a measurable quantity of individual.
Examples: weight of animals, litter size, temperature or
time, Height of pregnant patients.
11 / 46
Types of Quantitative Variables
Discrete Variable : Is a quantitative variable that has either a
finite number of possible values or a countable number of
possible values. The term countable means that the values
result from counting, such as 0, 1, 2, 3, and so on. Examples:
litter size, number of laid eggs per month, number of animals
alive, number of pregnant patients received, number of needle
punctures, number of pregnancies, and number of
hospitalizations

Continuous Variable one that can take on any value within


some range or interval (i.e., within a specified lower and upper
limit). Examples: milk yield, weight,body mass, height,
blood pressure and cholesterol

11 / 46
Figure: Illustration of the relationship among qualitative, quantitative,
discrete, and continuous variables.

12 / 46
DATA VRS VARIABLE
The list of observed values for a variable is data.
Example; gender is a variable; the observations male or
female are data.
Qualitative data are observations corresponding to a
qualitative variable.
Quantitative data are observations corresponding to a
quantitative variable.
Discrete data are observations corresponding to a discrete
variable
Continuous data are observations corresponding to a
continuous variable

Univariate vs Bivariate data


Univariate data when only one variable is involved in the study
Bivariate data when two variables are involved
Multivariate data when a study has more than two variables
13 / 46
MEASUREMENT

“If a thing exists, it exists in some amount; and


if it exists in some amount, it can be measured”
E. L. Thorndike (1914)

14 / 46
MEASUREMENT

What is measurement?
Measurement is the application of mathematics to things or
events.
A system of measurement is a crucial component of research.
Simple example: How tall is a patient? More complex example:
How obese is an animal?

15 / 46
Scales of measurement

Nominal Scale
Data that represent categories or names or labels. There is
no implied order to the categories of nominal data.
Observations are classified into mutually exclusive
categories

Examples: identification number(1, 2, 3, 4,...), color(brown,


black, white,...), gender(male, female)

Sometimes numbers are used to designate category membership.Here,


the numbers do not have numeric implications; they are simply
convenient labels.
Example: Eye Color
Blue = 1 Brown = 2 Green = 3 Other = 4

16 / 46
Scales of Measurement

Ordinal Scale:
This scale has a logical ordering of the categories.
designates an ordering (greater than, less than). It does
not assume that the intervals between numbers are equal.

1 For example, calving ease score (normal calving, calving


with little intervention, calving with considerable
intervention, very difficult calving, Caesarean section)
2 A Veterinary Scientist or A Medical Laboratory Scientist
may, for example, rate an animal or patient’s Pain level on
a ’no’, ’mild’, ’moderate’ or ’severe’ scale and use the
numbers 0, 1, 2 and 3 to label the categories, with lower
numbers indicating less anxiety.
3 Pain Level [no PL = 0, mild PL = 1, moderate PL = 2,
severe PL = 3]

17 / 46
18 / 46
MEASUREMENT

Scales of Measurement
Interval Scale:
An important point to make about interval scales is that
the zero point is simply another point on the scale; it does
not represent the starting point of the scale or the total
absence of the characteristic being measured.
Designates an equal-interval ordering. For example,
Temperature in Fahrenheit or Celsius is an interval scale
measurement. The difference in temperature between 20
degrees F and 25 degrees F is the same as the difference
between 76 degrees F and 81 degrees F.
Likert scale is another example of interval scale
measurement.

19 / 46
Example: Temperature

20 / 46
MEASUREMENT

Scales of Measurement
Ratio Scale: designates an equal-interval ordering with a true
zero point (i.e. the zero implies an absence of the thing being
measured).

Examples Temperature in Kelvin (zero is the absence of heat


or can’t get colder) and measurements of weight of animal (zero
means complete lack of weight) or measurements of heights of
patients (zero means complete lack of height)..

21 / 46
22 / 46
MEASUREMENT

Summary of Measurement Scales


Measurement scales differ by order, equal intervals between
adjacent units and absolute zero point.
Nominal: None
Ordinal: Order
Interval: Order + Equal intervals
Ratio: Order + Equal intervals + True zero
Nominal or ordinal scaled data – Use Bar Charts (simple,
multiple, compound, etc ) or Pie Charts
Interval or ratio scaled data – Use Histogram, polygon,
ogive, etc
Scatter plot to assess association between quantitative
variables. Note: No inference drawn at this point. The
object being to convey information

23 / 46
Summary

24 / 46
STAGES OF STATISTICAL INVESTIGATIONS

If the investigation is to optimize the use of the available


resources, expertise and time, it is essential to carefully examine
all aspects of the design and application of statistical
investigations (experiments and surveys) at the planning level.

STEPS
1. Statement of problem and objectives: We must
identify the cause for concern and state explicitly what the
problem is, characteristics to be measured, collection,
processing and publishing methods
2. Target population and the use of sample or entire
population: Define in clear unambiguous terms the population
of interest, define the sample units to make them distinct,
non-overlapping and recognizable and select an appropriate
sampling design

25 / 46
STAGES OF STATISTICAL INVESTIGATIONS

3. Design of Questionnaire or Schedule: Construction of


questionnaire or schedule is extremely important since the
respondent and data collector must interpret them
4. Method of data collection: You have to decide whether
data will be collected by personal interview, online, physical
observation or some other method. Cost is a major factor here
Personnel must be thoroughly trained to correctly locate
sampling units and take measurements.
5. Required data: The data to be collected should be guided
by the objective of the investigation.

26 / 46
STAGES OF STATISTICAL INVESTIGATIONS

6. List of available resources: A wide variety of resources is


likely to be required for the operation of the investigation and
the analysis of the results. These include the following:
Physical resources: Sampling frame, maps etc
Human resources: Data collectors, data analysts
Financial resources
7. Conducting a pilot Survey: This must be carried out
before the main survey.
8. Collection, Editing, Storage and organization of data
9. Interpretation and Presentation of Results

27 / 46
DATA COLLECTION METHODS

Pros and Cons of Primary and Secondary Data

Where do data come from?


We have often seen our data all nice and collated in a database
form:
Results of product and process improvement experiments
Firms/Hospitals (demographic data, patient recovery data,
drug efficiency data, etc)
Take a step back – if we’re starting from scratch, how do we
collect or find data?
Secondary data
Primary data

28 / 46
DATA COLLECTION METHODS

Secondary Data
Secondary data is data someone else has collected
EXAMPLES OF SOURCES
Vital Statistics – birth, death certificates
Hospital, clinic, school nurse records
Private and foundation databases
City and regional governments
Surveillance data from state government programs
Federal agency statistics - Census, NHIS, etc

29 / 46
DATA COLLECTION METHODS

Secondary Data - LIMITATIONS


Finding secondary data could sometimes be frustrating

30 / 46
DATA COLLECTION METHODS
Secondary Data - LIMITATIONS
When was it collected? For how long? Maybe out of date
for what you want to analyze. May not have been collected
long enough for detecting trends.
Is the data set complete? There may be missing
information on some observations. Unless such missing
information is seen and corrected for, analysis will be
biased.
Are there confounding problems? Sample selection bias?
Source choice bias? In time series, did some observations
drop out over time?
Are the data consistent/reliable? Did variables drop out
over time? Did variables change in definition over time?
For example, number of years of education versus highest
degree obtained

31 / 46
DATA COLLECTION METHODS

Secondary Data - LIMITATIONS


Is the information exactly what you need? In some cases,
may have to use “proxy variables”. Variables that may
approximate something you really wanted to measure. Are
they reliable? Is there correlation to what you actually
want to measure?

USES OF SECONDARY DATA


As an alternative to a survey
As a source of supplementary information
As a check on possible survey biases
As a means of improving survey estimates

32 / 46
DATA COLLECTION METHODS

Secondary Data - ADVANTAGES


No need to reinvent the wheel. If someone has already
found the data, take advantage of it.

33 / 46
DATA COLLECTION METHODS

Secondary Data - ADVANTAGES


It will save you money. Even if you have to pay for access,
often it is cheaper in terms of money than collecting your
own data.
It will save you time. Primary data collection is very time
consuming.
It may be very accurate. When especially a government
agency has collected the data, incredible amounts of time
and money went into it. It’s probably highly accurate.
It has great exploratory value. Exploring research
questions and formulating hypothesis to test.

34 / 46
DATA COLLECTION METHODS

PRIMARY DATA
Primary data is data you collect.

35 / 46
DATA COLLECTION METHODS
PRIMARY DATA - EXAMPLES
Surveys
Focus groups
Questionnaires
Personal interviews
Experiments and observational study

36 / 46
DATA COLLECTION METHODS
PRIMARY DATA - LIMITATIONS
Do you have the time and money for:
Designing your collection instrument?
Selecting your population or sample?
Pretesting/piloting the instrument to work out sources of
bias?
Administration of the instrument?
Entry/collation of data?
Uniqueness. May not be able to compare to other
populations
Researcher error (Sample bias, Other confounding factors)

DATA COLLECTION CHOICE


What you must ask yourself:
WILL THE DATA ANSWER MY RESEARCH
QUESTION?
37 / 46
DATA COLLECTION METHODS

DATA COLLECTION CHOICE


To answer that, you must first decide what your research
question is. Then you need to decide what data/variables are
needed to scientifically answer the question
If that data exists in secondary form, then use them to the
extent you can, keeping in mind limitations
But if it does not, and you are able to fund primary
collection, then it is the method of choice. For example,
Direct Observation/Experiments
Telephone
Postal or electronic mails
Documents and reports
Interviewing

38 / 46
DATA COLLECTION METHODS
Questionnaire design
A survey is only as good as the questions it asks

What you should ask?


39 / 46
DATA COLLECTION METHODS
Questionnaire design
The questions asked are a function of previous decisions
The questions asked are a function of future decisions (such
as statistical analysis)

Key Criteria
Questionnaire relevancy: No unnecessary information is
collected and only information needed to solve the problem
is obtained. Be specific about your data needs; tie each
question to an objective
Questionnaire accuracy: Information is both reliable
and valid
Phrasing Questions
Open ended response versus fixed alternative questions
Decision criteria: type of research; time; method of
delivery; budget; concerns regarding researcher bias
40 / 46
DATA COLLECTION METHODS

AVOID
Leading questions
Overly complex questions
Use of jargon
Loaded questions (can use a counter-biasing statement)
Ambiguity
Double barreled questions
Making assumptions

DECISIONS
Ranking, sorting, rating or choice
How many categories or response positions
Balanced or unbalanced
Forced choice or non-forced choice

41 / 46
DATA COLLECTION METHODS

Types of questions
Types of fixed alternative questions
Single dichotomy or dichotomous-alternative questions
Example: Do any patient show sign of coughing?
Yes No
Respondent chooses one of two alternatives (yes/no;
male/female)
What scale would this data create?

42 / 46
DATA COLLECTION METHODS

Multi-choice alternative questions


Multi-choice alternative (Respondent chooses from several
alternatives)
1.Determinant choice
Choose only one from several possible responses
Example: Which College are you currently registered in
at the University?
Engineering
Science
Arts/Soc. Science
Health sciences
Planning and Architecture

43 / 46
DATA COLLECTION METHODS
Frequency determination
Asks for an answer about frequency of occurrence
Example: In a typical week, how often do you purchase
animal feed?
Never
Once
Two or more times

Multi-choice alternative questions


3.Check list
Provide multiple answers to a single question
Should be mutually exclusive and exhaustive
Example: What brands of animal feed have you, to the best of
your memory, purchased in the past month (check all that
apply?)
Cargill Purina New Hope Liuh
44 / 46
END OF LECTURE ASSIGNMENT

Group Assignment
In groups of ten (10), design a questionnaire on the topic,
”Content of Introductory Chemistry Courses for Different
Degree Programs at Knust”

45 / 46
46 / 46

You might also like