01 - Introduction To Biostatistics
01 - Introduction To Biostatistics
2 04/22/2020
What is Statistics ?
Statistics is a science that deals with the
collection,
organizing,
analysis,
interpretation, and
presentation of information that can be stated
numerically.
(Daniel
WW, 1999)
Biostatistics is the application of statistics in
biological sciences & medicine
3 04/22/2020
Why should I study
Statistics?
A tool for research
Every research involves data collection, analysis
and interpretation of results
Easier to communicate with
Statisticians/Biostatisticians
Understanding medical literature (improve
literature appraisal‐skills)
Enable to think analytically and facilitate decision‐
making process
Epidemiology and statistics go hand in hand
They both are instrumental in quantifying medical
sciences
4 04/22/2020
Data and Data Sets
Data are the facts and figures that are
collected, summarized, analyzed, and
interpreted.
The data collected in a particular study are
referred to as the data set.
Data are entered into computers and saved
as electronic database.
Statistical software are used to archive and
analyze data
5 04/22/2020
Elements, Variables, and
Observations
The elements are the entities on which data are
collected. E.g. all subjects in a population, all heights
or weights etc
A variable is a characteristic of interest for the
elements.
The set of measurements collected for a particular
element is called an observation.
The total number of data values in a data set is the
number of elements multiplied by the number of
variables.
E.g. # elements = 10 students; # variables = 3 (age,
wt., ht.)
Total # data values (bits) = 10 (ements)*3 (vars.) = 30
6 04/22/2020
Discrete Vs. Continuous Random
Variables?
8 04/22/2020
Classification of variables
Characteristics that are Characteristics that are
merely categorized. measurable.
Unit of measurement is not Variable Unit of measurement is either
numeric. discrete or continuous.
Categorica Numerical
l (Quantitat
(Qualitativ ive)
Known as
‘scale’ in
e) SPSS
9 04/22/2020
more informative
Measurement Scales
Assignment of numerals to objects or events
according to a set of rules (Steven, 1946).
Knowledge of knowing types of variables helps
in deciding how to organize, analyze and present
the data.
10 04/22/2020
Categorical variables
NOMINAL
classify characteristics of variables into
categories e.g. sex, race, colour, blood group; or
dichotomous variables: e.g. high or low BP, alive
or dead
data is mutually exclusive and not in rank order.
assigned by numeric values in statistical
software;
Sex: 1 = male Race: 1 = Malay
2 = female 2 = Chinese
3 = Indian
11
4 = Others
04/22/2020
Categorical variables
ORDINAL
classifies data into categories that can be ranked
(ordered categories)
differences/distances between ranks don't exist.
properties of ordinal data:
- data are mutually exclusive
- data categories have some logical order
13 04/22/2020
Numerical variables
RATIO
Very similar to interval variables
However, zero point is NOT arbitrary, e.g:
temperature in Kelvin scale (0 point is physically
zero/no value).
Able to multiple or divide the values.
Other examples of ratio variables:
Blood pressure
Height
NOTES:
Intervals and Ratio variables sometime are
indistinguishable and handled the same way in
analyzing data.
14
Both can be converted to categorical variables04/22/2020
(but
with lost of information (Cohen, 1983).
Exercise!
State the data type of each of the following:
Homicide rate (deaths/100,000)
High school graduate (Y/N)
Hair color
Hospital expenditures (yearly, in dollars)
Smoking status (none, light, heavy)
Coronary heart disease (Y/N)
15 04/22/2020
16 04/22/2020