Lab - Unit 1 Lesson 1
Lab - Unit 1 Lesson 1
Lesson 1 Variables
1. Definition of Variable
2. Types of Variables and Data
3. Other types of Variables
Assessment
Overview
Statistics is the science of conducting studies to collect, organize, summarize, analyze, and draw
conclusions from data. You need to be able to read and comprehend the numerous statistical studies carried
out in your subjects, just like professionals do. You must be familiar with the terms, symbols, ideas, and
statistical techniques utilized in these investigations in order to comprehend this.
Definition of Variable
Data are the values (measurements or observations) that the variables can assume. a sets describe values
for each variable for unknown quantities such as height, weight, temperature, volume, etc., of an object or
values of random numbers. The values in this set are known as a datum. The data set consists of data of one
or more members corresponding to each row. Each value in the data set is called a data value or a datum.
Source: https://www.saedsayad.com/data_preparation.htm
Source: https://www.saedsayad.com/data_preparation.htm
A numerical or continuous variable is one that can accept any value within a finite or infinite interval
(e.g., height, weight, temperature, blood glucose ...). There are two types of numerical
data, interval and ratio. Data on an interval scale can be added and subtracted but cannot be meaningfully
multiplied or divided because there is no true zero. For example, we cannot say that one day is twice as hot
as another day. On the other hand, data on a ratio scale has true zero and can be added, subtracted, multiplied
or divided (e.g., weight).
A categorical or discrete variable is one that can accept two or more values (categories). There are two
types of categorical data, nominal and ordinal. Nominal data does not have an intrinsic ordering in the
categories. For example, "gender" with two categories, male and female. In contrast, ordinal data does have
an intrinsic ordering in the categories. For example, "level of energy" with three orderly categories (low,
medium and high).
https://medium.com/@dharmanathpatil/types-of-random-variables-
Four common types of scales are used: nominal, ordinal, interval, and ratio.
1. Ratio
The ratio level of measurement possesses all the characteristics of interval measurement, and there exists
a true zero. In addition, true ratios exist when the same variable is measured on two different members of
the population.
For example, some researchers classify IQ data as ratio data rather than interval. Also, data can be altered
so that they fit into a different category. For instance, if the incomes of all professors of a college are
classified into the three categories of low, average, and high, then a ratio variable becomes an ordinal
variable.
2. Interval
The interval level of measurement ranks data, and precise differences between units
of measure do exist; however, there is no meaningful zero.
For example, many standardized psychological tests yield values measured on an interval scale. IQ is an
example of such a variable. There is a meaningful difference of 1 point between an IQ
of 109 and an IQ of 110. Temperature is another example of interval measurement, since
there is a meaningful difference of 1F between each unit, such as 72 and 73F. One
property is lacking in the interval scale: There is no true zero. For example, IQ tests do
not measure people who have no intelligence. For temperature, 0F does not mean no
heat at all.
3. Nominal
The nominal level of measurement classifies data into mutually exclusive (nonoverlapping),
exhausting categories in which no order or ranking can be imposed on the data.
A sample of college instructors classified according to subject taught (e.g., English, history, psychology, or
mathematics) is an example of nominal-level measurement. Classifying survey subjects as male or female
is another example of nominal-level measurement. No ranking or order can be placed on the data.
Classifying residents according to zip codes is also an example of the nominal level of measurement. Even
though numbers are assigned as zip codes, there is no meaningful order or ranking.
4. Ordinal
The ordinal level of measurement classifies data into categories that can be ranked;
however, precise differences between the ranks do not exist.
Data measured at this level can be placed into categories, and these categories can be ordered, or ranked.
For example, from student evaluations, guest speakers might be ranked as superior, average, or poor. Floats
in a homecoming parade might be ranked as first place, second place, etc. Note that precise measurement
of differences in the ordinal level of measurement does not exist. For instance, when people are classified
according to their build (small, medium, or large), a large variation exists among the individuals in each
class.
2. Mediating Variable
Mediating variables help us understand the relationship between the independent variable, and
dependent variable when there is no such direct relationship between both. When independent variables
cannot influence the dependent variable, a mediating variable works as a referee between the two and help
us navigate the relationship between independent variables (IV) and dependent variables (DV). Mediating
variables are also called intervening variables.
3. Moderating Variable
A moderating variable is a variable that affects the strength or direction of the relationship between
two other variables. It is also referred to as an interactive variable or a moderator.In social science
research, a moderating variable is often used to understand how the relationship between two
variables changes depending on the level of a third variable.
Moderating Variable Examples
Here are a few examples of moderating variables:
Age as a moderating variable: Suppose a study examines the relationship between exercise and heart
health. Age may act as a moderating variable, influencing the relationship between exercise and heart
health. For example, the relationship between exercise and heart health may be stronger for younger
adults compared to older adults.
Gender as a moderating variable: Consider a study examining the relationship between salary and
job satisfaction. Gender may act as a moderating variable, influencing the relationship between salary
and job satisfaction. For example, the relationship between salary and job satisfaction may be stronger
for men than for women.
Social support as a moderating variable: Suppose a study examines the relationship between stress
and mental health. Social support may act as a moderating variable, influencing the relationship
between stress and mental health. For example, the relationship between stress and mental health may
be stronger for individuals with low social support compared to those with high social support.
Education level as a moderating variable: Consider a study examining the relationship between
technology use and academic performance. Education level may act as a moderating variable,
influencing the relationship between technology use and academic performance. For example, the
relationship between technology use and academic performance may be stronger for individuals with
higher education levels compared to those with lower education levels.
Lighting
Background noise
Room temperature
Visual distractions
2. Participant variables
Participant variables, or personal variables, occur when a participant’s personal feelings or
character traits affect the experiment. Examples include participants’:
Natural intelligence
Demographics
Mood
Physical capabilities
Level of understanding of the study
3. Experimenter variables
This type of extraneous variable occurs when the researcher or experimenter unintentionally
influences how participants should behave. Examples include:
The age, gender and mannerisms of the experimenter
Using certain phrasing to hint at the way experimenters want participants to behave
Speaking in a positive or negative tone when providing experiment instructions
4. Demand characteristic variables
These variables convey information or clues to the participant about the purpose of the study. They
can also let the participant know the outcome the researcher desires, which can affect the results of
the study. Examples include:
Restriction example
You want to study whether a low-carb diet can cause weight loss. Since you know that age, sex, level of
education and exercise intensity are all factors that may be associated with weight loss, as well as with the
diet your subjects choose to follow, you choose to restrict your subject pool to 45-year-old women with
bachelor’s degrees who exercise at moderate levels of intensity between 100–150 minutes per week.
2. Matching
In this method, you select a comparison group that matches with the treatment group. Each member of the
comparison group should have a counterpart in the treatment group with the same values of potential
confounders, but different independent variable values.
This allows you to eliminate the possibility that differences in confounding variables cause the variation in
outcomes between the treatment and comparison group. If you have accounted for any potential
confounders, you can thus conclude that the difference in the independent variable must be the cause of the
variation in the dependent variable.
Matching example
In your study on low-carb diet and weight loss, you match up your subjects on age, sex, level of education
and exercise intensity. This allows you to include a wider range of subjects: your treatment group includes
men and women of different ages with a variety of education levels.
3.Statistical control
If you have already collected the data, you can include the possible confounders as control variables in your
regression models; in this way, you will control for the impact of the confounding variable.
Any effect that the potential confounding variable has on the dependent variable will show up in the results
of the regression and allow you to separate the impact of the independent variable.