0% found this document useful (0 votes)

11 views

Variables

Uploaded by

onlinenotes4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

Variables

Uploaded by

onlinenotes4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

VARIABLES

Kerlinger (1986):Definition: “A variable is a property that takes on different

values. It is a symbol to which numerals or values are assigned.”

Best and Kahn (2003):Definition: “A variable is any characteristic or quality

that varies among the members of a particular group.”

In biostatistics, variables are classified into different types based on their

characteristics and the kind of data they represent. These classifications help
determine the appropriate statistical methods for analysis. The main types of
variables in biostatistics are:

### 1. Categorical Variables

Categorical variables represent data that can be divided into distinct groups
or categories. They can be further classified into nominal and ordinal
variables.

- Nominal Variables: These variables have categories without any

intrinsic ordering. Examples include blood type (A, B, AB, O), gender
(male, female), and race (Caucasian, Asian, African American, etc.).
- Ordinal Variables: These variables have categories with a meaningful
order, but the intervals between the categories are not necessarily
equal. Examples include stages of cancer (Stage I, Stage II, Stage III,
Stage IV) and satisfaction ratings (very satisfied, satisfied, neutral,
dissatisfied, very dissatisfied).

### 2. Numerical Variables

Numerical variables represent data that can be measured on a numerical

scale. They can be further classified into discrete and continuous variables.

- Discrete Variables: These variables represent countable data,

usually integers. Examples include the number of patients in a study,
the number of infections, and the number of hospital visits.
- Continuous Variables: These variables represent data that can take
any value within a range and can be measured with high precision.
Examples include height, weight, blood pressure, cholesterol levels,
and age.

### 3. Binary Variables

Binary variables, also known as dichotomous variables, are a special type of

categorical variable that have only two categories or levels. Examples
include presence or absence of a disease (yes/no), gender (male/female),
and survival status (alive/deceased).

### 4. Interval Variables

Interval variables are numerical variables that have equal intervals between
values, but no true zero point. Examples include temperature in Celsius or
Fahrenheit, where the difference between 20°C and 30°C is the same as
between 30°C and 40°C, but 0°C does not indicate an absence of
temperature.

### 5. Ratio Variables

Ratio variables are numerical variables that have equal intervals between
values and a true zero point, which allows for meaningful ratios. Examples
include height, weight, age, and income. For instance, a weight of 0 means
no weight, and a weight of 10 kg is twice as much as 5 kg.

5. *Confounding Variables*:

- Variables that affect both independent and dependent variables (e.g.,

age, gender)

6. *Moderator Variables*:

- Variables that interact with independent variables to affect the

dependent variable (e.g., drug interactions)

7. *Time Variables*:

- Variables that measure time (e.g., follow-up time, survival time)

8. Repeated Measures Variables:

- Variables measured multiple times for each individual (e.g., longitudinal

studies)

NORMAL DISTRIBUTION
Normal distribution, also known as the Gaussian distribution, is a probability
distribution that appears as a “bell curve” when graphed. The normal
distribution describes a symmetrical plot of data around its mean value,
where the width of the curve is defined by the standard deviation.
Formula:

X = value of the variable or data being examined and f(x) the probability
function

μ = the mean

σ = the standard deviation

The properties of normal distributions:

 The mean, median and mode are exactly the same.

 The distribution is symmetric about the mean—half the values fall
below the mean and half above the mean.
 The distribution can be described by two values: the mean and the
standard deviation.
 In a normal distribution, the mean is zero and the standard deviation is
1. It has zero skew and a kurtosis of 3.

Empirical rule

 Around 68.2% of values are within 1 standard deviation from the mean.
 Around 95.4% of values are within 2 standard deviations from the
mean.
 Around 99.7% of values are within 3 standard deviations from the
mean.

Central Limit Theorem:

The Central Limit Theorem (CLT) states that the sum (or average) of a large
number of independent, identically distributed variables will be
approximately normally distributed, regardless of the original distribution of
the variables. This is crucial for inferential statistics, as it allows for the use
of normal distribution assumptions in hypothesis testing and confidence
interval estimation.

Applications in Biostatistics:

- Summarizing data using mean and standard deviation.

- Conducting hypothesis tests and creating confidence intervals for

population parameters.

- Assuming normal distribution of errors to make inferences about

relationships between variables.

- Using normal distribution properties to monitor and control processes.

Examples

- Height and Weight: Often, the distribution of heights and weights in a

population approximates a normal distribution.

- Blood Pressure: The distribution of blood pressure readings in a large

population can often be modeled as a normal distribution.
SKEWNESS

Skewness is a statistical measure that describes the asymmetry of a

distribution. It indicates whether the data points are spread out more to the
left or the right of the mean. There are three types of skewness:

Positive Skewness (Right Skewness): When the tail on the right side of
the distribution is longer or fatter than the left side. The bulk of the values lie
to the left of the mean, and the mean is typically greater than the median.

Negative Skewness (Left Skewness): When the tail on the left side of the
distribution is longer or fatter than the right side. The bulk of the values lie to
the right of the mean, and the mean is typically less than the median.

Zero Skewness (Symmetry): When the distribution is symmetrical, the

tails on both sides are balanced, indicating that the mean and median are
equal.
**Importance of Skewness:**

1. Understand data distribution shape

2. Identify outliers and their impact

3. Make informed decisions in finance, risk assessment, and quality control

**Limitations of Skewness:**

1. Skewness does not provide information about the peakedness or tail

heaviness of the distribution.
2. Skewness can be highly affected by outliers, leading to potential
misinterpretation.
3. It only describes the direction of asymmetry, not its impact on data
interpretation.

POSITIVE VS NEGATIVE SKEWNESS

1. Direction of the Tail:

- Positive Skewness (Right-Skewed): The tail extends to the right,

indicating the presence of outliers that are larger than most of the data.

- Negative Skewness (Left-Skewed): The tail extends to the left,

indicating the presence of outliers that are smaller than most of the data.

Boi er 3 no. Point ( middle)

4. Shape of the Distribution:

- **Positive Skewness**: The peak (mode) is to the left of the center, with
the right tail being longer.

- Negative Skewness: The peak (mode) is to the right of the center,

with the left tail being longer.

6. Impact on Descriptive Statistics:

- Positive Skewness: Increases the standard deviation and variance due

to the presence of high-value outliers.

- Negative Skewness: Decreases the standard deviation and variance

due to the presence of low-value outliers.

Here are 10 key differences between positive and negative skewness:

1. Direction of the tail:

- Positive skew: Long tail extends to the right

- Negative skew: Long tail extends to the left

2. Position of the mean:

- Positive skew: Mean is greater than median

- Negative skew: Mean is less than median

3. Relation to mode:

- Positive skew: Mode < Median < Mean

- Negative skew: Mean < Median < Mode

4. Concentration of data:

- Positive skew: Most data concentrated on the left

- Negative skew: Most data concentrated on the right

5. Outliers:

- Positive skew: Outliers tend to be on the high end

- Negative skew: Outliers tend to be on the low end

7. Common examples:

- Positive skew: Income distributions, reaction times

- Negative skew: Age at death, exam scores in easy tests

8. Shape analogy:

- Positive skew: “Right-tailed” or “right-leaning”

- Negative skew: “Left-tailed” or “left-leaning”

9. Mathematical representation:

- Positive skew: Skewness coefficient > 0

- Negative skew: Skewness coefficient < 0

10. Effect on standard deviation:

- Positive skew: Tends to inflate standard deviation

- Negative skew: May compress standard deviation

KURTOSIS
Kurtosis is a statistical measure that describes the shape of a distribution’s
tails in relation to its overall shape, specifically the “tailedness” or the
propensity for producing outliers. It provides insight into the data’s
distribution, indicating how much of the data is in the tails and the peak of
the distribution.

1. **Types of Kurtosis**:

- Mesokurtic (Kurt= 3.0) : This is the kurtosis of a normal distribution,

with a kurtosis value of zero. It represents a moderate level of tail thickness
and peak height.

- Leptokurtic (Kurt > 3.0) : Distributions with positive kurtosis values

(>0) are called leptokurtic. These have fatter tails and a sharper peak than
the normal distribution, indicating more frequent extreme values. While a
leptokurtic distribution may be “skinny” in the center, it also features “fat
tails.”

- Platykurtic (Kurt<3.0) : Distributions with negative kurtosis values (<0)

are called Platykurtic. These have thinner tails and a flatter peak than the
normal distribution, indicating fewer extreme values.
Excess Kurtosis

Excess kurtosis is a metric that compares the kurtosis of a distribution

against the kurtosis of a normal distribution. The kurtosis of a normal
distribution equals 3. Therefore, the excess kurtosis is found using the
formula below:

Excess Kurtosis = Kurtosis – 3

**Importance of Kurtosis:**

1. Tail Extremity Analysis

2. Risk Assessment
3. Distribution Shape Insight

Limitations of Kurtosis:

1. Insensitive to Mean and Variance

2. Complex Interpretation

3. Outlier Sensitivity

Great Books - Chapter 1
No ratings yet
Great Books - Chapter 1
7 pages
Stat Distributions
No ratings yet
Stat Distributions
24 pages
Descriptive and Inferential Statistics. Confidence Interval
No ratings yet
Descriptive and Inferential Statistics. Confidence Interval
42 pages
Stats Midterms Cheat Sheet
No ratings yet
Stats Midterms Cheat Sheet
3 pages
Class Test 1 Revision Notes
No ratings yet
Class Test 1 Revision Notes
10 pages
Biostats
No ratings yet
Biostats
17 pages
8614.educational Statitics Unit 4
No ratings yet
8614.educational Statitics Unit 4
34 pages
Analytics compendium (incl stats)
No ratings yet
Analytics compendium (incl stats)
31 pages
02 Normal Distribution - TV
No ratings yet
02 Normal Distribution - TV
23 pages
Descriptive Statistics MBA
100% (2)
Descriptive Statistics MBA
7 pages
DS Notes Unit - III
No ratings yet
DS Notes Unit - III
29 pages
Basic Biostats Part
No ratings yet
Basic Biostats Part
59 pages
Social Statistics
No ratings yet
Social Statistics
8 pages
BUSINESS AND STATISTICS
No ratings yet
BUSINESS AND STATISTICS
29 pages
Reviewer Part 1
No ratings yet
Reviewer Part 1
9 pages
Unit 2
No ratings yet
Unit 2
7 pages
WK 1b Biostat
No ratings yet
WK 1b Biostat
38 pages
Mean, Median, Mode and Standard Deviation
No ratings yet
Mean, Median, Mode and Standard Deviation
42 pages
Quantitative Methods
No ratings yet
Quantitative Methods
4 pages
2 - Central Tendency and Dispersion - SFB
No ratings yet
2 - Central Tendency and Dispersion - SFB
69 pages
2statsnotes 1
No ratings yet
2statsnotes 1
24 pages
Basics of Statistics
No ratings yet
Basics of Statistics
40 pages
Skewness Kurtosis and Histogram
No ratings yet
Skewness Kurtosis and Histogram
4 pages
SKEWNESS AND KURTOSIS STAT PRACTICALS 1[1]
No ratings yet
SKEWNESS AND KURTOSIS STAT PRACTICALS 1[1]
4 pages
Mba Statistics Midterm Review Sheet
No ratings yet
Mba Statistics Midterm Review Sheet
1 page
Classification of Data: Objectives: Understand How Data Are Classified. Recognize The Different Types of Data
No ratings yet
Classification of Data: Objectives: Understand How Data Are Classified. Recognize The Different Types of Data
39 pages
Statisitcs
No ratings yet
Statisitcs
22 pages
Statistics
No ratings yet
Statistics
30 pages
5. CH.5. stat.com
No ratings yet
5. CH.5. stat.com
34 pages
Biostatistics: DR Priyanka N Maiya
No ratings yet
Biostatistics: DR Priyanka N Maiya
85 pages
Intro SRM
No ratings yet
Intro SRM
73 pages
Measures of Central Tendency and Variability
No ratings yet
Measures of Central Tendency and Variability
9 pages
Why Study Dispersion?: Spread of The Data
No ratings yet
Why Study Dispersion?: Spread of The Data
31 pages
Normal DistrCent Tendency Measures of Dispersion
No ratings yet
Normal DistrCent Tendency Measures of Dispersion
26 pages
MMW REVIEWER FOR MIDTERMS
No ratings yet
MMW REVIEWER FOR MIDTERMS
4 pages
It0089 Finalreviewer
No ratings yet
It0089 Finalreviewer
143 pages
Class 1 - Descripritive Statistics
No ratings yet
Class 1 - Descripritive Statistics
46 pages
5. Shap of Distributions_٠٧١٩٥٦
No ratings yet
5. Shap of Distributions_٠٧١٩٥٦
20 pages
Cba101 MT
No ratings yet
Cba101 MT
4 pages
Statistics
No ratings yet
Statistics
11 pages
INF30036 Lecture5
No ratings yet
INF30036 Lecture5
33 pages
Lecture 2 - Normative Distribution and Descriptive Statistics
No ratings yet
Lecture 2 - Normative Distribution and Descriptive Statistics
51 pages
STAT100 - Full Course Notes
No ratings yet
STAT100 - Full Course Notes
27 pages
All Lectures
No ratings yet
All Lectures
53 pages
MODULE 1 Introduction To BIOSTAT
No ratings yet
MODULE 1 Introduction To BIOSTAT
49 pages
Chap 4
No ratings yet
Chap 4
7 pages
What Is Normal Distribution?
No ratings yet
What Is Normal Distribution?
5 pages
BAA Class Notes
No ratings yet
BAA Class Notes
16 pages
3- Descriptive Statistics
No ratings yet
3- Descriptive Statistics
45 pages
Week 3 - Measures of Central Tendency
No ratings yet
Week 3 - Measures of Central Tendency
4 pages
Biostatistics - Part 6 - DR - Vennila J
No ratings yet
Biostatistics - Part 6 - DR - Vennila J
14 pages
Psych Stats
No ratings yet
Psych Stats
6 pages
Basic Concepts in Biostatistics-1
No ratings yet
Basic Concepts in Biostatistics-1
40 pages
Nature of Statistics
No ratings yet
Nature of Statistics
5 pages
Module 2 - Statistical Foundations
No ratings yet
Module 2 - Statistical Foundations
108 pages
2NUBIONormalCurve2T24-25
No ratings yet
2NUBIONormalCurve2T24-25
50 pages
Lecture 2-Summarizing Data - HSciences Biostats - 010232en
No ratings yet
Lecture 2-Summarizing Data - HSciences Biostats - 010232en
37 pages
chapter2-statistical analysis
No ratings yet
chapter2-statistical analysis
86 pages
SCSA1606 - Predictive and Advanced Analytics - Unit II
No ratings yet
SCSA1606 - Predictive and Advanced Analytics - Unit II
50 pages
Biostatistics in Orthodontics
100% (3)
Biostatistics in Orthodontics
108 pages
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet
Session 5 - Marketing Information Systems
No ratings yet
Session 5 - Marketing Information Systems
48 pages
Full Low Platinum Fuel Cell Technologies Junliang Zhang PDF All Chapters
100% (3)
Full Low Platinum Fuel Cell Technologies Junliang Zhang PDF All Chapters
62 pages
CV Writing
No ratings yet
CV Writing
3 pages
PFE Omar - Aya 23 - 24
No ratings yet
PFE Omar - Aya 23 - 24
45 pages
Comparative Guide 2016: Better Living. Better Life
No ratings yet
Comparative Guide 2016: Better Living. Better Life
32 pages
Gym Tut Notes
No ratings yet
Gym Tut Notes
31 pages
Leukocytes Benign Disorders
100% (3)
Leukocytes Benign Disorders
3 pages
Practical Research 1 Lesson 3
No ratings yet
Practical Research 1 Lesson 3
14 pages
Colour Conversion Guide Pantone Cmyk RGB Hex
100% (2)
Colour Conversion Guide Pantone Cmyk RGB Hex
16 pages
Waldemar Gutwinski - Cohesion in Literary Texts
100% (1)
Waldemar Gutwinski - Cohesion in Literary Texts
188 pages
Notes - GM Study Lab Report Final
No ratings yet
Notes - GM Study Lab Report Final
16 pages
Opengd77-Csv by G4Eml: Description
No ratings yet
Opengd77-Csv by G4Eml: Description
7 pages
Draft Court Annexed Mediation Rules 2025 REVIEWED VERSION OF 16th February, 2025 (1)
No ratings yet
Draft Court Annexed Mediation Rules 2025 REVIEWED VERSION OF 16th February, 2025 (1)
40 pages
Sony cdx-gt310 Ver-1.3 SM PDF
No ratings yet
Sony cdx-gt310 Ver-1.3 SM PDF
37 pages
8 Advanced Mathematics PDF
No ratings yet
8 Advanced Mathematics PDF
5 pages
Bulletin 13-3 en World Meteorological Association July 1964
No ratings yet
Bulletin 13-3 en World Meteorological Association July 1964
79 pages
Canalización y Alimentadores
No ratings yet
Canalización y Alimentadores
9 pages
Stage 4 NESA - Mathematics - K - 10 - 2022
No ratings yet
Stage 4 NESA - Mathematics - K - 10 - 2022
74 pages
Jsgs 862 - Political Economy: Calendar Description
No ratings yet
Jsgs 862 - Political Economy: Calendar Description
10 pages
Stress and Burnout Management
No ratings yet
Stress and Burnout Management
7 pages
CRS XML Schema 2.0
No ratings yet
CRS XML Schema 2.0
66 pages
Chij ST Nicholas Girls' School (Primary) .: Primary 6 2010 Continual Assessment 1 - 26 February, 2010
No ratings yet
Chij ST Nicholas Girls' School (Primary) .: Primary 6 2010 Continual Assessment 1 - 26 February, 2010
22 pages
Soteriology Workbook Jul 2006
67% (6)
Soteriology Workbook Jul 2006
189 pages
Top The #Bestbarever: Remedial Law Notes
No ratings yet
Top The #Bestbarever: Remedial Law Notes
2 pages
Computer f1
No ratings yet
Computer f1
2 pages
Chapter3 setC
No ratings yet
Chapter3 setC
10 pages
Pleased To Meet You Won'T You Guess My Name?
No ratings yet
Pleased To Meet You Won'T You Guess My Name?
24 pages
DFL E-27 Emulsion
100% (1)
DFL E-27 Emulsion
21 pages
Checklist
No ratings yet
Checklist
2 pages

Variables

Uploaded by

Variables

Uploaded by

VARIABLES

Kerlinger (1986):Definition: “A variable is a property that takes on different

Best and Kahn (2003):Definition: “A variable is any characteristic or quality

In biostatistics, variables are classified into different types based on their

### 1. **Categorical Variables**

- Nominal Variables: These variables have categories without any

### 2. **Numerical Variables**

Numerical variables represent data that can be measured on a numerical

- Discrete Variables: These variables represent countable data,

### 3. Binary Variables

Binary variables, also known as dichotomous variables, are a special type of

### 4. Interval Variables

### 5. Ratio Variables

- Variables that affect both independent and dependent variables (e.g.,

- Variables that interact with independent variables to affect the

- Variables that measure time (e.g., follow-up time, survival time)

8. *Repeated Measures Variables*:

- Variables measured multiple times for each individual (e.g., longitudinal

σ = the standard deviation

The properties of normal distributions:

 The mean, median and mode are exactly the same.

Central Limit Theorem:

- Summarizing data using mean and standard deviation.

- Conducting hypothesis tests and creating confidence intervals for

- Assuming normal distribution of errors to make inferences about

- Using normal distribution properties to monitor and control processes.

- **Height and Weight**: Often, the distribution of heights and weights in a

- **Blood Pressure**: The distribution of blood pressure readings in a large

Skewness is a statistical measure that describes the asymmetry of a

Zero Skewness (Symmetry): When the distribution is symmetrical, the

1. Understand data distribution shape

3. Make informed decisions in finance, risk assessment, and quality control

1. Skewness does not provide information about the peakedness or tail

POSITIVE VS NEGATIVE SKEWNESS

1. **Direction of the Tail**:

- **Positive Skewness (Right-Skewed)**: The tail extends to the right,

- **Negative Skewness (Left-Skewed)**: The tail extends to the left,

Boi er 3 no. Point ( middle)

4. **Shape of the Distribution**:

- **Negative Skewness**: The peak (mode) is to the right of the center,

6. **Impact on Descriptive Statistics**:

- **Positive Skewness**: Increases the standard deviation and variance due

- **Negative Skewness**: Decreases the standard deviation and variance

Here are 10 key differences between positive and negative skewness:

- Positive skew: Long tail extends to the right

- Negative skew: Long tail extends to the left

2. Position of the mean:

- Positive skew: Mean is greater than median

- Negative skew: Mean is less than median

- Positive skew: Mode < Median < Mean

- Negative skew: Mean < Median < Mode

- Positive skew: Most data concentrated on the left

- Negative skew: Most data concentrated on the right

- Positive skew: Outliers tend to be on the high end

- Negative skew: Outliers tend to be on the low end

- Positive skew: Income distributions, reaction times

- Negative skew: Age at death, exam scores in easy tests

- Positive skew: “Right-tailed” or “right-leaning”

- Positive skew: Skewness coefficient > 0

- Negative skew: Skewness coefficient < 0

10. Effect on standard deviation:

- Positive skew: Tends to inflate standard deviation

- Negative skew: May compress standard deviation

- Mesokurtic (Kurt= 3.0) : This is the kurtosis of a normal distribution,

- Leptokurtic (Kurt > 3.0) : Distributions with positive kurtosis values

- Platykurtic (Kurt<3.0) : Distributions with negative kurtosis values (<0)

Excess kurtosis is a metric that compares the kurtosis of a distribution

Excess Kurtosis = Kurtosis – 3

1. Tail Extremity Analysis

1. Insensitive to Mean and Variance

You might also like

### 1. Categorical Variables

### 2. Numerical Variables

8. Repeated Measures Variables:

- Height and Weight: Often, the distribution of heights and weights in a

- Blood Pressure: The distribution of blood pressure readings in a large

1. Direction of the Tail:

- Positive Skewness (Right-Skewed): The tail extends to the right,

- Negative Skewness (Left-Skewed): The tail extends to the left,

4. Shape of the Distribution:

- Negative Skewness: The peak (mode) is to the right of the center,

6. Impact on Descriptive Statistics:

- Positive Skewness: Increases the standard deviation and variance due

- Negative Skewness: Decreases the standard deviation and variance