3RD Quarter Statistics and Probability
3RD Quarter Statistics and Probability
STATISTICS
- Is the science of collecting, organizing, analyzing, and interpreting data in order to
make decisions.
DATA
- Consists of information coming from observation, counts, measurements, or
responses.
POPULATION
- Is the collection of all outcomes, responses, measurements, or counts that are of
interest.
SAMPLE
- Is the subset of a population.
- Like representative of a population.
PARAMETER
- Is a numerical description of a population's characteristics.
STATISTIC
- Is a numerical description of a sample's characteristics.
- !!!!STATISTICS AND STATISTIC ARE DIFFERENT.!!!
BRANCHES OF STATISTICS
DESCRIPTIVE STATISTICS
- Aims to provide information about the collected data and does not attempt to draw
conclusions about anything that pertains to more than the data collected or a larger
set of data.
INFERENTIAL STATISTICS
- Is concerned with making generalizations based on information gathered from a small
group of observations (sample) to a bigger group of observations (population).
- It leads to prediction, inferences, interpretation or conclusion about the entire
population.
VARIABLE
- Is a characteristic that changes or varies over a period of time and/or different
individuals or objects under consideration. (Ex. Age, body temperature, height,
educational attainment, numbers of errors in a test.)
- When a variable is actually measured on individuals or objects of interest to the
investigator, a set of measurements or data is obtained.
UNIVARIATE DATA
- Result when a single variable is measured on a single individual or object.
BIVARIATE DATA
- Result when two variables are measured on a single individual or object.
MULTIVARIATE DATA
- Result when more than two variables are measured on a single individual or object.
TYPES OF DATA
QUALITATIVE DATA
- Consists of attributes, labels, or non-numerical entries. (Ex. Sex of a person, school
type, favorite color, rice variety.)
QUANTITATIVE DATA
- Consists of numerical measurements or counts. (Ex. Age, height, length of time to
complete a test, performance ratings.)
● CONTINUOUS VARIABLE
- Is one for which, within the limits the variable ranges, any value is
possible. (Ex. Weight, time to complete a task, height, salary, amount.)
● DISCRETE VARIABLE
- Is one that cannot take on all values within the limits of the variable
- Can be attained by counting. (Ex. Responses to a five-point rating scale,
number of provinces in region 12, number of children.)
LEVELS OF MEASUREMENT
( LOWEST TO HIGHEST )
NOMINAL
- Calculated using names, labels, or qualities. No mathematical computations can be
made at this level.
- Only characteristics.
- Data is only qualitative at this level.
ORDINAL
- Arranged in order, but differences between data entries are not meaningful.
- Numbers are involved.
- In order and has characteristics.
- Data at this level is qualitative or quantitative.
INTERVAL
- Arranged in order, the differences between the data entries can be calculated.
- Has characteristics, in order, has no true meaning of 0.
- Data at this level is quantitative.
RATIO
- Data at this level are similar to the interval level, but a zero entry is meaningful.
- A ratio of two data values can be formed so one data value can be expressed as a
ratio.
- Has characteristics, in order, has a true meaning of 0.
WEEK 2 : MEASURES OF CENTRAL TENDENCY
MEAN
- Most commonly referred to as the average.
- To find the mean, add up all of the numbers in your list and divide by the number of
numbers.
- Really good when the data is fairly close together.
- Most commonly used.
- In statistics, it is important to distinguish between the mean of a population and the
mean of a sample of that population.
● POPULATION
- The greek letter mu, µ - represents a population mean.
𝑥1 + 𝑥2 +...+ 𝑥𝑛
- µ = 𝑁
∑𝑥
- µ = 𝑁
● SAMPLE
- 𝑥 read as “x-bar” represents a Sample mean
𝑥1+𝑥2+...+𝑥𝑛
- 𝑥 = 𝑛
∑𝑥
- 𝑥 = 𝑛
Ex. 1
If odd
Median = 93
Ex. 2
If even
MODE
- Is the most frequent number in a data set.
- There can be no mode as well as more than one mode.
- Good when the value of the number is the most important information. (ex. Shoe size).
- Only choice with categorical data.
Ex.
VARIANCE
- Is a measure of how data points vary from the mean
- Is represented in squared units.
● POPULATION VARIANCE
𝑁
2 1 2
- σ = 𝑁
∑ (𝑋𝑖 − µ)
𝑖=1
2
- σ = Population variance
- N = Number of observations in population
- 𝑋𝑖= ith Observation in the population
- µ = Population mean
● SAMPLE VARIANCE
𝑛
2 1 2
- 𝑠 = 𝑛−1
∑ (𝑥𝑖 − 𝑥)
𝑖=1
2
- 𝑠 = Sample variance
- n = number of observation in sample
- 𝑥𝑖 = ith observation in the sample
- 𝑥 = sample mean
STANDARD DEVIATION
- Is the measure of the distribution of statistical data.
- Is represented in the same units as the mean of data.
● POPULATION VARIANCE
𝑁
1 2
- σ = 𝑁
∑ (𝑋𝑖 − µ)
𝑖=1
𝑛
1 2
- 𝑠 = 𝑛−1
∑ (𝑥𝑖 − 𝑥)
𝑖=1
- 𝑥 = sample mean
EXCEL APPLICATION