Statistics Midterm Study Guide
Statistics Midterm Study Guide
Unit 1:
Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to
assist in making more effective decisions.
Descriptive Statistics: Study of the behavior of the data of an experiment based on the
measures of dispersion, centralization and shape and their graphical representation.
Inferential Statistics: Analysis of data from a randomized experiment with the aim of drawing
consequences and making predictions about the entire population.
Probability: Formal (mathematical) study of randomness. Construction of models
Data: Facts, information or figures that are collected, analyzed and summarized for their
presentation or interpretation.
Individuals/elements: Entities on which data is collected.
Variable: A characteristic of interest of the element which we want to analyze
Sample: A portion, or part, of the population of interest. We use it to draw conclusions about the
population. The number of individuals in a sample is expressed as n.
Population: The entire set of individuals or objects of interest or the measurements obtained
from all individuals or objects of interest. We want to draw conclusions about a characteristic of
the population. The number of individuals is typically expressed as N.
INFERENTIAL STATISTICS: We want to estimate the real value of the population parameter
using a sample
Parameter ← Statistic
Parameter Statistic
Simple Random Sample: A sample selected so that each item or person in the population has
the same probability or chance of being included.
SystematicEditar
RandomconSample: A random
la aplicación starting point is selected, and then every kth member
Documentos
of the population is selected.
Stratified Retoca
Random el archivo,
Sample:deja comentarios
A population y
is divided into subgroups, called strata, and a sample
compártelo con otras personas
is randomly selected from each stratum. para editarlo
de forma conjunta.
Cluster Sampling: A population is divided into clusters using naturally occurring geographic or
other boundaries. Then, clusters are randomly selected, and a sample is collected by randomly
selecting from each cluster.
NO, GRACIAS
Convenience Sampling: Instead of picking people at random from a certain population,
convenience sampling involves picking the people who are easiest for the researcher to get
information from
DESCARGAR LA APLICACIÓN
https://docs.google.com/document/u/1/d/15aEvu8RoTZPs5HsnFqy18WbuiR93rOQGHoHP5qUoTFA/mobilebasic 6/5/25, 18 32
Página 1 de 6
:
Unit 2:
Median: Value in the center of the data distribution; Value in the middle
- 1+2/2=1.5
Mode: Most repeated value
- Unimodal: 1 1 1 1 1 1 2 2 6 → Mode = 1
- Amodal: 1 2 3 4 5 6 7 → Mode = ? (Non-existent)
- Bimodal: 1 1 1 4 4 4 6 → Mode = 1 and 4
- Unimodal: 1 1 2 2 6 → Mode = 1,5
Mean: Average value
- The (arithmetic) mean is one way of averaging the data. We add all the values and
divide by the number of values.
- 1+5+6/3=12/3=4
https://docs.google.com/document/u/1/d/15aEvu8RoTZPs5HsnFqy18WbuiR93rOQGHoHP5qUoTFA/mobilebasic 6/5/25, 18 32
Página 2 de 6
:
Range: Interval in which we find the data (Greatest Value - Lowest value)
- (Range = XMax - XMin)
Variance: Magnifies values which are far from the mean (s2 or σ2)
- Example: 1+2+4+7+9=23/5=4.6 → (4.6-1)^2+(4.6-2)^2+(4.6-4)^2+(4.6-7)^2+(4.6-
9)^2= 9.04
Standard deviation: Similar to variance, but it has the same units as the data (s or σ)
- Example: s= √9.04 = 3.01
Coefficient of variation: Ratio which indicates if the mean is representative of the data
Example:
https://docs.google.com/document/u/1/d/15aEvu8RoTZPs5HsnFqy18WbuiR93rOQGHoHP5qUoTFA/mobilebasic 6/5/25, 18 32
Página 3 de 6
:
Measures of positions:
- Percentiles: Divide data in 100 parts containing 1% of the data. There are 99 centiles.
- Decicles: Divide data in 10 parts containing 10% of the data. There are 9 deciles.
- Quartiles: Divide data in 4 parts containing 25% of the data. There are 3 quartiles.
Example:
“Find Q1 and P33 of this set of numbers → 1 1 1 2 2 3 5 6 6 6 6 7 7 8 8 9 12”
P33
L33 = (17 +1) * (33/100) = 6 (Position 6)
P33 = 3
https://docs.google.com/document/u/1/d/15aEvu8RoTZPs5HsnFqy18WbuiR93rOQGHoHP5qUoTFA/mobilebasic 6/5/25, 18 32
Página 4 de 6
:
Sample space (Ω): The set of all possible outcomes.
- Example → Rolling a die: Ω = {1,2,3,4,5,6}
Sure event: The result is always verified and coincides with the sample space. The probability
of it happening is 1 (out of 100%).
- Example → Get a 1, 2, 3, 4, 5 or 6 in a die
Impossible event: The result is never checked and matches the empty space. It does not contain
any possible results. The probability of it happening is 0 (0%).
- Example → Get a 7 in a die
Random variables:
- Discrete Random Variables → A discrete random variable is a random variable that can
take a finite or countable number of values (which cannot take values between two
consecutive numbers), where each of these values has a probability greater than 0 of
being observed.
- Examples: (Number of students who pass a course, number of customers in one
day,…)
- Continuous Random Variables →
- Examples: (Temperature in this room, height of a person, contents of a
bottle,…)
https://docs.google.com/document/u/1/d/15aEvu8RoTZPs5HsnFqy18WbuiR93rOQGHoHP5qUoTFA/mobilebasic 6/5/25, 18 32
Página 5 de 6
:
Experimental (or statistical) probability:
- Intuition: it tells us that when drawing a ball, the ball that we will draw will surely be
red. How sure can we be?
- Experimental definition: If we repeat the experiment, we will calculate fA, that is, we
calculate the times that the event A occurred. The more we repeat the experiment, the
more fA tends to the same number, which we call probability of the event A, P(A).
Theoretical probability:
Laplace rule:
- Properties:
event:
https://docs.google.com/document/u/1/d/15aEvu8RoTZPs5HsnFqy18WbuiR93rOQGHoHP5qUoTFA/mobilebasic 6/5/25, 18 32
Página 6 de 6
: