0% found this document useful (0 votes)

6 views

Statistical Methods

The document discusses statistical methods and probability. It covers topics like sampling, variables, data types, distributions, and probability rules. Statistical analysis techniques are presented for summarizing and describing data distributions through graphical and numerical methods.

Uploaded by

lizahxm

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Statistical Methods

Uploaded by

lizahxm

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Statistical Methods

Lecture 1
Census: collection of data from every member of population, usually too large to collect.
Sample: sub-collection from the population.
Different sample → different data → different conclusions about population.
A sample should be repress entative and unbiased.

Parameter: numerical measurement describing a population’s characteristic, often Greek symbols.

Statistic: numerical measurement describing a sample’s characteristic, small letters.

Sampling methods:
o Voluntary response sample: subjects decide themselves to be included in sample.
o Random sample: each member of population has equal probability of being selected.

o Simple random sample: each sample of size n has equal probability of being chosen
o Systematic sampling: after starting point, select every k-th member.
o Stratified sampling: divide population into subgroups such that subjects within groups have
same characteristics, then draw a (simple) random sample from each group.
o Cluster sampling: divide population into clusters, then randomly select some of these
clusters.
o Convenience sampling: easily available results.

Variable: varying quantity

o Response (dependent) variable: representing the effect to study
o Explanatory (independent) variable: possibly causing that effect
o Confounding: mixing influence of several explanatory variables on response
Types of studies
o Observational study: characteristics of subjects are observed, subjects are not modified.
• Retrospective (case-control): data from past
• Cross-sectional: data from one point in time
• Prospective (longitudinal): data are to be collected
o Experiment: some subject treatment
• Sometimes control and treatment group; single-blind or double-blind
• To measure placebo effect or experimenter effect

Types of data
o Qualitative (categorical): names or labels represent counts/measurements
• Nominal: names, labels, categories (no ordering)
➔ Gender, eye color
• Ordinal: categories with ordering, but no (meaningful) differences
➔ U.S. grades, opinions
o Quantitative (numerical): numbers represent counts/measurements
• Interval: ordering possible and differences between numbers are meaningful. No
natural zero starting point
➔ Year of birth, temperatures
• Ratio: ordering possible, differences are meaningful and there is a natural starting
point.
→ body length, marathon times

• Discrete: countable number of possible values

• Continuous: uncountably many possible values

Summarizing data:
o Graphical: tables, graphs, other figures
o Descriptive:
• Qualitative: describe shape, location and dispersion/variation
• Quantitative: numerical summaries of location and variation

Graphical summaries:
o Frequency distribution (table)
Count occurrences of category
o Bar chart
Spaces in between the categories
o Pareto bar chart
Bar chart, but categories are ordered w.r.t. frequency.
Data of nominal measurement level is required
o Pie chart
Pie piece sizes determined by relative frequency of category
o Histogram
Bar areas are proportional to frequency in respective interval. No white space.
Only used for quantitative data
o Time series
Visualization of time-varying quantity
Qualitative description:
o Shape:
Make smooth approximation of histogram
Shape of smooth curve relates data distribution to familiar distributions.
• Symmetrical
• Left- or Right-skewed
• Uniform
o Location:
Position on x-axis
Same shape, different location
o Dispersion (spread/variation):
Measure of variation within dataset
Same shape and location; different dispersion
• Small or large dispersion

Measures of center: value at center/middle of a dataset

o Mean

Average
Every data value is used
Strongly affected by extreme values
• Sample mean denoted by x̄
• Population mean denoted by μ
o Median
Middle value after sorting
Not much affected by extreme values
o Mode
Value with highest frequency
Bimodal (2), multimodal (>2)

(sample) standard deviation: common measure of variation (or deviation from x̄ )

Measures how much the values deviate from the sample mean.

➔ Square root of sample variance (‘mean quadratic deviation from x̄ )

Population standard deviation: σ

Population variance: σ2
Range = maximum – minimum

Percentiles: measures of location and dispersion

Quartiles:
o Q1= P25
o Q2= P50 = median
o Q3= P75
5 number summary:
1. Minimum
2. Q1
3. Median, Q2
4. Q3
5. Maximum
Interquartile range = Q3 – Q1
➔ Boxplot: provide information about distribution
top value: maximum
top of box: Q3
thick line: median
bottom of box: Q1
lowest value: minimum

whiskers: lines extending from the box

outliers: all points not included between whiskers

Lecture 2
Probability experiment: production of (random) outcome.
➔ dice roll, coin toss
Sample space Ω: set of all possible outcomes
➔ Ω = { 1,2,3,4,5,6}
Event A, B, …: collection of outcomes
➔ A = {even number thrown} = { 2,4,6}
Simple event: consist of 1 outcome
Probability measure: function P(.) assigning values between 0 and 1 to events
➔ P(A) = P({2,4,6}) = ½
Interpretation of probabilities:
o P(A) = 0 → occurrence of A is impossible
o P(A) = 1 → occurrence of A is certain
o P(A) = small e.g. <0.05 → occurrence of A is unlikely

3 ways to determine probability P(A) of event A:

1. Estimate with relative frequency:
𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑡𝑖𝑚𝑒𝑠 𝐴 𝑜𝑐𝑐𝑢𝑟𝑟𝑒𝑑
P(A) =𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑡𝑖𝑚𝑒𝑠 𝑡ℎ𝑒 𝑝𝑟𝑜𝑐𝑒𝑑𝑢𝑟𝑒 𝑤𝑎𝑠 𝑟𝑒𝑝𝑒𝑎𝑡𝑒𝑑

2. Classical (theoretical) approach

Make probability model (outcome space, probability measure..), compute P(A) by using
properties of P
3. Subjective approach
Estimate P(A), based on intuition/experience

With relative frequency, many trials lead to the relative frequency almost being equal to the real
value of P(A) → Law of Large numbers: suppose a procedure is repeated (independently). The
relative frequency probability of an event A tends towards true P(A)

Determining P(A) is all outcomes are equally likely:

𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑤𝑎𝑦𝑠 𝐴 𝑐𝑎𝑛 𝑜𝑐𝑐𝑢𝑟
P(A) =𝑡𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑡 𝑠𝑖𝑚𝑝𝑙𝑒 𝑒𝑣𝑒𝑛𝑡𝑠
Counting principle:
Suppose 2 probability experiments are performed
a > x possible outcomes;
b > y possible outcomes
Combined: a x b possible outcomes

How to find P(A) in discrete case

o Find sample space Ω
o Determine probabilities P(ω) for all ω in Ω
Finite case with N equally likely outcomes: P(ω) = 1/N
o Determine which outcomes belong to A
o Compute

➔ Example biased dice: probability of even number

Addition rule:
o P(A ∪ B) = P(A) + P(B) − P(A ∩ B)
Notation:
A ∪ B = A or B: union, set of outcomes which are in A or B (or both)
A ∩ B = A and B: intersection, set of outcomes which are both in A and B

o A and B are disjoint if they exclude each other, A ∩ B = ∅

Addition rule for 2 disjoint events:
P(A ∪ B) = P(A) + P(B)

General addition rule for disjoint events

o : complement of A: outcomes which are not in A

Complement rule:

Multiplication rule
o P(B|A): conditional probability that B occurs given that A has occurred.
Conditional probability:
P(A ∩ B)
If P(A) > 0, then: P(B|A) = P(A)
BUT P(B|A) ≠ P(A|B)

o Multiplication rule:
P(A ∩ B) = P(A) · P(B|A).

o Independence:
Two events A and B are independent if P(A ∩ B) = P(A) · P(B)
➔ P(B) = P(B|A) when A and B are independent
➔ Independence ≠ disjointness
Two different sampling methods:
1. Sampling with replacement: selections are independent events
2. Sampling without replacement: selections are dependent events
➔ Drawing a small sample from a large population, then treat selections as independent
events.

o Complement of at least one:

P(≥ 1 occurrence of . . . ) = 1 − P(no occurrence of . . .)

Lecture 3
Addition rule for disjoint events

Then, multiplication rule:

➔ Simple law of total probability:

Let A and B be events, then:

Combine with multiplication rule:

➔ Bayes’ Theorem (simple):

!! , but

o Partition:
Events A1, …, Am are called partition if
• They are pairwise disjoint: Ai ∩ Aj = ∅, if i ≠ j;
• Their union is entire sample space: : A1 ∪ A2 ∪ . . . ∪ Am = Ω.

o Law of total probability

Let A1, …, Am be a partition, then:

o Bayes’ Theorem
Let A1, …, Am be partition, then for r ∈ {1, …, m}:

EXAMPLE:

A random variable is a variable that assigns a numerical value to each outcome of a probability
experiment.
Notation: X, Y, ..
X : random variable, x value of a random variable

EXAMPLE
o Probability distribution:
Determines probabilities of values of a random variable
Given by table, formula or graph

o Discrete random variable

Has finite (countably) many different values
Its probability distribution: collection of all their individual probabilities
Total sum of probabilities: 1

o Continuous random variable

Has uncountably many different values
Its probability distribution: given by probability density function; probabilities computed by
area under this function.
Total area: 1

Outcomes ω in Ω and probability measure P determine probability distribution of X:

P(X = x) = P({ω ∈ Ω : X(ω) = x}).

Find probability distribution of discrete random variable:

1. Determine sample space of underlying probability experiment and probabilities of outcomes
ω (see lecture 2)
2. List values X(ω) for all ω in Ω
3. For each value of x of X, find all simple events {ω} with value x
Unify: {X = x} = {ω : X (ω) = x}
4. Probabilities P({ω}) determine probability of {X = x}:

5. Table. Left column: all values x of X and column with probabilities P(X =x)

EXAMPLE:
o Expected value (expectation/mean)
The expected value of a discrete random variable X with possible values x1, …, xk:
Weighted average of all possible values of X:

EXAMPLE

o Variance
The variance of a discrete random variable X with values x1, …, xk:

o The standard deviation of X is

EXAMPLE

Law of Large Numbers:

Let X1, …, Xn be n independent versions of random variable X; let µ = E(X)
1
Their mean 𝑛(X1 + … + Xn) tends to approach µ.
LLN of Lect.2: random variable Xi = 1 if A occurs, Xi = 0 if A doesn’t.

Find probability distribution of discrete random variable

1. Determine sample space and probabilities of underlying probability experiment.
2. List the numerical values X(ω) for each outcome ω ∈ Ω.
3. Find the collection of outcomes which have the same numerical value x.

4. Determine
5. Tabulate the results.

Lecture 4
Probability density function
A curve p(x) such that
o p(x) ≥ 0 for all x
o total area under curve = 1
P(X ∈ [a, b]) = area under the curve p(x) between a and b

Normal distribution
Random variable X has a normal distribution if p(x) is continuous, bell-shaped and symmetric

If E(X) = µ and SD(X) = σ ,

Notation: N(µ, σ2 ) for normal distribution with mean µ, variance σ 2

Standard normal distribution: N(0,1)

Determine probabilities of normal distribution
P(X ≤ z) = area under density to the left of z

P(X ∈ [a, b]) = P(X ≤ b) − P(X ≤ a)

P(X ≥ b) = 1 − P(X ≤ b)

EXAMPLE

o Probability density function

A curve p(x) such that p(x) ≥ 0 and the total area under the curve is 1.
The probability that X takes a value between a and b: P( X ∈ [a, b]), can be obtained by
determining the area under the curve p(x) between a and b.
o Normal distribution
A random variable X has a normal distribution if its probability density p(x) is continuous,
bell-shaped and symmetric.
Notation: N(µ, σ2 ) for a normal distribution with mean µ and variance σ 2 .
The standard normal distribution has mean 0 and standard deviation 1: N(0, 1).
o Determine probabilities of standard normal distribution
Let X has N(0,1) distribution.
Probability P(X ≤ x). Use table which shows the cumulative area under the curve to the left of
a z-score, P(X ≤ z).
Z-score of value x
Let x be a (data) value of interest, related to a population distribution with mean µ and standard
x−µ
deviation σ. The z-score of x is z = σ
➔ Number of standard deviations away from the mean
x−µ
Let X ∼ N(µ, σ2 ). Since P(X ≤ x) = P(Z ≤ z), where Z = σ ∼ N(0, 1), use Table 2

EXAMPLE

The Central Limit Theorem (CLT)

Take a sample of size n > 30 from a population with mean µ and standard deviation σ.

The population can have any distribution

The Central Limit Theorem for normal population (special case)

Take a sample of size n from a normal population with mean µ and standard deviation σ.

N can be any number

EXAMPLE
A model distribution
Probability distribution for describing the unknown true population distribution
Examples (continuous variables: normal, uniform, t, χ 2 , exponential.

The variable < . . . > is (modelled as) a random variable

having a < model distribution>
with < relevant parameters >

Example: The variable ‘Date of birth - Due date’ is a random variable having a normal distribution
with mean 0 and standard deviation 10

Accessing normality
Consider dataset x1, …, xn. When is model distribution N(µ, σ2 ) reasonable?
o Shape of histogram
Bell-shaped curve
Strong deviation from bell shape? Then N(µ, σ2 ) unlikely
o Normal QQ plot
Approximately straight line
EXAMPLE

What is a QQ plot
There are QQ plots other than “normal QQ plots”: use theoretical quantiles of other continuous
distributions.

Sample size
Small n: more variation
➔ histogram / QQ plot could deviate (from bell shape / straight line), even if N(µ, σ2 ) true.
Large n: histogram and QQ plot: more reliable

A location-scale family of probability distributions

Each member is obtained by
o Shifting (change in location) and/or
o Stretching/squeezing (change in scale)

Stochasts X and Y have probability distributions that are in the same location-scale family if and only
if the QQ-plot shows a straight line Y = a + bX

Normal distributions form a location-scale family

3 types of QQ-plots
1. X-axis: theoretical quantiles of a probability distribution.
y-axis: sample quantiles of this dataset
used to asses whether the particular distribution could be used as model distribution.
2. X-axis: theoretical quantiles of a probability distribution.
y-axis: theoretical quantiles of another probability distribution
used to compare the shape of two probability distributions, for instance to verify whether
they belong to the same location-scale family.
3. X-axis: sample quantiles of a dataset
y-axis: sample quantiles of another dataset
used to compare the shape of the two data distributions and assess whether they could
possibly originate from two model distributions belonging to the same location-scale family.

How to interpret QQ plots?

Draw straight line through middle of QQ plot
o Points on left side below straight line?
→ left tail of sample is heavier than left tail of N(0, 1).
o Points on left side above straight line?
→ left tail of N(0, 1) is heavier than left tail of sample.
o Points on right side above straight line?
→ right tail of sample is heavier than right tail of N(0, 1).
o Points on right side below straight line?
→ right tail of N(0, 1) is heavier than right tail of sample.

How to assess normality of data with QQ plot

o Make normal QQ plots
o If points follow approximately straight line y = a + bx (with slope b > 0), then N(a, b2 ) is
reasonable as model distribution.
o If points don’t follow straight line: sample most likely not from normal distribution.
In latter case: sample most likely from location-scale family with lighter or heavier tails than those of
normal distribution, depending on shape of QQ plot.

Peak Fit
No ratings yet
Peak Fit
295 pages
Statistics Cheat Sheet
100% (3)
Statistics Cheat Sheet
23 pages
Statistics: a QuickStudy Laminated Reference Guide
From Everand
Statistics: a QuickStudy Laminated Reference Guide
BarCharts Publishing, Inc.
No ratings yet
Photogrammetry Mathematics 080116
100% (2)
Photogrammetry Mathematics 080116
128 pages
Eal R2
100% (4)
Eal R2
28 pages
Statistical Methods
No ratings yet
Statistical Methods
16 pages
Statistics
No ratings yet
Statistics
36 pages
s2 Revision Notes
No ratings yet
s2 Revision Notes
5 pages
Statistics and Probability
No ratings yet
Statistics and Probability
4 pages
Probablity
No ratings yet
Probablity
37 pages
Notes
No ratings yet
Notes
16 pages
Stat I Tried
No ratings yet
Stat I Tried
8 pages
BLG 313_083427
No ratings yet
BLG 313_083427
25 pages
Theoretical Questions in Basic Business Statistics
No ratings yet
Theoretical Questions in Basic Business Statistics
12 pages
Blue White Abstract Simple Project Presentation _20240804_193747_0000
No ratings yet
Blue White Abstract Simple Project Presentation _20240804_193747_0000
16 pages
Stats Review
No ratings yet
Stats Review
65 pages
Stats Outline
No ratings yet
Stats Outline
4 pages
Probability Theory For Machine Learning: Chris Cremer September 2015
No ratings yet
Probability Theory For Machine Learning: Chris Cremer September 2015
40 pages
Distribution Theory Questionnaire
No ratings yet
Distribution Theory Questionnaire
3 pages
Basic Stat Chapter 4 Probability & Probability Distribution
No ratings yet
Basic Stat Chapter 4 Probability & Probability Distribution
73 pages
2 Probability and Statistics
No ratings yet
2 Probability and Statistics
29 pages
Statistical Methods in Quality Management
No ratings yet
Statistical Methods in Quality Management
71 pages
Ch-3
No ratings yet
Ch-3
30 pages
Unit 5 & 6. Probability and Prob Disti
No ratings yet
Unit 5 & 6. Probability and Prob Disti
90 pages
Some Definitions in COMP Maths
No ratings yet
Some Definitions in COMP Maths
10 pages
Outline 2
No ratings yet
Outline 2
1 page
Statistical Methods in Quality Management
No ratings yet
Statistical Methods in Quality Management
71 pages
10) ISM-Session 10
No ratings yet
10) ISM-Session 10
61 pages
Statistics and Machine Learning
No ratings yet
Statistics and Machine Learning
51 pages
Please DO NOT Bring This Formula Sheet To The Class Room On Exam Day. Formula Sheet and Function Tables Will Be Provided
No ratings yet
Please DO NOT Bring This Formula Sheet To The Class Room On Exam Day. Formula Sheet and Function Tables Will Be Provided
5 pages
SullivanChapter 6 Outline
No ratings yet
SullivanChapter 6 Outline
12 pages
Statistics and Probability 2-Mark Questions and Answers with Formulas - Google Docs
No ratings yet
Statistics and Probability 2-Mark Questions and Answers with Formulas - Google Docs
6 pages
Unit 2 .Statistical Decision Making-1
No ratings yet
Unit 2 .Statistical Decision Making-1
213 pages
5juni2021 RandomVariabel
No ratings yet
5juni2021 RandomVariabel
17 pages
Intro To Probability (Pattern Recognition)
No ratings yet
Intro To Probability (Pattern Recognition)
94 pages
Math 215 Cheat Sheet
No ratings yet
Math 215 Cheat Sheet
3 pages
Mathematics in Machine Learning
No ratings yet
Mathematics in Machine Learning
83 pages
AP ECON 2500 Session 4
No ratings yet
AP ECON 2500 Session 4
18 pages
Chapter 3
No ratings yet
Chapter 3
37 pages
Mathematics Probability
No ratings yet
Mathematics Probability
2 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
41 pages
A Survey of Probability Concepts
No ratings yet
A Survey of Probability Concepts
4 pages
Statistics 2 intro prob
No ratings yet
Statistics 2 intro prob
21 pages
Econ1203 Notes
67% (3)
Econ1203 Notes
35 pages
Lecture 9
No ratings yet
Lecture 9
28 pages
Slides-Sksk
100% (1)
Slides-Sksk
151 pages
Class 12 Applied Mathematics Complete Theory
No ratings yet
Class 12 Applied Mathematics Complete Theory
15 pages
Slides 11 09 PDF
No ratings yet
Slides 11 09 PDF
105 pages
5juni2021 RandomVariabel
No ratings yet
5juni2021 RandomVariabel
17 pages
Satistics
No ratings yet
Satistics
18 pages
Chapter 03
No ratings yet
Chapter 03
18 pages
Probability Notes-1
No ratings yet
Probability Notes-1
14 pages
Unit 4
No ratings yet
Unit 4
45 pages
Lecture 01 Probability
No ratings yet
Lecture 01 Probability
51 pages
I Unit
No ratings yet
I Unit
16 pages
Probability and Random Variables
No ratings yet
Probability and Random Variables
14 pages
Module 4 - Fundamentals of Probability
No ratings yet
Module 4 - Fundamentals of Probability
50 pages
MSD_Discrete_count_models_2
No ratings yet
MSD_Discrete_count_models_2
42 pages
Module01_ProbabilityAndHypothesisTesting
No ratings yet
Module01_ProbabilityAndHypothesisTesting
62 pages
Chapter 5 Discrete Probability Distributions: Definition. If The Random Variable
No ratings yet
Chapter 5 Discrete Probability Distributions: Definition. If The Random Variable
9 pages
Module 2 in IStat 1 Probability Distribution
No ratings yet
Module 2 in IStat 1 Probability Distribution
6 pages
Chapter 1
No ratings yet
Chapter 1
35 pages
S2 Revision Notes
No ratings yet
S2 Revision Notes
2 pages
Gaussian Smoothing: Gaussian Smoothing Is The Result of Blurring An Image by A
No ratings yet
Gaussian Smoothing: Gaussian Smoothing Is The Result of Blurring An Image by A
30 pages
Module - 4 Bayeian Learning
No ratings yet
Module - 4 Bayeian Learning
44 pages
Newsvendor Model
No ratings yet
Newsvendor Model
6 pages
The RANDOM Statement and More Moving On With PROC MCMC
No ratings yet
The RANDOM Statement and More Moving On With PROC MCMC
21 pages
Ap Stats 7.2
No ratings yet
Ap Stats 7.2
15 pages
Functional Loss in The Magnocellular and Parvocellular Pathways in Patients With Optic Neuritis
No ratings yet
Functional Loss in The Magnocellular and Parvocellular Pathways in Patients With Optic Neuritis
8 pages
ApproxBinomial2Normal PDF
No ratings yet
ApproxBinomial2Normal PDF
9 pages
Evans Analytics2e PPT 12
100% (1)
Evans Analytics2e PPT 12
63 pages
(eBook PDF) The Analysis of Biological Data Second Editioninstant download
100% (4)
(eBook PDF) The Analysis of Biological Data Second Editioninstant download
57 pages
Probabilistic Programming in Python Using PyMC
No ratings yet
Probabilistic Programming in Python Using PyMC
19 pages
Grad Handbook 2013
No ratings yet
Grad Handbook 2013
20 pages
Dhaapps Datascience With Gen AI-1
No ratings yet
Dhaapps Datascience With Gen AI-1
23 pages
Test For Normality PDF
No ratings yet
Test For Normality PDF
30 pages
2-1 questions papers (3)
No ratings yet
2-1 questions papers (3)
10 pages
Chapter 12 Inference About A Population: QMDS 202 Data Analysis and Modeling
No ratings yet
Chapter 12 Inference About A Population: QMDS 202 Data Analysis and Modeling
7 pages
Statistical Techniques in Business & Economics, Lind/Marchal/Wathen, 13/e 105
100% (1)
Statistical Techniques in Business & Economics, Lind/Marchal/Wathen, 13/e 105
7 pages
Ea 2 Perfect
No ratings yet
Ea 2 Perfect
10 pages
A Fuzzy Number Based Methodology For Harmonic Load-Flow (Final)
No ratings yet
A Fuzzy Number Based Methodology For Harmonic Load-Flow (Final)
8 pages
Choose The BEST Answer.: Practice Test 2 - Assessment of Learning Multiple Choice
100% (1)
Choose The BEST Answer.: Practice Test 2 - Assessment of Learning Multiple Choice
6 pages
Mathematics in The Modern World Midterm Reviewer
No ratings yet
Mathematics in The Modern World Midterm Reviewer
8 pages
Syndicate 02 - Project Management
No ratings yet
Syndicate 02 - Project Management
3 pages
MT2 - Wk8 - S15 Notes - Random Walks
No ratings yet
MT2 - Wk8 - S15 Notes - Random Walks
6 pages
CIForMean LargeSample Lesson PDF
No ratings yet
CIForMean LargeSample Lesson PDF
21 pages
Download Statistical Process Control and Data Analytics 8th Edition John Oakland & Robert Oakland ebook All Chapters PDF
100% (4)
Download Statistical Process Control and Data Analytics 8th Edition John Oakland & Robert Oakland ebook All Chapters PDF
77 pages
Docx
No ratings yet
Docx
16 pages
OFDM Simulation EE810
100% (1)
OFDM Simulation EE810
18 pages
Business Analytics
No ratings yet
Business Analytics
3 pages

Statistical Methods

Uploaded by

Statistical Methods

Uploaded by

Statistical Methods

Parameter: numerical measurement describing a population’s characteristic, often Greek symbols.

Variable: varying quantity

• Discrete: countable number of possible values

Measures of center: value at center/middle of a dataset

(sample) standard deviation: common measure of variation (or deviation from x̄ )

➔ Square root of sample variance (‘mean quadratic deviation from x̄ )

Population standard deviation: σ

Percentiles: measures of location and dispersion

whiskers: lines extending from the box

3 ways to determine probability P(A) of event A:

2. Classical (theoretical) approach

Determining P(A) is all outcomes are equally likely:

How to find P(A) in discrete case

➔ Example biased dice: probability of even number

o A and B are disjoint if they exclude each other, A ∩ B = ∅

General addition rule for disjoint events

o : complement of A: outcomes which are not in A

o Complement of at least one:

Then, multiplication rule:

➔ Simple law of total probability:

Combine with multiplication rule:

➔ Bayes’ Theorem (simple):

o Law of total probability

o Discrete random variable

o Continuous random variable

Outcomes ω in Ω and probability measure P determine probability distribution of X:

Find probability distribution of discrete random variable:

o The standard deviation of X is

Law of Large Numbers:

Find probability distribution of discrete random variable

If E(X) = µ and SD(X) = σ ,

Standard normal distribution: N(0,1)

P(X ∈ [a, b]) = P(X ≤ b) − P(X ≤ a)

o Probability density function

The Central Limit Theorem (CLT)

The population can have any distribution

The Central Limit Theorem for normal population (special case)

N can be any number

The variable < . . . > is (modelled as) a random variable

A location-scale family of probability distributions

Normal distributions form a location-scale family

How to interpret QQ plots?

How to assess normality of data with QQ plot

You might also like