0% found this document useful (0 votes)

24 views

Stat and Prob

Statistics is the science of collecting, analyzing, and drawing conclusions from data. It involves methods like descriptive statistics which describe data properties, inferential statistics which test hypotheses and draw conclusions, and probability which quantifies likelihoods of events. Important distributions in statistics include the binomial, Poisson, and normal distributions which model count data and continuous variables. Random sampling techniques are used to select representative samples from populations.

Uploaded by

Mc Larens Escarmosa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views

Stat and Prob

Uploaded by

Mc Larens Escarmosa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

01 HANDOUT AND PPT

Statistics

 A science that studies data to be able to make a decision

 It is a tool in the decision-making process
 It involves the methods of collecting, processing, summarizing, and analyzing data in order to
provide answers or solutions to an inquiry

FAMOUS STATISTICIANS
 Gertrude Cox  William Sealy Gosset
 Florence Nightingale  Ronald A. Fisher
 J. Stuart Hunter  George E.P. Box
 John Carl Friedrich Gauss  Thomas Bayes

Area of Statistics
Descriptive Statistics
- Describes the properties of sample and population data
- Include mean (average), variance, skewness and kurtosis

Inferential Statistics
- Use those properties to test hypotheses and draw conclusions
- Include linear regression analysis, analysis of variance (ANOVA), and null hypothesis testing

Sources of Data
Primary Data – the researcher gathers the data him/herself
Secondary Data – the researcher uses data gathered by somebody

Data Science
- The center of data science is data, especially Big Data
- The purpose of data science is to obtain information or knowledge from the data that will help
in making better decisions and understanding the development and change of nature or
society better
- Data science is a multidisciplinary field that has applied theories and technologies from several
disciplines
R is a language and environment for statistical computing and graphics developed by Bell
Laboratories (present-day Lucent Technologies).
Python is an object-oriented, interpreted, and interactive programming language developed by Guido
van Rossum.
The SAS language is a programming language developed by Anthony James Barr as a statistical
analysis tool.

Probability
- A number that reflects the chance or likelihood that a particular event will occur
- 0 to 1 or 0% to 100%

Interpretation of Probability
Classical – equally likely to happen
Frequentist – long frequency of repeatable experiments
Subjective – a probability derived from an individual’s personal judgement or own experience
Bayesian – measures a degree of belief

Sample Space – the collection of all possible outcomes

Tree Diagram – a way of organizing the information of two or more probability events
Events – the set of outcomes from an experiment
 Union – combine the elements of the 2 sets
 Intersection – must be in BOTH sets
 Mutually Exclusive/Disjoint – 2 events have no elements in common
02 HANDOUT AND PPT
Random Variables – is a set of possible values from a random experiment

Types of Random Variable

Discrete – countable or finite (whole number)
Continuous – infinite (decimal)

Discrete Probability Distribution – is a table, graph, or a formula listing all possible values that a
discrete random variable can take on, along with the associated probabilities
03 HANDOUT AND PPT
Binomial Distribution

 In an experiment of trials, each trial has two (2) possible outcomes: success or failure.
 The trials are independent, meaning, the result of the first trial does not affect the result of the
next.
 The process is called binomial experiment, and each trial in a process that has two (2) possible
outcomes is called the Bernoulli Trial

Binomial Distribution Formula where:

n = the number of (Bernoulli) trials
x = total number of choices
p = the number of probabilities of each success
q = the probability of each failure (1-p)

Poisson Distribution – counts the number of rare events or successes that occur in a specified time
interval or region

Poisson Distribution Formula where:

x = the number of choices we want
e = the natural base of the natural algorithms, also known as Euler’s constant
λ = the average number of successes occurring in an interval
04 NORMAL DISTRIBUTION
The Normal Curve
The most important of all continuous probability distributions is the normal distribution. Its graph, called
the normal curve, is a bell-shaped curve. It lies entirely above the horizontal axis. It is symmetrical,
unimodal, and asymptotic to the horizontal axis.

Properties of the Normal Curve

• The entire family of the normal probability distributions is differentiated by two (2) parameters: the
mean 𝜇 and the standard deviation 𝜎.
• The highest point on the normal curve is at the mean, which is also the median and mode of the
distribution.
• The mean of the distribution can be any numerical value: the negative, zero, or positive.
• The normal distribution is symmetric, with the shape of the normal curve to the left of the mean a
mirror image of the shape of the normal curve to the right of the mean.
• The standard deviation determines how flat and wide the normal curve is.
• The total area under the curve for the normal distribution is 1.

The Empirical Rule

The empirical rule, also known as the three-sigma rule or the 68-95-99.7 rule, provides a quick estimate
of the spread of data in a normal distribution given the mean and standard deviation. For a distribution
that is symmetrical and bell-shaped (in particular, for a normal distribution):
• Approximately 68% of the data values will lie within 1 standard deviation on each side of the mean.
• Approximately 95% of the data values will lie within 2 standard deviations on each side of the
mean.
• Approximately 99.7% (or almost all) of the data values will lie within 3 standard deviations on each
side of the mean.

Formula for 𝒛-scores

The 𝑧 value or 𝑧 score gives the number of standard deviations between a measurement 𝑥 and the
mean 𝜇 of the 𝑥 distribution.

Standard Normal Distribution

The standard normal distribution is a normal distribution with mean 𝜇=0 and standard deviation 𝜎=1.
04 NORMAL DISTRIBUTION
Random Sampling is a method of selecting a sample (random sample) from a statistical population.

Types of Random Sampling

1. Simple Random Sampling is a sampling technique in which every element of the population
has the same probability of being selected for inclusion in the sample
2. Systematic Sampling is a random sampling technique in which a list of elements of the
population is used as a sampling frame, and the elements to be included in the desired sample
are selected by skipping through the list at regular intervals
3. Stratified Sampling is a random sampling technique in which the population is first divided into
strata and then samples are randomly selected separately from each stratum.
4. Cluster or Area Sampling is a random sampling technique in which the entire population is
broken into small groups, or clusters, and then, some of the clusters are randomly selected.

Parameter – is a measure that describes a population

Statistic – is a measure that describes a sample
Sampling Distribution – describes the probability for each mean of all samples with the same sample
size n
Central Limit Theorem – If samples of size 𝑛, where 𝑛 is sufficiently large, are drawn from any
population with a mean 𝜇 and a standard deviation 𝜎, then the sampling distribution of sample means
approximates a normal distribution.

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6412)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (640)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1173)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (990)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1852)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4101)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (627)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1015)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1138)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (581)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (297)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4355)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (278)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1087)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2032)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2876)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (835)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (918)
Wind Resource Assessment - Coursera
100% (1)
Wind Resource Assessment - Coursera
1 page
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (814)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
Stat Quiz Ball
No ratings yet
Stat Quiz Ball
85 pages
LBOLYTC Quiz 1 Reviewer
No ratings yet
LBOLYTC Quiz 1 Reviewer
21 pages
Household Income and Expenditure Survey 2018 en 27-6-2019
No ratings yet
Household Income and Expenditure Survey 2018 en 27-6-2019
110 pages
Biostatistics Module 3
No ratings yet
Biostatistics Module 3
9 pages
Expectations: Mathematics 10 Quarter 1 Week 2
No ratings yet
Expectations: Mathematics 10 Quarter 1 Week 2
9 pages
PERIODICAL TEST 2nd Garding Quantitative For Printing
No ratings yet
PERIODICAL TEST 2nd Garding Quantitative For Printing
6 pages
Biostatistics
No ratings yet
Biostatistics
23 pages
Quantitative Methods Quiz 1 Prelim
No ratings yet
Quantitative Methods Quiz 1 Prelim
11 pages
Mathematics10 Quarter1 Module4 Week4
No ratings yet
Mathematics10 Quarter1 Module4 Week4
6 pages
5B Bayesian Inference: Class Problems
No ratings yet
5B Bayesian Inference: Class Problems
9 pages
X and Moving Range Charts: Exit Program
No ratings yet
X and Moving Range Charts: Exit Program
21 pages
Question Bank - Maths-III - 1677825614
No ratings yet
Question Bank - Maths-III - 1677825614
12 pages
Free Access to Using and Understanding Mathematics 6th Edition Bennett Solutions Manual Chapter Answers
100% (15)
Free Access to Using and Understanding Mathematics 6th Edition Bennett Solutions Manual Chapter Answers
52 pages
SB Quiz 3
100% (1)
SB Quiz 3
16 pages
USACE Freeman Grogan 1997 - Statistical Analysis Variability Pavement Materials
No ratings yet
USACE Freeman Grogan 1997 - Statistical Analysis Variability Pavement Materials
164 pages
Download full Test Bank for Elementary Statistics, 7th Edition, Ron Larson, Betsy Farber ebook all chapters
100% (22)
Download full Test Bank for Elementary Statistics, 7th Edition, Ron Larson, Betsy Farber ebook all chapters
59 pages
325-Town and Country Planning
No ratings yet
325-Town and Country Planning
51 pages
Introduction To Business Statistics (Revision Questions) : IBS/Revision Worksheet/ BHRM/ 2020
No ratings yet
Introduction To Business Statistics (Revision Questions) : IBS/Revision Worksheet/ BHRM/ 2020
4 pages
NEP Syllabus UG Economics
No ratings yet
NEP Syllabus UG Economics
54 pages
Social and Economics Statistics Multiplechoose Question
No ratings yet
Social and Economics Statistics Multiplechoose Question
12 pages
Stats Medic Ultimate Inference Guide For AP Statistics
No ratings yet
Stats Medic Ultimate Inference Guide For AP Statistics
3 pages
APstat-Ch2 HW Kirkwood Solutions PDF
No ratings yet
APstat-Ch2 HW Kirkwood Solutions PDF
9 pages
Lecture No.10
No ratings yet
Lecture No.10
8 pages
Definition of Mean PDF
No ratings yet
Definition of Mean PDF
7 pages
STPDF2 - Descriptive Statistics
100% (1)
STPDF2 - Descriptive Statistics
74 pages
15 Hardest SAT Math Questions to Improve Your Score
No ratings yet
15 Hardest SAT Math Questions to Improve Your Score
1 page
Use 1: To Summarize Data With Central Values
No ratings yet
Use 1: To Summarize Data With Central Values
34 pages
Civil Courses (Session 2015 Onward) Channab UET
No ratings yet
Civil Courses (Session 2015 Onward) Channab UET
32 pages
Bir Glen 2018
No ratings yet
Bir Glen 2018
10 pages

Stat and Prob

Uploaded by

Stat and Prob

Uploaded by

01 HANDOUT AND PPT

 A science that studies data to be able to make a decision

Sample Space – the collection of all possible outcomes

Types of Random Variable

Binomial Distribution Formula where:

Poisson Distribution Formula where:

Properties of the Normal Curve

The Empirical Rule

Formula for 𝒛-scores

Standard Normal Distribution

Types of Random Sampling

Parameter – is a measure that describes a population

You might also like