BSTATS
BSTATS
DISTRIBUTION
INTRODUCTION
PROBABILITY VS PROBABILITY DISTRIBUTION
RANDOM VARIABLE
PROBABILITY MASS FUNCTION
PROBABILITY DENSITY FUNCTION
CUMULITIVE DISTRIBUTION FUNCTION
EXPECTATION
TYPES OF PROBABILITY DISTRIBUTION
APPLICATIONS OF PROBABILITY DISTRIBUTION
BINOMIAL
POISSON
NORMAL
INTRODUCTION TO PROBABILITY DISTRIBUTIONS
Probability distribution yields the possible outcomes for any random event. It is also
defined based on the underlying sample space as a set of possible outcomes of any
random experiment. These settings could be a set of real numbers or a set of vectors or a
set of any entities. It is a part of probability and statistics.
If I ask, "What’s the chance of rolling a 3?", the answer would be 1 out of 6 (or about 16.67%).This is probability: the chance of one specific
event happening (rolling a 3).
Probability Distribution:
If I ask, "What are the chances of rolling any number from 1 to 6?",
you would list the probabilities for each possible outcome:
1: 1/6
2: 1/6
3: 1/6
4: 1/6
5: 1/6
6: 1/6
This is a probability distribution: it shows the likelihood of all possible outcomes of rolling the die.
In summary:
Probability is the chance of a single outcome (e.g., rolling a 3).Probability distribution shows the chances for all possible outcomes (e.g.,
rolling a 1, 2, 3, 4, 5, or 6).
KEY CONCEPTS OF PROBABILITY DISTRIBUTION
• Random Variable: A variable that takes on different values based on the outcome of
a random event.
• Discrete: Can take on a countable number of values.
• Continuous: Can take on an infinite number of values within a range.
Imagine you are tossing a fair coin. The possible outcomes are either heads (H) or tails (T). We can define
a random variable ( X ) that represents the outcome of the coin toss as follows:
•( X = 1 ) if the outcome is heads (H)
•( X = 0 ) if the outcome is tails (T)
There are following two types of Random Variable-
Example -0.1,0.2.0.3
(height – 5’8,5’4)
PROBABILITY MASS FUNCTION (PMF)
Definition:The PMF of a discrete random variable gives the probability that the
variable is exactly equal to some value. It is a function that maps each possible value
of the random variable to its corresponding probability.
Formula: 𝑃(𝑋=𝑥)=𝑝(𝑥) ,Where 𝑃(𝑋=𝑥) is the probability that the random variable 𝑋
takes the value x.
Example:
•Definition:
A PDF is a function that describes the likelihood of a continuous random variable taking
on a particular value. Unlike PMF, the value of the PDF at any specific point is not the
probability itself but the density of probability.
•Formula:
f(x)
Where f(x) is the value of the PDF at point x
PROBABILITY DENSITY FUNCTION GRAPH
The graph of a CDF is a non-decreasing curve that starts at 0 and approaches 1 as the x increases. It is
useful for visualizing the probability distribution of the random variable
EXPECTATION (MEAN)
•Definition:
•The expectation or mean of a random variable is the long-term average value of the
variable if the experiment is repeated many times.
•Formula:
•For Discrete Variables:
1. Risk Assessment: Probability distributions are used to model financial risks, such
as stock price movements, interest rates, and insurance claims. For example, the
Normal distribution is often applied in risk management to model the returns of
assets.
2. Inventory Management: The Poisson distribution helps in determining the
optimal inventory levels by predicting the demand for products or services over a
fixed period.
3. Market Research: Binomial distribution is used to analyze survey results, such
as estimating the probability of customer preferences or purchasing decisions.
APPLICATIONS OF PROBABILITY DISTRIBUTIONS
7. Network Traffic Analysis: The Poisson distribution models the number of packets
arriving at a network server, helping optimize server performance and reduce latency.
BINOMIAL DISTRIBUTION
• Definition:
• The Binomial distribution is a discrete probability distribution that models the number of successes in a fixed
number of independent and identical trials, where each trial has only two possible outcomes: success or
failure.
• Key Characteristics:
• Binary Outcomes: Each trial results in one of two possible outcomes, commonly labeled as "success" and
"failure."
• Fixed Number of Trials (n): The distribution considers a fixed number of independent trials.
• Constant Probability (p): The probability of success (p) remains the same across all trials.
• Independence: The outcome of any given trial does not affect the outcome of the others.
MATHEMATICAL REPRESENTATION OF BINOMIAL
DISTRIBUTION
The mean of a binomial distribution is calculated by multiplying the number of trials by the probability
of successes, i.e, “(np)”, and the variance of the binomial distribution is “np (1 − p)”.
When p = 0.5, the distribution is said to be symmetric about mean, when p > 0.5, the distribution is
skewed to the left, and when p < 0.5, the distribution is skewed to the right.
It incorporates the following properties;
•Key Characteristics:
•Discrete Events: The distribution counts the number of occurrences of an event.
•Fixed Interval: The interval can be time, space, volume, etc., and is held constant.
•Constant Rate (λ): The average number of events (λ) in the interval remains constant.
•Independence: The occurrence of an event in one interval does not affect the occurrence in another .
MATHEMATICAL REPRESENTATION OF POISSON DISTRIBUTION
Poisson Distribution is only suited in the conditions where events occur at random points of time and
space, and point of interest exists only in the total number of occurrences of the event.
In addition to that, a few symbols that are used in Poisson distribution are; “λ” is the rate at which an
event occurs, “t” is the length of a time interval, and X is the number of events in that time interval,
then mean (denoted by “µ”) is defined as the “λ” times length of that time interval, “µ = λ*t”.
Some features of Poisson distribution are following;
One successful event would not affect the outcome of another successful event.
Over a small interval, a probability of success must be equivalent to the probability of success around a
larger interval.
If the interval tends to be smaller then the probability of success approaches zero in an interval.
APPLICATIONS OF POISSON DISTRIBUTION
1. Telecommunications:
Modeling the number of phone calls received by a call center per minute or hour, helping to
optimize staffing and resource allocation.
2. Traffic Flow Analysis:
Estimating the number of vehicles passing through an intersection or toll booth in a given time
period, which is crucial for traffic management and infrastructure planning.
3. Epidemiology:
Predicting the number of occurrences of rare diseases in a population over a specific period,
aiding in public health planning.
4. Queueing Theory:
Analyzing the arrival of customers at a service point (e.g., supermarket checkout, bank teller) to
optimize service efficiency.
5. Natural Disasters:
Estimating the frequency of events like earthquakes, floods, or wildfires in a region over a certain
period, which can inform disaster preparedness strategies.
NORMAL DISTRIBUTION
Definition:
The Normal distribution, also known as the Gaussian distribution, is a continuous
probability distribution characterized by its bell-shaped curve, which is symmetric around
the mean.
Key Characteristics:
• Symmetry: The distribution is perfectly symmetrical around the mean (μ).
• Bell-Shaped Curve: The highest point on the curve corresponds to the mean, and the
curve decreases gradually on both sides.
• Mean, Median, and Mode: In a normal distribution, these three measures of central
tendency are all equal.
• Asymptotic: The tails of the distribution curve approach the horizontal axis but never
touch it, meaning extreme values are possible but rare.
FEATURES OF NORMAL DISTRIBUTION
The normal distribution is completely described by its mean and standard deviation, i.e., the
distribution is not skewed and does show kurtosis due to which the distribution is symmetric and
depicted as a “bell-shaped curve” when plotted.
It has the following features:
1. Standardized Testing:
Test scores (e.g., SAT, IQ tests) are often normally distributed, allowing for the
creation of percentile ranks and standardized scoring.
2. Finance:
Modeling the returns on stocks and other financial instruments, where asset
returns are often assumed to be normally distributed in risk management and
option pricing (e.g., Black-Scholes model).
3. Quality Control:
In manufacturing, the distribution of product dimensions is assumed to be
normal, which is used to set tolerance levels and improve product quality.
Thank you