0% found this document useful (0 votes)

6 views

Random Variables Review - unannotated

Uploaded by

Chamod

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Random Variables Review - unannotated

Uploaded by

Chamod

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Random Variables and Probability Theory Review

In the field of pattern recognition, patterns are typically represented as random vectors. These random
vectors consist of multiple random variables. These random variables exhibit joint probabilistic behavior
when combined, forming a comprehensive representation of the pattern in the form of a random
vector. This approach allows pattern recognition systems to incorporate uncertainty and statistical
characteristics into their modeling and decision-making processes, making them well-suited for tasks
such as image recognition, speech processing, and more, where patterns often exhibit complex
statistical relationships.

Random Variables and PDF:

A random variable is a fundamental concept in probability theory and statistics that serves as a
mathematical representation of uncertain or random events. It quantifies the possible outcomes of a
random experiment. Consider a classic example: a coin flip. In this scenario, the outcome, either "heads"
or "tails," is inherently uncertain. We use a random variable to represent this uncertainty. Let's call this
random variable . It takes on one of two values: . Importantly, these values are
not fixed; they depend on the outcome of the coin flip. For a fair coin, the probability associated with
each value of the random variable . This means that there's an equal chance of getting a
when you perform the coin flip. This specific example illustrates a discrete random variable because it
can only take on a finite set of distinct values (0 and 1, in this case). However, random variables can also
be continuous, where the possible values form a continuous range. For instance, if you measure the
height of individuals, the height values can take on any real number within a certain range, making it a
continuous random variable.

Probability Density Function (PDF):

A Probability Density Function, often denoted as , is a mathematical function used to specify the
likelihood of a continuous random variable taking on a specific value or falling within a particular range.
Unlike discrete random variables, which have a finite set of possible values, continuous random
variables can take on an infinite number of values within a specified interval.

Properties of PDF:

There is also the CDF function, but we are generally less concerned with it in pattern recognition.

Mean and Expectation:

Consider a scenario where we have a random variable, denoted as . Understanding the distribution of
this random variable can sometimes be challenging due to its complexity. To make sense of its behavior
more easily, it's valuable to summarize its characteristics using . The most encountered
ones are the mean, the variance, and the standard deviation.
Before we start with the summary statistics, let’s review expectation, defined as: The expectation of a
random variable , denoted as , is a measure of the central tendency or "average" value that is
likely to take on over a large number of trials or observations. For discrete random variables, the
expectation is computed as the weighted sum of all possible values that can take, where the weights
are given by the probabilities associated with each value. In mathematical notation:

where represents each possible value of , and is the probability of taking on the value .

For continuous random variables, the expectation is calculated as the integral of the random variable
multiplied by its probability density function (PDF) over its entire range:

The way we should interpret the mean (albeit with caution) is that it tells us essentially where the
random variable tends to be located. The mean is showing the most expected value of the RV.

Example:

Means are useful for understanding the average behavior of a random variable, however the mean is
insufficient to have a full intuitive understanding. Making a profit of per sale is very different from
making per sale despite having the same average value. The second one has a much larger
degree of fluctuation and thus represents a much larger risk. Thus, to understand the behavior of a
random variable, we will need at minimum one more measure: some measure of how widely a random
variable fluctuates.

Variance:
Variance is a quantitative measure of how far a random variable deviates from the mean. Consider the
expression , This is the deviation of the random variable from its mean. This value can be positive
or negative, so we need to do something to make it positive so that we are measuring the magnitude of
the deviation.
Variance formula:

Let’s consider the same example above where we calculated the mean:
Common distribution models

Gaussian:

Uniform:

Exponential:

Joint Statistics:
The above work assumes we are working with a single real valued random variable. However, what
happens when we encounter situations with two or more random variables that may be highly
interrelated? This scenario is quite common in machine learning, where we often encounter pairs or
groups of correlated random variables. Consider, for instance, random variables like which encode
the red value of the pixel at the coordinate in an image. In this example, adjacent pixels in an image
tend to exhibit similar colors. Treating these variables as independent entities and trying to build a
successful model using this assumption can be challenging. We can use multiple integrals to characterize
the relationship of correlated random variables. Let’s start with two random variables

When working with multiple variables, there are situations where we want to disregard the
interdependencies and focus solely on one variable at a time. This concept involves examining the
distribution of a single variable in isolation, irrespective of the others, and it's referred to as a "marginal
distribution." Let’s consider the random variables with joint density given by . When we discuss
the marginal distribution, we are essentially taking this joint density function, which encompasses both
and , and using it to determine the distribution of just one variable . The subscript here indicates
which random variable the density is for.
For this, treat y as a constant when finding the distribution for x.

Similar to a single variable case, we can find determine summary statistics for joint densities.

Expectations for joint probabilities:

Covariance:

When working with multiple random variables, there's an additional summary statistic that proves to be
quite useful: covariance. Covariance quantifies the extent to which two random variables tend to vary or
fluctuate together.

Let us see some properties of covariances:

Correlation:

Let's turn our attention to units of quantities. If one variable, let's call it is measured in one unit (for
example, inches), and another variable, say is measured in a different unit (like dollars), the
covariance between them is measured in the product of these two units. These units can be hard to
interpret. In many cases, what we're really interested in is a measure of relatedness between variables
that is independent of their specific units of measurement. We often don't require an exact quantitative
correlation but rather seek to understand if the variables move in the same direction and the strength of
this relationship.
To gain a clearer understanding, let’s convert our random variables, one measured in inches and the
other in dollars, into inches and cents. In this conversion, the random variable initially measured in
inches, remains unchanged. However, the random variable originally measured in dollars, is now
multiplied by 100 to represent cents. If we work through the definition, this means that will be
multiplied by 100. To arrive at a unit-invariant measure of correlation, we need to counteract this unit
change by dividing the covariance by something that also scales in the same way. The natural candidate
for this role is the standard deviation. Indeed, if we define the correlation coefficient as

we see that this is a unit-less value.

Properties of correlation:

Independence:

Two random variables x and y are independent if:

Means that knowing tells us nothing about . Example: When rolling two fair dice, the outcomes of
each die are independent random variables because the probability of each die's outcome is not
influenced by the other die's outcome.

The two RVs are uncorrelated if:

Uncorrelated random variables imply that there is no linear relationship between them. However, they
may still be dependent in a nonlinear or non-monotonic way. In other words, knowing the value of one
variable does not provide information about the linear relationship with the other, but they can still
exhibit other forms of statistical dependence. Consider representing the temperature in degrees
Fahrenheit and representing the temperature in degrees Celsius. These two variables are uncorrelated
because there is no linear relationship between them, but they are clearly dependent as changes in one
are related to changes in the other through a nonlinear conversion formula.

Conditional Statistics

Now that we know joint probabilities, let’s talk about conditional statistics which make the core of
supervised machine learning. For random variables X and Y, is the PDF for conditioned on . A
Conditional statistic like this is very important for pattern recognition, since we are assessing the
unknown (e.g., identity of pattern) conditioned on what’s known (e.g., measurement).

Baye’s Rule:

In machine learning and Bayesian statistics, we are often interested in making inferences of unobserved
(latent) random variables given that we have observed other random variables. Let us assume we have
some prior knowledge about an unobserved random variable and some relationship between
and a second random variable , which we can observe. If we observe , we can use Bayes’ theorem to
draw some conclusions about given the observed values of .

Here, represents the prior distribution, which encapsulates our subjective understanding of the
unobserved (latent) variable before we have observed any data. We have the flexibility to select any
prior distribution that aligns with our reasoning, but it's of utmost importance to guarantee that this
prior has a non-zero probability density function (pdf) for all conceivable values of even if these
values are exceptionally infrequent or rare.

The likelihood describes how and are related, and in the likelihood case of discrete probability
distributions, it is the probability of the data if we were to know the latent variable . Note that the
likelihood is not a distribution in , but only in . We call either the “likelihood of 𝑥 (given 𝑦)” or the
“probability of 𝑦 given 𝑥” but never the likelihood of 𝑦.

The posterior is the quantity of interest in Bayesian statistics and in pattern recognition because it
expresses exactly what we are interested in, i.e., what we know about after having observed .
Random Vectors:

In pattern recognition, we will not be looking at one or two variables, but a large number of random
variables. We can represent them as random vectors. The joint statistics that we developed above for
two variables would transfer to the general case of random vectors with many random variables. For
example, the mean will also be vector, and covariance will be a matrix. Note that, diagonal terms in the
covariance matrix are the variances of the random variables (hence, always positive).

Sample Statistics:

Given probability density , you can work out any expectation or correlation you want. However, in
reality, we often don’t know the probability density, or even the mean or covariance of a random
vector! Instead, we will probably be given some training samples, and we will have to infer the
probability density and other statistics from these samples. x1 , x2 , . . . , xN. Such inferred statistics are
called sample statistics.

Gaussian Distribution (Multivariate):

Since the Gaussian distribution is by far the most common statistical model used in pattern recognition,
it is important to understand how it can be used in the multivariate case. Gaussian distribution has many
convenient properties which we will discuss later.

Let’s see some interesting special cases:

Single variable:
Diagonal covariances:

This means that all components of x are uncorrelated! Much easier to deal with since the following then
holds true:

Visualization of Gaussian Distribution:

One important way of visualizing bivariate Gaussian distributions is to sketch the equiprobability
contour (all points along contour have equal probability), defined as:

After some derivation for the Gaussian distribution, what we need to sketch an equiprobability contour
is: the mean of the distribution which will be the center of the ellipsoid, the eigenvectors of the
covariance matrix which will be the axes, and the length of the axes will be set to the square root of
eigenvalues of .

Steps for getting the equiprobability contour of a Gaussian distribution:

• Given mean and covariance matrix , Compute eigenvalues

Compute eigenvectors

Sketch ellipse, centered at , with axes as and length of axes as .

Example: Suppose we are given the following data:

How do we sketch its equiprobability contour?

Econometrics I Lecture 2 Wooldridge
No ratings yet
Econometrics I Lecture 2 Wooldridge
40 pages
04 Ekspektasi - Matematik - SLIDE
No ratings yet
04 Ekspektasi - Matematik - SLIDE
28 pages
Econometrics1 2 PDF
No ratings yet
Econometrics1 2 PDF
63 pages
Stat Reviewer 1
No ratings yet
Stat Reviewer 1
61 pages
mean-variance
No ratings yet
mean-variance
14 pages
L-10 Expectation & Variance PDF
No ratings yet
L-10 Expectation & Variance PDF
34 pages
Discrete Random Variables: Integral
No ratings yet
Discrete Random Variables: Integral
7 pages
Theory of Probability .
No ratings yet
Theory of Probability .
11 pages
Random Variable Definition, Types, Formula & Example
No ratings yet
Random Variable Definition, Types, Formula & Example
1 page
Final_Updated_GDPI_Kit_compressed
No ratings yet
Final_Updated_GDPI_Kit_compressed
59 pages
Introductory Econometrics: Probability and Statistics Refresher
No ratings yet
Introductory Econometrics: Probability and Statistics Refresher
35 pages
LEC2&3
No ratings yet
LEC2&3
46 pages
Random Variable
No ratings yet
Random Variable
4 pages
EEN330 Topic 2
No ratings yet
EEN330 Topic 2
2 pages
Basic Statistics For Data Science
100% (1)
Basic Statistics For Data Science
45 pages
Random Variables: Prof. Megha Sharma
No ratings yet
Random Variables: Prof. Megha Sharma
35 pages
Introduction To Probability and Random Processes: Appendix
No ratings yet
Introduction To Probability and Random Processes: Appendix
19 pages
UNEC__1729702589.pptx
No ratings yet
UNEC__1729702589.pptx
18 pages
Random Variables: Presented by in Stochastic Analysis and Inverse Modelling
100% (1)
Random Variables: Presented by in Stochastic Analysis and Inverse Modelling
21 pages
Mathematical Expectation: Lecture # 2
No ratings yet
Mathematical Expectation: Lecture # 2
17 pages
Chapter 2 - Lesson 4 Random Variables
No ratings yet
Chapter 2 - Lesson 4 Random Variables
19 pages
AP Statistics - Chapter 7 Notes: Generating A Random Number Which Can Be Any Value Along The Interval (0,1) ) and
No ratings yet
AP Statistics - Chapter 7 Notes: Generating A Random Number Which Can Be Any Value Along The Interval (0,1) ) and
3 pages
PME-lec7-ch4-a
No ratings yet
PME-lec7-ch4-a
67 pages
3rd Quarter Stat
100% (1)
3rd Quarter Stat
25 pages
Lecture 6-7 Random Variable
No ratings yet
Lecture 6-7 Random Variable
29 pages
FRM Part 1: Basic Statistics
No ratings yet
FRM Part 1: Basic Statistics
28 pages
Discrete and Continuos Random Variable With Examples
No ratings yet
Discrete and Continuos Random Variable With Examples
8 pages
2A2. Review of Probability
No ratings yet
2A2. Review of Probability
8 pages
C0 English
No ratings yet
C0 English
42 pages
Statistics and Probability
No ratings yet
Statistics and Probability
27 pages
Chapter 1 - Statistics
No ratings yet
Chapter 1 - Statistics
2 pages
Corporate Finance - Statistics Review: Random Variable
No ratings yet
Corporate Finance - Statistics Review: Random Variable
15 pages
Lecture5 - Random Variable - 0923
No ratings yet
Lecture5 - Random Variable - 0923
44 pages
AE 248: AI and Data Science: Prabhu Ramachandran 2024-01-01
No ratings yet
AE 248: AI and Data Science: Prabhu Ramachandran 2024-01-01
12 pages
Basics
No ratings yet
Basics
8 pages
WEEK 1 & 2
No ratings yet
WEEK 1 & 2
33 pages
AML - Unit -2
No ratings yet
AML - Unit -2
29 pages
Unit – III Spatial Data Ajustment
No ratings yet
Unit – III Spatial Data Ajustment
127 pages
WEEK 1 & 2
No ratings yet
WEEK 1 & 2
33 pages
Information Theory: 1 Random Variables and Probabilities X
No ratings yet
Information Theory: 1 Random Variables and Probabilities X
8 pages
Review Some Basic Statistical Concepts: Topic
No ratings yet
Review Some Basic Statistical Concepts: Topic
55 pages
ST3236_Note3
No ratings yet
ST3236_Note3
17 pages
Variance PDF
No ratings yet
Variance PDF
14 pages
inbound4421484962866478386
No ratings yet
inbound4421484962866478386
68 pages
Definition of Statistics
No ratings yet
Definition of Statistics
19 pages
MATH230 Lecture Notes 3
No ratings yet
MATH230 Lecture Notes 3
45 pages
Las in Statistics and Probability
No ratings yet
Las in Statistics and Probability
18 pages
Elements of Probability Theory
No ratings yet
Elements of Probability Theory
6 pages
Normal Distribution
No ratings yet
Normal Distribution
42 pages
Chapter 4 Discrete Probability Distribution
No ratings yet
Chapter 4 Discrete Probability Distribution
8 pages
Random Variables 15 Aug 10
No ratings yet
Random Variables 15 Aug 10
3 pages
Basic Statistics For Lms
0% (1)
Basic Statistics For Lms
23 pages
MGMT 322 Chp3. Probability Distributions
No ratings yet
MGMT 322 Chp3. Probability Distributions
68 pages
1 Random Variable
No ratings yet
1 Random Variable
138 pages
Mosconi W1
No ratings yet
Mosconi W1
14 pages
Lecture Notes 4MeanVariance
No ratings yet
Lecture Notes 4MeanVariance
44 pages
Module 2
No ratings yet
Module 2
36 pages
Probability & Stats Notes
No ratings yet
Probability & Stats Notes
17 pages
Week 1 Random Variables
No ratings yet
Week 1 Random Variables
30 pages
Cross Correlation: Unlocking Patterns in Computer Vision
From Everand
Cross Correlation: Unlocking Patterns in Computer Vision
Fouad Sabry
No ratings yet
Probability Cheatsheet Midterm
No ratings yet
Probability Cheatsheet Midterm
3 pages
Full download (Ebook) Applied Stochastic Processes by Ming Liao (Author) ISBN 9780367379773, 9780429168123, 9781466589339, 9781466589346, 0367379775, 0429168128, 1466589337, 1466589345 pdf docx
100% (1)
Full download (Ebook) Applied Stochastic Processes by Ming Liao (Author) ISBN 9780367379773, 9780429168123, 9781466589339, 9781466589346, 0367379775, 0429168128, 1466589337, 1466589345 pdf docx
67 pages
Stat 150 Class Notes: Onur Kaya 16292609
No ratings yet
Stat 150 Class Notes: Onur Kaya 16292609
4 pages
MTH302 Quiz 5 File by Vu Topper RM-1
No ratings yet
MTH302 Quiz 5 File by Vu Topper RM-1
30 pages
Random Variable
No ratings yet
Random Variable
18 pages
Determine The Value For C and The Covariance and Correlation - Quizlet
No ratings yet
Determine The Value For C and The Covariance and Correlation - Quizlet
5 pages
MA2216 Summary
100% (1)
MA2216 Summary
1 page
U2-Two Marks Q & A
No ratings yet
U2-Two Marks Q & A
13 pages
SST 304 Lesson 6 - 240912 - 000756
No ratings yet
SST 304 Lesson 6 - 240912 - 000756
6 pages
Continuous Random Variables: Problem 3.1
No ratings yet
Continuous Random Variables: Problem 3.1
12 pages
Probability Distributions
No ratings yet
Probability Distributions
29 pages
N+M N M N+M M: Provided You Took Care With The "Limits of Integration"
No ratings yet
N+M N M N+M M: Provided You Took Care With The "Limits of Integration"
7 pages
Probability Theory and Stochastic Process
No ratings yet
Probability Theory and Stochastic Process
9 pages
Rpla 2M
No ratings yet
Rpla 2M
22 pages
DLL STAT Week 1
No ratings yet
DLL STAT Week 1
12 pages
SOME RESULTS FOR REPAIRABLE SYSTEMS WITH GENERAL REPAIR
No ratings yet
SOME RESULTS FOR REPAIRABLE SYSTEMS WITH GENERAL REPAIR
15 pages
Algunas Distribuciones Discretas: Viswanathan Arunachalam (ARUN)
No ratings yet
Algunas Distribuciones Discretas: Viswanathan Arunachalam (ARUN)
60 pages
Statistic
No ratings yet
Statistic
26 pages
Quality Control
No ratings yet
Quality Control
39 pages
Class 11 Maths Chapter 16
No ratings yet
Class 11 Maths Chapter 16
44 pages
Chapter 5 - Probability and Counting Rules III
50% (2)
Chapter 5 - Probability and Counting Rules III
65 pages
MAT3003 Modules - (1 2 3) - Updated
No ratings yet
MAT3003 Modules - (1 2 3) - Updated
40 pages
Probability in A Nutshell
No ratings yet
Probability in A Nutshell
3 pages
c3 Dist
No ratings yet
c3 Dist
21 pages
ECE2005 - Probability Theory and Random Processes - 16.12.15 - 10.20PM
No ratings yet
ECE2005 - Probability Theory and Random Processes - 16.12.15 - 10.20PM
3 pages
GMMT 7201 Exam 2023
No ratings yet
GMMT 7201 Exam 2023
4 pages
IAL Revision Worksheet Discreet Random Variables SET B II
No ratings yet
IAL Revision Worksheet Discreet Random Variables SET B II
6 pages
Answers To Student's Questions On Frequency Distribution
No ratings yet
Answers To Student's Questions On Frequency Distribution
5 pages
Chapter 6 DPD
No ratings yet
Chapter 6 DPD
47 pages
G11 STATPRB Quarter 3 Module 1 FINALeditedlatest
100% (1)
G11 STATPRB Quarter 3 Module 1 FINALeditedlatest
24 pages

Random Variables Review - unannotated

Uploaded by

Random Variables Review - unannotated

Uploaded by

Random Variables and Probability Theory Review

Random Variables and PDF:

Probability Density Function (PDF):

Mean and Expectation:

Expectations for joint probabilities:

Let us see some properties of covariances:

we see that this is a unit-less value.

Two random variables x and y are independent if:

The two RVs are uncorrelated if:

Gaussian Distribution (Multivariate):

Let’s see some interesting special cases:

Visualization of Gaussian Distribution:

Steps for getting the equiprobability contour of a Gaussian distribution:

Sketch ellipse, centered at , with axes as and length of axes as .

Example: Suppose we are given the following data:

How do we sketch its equiprobability contour?

You might also like