Statistics 111 - Lecture 6: Probability
Statistics 111 - Lecture 6: Probability
Administrative Note
Homework 2 due Monday, June 8th
Look at the questions now!
June 3, 2008
Course Overview
Collecting Data
Comparing Variables
Means
Proportions
Regression
Contingency Tables
June 2, 2008
June 3, 2008
In random processes, the outcome is not known exactly, but we can still describe the probability distribution of possible outcomes
Eg. 10 coin tosses: we dont know exactly how many heads we will get, but we can calculate the probability of getting a certain number of heads
June 3, 2008 Stat 111 - Lecture 6 - Probability 5
Events
An event is an outcome or a set of outcomes of a random process
Example: Tossing a coin three times Event A = getting exactly two heads = {HTH, HHT, THH} Example: Picking real number X between 1 and 20 Event A = chosen number is at most 8.23 = {X 8.23} Example: Tossing a fair dice Event A = result is an even number = {2, 4, 6}
Notation: P(A) = Probability of event A Probability Rule 1: 0 P(A) 1 for any event A
June 3, 2008 Stat 111 - Lecture 6 - Probability 6
Sample Space
The sample space S of a random process is the set of all possible outcomes
Example: one coin toss S = {H,T} Example: three coin tosses S = {HHH, HTH, HHT, TTT, HTT, THT, TTH, THH} Example: roll a six-sided dice S = {1, 2, 3, 4, 5, 6} Example: Pick a real number X between 1 and 20 S = all real numbers between 1 and 20
Combinations of Events
The complement Ac of an event A is the event that A does not occur Probability Rule 3: P(Ac) = 1 - P(A) The union of two events A and B is the event that either A or B or both occurs The intersection of two events A and B is the event that both A and B occur
Event A Complement of A Union of A and B Intersection of A and B
June 3, 2008
Disjoint Events
Two events are called disjoint if they can not happen at the same time
Events A and B are disjoint means that the intersection of A and B is zero
Probability Rule 4: If A and B are disjoint events then P(A or B) = P(A) + P(B)
June 3, 2008 Stat 111 - Lecture 6 - Probability 9
Independent events
Events A and B are independent if knowing that A occurs does not affect the probability that B occurs
Independent
Example: One Dice Tossed P(even number) = |2,4,6| / |1,2,3,4,5,6| Note: equal outcomes rule only works if the number of outcomes is countable
Eg. of an uncountable process is sampling any fraction between 0 and 1. Impossible to count all possible fractions !
June 3, 2008
11
June 3, 2008
12
Break
June 3, 2008
16
Outline
Moore, McCabe and Craig: Section 4.3,4.5 Conditional Probability Discrete Random Variables Continuous Random Variables Properties of Random Variables
Means of Random Variables Variances of Random Variables
June 4, 2008 Stat 111 - Lecture 6 - Random Variables 17
Conditional Probabilities
The notion of conditional probability can be found in many different types of problems Eg. imperfect diagnostic test for a disease
Disease + Test + Test 30 10 Disease 10 50 Total 40 60
Total
40
60
100
What is probability that a person has the disease? Answer: 40/100 = 0.4 What is the probability that a person has the disease given that they tested positive?
More Complicated !
June 4, 2008 Stat 111 - Lecture 6 - Random Variables 18
June 4, 2008
19
Random variables
A random variable is a numerical outcome of a random process or random event Example: three tosses of a coin
S = {HHH,THH,HTH,HHT,HTT,THT,TTH,TTT} Random variable X = number of observed tails Possible values for X = {0,1, 2, 3}
June 4, 2008
21
2
0.413
3
0.236
4
0.211
5
0.090
6
0.032
7
0.018
June 4, 2008
22
Another Example
Random variable X = the sum of two dice X takes on values from 2 to 12 Use equally-likely outcomes rule to calculate the probability distribution:
X
# of Outco mes
2 1 1/36
3 2 2/36
4 3 3/36
5 4 4/36
6 5 5/36
7 6 6/36
8 5 5/36
9 4 4/36
10 3 3/36
11 2 2/36
12 1 1/36
P(X)
Probability Histograms
Probability histogram of sum of two dice:
Using the disjoint addition rule, probabilities for discrete random variables are calculated by adding up the bars of this histogram:
P(sum > 10) = P(sum = 11) + P(sum = 12) = 3/36
June 4, 2008 Stat 111 - Lecture 6 - Random Variables 24
June 4, 2008
25
Although it seems more complicated, it is often easier to integrate than add up discrete bars If a discrete r.v. has many possible values, we often treat that variable as continuous instead
June 4, 2008 Stat 111 - Lecture 6 - Random Variables 26
June 4, 2008
27
June 4, 2008
29
June 4, 2008
32