Math 10 Note Pack With Practice Summer 2016
Math 10 Note Pack With Practice Summer 2016
1
Chapter 1: Sampling and Data - Notes
Statistics:
Descriptive:
Inferential:
Probability:
Key Terms:
Example: We are interested in the average number of units students in this class are taking
in Summer 2016.
Population:
Sample:
Parameter:
Statistic:
Variable:
Data:
2
Data (actual values of a variable)
Qualitative Data:
Quantitative Data:
Discrete:
Continuous:
Sampling:
Convenience Sampling:
3
Frequency Tables
4
Chapter 1: Sampling and Data - Practice
Sixty (60) De Anza students were asked how many movies (at the movie theater) they saw
last month. The data is summarized in the frequency table below.
Number of Movies Frequency Relative Frequency Cumulative
Relative
Frequency
0 10 0.1667
1 14 0.2333 0.4000
2 17
3 7
4
5 5
6 1
7 2
1. Put the table into your calculator, then use your calculator to complete the table.
5. The data was collected at De Anza College by dividing all students into 10 groups
according to their majors (some groups had more than one major) and then randomly
selecting 6 students from each of the 10 groups. What type of sampling is this?
6. If we took another sample by getting the alphabetical listing of all De Anza students,
randomly choosing the first student in the sample from the list, and then choosing every
350th student, what kind of sample would we have?
7. The average number of movies seen by students in the sample is an example of:
A. Parameter B. Data C. Statistic D. Variable
8. Let X = the number of movies (in a theater) De Anza students saw in the last month. X is
the:
A. Parameter B. Data C. Statistic D. Variable
5
9. Identify the type of data:
a. Number of students enrolled in Math 10.
b. Brand of coffee.
e. Favorite movie.
10. De Anza Security is taking a survey of the number of people arriving in each car that
will be parked in the parking structure. Name the sampling method used for each of the
following.
c. Randomly pick one section of the parking structure and survey every car in that section.
11. A lawyer is interested in the average time it takes for bills to be paid by clients who pay
bills.
b. X = the time it takes for ONE client to pay their bill. What is X an example of?
c. The lawyer takes her sample by gathering data on 10 randomly selected clients who have
paid their bills. The lawyer’s sample produces an average time to pay of 2 months. What
is this value (2 months) is an example of?
d. One particular client took 4 months to pay the bill. What is this value an example of?
6
Chapter 2: Displaying and Measuring Data - Notes
Stemplots
Example 1: The number of minutes 20 internet subscribers spent on the internet during
their most recent session:
50 40 41 17 11 7 22 44 28 21
19 23 37 51 54 42 88 41 78 56
Draw stem-and-leaf graph (also known as stem plot) Stems here are 0 – 8.
7
Histograms
Horizontal axis:
Vertical Axis:
Number of bars:
Lowest Value:
Highest Value:
Width of bar:
8
Box Plots and Histograms
Consider the following data for the number of movies 60 students watched during Spring
Break:
2. Find:
lowest value:
highest value:
median:
Q1:
Q3:
IQR:
Mean:
Mode:
9
4. Draw a histogram.
POTENTIAL OUTLIERS: values that lie more than 1.5 IQR above Q3 or more than 1.5 IQR
below Q1
Q3 + 1.5*IQR
Q1 - 1.5*IQR
7. Skewness?
10
z-score
z-score:
Example: Let the population mean be 5 and the population standard deviation be 2.
11
Chapter 2: Displaying and Measuring Data – Practice
Consider the following data for the number of movies 80 students watched during Spring
Break:
2. Draw a histogram.
12
4. Find:
lowest value:
highest value:
median:
Q1:
Q3:
IQR:
Mean:
Mode:
13
7. Find the value that is 2 standard deviations above the mean. Are there any data values above
this?
8. Find the value that is 2 standard deviations below the mean. Are there any data values below
this?
9. Khong, Megan and Jabbar are runners on the track teams at three different schools. Their
running times, in minutes, and the statistics for the track teams at their respective schools for a
one mile run are given in the table below.
a) Which student is the FASTEST when compared to the other runners at his or her school?
HINT – calculate the z-score for each student.
b) Which student is the SLOWEST when compared to the other runners at his or her
school?
14
Chapter 3: Probability Topics - Notes
P(A)
P(B)
P(A AND B)
P(A OR B)
P(B | A)
P(A | B)
P(A’)
15
Independent Events:
• Addition Rule:
Example
Roll a dice. Event A = get a prime number. Event B = get a multiple of 3.
Find P(A)
Find P(B)
Find P(A | B)
Find P(B’)
16
Contingency Tables:
Example: Random Sample of 100 hikers and the areas of preferred hiking.
HIKING AREA
PREFERENCE
The Coastline Near Lakes On Mountain
and Streams Peaks Total
SEX
Female 18 16 45
Male 14 55
41
Total
M = being male F = being female MP = On Mt. Peaks LS = near lakes and streams Find:
• P(F)
• P(M)=
• P(LS)=
• P(F or MP)=
• P(M|MP)=
• P(LS|M)=
17
Trees:
A frequency tree has branches labeled with frequencies and a probability tree has
branches labeled with probabilities.
Example: There are 3 green balls and 4 red balls in a box that you cannot see into. Draw 2
balls WITHOUT replacement. R1 = red ball on first drawing, G1 = green ball on first
drawing, etc. Organize the information in a tree. Draw a probability tree.
• P(G2)
• P(G1 | G2)
18
Chapter 3: Probability Topics - Practice
1. The following table gives the number of medical claims by type of treatment and
geographic region.
Region
Type of Claim West Mid West South East TOTAL
Outpatient 99 65 326 100 590
Physician Visit 251 104 514 233 1102
Hospitalization 52 29 128 75 284
TOTAL 402 198 968 408 1976
If a claim is chosen at random, compute the following. Leave your answers in unreduced
fractional form.
f. Are Hospitalization AND South MUTUALLY EXCLUSIVE events? Use numbers to justify
your answer and explain.
19
2. Based on the results of a poll of 180 voters, out of the 80 Republican voters, 60 voted for
a certain proposition and 20 voted against the proposition. Among the100 Democrats, 30
voted in favor of the proposition and 70 voted against the proposition.
R = Republican; D = Democrat; F = for; A = against.
b) If a voter is randomly selected from those voters polled, what is the probability that he
was against the proposition?
c) If a voter who was against the proposition was randomly selected from those voters
polled, what is the probability that he was a Democrat?
20
Chapter 4: Discrete Random Variables-Notes
Terms
Random variable:
Example Nancy has class 3 days a week. She attends class 3 days a week 80% of the time, 2
days 15% of the time, 1 day 4%, 0 days 1%.
Random variable X =
21
Binomial Distribution X ~ B(n, p)
Bernoulli Trial:
Binomial Distribution:
X=
n=
p=
X~
22
Example: John comes to class totally unprepared for a 21 question math 10 Exam, so he
guesses randomly on each question. There are 4 possible answers per question.
Let X =
n=
p=
X~
23
Chapter 4: Discrete Random Variables - Practice
Based on past experience, the Math printer at De Anza College is operating properly 70% of
the time. Suppose inspections are made at 10 randomly selected times.
d) What is the probability that the Math printer is operating properly for exactly 7 of the
inspections?
e) What is the probability that the Math printer is operating properly on 8 or more
inspections?
f) What is the expected number of inspections in which the Math printer is operating
properly?
24
Chapter 5: Continuous Random Variables - Notes
Skip the Uniform Distribution
Exponential Distribution
• Notation: X
• decay parameter
• mean
• standard deviation
25
Probability:
Area to the left of k:
Percentiles:
26
Example: The length of time a randomly chosen 11-year old child spends playing video
games per day is approximately exponentially distributed with a mean equal to 1.5 hours.
Let X =
X~
f(X) =
2. Find the probability that a randomly chosen 11-year old spends more than 1.5 hours
playing video games per day.
3. Find the probability that a randomly chosen 11-year old spends less than 2 hours
playing video games per day.
4. Find the probability that a randomly chosen 11-year old spends between 1 and 1.8
hours playing video games per day.
5. Ninety percent of the 11-year olds spend at LEAST how long playing video games
per day?
6. Find the median length of time 11-year olds spend playing video games per day.
27
Chapter 5: Continuous Random Variables - Practice
1. Suppose the battery life (in years) of a battery in Kathy’s wrist watch is exponentially
distributed with a decay rate of 0.25.
a) In words, X =
b) In symbols, X ~
c) On average, how long will the battery last? Be sure to give units.
d) Find the probability that Kathy won’t have to replace her watch’s battery for at least 6
years. Draw an appropriate picture.
e) Find the probability the battery will last between 3 and 4.5 years. Draw an appropriate
picture.
f) 70% of batteries will last at least how long? Draw an appropriate picture.
28
2.. The time between patients arriving at an urgent care clinic follows an exponential
distribution with a mean of 7 minutes.
a) In words, X =
b) In symbols, X ~
c) Find the probability that the next patient arrives in less than 5 minutes after the
previous one. Draw an appropriate picture.
d) Find the 80th percentile for the amount of time that passes between patient arrivals.
Write a sentence interpreting this percentile. Draw an appropriate picture.
29
Chapter 6: Normal Distribution - Notes
• Parameters:
• Graph
• z-score
30
Example: Suppose Z ~ N(0, 1). Draw pictures and find the following.
1. P(-128 < Z < 1.28)
Nonstandard normal:
31
Example: Chickens at Colonel Mustard’s ranch have a mean weight of 1853 grams with a
standard deviation of 150 grams. The weights of the chickens are closely approximated by
a normal curve.
e) What is the minimum weight of a chicken that is in the top 12% weight group?
32
Chapter 6: Normal Distribution – Practice
1. A stamping machine produces can tops whose diameters are normally distributed with a
mean of 3.124 inches and a standard deviation of 0.03 inches. Let X be the diameter of a
can top produced by this stamping machine.
2. The time, in minutes, to complete the daily Sudoku puzzle in the San Jose Mercury News
follows a normal distribution. For Ali, the distribution is N(5, 1.2); for Binh, N(8, 2.1); for
Carlos, N(7, 2.8). On Saturday, the puzzle was particularly hard. Ali took 7.5 minutes, Binh
took 12 minutes and Carlos took 11 minutes.
Ali___________________________
Binh __________________
Carlos________________________
Is Ali faster than the others when each is compared to his/her usual performance?
33
3. The percent of fat calories that a person in America consumes each day is normally
distributed with a mean of about 37 and a standard deviation of 12. Suppose that one
individual is randomly chosen. Let X =percent of fat calories.
a. X∼_______(_______,_______)
b. Find the probability that the percent of fat calories a person consumes is more than 42.
Graph the situation. Shade in the area to be determined.
c. Find the minimum number for the upper quarter of percent of fat calories. Sketch the
graph and write the probability statement.
34
Chapter 7: Central Limit Theorem (CLT) – Notes
NOTE: SKIP The Central Limit Theorem (CLT) for Sums. We do CLT for AVERAGES.
Experiment:
35
Draw a histogram for each.
• µx = the mean of X
If you draw random samples of size n, then as n increases, the random variable X
which consists of averages tends to be normally distributed and
X ~
The CLT for Averages says that if you keep drawing larger and larger samples and
taking their averages, the averages themselves form a normal distribution.
37
Example 1: Suppose that the distance of fly balls hit to the outfield (in baseball) is
normally distributed with a mean of 250 feet and a standard deviation of 50 feet. We
randomly sample 49 fly balls.
a. If X = average distance in feet for 49 fly balls, then X ∼ ____ ( _____ , _____ )
c. What is the probability that the 49 balls traveled an average of less than 240
feet?
d. What is the probability that a single fly ball traveled less than 240 feet?
e. Find the 80th percentile of the distribution of the average distance of 49 fly
balls.
38
Example 2: The monthly income of the trainees at a large corporation is
approximately normally distributed. The mean monthly income of the trainees is
$1500 with a standard deviation of $200. We’ll take a random sample of size 25.
a. Define X and X . Give their distributions.
d. What is the probability that a trainee has income between $1400 and
$1600?
e. If 25 trainees are selected randomly, what is the probability that their average
income will be between $1400 and $1600?
39
Chapter 7: Central Limit Theorem (CLT) – Practice
1. We are interested in the lifetime of one battery. Define the random variable X in
words.
3. Find the probability that the lifetime of one battery is between 35 and 40 hours.
40
6. We are interested in the average lifetime of 16 of these batteries. Call this random
variable Y. In words, define Y.
8. Find the probability that the average lifetime of 16 batteries is between 35 and
40 hours.
10. Find the 75th percentile for the average lifetime of 16 batteries.
41
11. The amount of time that Humboldt State female soccer players play “Texas Hold
‘Em” each week has an unknown distribution with a mean of 6.5 hours. Suppose the
standard deviation is 2 hours. Consider a random sample of 20 soccer players
a) In words, X =
b) In words, X =
c) X ~
d) Find the probability that the average time the 20 female soccer players play
“Texas Hold ‘Em” is between 6.2 and 6.7 hours. Draw a picture, shading the
appropriate area. Label the x-axis.
e) Find the 95th percentile of the distribution for the average time that the 20
Humboldt State female soccer players play “Texas Hold ‘Em” each week. Round
answer to 2 decimal places.
42
Chapter 8: Confidence Intervals – Notes
point estimate
confidence interval
If you draw random samples of size n, then as n increases, the random variable X
which consists of averages tends to be normally distributed.
X ~ N(µx ,σx/ n )
43
Confidence interval for single population mean, population standard
deviation known – use normal distribution
Example: Unoccupied seats on flights cause airlines to lose revenue. Suppose a large
airline wants to estimate its average numbers of unoccupied seats per flight over the
past year. To accomplish this, the records of 15 flights are randomly selected and
the number of unoccupied seats is noted for each of the sample flights. The sample
mean is x = 11.6. Assume the standard deviation for the population of unoccupied
seats is σ = 2.25. Find and interpret a 95% confidence interval for the true average
number of unoccupied seats per flight.
c) What is a point estimate for the true average number of unoccupied seats per
flight?
d) Find a 95% confidence interval for the true mean number of unoccupied seats
per flight.
44
Confidence interval for single population mean, population standard
deviation unknown – use student-t distribution
Example: With the price of ski slope passes, most skiers or snowboarders are
interested in finding a reasonably priced room within the ski area. A random
sample of the minimum price per night of a room in 10 ski areas is as follows: $35,
$65, $65, $72, $79, $79, $79, $89, $99, $99.
c) What is a point estimate for the true mean minimum price per night of a room in
a ski area?
d) Find a 93% confidence interval for the true mean minimum price per night of a
room in a ski area.
45
Confidence interval for a population proportion – use normal approximation
to the binomial
e) CHOOSE ONE: The term “95% confidence” means if we took repeated samples,
then:
46
Chapter 8: Confidence Intervals – Practice
Practice 1: The Ice Chalet offers dozens of different beginning ice-skating classes.
All of the class names are put into a bucket. The 5 P.M., Monday night, ages 8 - 12,
beginning ice-skating class was picked. In that class were 64 girls and 16 boys.
Suppose that we are interested in the true proportion of girls, ages 8 - 12, in all
beginning ice-skating classes at the Ice Chalet.
ESTIMATED DISTRIBUTION
b. n = __________
c. p' = __________
P' = ______________________________________________________________
Construct a 92% Confidence Interval for the true proportion of girls in the age 8 - 12
beginning ice-skating classes at the Ice Chalet.
12. Fill in the blanks on the graph with the areas, upper and lower limits of the
Confidence Interval, and the sample proportion.
48
DISCUSSION QUESTIONS
1. Using the same p' and level of confidence, suppose that n were increased to 100.
Would the error bound become larger or smaller? How do you know?
2. Using the same p' and n = 80, how would the error bound change if the
confidence level were increased to 98%? Why?
Practice 2: The following real data are the result of a random survey of 39 national
flags (with replacement between picks) from various countries. We are interested in
finding a confidence interval for the true average number of colors on a national
flag. Let X = the number of colors on a national flag.
X Freq.
1 1
2 7
3 18
4 7
5 6
b. sx = __________
c. n = ___________
X = ___________________________________________________________
49
3. What is x estimating?
4. Is σx known?
5. As a result of your answer to (4), state the exact distribution to use when
calculating the Confidence Interval.
Construct a 95% Confidence Interval for the true average number of colors on
national flags.
50
10. Fill in the blanks on the graph with the areas, upper and lower limits of the
Confidence Interval and the sample mean.
DISCUSSION QUESTIONS
12. Using the same x , sx , and level of confidence, suppose that n were 69 instead of
39. Would the error bound become larger or smaller? How do you know?
13. Using the same x , sx , and n = 39, how would the error bound change if the
confidence level were reduced to 90%? Why?
51
Chapter 9: Hypothesis Testing with One Sample – Notes
§ Null hypothesis:
§ Alternate/Alternative hypothesis:
Example: State the null hypothesis, H0, and the alternative hypothesis, Ha, in terms
of the appropriate parameter (μ or p).
H0:
Ha:
Fewer than 5% of adults ride the bus to work in New York City.
H0:
Ha:
H0:
Ha:
Private universities cost, on average, more than $20,000 per year for tuition, room,
and board.
H0:
Ha:
52
Correct Decisions and Errors:
H0 is true H0 is false
Decision: Reject H0
α is preconceived. Its value is set before the hypothesis test starts. If there is no
given preconceived α, then use α=0.05
Private universities cost, on average, more than $20,000 per year for tuition, room,
and board.
Ho:
Ha:
Type I Error:
Type II Error:
53
To perform a hypothesis test:
§ Set up hypotheses
§ sample data is gathered
§ data typically favors one of the hypotheses
§ Calculate p-value
p-value =
§ Decisions
n if data favors the null hypothesis (H0), we “do not reject the null”
n if data favors the alternate hypothesis (Ha), we “reject the null”
54
Example 1: According to an article in The New York Times (5/12/2004), 19.3% of
New York City adults smoked in 2003. Suppose that a survey is conducted to
determine this year’s rate. Twelve out of 70 randomly chosen N.Y. City residents
reply that they smoke. At the 5% level, conduct a hypothesis test to determine if the
rate is less than 19.3%.
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
g. Use the previous information to sketch a picture of this situation. CLEARLY, label
and scale the horizontal axis and shade the region(s) corresponding to the
p-value.
<---------------------------------------------------------------------------->
55
h. Indicate the correct decision (“reject” or “do not reject” the null hypothesis), the
reason for it, and write an appropriate conclusion, using COMPLETE SENTENCES.
__________________ _______________________________
Conclusion:________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
i. Construct a 95% Confidence Interval for the true mean or proportion. Include a
sketch of the graph of the situation. Label the point estimate and the lower and
upper bounds of the Confidence Interval.
<---------------------------------------------------------------------------->
56
Example 2: The average number of sick days an employee takes per year is believed to
be about 10. Members of a personnel department do not believe this figure. They
randomly survey 8 employees. The number of sick days they took for the past year are as
follows: 12; 4; 15; 3; 11; 8; 6; 8. Let x = the number of sick days they took for the past
year. Should the personnel team believe that the average number is about 10?
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
g. Use the previous information to sketch a picture of this situation. CLEARLY, label
and scale the horizontal axis and shade the region(s) corresponding to the
p-value.
<---------------------------------------------------------------------------->
57
h. Indicate the correct decision (“reject” or “do not reject” the null hypothesis), the
reason for it, and write an appropriate conclusion, using COMPLETE SENTENCES.
__________________ _______________________________
Conclusion:________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
i. Construct a 95% Confidence Interval for the true mean or proportion. Include a
sketch of the graph of the situation. Label the point estimate and the lower and
upper bounds of the Confidence Interval.
<---------------------------------------------------------------------------->
58
Example 3: Employees in a large firm claim that the mean annual salary of the firm’s
accountants is less than that of its competitors, which is $45,000. A random sample
of 30 of the firm’s accountants has a mean salary of $43999 with a standard
deviation of $5200. At a significance level of 5%, test the employee’s claim.
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
g. Use the previous information to sketch a picture of this situation. CLEARLY, label
and scale the horizontal axis and shade the region(s) corresponding to the
p-value.
<---------------------------------------------------------------------------->
59
h. Indicate the correct decision (“reject” or “do not reject” the null hypothesis), the
reason for it, and write an appropriate conclusion, using COMPLETE SENTENCES.
__________________ _______________________________
Conclusion:________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
i. Construct a 95% Confidence Interval for the true mean or proportion. Include a
sketch of the graph of the situation. Label the point estimate and the lower and
upper bounds of the Confidence Interval.
<---------------------------------------------------------------------------->
60
Chapter 9: Hypothesis Testing with One Sample - Practice
1. Suppose the percent of students expected to pass the second exam of the quarter
in Calculus is expected to be 74%. A sample of 21 students results shows that 66%
of the students passed the exam. Conduct a hypothesis test to determine if the
survey result is different from the expected pass percentage of 74%.
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
g. Use the previous information to sketch a picture of this situation. CLEARLY, label
and scale the horizontal axis and shade the region(s) corresponding to the
p-value.
<---------------------------------------------------------------------------->
61
h. Indicate the correct decision (“reject” or “do not reject” the null hypothesis), the
reason for it, and write an appropriate conclusion, using COMPLETE SENTENCES.
__________________ _______________________________
Conclusion:________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
i. Construct a 95% Confidence Interval for the true mean or proportion. Include a
sketch of the graph of the situation. Label the point estimate and the lower and
upper bounds of the Confidence Interval.
<---------------------------------------------------------------------------->
62
2. A student is doing a statistics project on the weight of small “fun-sized” bags of
M&M’s. The product information states that the average weight of a bag is 1.75
ounces. The student weighs 18 bags of candy and finds that the mean weight is 1.7
ounces with a standard deviation of 0.0475 ounces. At a 5% significance level, is the
manufacturer’s stated weight accurate? (Assume that the underlying distribution is
normal)
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
g. Use the previous information to sketch a picture of this situation. CLEARLY, label
and scale the horizontal axis and shade the region(s) corresponding to the
p-value.
<---------------------------------------------------------------------------->
63
h. Indicate the correct decision (“reject” or “do not reject” the null hypothesis), the
reason for it, and write an appropriate conclusion, using COMPLETE SENTENCES.
__________________ _______________________________
Conclusion:________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
i. Construct a 95% Confidence Interval for the true mean or proportion. Include a
sketch of the graph of the situation. Label the point estimate and the lower and
upper bounds of the Confidence Interval.
<---------------------------------------------------------------------------->
64
Chapter 10: Hypothesis Testing with Two Samples - Notes
Two proportions
65
Example 1: The average number of English courses taken in a two–year time period
by male and female college students is believed to be about the same. An experiment
is conducted and data are collected from 29 males and 16 females. The males took
an average of 3 English courses with a standard deviation of 0.8. The females took
an average of 4 English courses with a standard deviation of 1.0. Are the averages
statistically the same?
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
66
g. Use the previous information to sketch a picture of this situation. CLEARLY, label
and scale the horizontal axis and shade the region(s) corresponding to the
p-value.
<---------------------------------------------------------------------------->
h. Indicate the correct decision (“reject” or “do not reject” the null hypothesis) and
write appropriate conclusions, using COMPLETE SENTENCES.
Conclusion:________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
67
Example 2: Proportions
Medical researchers were interested in whether a new drug (N) reduces pain more
effectively than the current drug (C). Results of patient responses from two groups
of randomly selected patients taking one of the two drugs are summarized below.
Use a 3% level of significance.
Pooled Proportion:
Distribution to use:
_______________________________________________________________________________________________
68
f. p-value = ______________ In 1 – 2 complete sentences, explain what the p-value
means for this problem.
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
g. Use the previous information to sketch a picture of this situation. CLEARLY, label
and scale the horizontal axis and shade the region(s) corresponding to the
p-value.
<---------------------------------------------------------------------------->
h. Indicate the correct decision (“reject” or “do not reject” the null hypothesis) and
write appropriate conclusions, using COMPLETE SENTENCES.
Conclusion:________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
69
Example 3 (Matched or paired samples)
Ten individuals went on a low–fat diet for 12 weeks to lower their cholesterol.
Evaluate the data below. Do you think that their cholesterol levels were significantly
lowered?
_______________________________________________________________________________________________
70
f. p-value = ______________ In 1 – 2 complete sentences, explain what the p-value
means for this problem.
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
g. Use the previous information to sketch a picture of this situation. CLEARLY, label
and scale the horizontal axis and shade the region(s) corresponding to the
p-value.
<---------------------------------------------------------------------------->
h. Indicate the correct decision (“reject” or “do not reject” the null hypothesis) and
write appropriate conclusions, using COMPLETE SENTENCES.
Conclusion:________________________________________________________________________________
_______________________________________________________________________________________________
71
Chapter 10: Hypothesis Testing with Two Samples - Practice
Test the hypothesis that adults prefer warmer room temperatures than children.
_______________________________________________________________________________________________
72
f. p-value = ______________ In 1 – 2 complete sentences, explain what the p-value
means for this problem.
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
g. Use the previous information to sketch a picture of this situation. CLEARLY, label
and scale the horizontal axis and shade the region(s) corresponding to the
p-value.
<---------------------------------------------------------------------------->
h. Indicate the correct decision (“reject” or “do not reject” the null hypothesis) and
write appropriate conclusions, using COMPLETE SENTENCES.
Conclusion:________________________________________________________________________________
_______________________________________________________________________________________________
73
2. A local women’s group has claimed that men and women differ in attitudes about
the environment. A group of 50 men (M) and 40 women (W) were asked if they
thought that protecting the environment was an important issue. Of those persons
sampled, 11 of the men and 19 of the women said they believed that protecting the
environment was an important issue. Conduct a hypothesis test to test the local
women’s group claim.
_______________________________________________________________________________________________
74
f. p-value = ______________ In 1 – 2 complete sentences, explain what the p-value
means for this problem.
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
g. Use the previous information to sketch a picture of this situation. CLEARLY, label
and scale the horizontal axis and shade the region(s) corresponding to the
p-value.
<---------------------------------------------------------------------------->
h. Indicate the correct decision (“reject” or “do not reject” the null hypothesis) and
write appropriate conclusions, using COMPLETE SENTENCES.
Conclusion:________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
75
3. A city recently launched a neighborhood watch program to control crime. The
following table gives the number of crimes reported in six neighborhoods during the
six months before and six months after the new neighborhood watch program was
launched.
Before 57 73 47 68 79 39
After 41 65 28 73 61 32
Test the hypothesis that crime was better controlled after the new neighborhood
watch program was launched.
_______________________________________________________________________________________________
76
f. p-value = ______________ In 1 – 2 complete sentences, explain what the p-value
means for this problem.
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
g. Use the previous information to sketch a picture of this situation. CLEARLY, label
and scale the horizontal axis and shade the region(s) corresponding to the
p-value.
<---------------------------------------------------------------------------->
h. Indicate the correct decision (“reject” or “do not reject” the null hypothesis) and
write appropriate conclusions, using COMPLETE SENTENCES.
Conclusion:________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
77
Chapter 11: The Chi-Square Distribution - Notes
Test of Independence
Goodness-of-Fit Test
Chi-square or χ 2
78
Example: The percentage of students who attend a local school in any given school
week is as follows:
Monday Tuesday Wednesday Thursday Friday
95% 96% 98% 97% 95%
In one given school week, the number of students (data) who attended school out of
a student population of 500 was:
a. Ho:
b. Ha:
79
f. p-value = ______________ In 1 – 2 complete sentences, explain what the p-value
means for this problem.
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
g. Use the previous information to sketch a picture of this situation. CLEARLY, label
and scale the horizontal axis and shade the region(s) corresponding to the
p-value.
<---------------------------------------------------------------------------->
h. Indicate the correct decision (“reject” or “do not reject” the null hypothesis) and
write appropriate conclusions, using COMPLETE SENTENCES.
Conclusion:________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
80
Example 2: Conduct a hypothesis test to determine whether there is a relationship
between an employees performance in a company’s training program and his/her
ultimate success on the job. Use a level of significance of 1%.
Training program
Performance on Job
Performance in
a. Ho:
b. Ha:
Poor
Average
Very Good
81
g. p-value = ______________ In 1 – 2 complete sentences, explain what the p-value
means for this problem.
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
h. Use the previous information to sketch a picture of this situation. CLEARLY, label
and scale the horizontal axis and shade the region(s) corresponding to the
p-value.
<---------------------------------------------------------------------------->
i. Indicate the correct decision (“reject” or “do not reject” the null hypothesis) and
write appropriate conclusions, using COMPLETE SENTENCES.
Conclusion:_____________________________________________________________________________
_____________________________________________________________________________________________
_______________________________________________________________________________________________
82
Chapter 11: The Chi-Square Distribution – Practice
a. Ho:
b. Ha:
83
f. p-value = ______________ In 1 – 2 complete sentences, explain what the p-value
means for this problem.
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
g. Use the previous information to sketch a picture of this situation. CLEARLY, label
and scale the horizontal axis and shade the region(s) corresponding to the
p-value.
<---------------------------------------------------------------------------->
h. Indicate the correct decision (“reject” or “do not reject” the null hypothesis) and
write appropriate conclusions, using COMPLETE SENTENCES.
Conclusion:________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
84
Example 2: The table shows a random sample of 100 hikers and the area of hiking
preferred. Are hiking area preference and gender independent?
Female 18 16 11
Male 16 25 14
Perform a hypothesis test at the 5% level to determine if hiking area preference and
gender are independent.
a. Ho:
b. Ha:
Female
Male
85
g. p-value = ______________ In 1 – 2 complete sentences, explain what the p-value
means for this problem.
_______________________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
h. Use the previous information to sketch a picture of this situation. CLEARLY, label
and scale the horizontal axis and shade the region(s) corresponding to the
p-value.
<---------------------------------------------------------------------------->
i. Indicate the correct decision (“reject” or “do not reject” the null hypothesis) and
write appropriate conclusions, using COMPLETE SENTENCES.
Conclusion:________________________________________________________________________________
_______________________________________________________________________________________________
_______________________________________________________________________________________________
86
Chapter 12: Linear Regression and Correlation - Notes
Bivariate data :
Linear Regression:
Correlation Coefficient:
Coefficient of Determination:
87
Determine degrees of freedom:
OR use p-value from LinRegTTest (p-value < alpha means correlation coefficient is
significant)
Outliers!
88
NOTE: NEED Diagnostic ON (under 2nd catalogue)
Example: Table shows quiz score (out of 20) and the grades on a midterm exam
(out of 100) for a sample of 8 students who took this course last quarter..
quiz 20 15 13 18 18 20 14 16
midterm 92 72 72 95 88 98 65 77
1. Draw the scatter plot. Does it look like a straight line will fit the data?
2. Calculate the equation of least squares line / regression line/ line of best fit.
89
7. What is the predicted score on the midterm for a student who got a score of
17 on the quiz? Do NOT predict outside the domain!
90
Example: Table shows income level versus percent of income donated to charity.
income 42 48 50 59 65 72
level in
$1000
percent 9 10 8 5 6 3
donated
to charity
10. Draw the scatter plot. Does it look like a straight line will fit the data?
11. Calculate the least squares line / regression line.
12. Draw the regression line over the scatterplot.
13. Find the correlation coefficient. Is it significant? Give the p-value and the
degrees of freedom.
14. Write an interpretation of the slope of the line.
15. Find the estimated donating percent for someone earning $62,000, $45,000.
Do NOT predict outside the domain!
16. Find s - the standard deviation of the residuals
17. Are the any outliers? If so, what are they?
91
TABLE for calculating outliers:
Income level Percent Predicted y Residual |y- ŷ|2
in $1000 donated to ŷ y- ŷ
charity
42 9
48 10
50 8
59 5
65 6
72 3
92
Chapter 12: Linear Regression and Correlation – Practice
The following table shows the calories and sugar content for one cup of each of 7
different breakfast cereals.
1. Draw the scatter plot. Does it look like a straight line will fit the data?
2. Calculate the least squares line / regression line. line of best fit. Put the
equation in the form yhat = a + b x.
3. Draw the regression line over the scatterplot.
4. Find the correlation coefficient. Is it significant? Give the p-value and the
degrees of freedom.
5. Write an interpretation of the slope of the line
6. Find the estimated grams of sugar for a cereal containing 115 calories in one
cup.
7. Find s - the standard deviation of the residuals
8. Are the any outliers? If so, what are they?
93
Table for calculating outliers:
Calories, x Sugar (g) y Predicted y Residual |y-ŷ|2
ŷ y- ŷ
130 16
100 2
120 14
120 15
120 11
110 6
110 4
94
Chapter 13: F Distribution and One Way ANOVA – Notes
ANOVA
F-Distribution
95
ANOVA summary table
Example: Are the means for the final exam scores the same for all statistics class
delivery types? The table below shows the scores on final exams from several
randomly selected classes that used the different delivery types.
Assume that all distributions are normal, the 3 population standard deviations are
approximately the same, and the data were collected independently and randomly.
Use the 0.05 level of significance.
H0: Ha:
96
Complete the ANOVA table:
Test Statistic:
Decision:
Conclusion:
97