0% found this document useful (0 votes)
38 views

1. Using Excel, SPSS or any other software package of your choice, analyze the data in problem 3.40 on page 91. Have the computer generate descriptive statistics and a histogram on the age. The descriptiv

1. Using Excel, SPSS or any other software package of your choice, analyze the data in problem 3.40 on page 91. Have the computer generate descriptive statistics and a histogram on the age. The descriptiv

Uploaded by

Adel Hassan
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
38 views

1. Using Excel, SPSS or any other software package of your choice, analyze the data in problem 3.40 on page 91. Have the computer generate descriptive statistics and a histogram on the age. The descriptiv

1. Using Excel, SPSS or any other software package of your choice, analyze the data in problem 3.40 on page 91. Have the computer generate descriptive statistics and a histogram on the age. The descriptiv

Uploaded by

Adel Hassan
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 15

1. Using Excel, SPSS or any other software package of your choice, analyze the data in problem 3.

40 on page 91
statistics and a histogram on the age. The descriptive statistics should at minimum include mean, median, mode,
of skewness, kurtosis, and the coefficient of variation.

1a. What are the values for mean, mode, median, standard deviation, and coefficient of variation?
Mean =
Mode =
Median =
Standard Deviation =
Coefficient of Variation =

1b. Based on the values for mean, mode, and median, what statement can be made related to skewness of the dist
Place your answer for question 1b in this cell

1c. What is the interpretation of the value for coefficient of variation? (i.e. explain what it does it mean if COV=1
Place your answer for question 1c in this cell

1d. Is the data for problem 3.40 mesokurtic, leptokurtic, or platylurtic? Justify your answer using descriptive sta
Place your answer for question 1d in this cell

2. Using a one population parameter test of hypothesis, test the hypothesis described for the following problem:
designed landscaping for residential areas. The estimated labor cost associated with a particular landscaping prop
plantings of trees, shrubs, and so on to be used for the project. For cost estimating purposes, managers use 2 hou
medium-size tree. Actual times from a sample of 10 plantings during the past month follow (times in hours): 1.9

2a. Conduct the appropriate directional test of hypothesis using a significance level of 5%, to test whether the me
Make sure your computer report has the answers to the following steps:

State the null and alternative hypothesis.


Place the null and alternative hypothesis in this cell
What is (are) the critical value(s) of sample test statistics based on problem specified significance level?
Place the critical value(s) of sample test statistics in this cell
What is the calculated sample statistic based on sample data? (the computer generated output should provide this
Place the value of sample test statistic in this cell
Reject or don't reject null hypothesis?
State "reject/don't reject null hypothesis" in this cell
What statement should be made about the mean planting times based on this test of hypothesis?
Place the statement about mean planting times in this cell
2b. Using the confidence interval approach, calculate a 95% confidence interval for the data in problem #2.
Place the 95% confidence interval in this cell

3. Use the data on page 367, problem #10.8, of the course text book to determine whether there is a difference in
Atlanta. Use significance level of 5%.

State the null and alternative hypothesis.


Place the null and alternative hypothesis in this cell
What is (are) the critical value(s) of sample test statistics based on problem specified significance level?
Place the critical value(s) of sample test statistics in this cell
What is the calculated sample statistic based on sample data?
Place the value of sample test statistic in this cell
Reject or don't reject null hypothesis?
State "reject/don't reject null hypothesis" in this cell
What statement should be made about the difference in cost of gallon of milk in Seattle and Atlanta?
Place the statement about the difference in cost of a gallon of milk in Seattle and Atlanta in this cell

4. Using your computer software program and the data on pg. 407, problem
10.63, answer the followings:

Construct a 90% confidence interval to estimate the average difference


between the price of name-brand soup and the price of store-brand soup.
Place the 90% confidence interval in this cell

Does it make sense to treat these variables (groups) as dependent or


independent when conducting test of hypothesis? Why would it matter?
What are the trade-offs?
Place your answer to the above questions in this cell (variable treatment,
trade-offs, …)
Conduct the appropriate test of hypothesis to determine whether there is a
difference between the price of name-brand soup and store-brand soup, using
significance level of 10%. Use the five step process and analyze your results
in relation to the information described in the exercise.

State the null and alternative hypothesis.


Place the null and alternative hypothesis in this cell
What is (are) the critical value(s) of sample test statistics based on problem
specified significance level?
Place the critical value(s) of sample test statistics in this cell
What is the calculated sample statistic based on sample data? (the computer
generated output should provide this information)
Place the value of sample test statistic in this cell
Reject or don't reject null hypothesis?
State "reject/don't reject null hypothesis" in this cell
What statement should be made about the difference between price of name-
brand soup and store-brand soup?
Place the statement about the difference between price of name-brand soup
and store-brand soup in this cell

5. Using the data in chapter 11, problem #11.13, pg. 428, have the computer do a one-way analysis of variance to
significant difference in the evaluations according to manager level.

5a. What are the appropriate null and alternative hypothesis?


Place the null and alternative hypothesis in this cell
5b. What is the test statistic that should be used for this type of test of hypothesis and its actual value based on yo
Place the value of the test statistic in this cell
5c. What is the critical value of the test statistic?
Place the critical value of the test statistic in this cell
5d. What statement should be made about the null hypothesis based on your answers to the previous two questio
Place the statement (reject/don't reject) null hypothesis in this cell
5e. What is the conclusion specific to this problem based on the statement made about the null hypothesis in the
Place the specific conclusion in this cell
6. If you obtain a significant F statistic for question 5, run a Post-Hoc Comparison’s test on the 3 means to answ
whichever Post-Hoc Comparison procedure you desire. If none exists on your software, do individual t-tests on a
relates to the concept in chapter 11.3)

6a. Which Post-Hoc Comparison procedure did you use for this problem?
Place the name of the Post-Hoc Comparison procedure used in this cell
6b. Which pairs of means are significantly different, if any?
Place your answer for question 2b in this cell
6c. What is the conclusion specific to this problem based on the answer to "Which pairs of means are significant
Place your answer for question 2c in this cell

===============

/\‘
3.40 The 2010 U.S. Census also asked for each person’s age.
SuppOse that a sample of 40 households taken: from'the
census data showed the'age of the first person recorded
on the census form to be asfollows.
42 29 31 38 55 27 28
33 a49 70 '25 21 38‘ 47
63 22 38 52 50 41 19
22 29 81 52 26 35 38
29 31 48 26 33 42 58
40 32 ’ 24 34 25
Compute P10, P80, Q1, Q, the interquartile range,and
the range for these data. '
L3.41 Shown below are the top 15 market research firms in
the United States in 2010 according to Inside Research
and the 2011 Honomichl report. Compute the mean,
median, P30, P60, _P90, Q1, Q3, range, and the interquartile
range on these data.
Company Sales ($millions)
The Nielsen Co. $2,407.0 IV
Kantar 914.7
IMS Health Inc. 801.0
SymphonyIRI Group 457.0
Westat Inc. 455.3
Arbitron Inc. 3904
Ipsos 379.6
GfK USA 290.9
Synovate 235.8
The NPD Group Inc. 173.7
ICF‘InternationalInc. ’ 153.2 '
JD. Power and Associates 147.3
comScore Inc.- 142.0
Maritz Research 140.9
dunnhumbyUSA LLC 1107
3.42 Shown in the left column are the top 10 companies
receiving the largest dollar volume of contract awards
from the U.S. Department of Defense in 2012 according
to Barr Group Aerospace. Use this population data to
compute a mean and a standard deviation for these top
'10 companies.
« Amount of Contracts
Company ($billions)
LockheedMartin 29.3
Boeing 27.8
Raytheon 14.2
General Dynamics 13.6
Northrop Grumman 8.7
UnitedTechnologies 7.9
L‐3 Communications ' 6.4
BAE Systems » 5.9
SAIC 5.1
Huntington Ingalls Industries 4.0
‘ . Supplementary Problems ‘91
3.438hown here are the top twelve .biggestoil and gas
companies in the world according to Forbes. Use
these as population data and answer the questions
that follow. ' '
ProductionVolume .
0 Company (millionbarrels per day) .
SaudiAramco (Saudi Arabia) I 12.5
Gazprom (Russia) , 9.7
NIOC (Iran) A 6.4
ExxonMobil Corp. (USA) * 5.3
PetroChina (China) 4.4
BP(LH<) _4J
Royal Dutch/Shell (NL/UK) 3.9
Pemex (Mexico) . ' 3.6
Chevron Corp. (USA) 3.5
KPC (Kuwait) ' 3.2
ADN0C(UAm 29
Sonatrach (Algeria) 1 f 2.7
a. What are the values of the mean and the median?
Compare the answers and state which you prefer asa
measure of location for these data/and why.
1 ) .What are the values of the range and interquartile
range? Howdo they differ? ‘ ‘ ' ‘
What are the values of Variance and standard deviation
for these data? . ‘
‘ d. ‘What is the z score forADNOC?What is the z sCore fo
ExxonMobil? Interpret these 2Scores. . ‘
C
3.44 The U.S.Department of the Interiorreleases figures on
mineral production. Following are the 15 leading
states in nonfuel mineral productiOn in the United
States for 2010.
‘ State Value ($billions)
Nevada . L 7.55 3
Arizona 3 p’ 6.70
Utah 4.42
Minnesota / v3.86
Alaska 3.24
California 2.71
Texas 2.56
Missouri > I 2.14
Florida 2.08
Michigan , t 1.96
Colorado ‘ 1.93
Wyoming 1.86
Pennsylvania 1.53
Georgia 1.50
NewYork: ‘ ' ‘ ' 1.29
a. Calcu1ate the mean and median. .
b. Calculate the range, interquartile range, mean
absolute deviation, sample variance, and sample standard
deviation. ‘
Answer

1. Using Excel, SPSS or any other software package of your choice, analyze the data in problem 3.4
computer generate descriptive statistics and a histogram on the age. The descriptive statistics should
median, mode, range, variance, standard deviation, measure of skewness, kurtosis, and the coefficie

1a. What are the values for mean, mode, median, standard deviation, and coefficient of variation?
Mean = 38.075
Mode = 38
Median = 34.5
Standard Deviation = 14.0756
Coefficient of Variation = 0.36968

1b. Based on the values for mean, mode, and median, what statement can be made related to skewn

As mean > median so the data seems positively or right skewed.

1c. What is the interpretation of the value for coefficient of variation? (i.e. explain what it does it mean
COV=100%)

The standard deviation is 36.97% of the mean.

1d. Is the data for problem 3.40 mesokurtic, leptokurtic, or platylurtic? Justify your answer using desc

As the coefficient of kurtosis is positive so this is a mesokurtic data.

PLACE (COPY/PASTE) YOUR COMPUTER GENERATED OUTPUT SUCH AS EXCEL'S EMBEDDE


GENERATED TABULAR DATA IN THIS AREA.

Data

Mean 38.0750
Standard Error 2.2255
Median 34.5
Mode 38
Standard Deviation 14.0756
Sample Variance 198.1224
Kurtosis 1.1211
Skewness 1.0956
Range 62
Minimum 19
Maximum 81
Sum 1523
Count 40
Coefficient of Variation = 0.36968

2. Using a one population parameter test of hypothesis, test the hypothesis described for the following
specializes in custom-designed landscaping for residential areas. The estimated labor cost associated
proposal is based on the number of plantings of trees, shrubs, and so on to be used for the project. F
managers use 2 hours of labor time for planting of a medium-size tree. Actual times from a sample of
month follow (times in hours): 1.9 1.7 2.8 2.4 2.6 2.5 2.8 3.2 1.6 2.5

2a. Conduct the appropriate directional test of hypothesis using a significance level of 5%, to test whe
exceeds 2 hours. Make sure your computer report has the answers to the following steps:

State the null and alternative hypothesis.


H0: µ = 2 and Ha: µ > 2
What is (are) the critical value(s) of sample test statistics based on problem specified significance leve
The critical value is 1.8331

What is the calculated sample statistic based on sample data? (the computer generated output should
The sample statistic = 2.4495

Reject or don't reject null hypothesis?


Reject the null hypothesis
What statement should be made about the mean planting times based on this test of hypothesis?
As we rejected the null hypothesis so the mean planting time is significantly more than 2 hours.

2b. Using the confidence interval approach, calculate a 95% confidence interval for the data in proble

(2.0306, 2.7694)
PLACE (COPY/PASTE) YOUR COMPUTER GENERATED OUTPUT SUCH AS EXCEL'S EMBEDDE
GENERATED TABULAR DATA IN THIS AREA.

t-Test: Two-Sample Assuming Unequal Variances

Time
Mean 2.4
Variance 0.2667
Observations 10
Hypothesized Mean Difference 0
df 9
t Stat 2.4495
P(T<=t) one-tail 0.0184
t Critical one-tail 1.8331
P(T<=t) two-tail 0.0368
t Critical two-tail 2.2622

Lower value of CI = 2.0306


Upper value of CI = 2.7694

3. Use the data


on page 367,
problem #10.8, of
the course text
book to determine
whether there is a
difference in cost
of gallon of milk in
Seattle and
S
Atlanta. Use ea Atl
significance level ttl an
of 5%. e ta
2. 2.
55 25
State the null and
alternative 2. 2.
hypothesis. 67 30
H0: µ(Seattle)-
µ(Atlanta) = 0 and
Ha: µ(Seattle)- 2. 2.
µ(Atlanta) ≠ 0 50 49
What is (are) the 2. 2.
61 41
critical value(s) of
sample test
statistics based on
problem specified
significance level?
The critical values
are -2.0262 and 2. 2.
2.0262 43 39
What is the
calculated sample
statistic based on 2. 2.
sample data? 36 26
Test Statistic = 2. 2.
3.8022 50 40
Reject or don't
reject null 2. 2.
hypothesis? 36 33
Reject the null 2. 2.
hypothesis 54 29
What statement
should be made
about the
difference in cost
of gallon of milk in
Seattle and 2. 2.
Atlanta? 54 48
As we rejected the
null hypothesis so
the difference is
significant at 5% 2. 2.
significance level. 80 59
2. 2.
61 38
2. 2.
56 39
2. 2.
64 40
2. 2.
43 23
PLACE 2. 2.
(COPY/PASTE) 43 29
YOUR
COMPUTER
GENERATED
OUTPUT SUCH
AS EXCEL'S
EMBEDDED
FUNCTIONS AND
EXCELS
GENERATED
TABULAR DATA
IN THIS AREA.
2. 2.
38 53
t-Test:
Two-
Sample
Assuming
Unequal 2. 2.
Variances 49 45
2.
57
S At
e la
at nt 2.
tle a 71
2. 2.
Mea 52 38 2.
n 14 11 27
0. 0.
Vari 01 01
ance 66 03
Obs
ervat
ions 21 18
Hyp
othe
size
d
Mea
n
Diffe
renc
e 0
df 37
3.
t 80
Stat 22
P(T<
=t) 0.
one- 00
tail 03
t
Criti
cal 1.
one- 68
tail 71
P(T<
=t) 0.
two- 00
tail 05
t 2.
Criti 02
cal
two-
tail 62

4. Using your computer software program and the data on pg. 407, problem 10.63, answer the follow

Construct a 90% confidence interval to estimate the average difference between the price of name-br

(3.5527, 5.4473)

Does it make sense to treat these variables (groups) as dependent or independent when conducting t
What are the trade-offs?
Here as the stores are same for both the samples so the data is dependent. If we use the independen
test wont consider the correlation or dependency between the variables and thus wont give the most

Conduct the appropriate test of hypothesis to determine whether there is a difference between the pri
soup, using significance level of 10%. Use the five step process and analyze your results in relation to

State the null and alternative hypothesis.

H0: µ(d) = 0 and Ha: µ(d) ≠ 0


What is (are) the critical value(s) of sample test statistics based on problem specified significance leve
The critical values are -1.8946 and 1.8946.

What is the calculated sample statistic based on sample data? (the computer generated output should
The test statistic is 9
Reject or don't reject null hypothesis?
Reject the null hypothesis.
What statement should be made about the difference between price of name-brand soup and store-b
As we rejected the null hypothesis so there is a significant difference between price of name-brand so

PLACE (COPY/PASTE) YOUR COMPUTER GENERATED OUTPUT SUCH AS EXCEL'S EMBEDDE


GENERATED TABULAR DATA IN THIS AREA.

t-Test: Paired Two Sample for Means

Name Brand Store Brand


Mean 55 50.5
Variance 11.1429 7.1429
Observations 8 8
Pearson Correlation 0.9127
Hypothesized Mean Difference 0
df 7
t Stat 9
P(T<=t) one-tail 0.0000
t Critical one-tail 1.4149
P(T<=t) two-tail 0.0000
t Critical two-tail 1.8946
Lower value of CI = 3.5527
Upper Value of CI = 5.4473

5. Using the data in chapter 11, problem #11.13, pg. 428, have the computer do a one-way analysis o
determine whether there is a significant difference in the evaluations according to manager level.

5a. What are the appropriate null and alternative hypothesis?


H0: µ(High) = µ(Mid) = µ(Low)
Ha: At least one mean is significantly different

5b. What is the test statistic that should be used for this type of test of hypothesis and its actual value
computer output?
11.7557

5c. What is the critical value of the test statistic?

3.6823

5d. What statement should be made about the null hypothesis based on your answers to the previous
Reject the null hypothesis in this cell
5e. What is the conclusion specific to this problem based on the statement made about the null hypot
previous question?
As we rejected the null hypothesis so there is a significant difference in the evaluations accordingto m

PLACE (COPY/PASTE) YOUR COMPUTER GENERATED OUTPUT SUCH AS EXCEL'S EMBEDDED FUNCTIONS AN
GENERATED TABULAR DATA IN THIS AREA.

Anova: Single Factor

SUMMARY
Groups Count Sum Average Variance
High Level 5 38 7.6000 0.8
Midlevel 7 62 8.8571 0.8
Low Level 6 35 5.8333 2.1

ANOVA
Source of Variation SS df MS F
Between Groups 29.6095 2 14.8048 11.7
Within Groups 18.8905 15 1.2594

Total 48.5 17

6. If you obtain a significant F statistic for question 5, run a Post-Hoc Comparison’s test on the 3 mean
question. You can select whichever Post-Hoc Comparison procedure you desire. If none exists on yo
individual t-tests on all pairs of means. (This problem relates to the concept in chapter 11.3)

6a. Which Post-Hoc Comparison procedure did you use for this problem?

I am using the Tukey's Post Hoc comparison

6b. Which pairs of means are significantly different, if any?

The mean for Low Level is significantly different from High level and Mid level means.

6c. What is the conclusion specific to this problem based on the answer to "Which pairs of means are
different, if any?"?

The conclusion is the mean for low level is significantly different than the other two.

PLACE (COPY/PASTE) YOUR COMPUTER GENERATED OUTPUT SUCH AS EXCEL'S EMBEDDED FUNCTIONS AN
GENERATED TABULAR DATA IN THIS AREA.

Post Hoc Tests


Multiple Comparisons
Dependent Variable: Data
Tukey HSD

(I) Factor Mean Difference (I-J) Std. Error Sig.


High Level Mid Level -1.25714 .65710 .169
Low Level 1.76667* .67953 .050
Mid Level High Level 1.25714 .65710 .169
Low Level 3.02381* .62434 .001
Low Level High Level -1.76667* .67953 .050
Mid Level -3.02381* .62434 .001
*. The mean difference is significant at the 0.05 level.

You might also like