1. Using Excel, SPSS or any other software package of your choice, analyze the data in problem 3.40 on page 91. Have the computer generate descriptive statistics and a histogram on the age. The descriptiv
1. Using Excel, SPSS or any other software package of your choice, analyze the data in problem 3.40 on page 91. Have the computer generate descriptive statistics and a histogram on the age. The descriptiv
40 on page 91
statistics and a histogram on the age. The descriptive statistics should at minimum include mean, median, mode,
of skewness, kurtosis, and the coefficient of variation.
1a. What are the values for mean, mode, median, standard deviation, and coefficient of variation?
Mean =
Mode =
Median =
Standard Deviation =
Coefficient of Variation =
1b. Based on the values for mean, mode, and median, what statement can be made related to skewness of the dist
Place your answer for question 1b in this cell
1c. What is the interpretation of the value for coefficient of variation? (i.e. explain what it does it mean if COV=1
Place your answer for question 1c in this cell
1d. Is the data for problem 3.40 mesokurtic, leptokurtic, or platylurtic? Justify your answer using descriptive sta
Place your answer for question 1d in this cell
2. Using a one population parameter test of hypothesis, test the hypothesis described for the following problem:
designed landscaping for residential areas. The estimated labor cost associated with a particular landscaping prop
plantings of trees, shrubs, and so on to be used for the project. For cost estimating purposes, managers use 2 hou
medium-size tree. Actual times from a sample of 10 plantings during the past month follow (times in hours): 1.9
2a. Conduct the appropriate directional test of hypothesis using a significance level of 5%, to test whether the me
Make sure your computer report has the answers to the following steps:
3. Use the data on page 367, problem #10.8, of the course text book to determine whether there is a difference in
Atlanta. Use significance level of 5%.
4. Using your computer software program and the data on pg. 407, problem
10.63, answer the followings:
5. Using the data in chapter 11, problem #11.13, pg. 428, have the computer do a one-way analysis of variance to
significant difference in the evaluations according to manager level.
6a. Which Post-Hoc Comparison procedure did you use for this problem?
Place the name of the Post-Hoc Comparison procedure used in this cell
6b. Which pairs of means are significantly different, if any?
Place your answer for question 2b in this cell
6c. What is the conclusion specific to this problem based on the answer to "Which pairs of means are significant
Place your answer for question 2c in this cell
===============
/\‘
3.40 The 2010 U.S. Census also asked for each person’s age.
SuppOse that a sample of 40 households taken: from'the
census data showed the'age of the first person recorded
on the census form to be asfollows.
42 29 31 38 55 27 28
33 a49 70 '25 21 38‘ 47
63 22 38 52 50 41 19
22 29 81 52 26 35 38
29 31 48 26 33 42 58
40 32 ’ 24 34 25
Compute P10, P80, Q1, Q, the interquartile range,and
the range for these data. '
L3.41 Shown below are the top 15 market research firms in
the United States in 2010 according to Inside Research
and the 2011 Honomichl report. Compute the mean,
median, P30, P60, _P90, Q1, Q3, range, and the interquartile
range on these data.
Company Sales ($millions)
The Nielsen Co. $2,407.0 IV
Kantar 914.7
IMS Health Inc. 801.0
SymphonyIRI Group 457.0
Westat Inc. 455.3
Arbitron Inc. 3904
Ipsos 379.6
GfK USA 290.9
Synovate 235.8
The NPD Group Inc. 173.7
ICF‘InternationalInc. ’ 153.2 '
JD. Power and Associates 147.3
comScore Inc.- 142.0
Maritz Research 140.9
dunnhumbyUSA LLC 1107
3.42 Shown in the left column are the top 10 companies
receiving the largest dollar volume of contract awards
from the U.S. Department of Defense in 2012 according
to Barr Group Aerospace. Use this population data to
compute a mean and a standard deviation for these top
'10 companies.
« Amount of Contracts
Company ($billions)
LockheedMartin 29.3
Boeing 27.8
Raytheon 14.2
General Dynamics 13.6
Northrop Grumman 8.7
UnitedTechnologies 7.9
L‐3 Communications ' 6.4
BAE Systems » 5.9
SAIC 5.1
Huntington Ingalls Industries 4.0
‘ . Supplementary Problems ‘91
3.438hown here are the top twelve .biggestoil and gas
companies in the world according to Forbes. Use
these as population data and answer the questions
that follow. ' '
ProductionVolume .
0 Company (millionbarrels per day) .
SaudiAramco (Saudi Arabia) I 12.5
Gazprom (Russia) , 9.7
NIOC (Iran) A 6.4
ExxonMobil Corp. (USA) * 5.3
PetroChina (China) 4.4
BP(LH<) _4J
Royal Dutch/Shell (NL/UK) 3.9
Pemex (Mexico) . ' 3.6
Chevron Corp. (USA) 3.5
KPC (Kuwait) ' 3.2
ADN0C(UAm 29
Sonatrach (Algeria) 1 f 2.7
a. What are the values of the mean and the median?
Compare the answers and state which you prefer asa
measure of location for these data/and why.
1 ) .What are the values of the range and interquartile
range? Howdo they differ? ‘ ‘ ' ‘
What are the values of Variance and standard deviation
for these data? . ‘
‘ d. ‘What is the z score forADNOC?What is the z sCore fo
ExxonMobil? Interpret these 2Scores. . ‘
C
3.44 The U.S.Department of the Interiorreleases figures on
mineral production. Following are the 15 leading
states in nonfuel mineral productiOn in the United
States for 2010.
‘ State Value ($billions)
Nevada . L 7.55 3
Arizona 3 p’ 6.70
Utah 4.42
Minnesota / v3.86
Alaska 3.24
California 2.71
Texas 2.56
Missouri > I 2.14
Florida 2.08
Michigan , t 1.96
Colorado ‘ 1.93
Wyoming 1.86
Pennsylvania 1.53
Georgia 1.50
NewYork: ‘ ' ‘ ' 1.29
a. Calcu1ate the mean and median. .
b. Calculate the range, interquartile range, mean
absolute deviation, sample variance, and sample standard
deviation. ‘
Answer
1. Using Excel, SPSS or any other software package of your choice, analyze the data in problem 3.4
computer generate descriptive statistics and a histogram on the age. The descriptive statistics should
median, mode, range, variance, standard deviation, measure of skewness, kurtosis, and the coefficie
1a. What are the values for mean, mode, median, standard deviation, and coefficient of variation?
Mean = 38.075
Mode = 38
Median = 34.5
Standard Deviation = 14.0756
Coefficient of Variation = 0.36968
1b. Based on the values for mean, mode, and median, what statement can be made related to skewn
1c. What is the interpretation of the value for coefficient of variation? (i.e. explain what it does it mean
COV=100%)
1d. Is the data for problem 3.40 mesokurtic, leptokurtic, or platylurtic? Justify your answer using desc
Data
Mean 38.0750
Standard Error 2.2255
Median 34.5
Mode 38
Standard Deviation 14.0756
Sample Variance 198.1224
Kurtosis 1.1211
Skewness 1.0956
Range 62
Minimum 19
Maximum 81
Sum 1523
Count 40
Coefficient of Variation = 0.36968
2. Using a one population parameter test of hypothesis, test the hypothesis described for the following
specializes in custom-designed landscaping for residential areas. The estimated labor cost associated
proposal is based on the number of plantings of trees, shrubs, and so on to be used for the project. F
managers use 2 hours of labor time for planting of a medium-size tree. Actual times from a sample of
month follow (times in hours): 1.9 1.7 2.8 2.4 2.6 2.5 2.8 3.2 1.6 2.5
2a. Conduct the appropriate directional test of hypothesis using a significance level of 5%, to test whe
exceeds 2 hours. Make sure your computer report has the answers to the following steps:
What is the calculated sample statistic based on sample data? (the computer generated output should
The sample statistic = 2.4495
2b. Using the confidence interval approach, calculate a 95% confidence interval for the data in proble
(2.0306, 2.7694)
PLACE (COPY/PASTE) YOUR COMPUTER GENERATED OUTPUT SUCH AS EXCEL'S EMBEDDE
GENERATED TABULAR DATA IN THIS AREA.
Time
Mean 2.4
Variance 0.2667
Observations 10
Hypothesized Mean Difference 0
df 9
t Stat 2.4495
P(T<=t) one-tail 0.0184
t Critical one-tail 1.8331
P(T<=t) two-tail 0.0368
t Critical two-tail 2.2622
4. Using your computer software program and the data on pg. 407, problem 10.63, answer the follow
Construct a 90% confidence interval to estimate the average difference between the price of name-br
(3.5527, 5.4473)
Does it make sense to treat these variables (groups) as dependent or independent when conducting t
What are the trade-offs?
Here as the stores are same for both the samples so the data is dependent. If we use the independen
test wont consider the correlation or dependency between the variables and thus wont give the most
Conduct the appropriate test of hypothesis to determine whether there is a difference between the pri
soup, using significance level of 10%. Use the five step process and analyze your results in relation to
What is the calculated sample statistic based on sample data? (the computer generated output should
The test statistic is 9
Reject or don't reject null hypothesis?
Reject the null hypothesis.
What statement should be made about the difference between price of name-brand soup and store-b
As we rejected the null hypothesis so there is a significant difference between price of name-brand so
5. Using the data in chapter 11, problem #11.13, pg. 428, have the computer do a one-way analysis o
determine whether there is a significant difference in the evaluations according to manager level.
5b. What is the test statistic that should be used for this type of test of hypothesis and its actual value
computer output?
11.7557
3.6823
5d. What statement should be made about the null hypothesis based on your answers to the previous
Reject the null hypothesis in this cell
5e. What is the conclusion specific to this problem based on the statement made about the null hypot
previous question?
As we rejected the null hypothesis so there is a significant difference in the evaluations accordingto m
PLACE (COPY/PASTE) YOUR COMPUTER GENERATED OUTPUT SUCH AS EXCEL'S EMBEDDED FUNCTIONS AN
GENERATED TABULAR DATA IN THIS AREA.
SUMMARY
Groups Count Sum Average Variance
High Level 5 38 7.6000 0.8
Midlevel 7 62 8.8571 0.8
Low Level 6 35 5.8333 2.1
ANOVA
Source of Variation SS df MS F
Between Groups 29.6095 2 14.8048 11.7
Within Groups 18.8905 15 1.2594
Total 48.5 17
6. If you obtain a significant F statistic for question 5, run a Post-Hoc Comparison’s test on the 3 mean
question. You can select whichever Post-Hoc Comparison procedure you desire. If none exists on yo
individual t-tests on all pairs of means. (This problem relates to the concept in chapter 11.3)
6a. Which Post-Hoc Comparison procedure did you use for this problem?
The mean for Low Level is significantly different from High level and Mid level means.
6c. What is the conclusion specific to this problem based on the answer to "Which pairs of means are
different, if any?"?
The conclusion is the mean for low level is significantly different than the other two.
PLACE (COPY/PASTE) YOUR COMPUTER GENERATED OUTPUT SUCH AS EXCEL'S EMBEDDED FUNCTIONS AN
GENERATED TABULAR DATA IN THIS AREA.