0% found this document useful (0 votes)
16 views

regression_predict_PART_1of2 (1)

Copyright
© © All Rights Reserved
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views

regression_predict_PART_1of2 (1)

Copyright
© © All Rights Reserved
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
You are on page 1/ 26

Outside Magazine tested 10 different models of day hikers and backpacking boo

show the upper support and price for each model tested. Upper support was me
1 to 5, with a rating of 1 denoting average upper support and a rating of 5 denotin

a. Use these data to develop and estimated regression equation to estimate the
boot given the upper support rating.
b. Examine the output and determine whether the upper support and price are (s
c. Would you feel comfortable using the estimated regression equation develope
for a day hiker or backpacking boot given the upper support rating?
d. Estimate the price of a day hiker with an upper support rating of 4.
e. Create a scatter plot and superimpose the equation and RSQ obtained via sca

Manufacturer and Model Support


Salomon Super Raid 2
Merrell Chameleon Prime 3
Teva Challenger 3
Vasque Fusion GTX 3
Boreal Maigmo 3
L.L. Bean GTX Super Guide 5
Lowa Kibo 5
Asolo AFX 520 GTX 4
Raichle Mt. Trail GTX 4
Scarpa Delta SL M3 5
Price = a + bSupport
Price = 49.9 + 31.2*Support
174.7

coeff = [49.9, 31.2}


newdata = [1,4]

Chart Title
250

200 f(x) = 31.2079207920792 x + 49.9306930693069


R² = 0.798151361012574

150

100

50

0
1.5 2 2.5 3 3.5 4 4.5 5 5.5
models of day hikers and backpacking boots. The following data
each model tested. Upper support was measured using a rating from
age upper support and a rating of 5 denoting excellent upper support.

mated regression equation to estimate the price of a day hiker and backpacking

whether the upper support and price are (statistically) related.


he estimated regression equation developed in part (a) to estimate the price
ven the upper support rating?
th an upper support rating of 4.
ose the equation and RSQ obtained via scatter charts excel funtionality.

Price
120
125
130
135
150
189
190
195
200
220
Title

.9306930693069

Newdata
1
4

174.8
.5 4 4.5 5 5.5
r and backpacking

mate the price

untionality.

Y = a + bX
Price = a + b*Support
SUMMARY OUTPUT

Regression Statistics
Multiple R 0.8933932
R Square 0.7981514
Adjusted R Square 0.7729203
Standard Error 17.633999
Observations 10

ANOVA
df SS MS F Significance F
Regression 1 9836.74 9836.74 31.6337 0.0005
Residual 8 2487.66 310.958
Total 9 12324.4

CoefficientsStandard Error t Stat P-value Lower 95% Upper 95%


Intercept 49.930693 21.274 2.34703 0.0469 0.8728 98.9886
Support 31.20792079 5.5486857 5.6243807 0.000496 18.412629 44.003213
Lower 95.0%
Upper 95.0%
0.8728 98.9886
18.412629 44.003213
Below we have data on the speed for plain text printing in page per minute and th

a. Develop the estimated regression equation with speed as the independent var
b. Compute R^2. What percentage of the variation in price can be explained by
c. Is Speed a (statistical) determinant of Price?
Predict the Price of a Corporate Printer boasting a speed of 15

Name Type Speed Price


Minolta-QMS PagePro 1250W Small Office 12 199
Brother HL-1850 Small Office 10 499
Lexmark E320 Small Office 12.2 299
Minolta-QMS PagePro 1250E Small Office 10.3 299
HP Laserjet 1200 Small Office 11.7 399
Xerox Phaser 4400/N Corporate 17.8 1850
Brother HL-2460N Corporate 16.1 1000
IBM Infoprint 1120n Corporate 11.8 1387
Lexmark W812 Corporate 19.8 2089
Oki Data B8300n Corporate 28.2 2200
Price = a + bType + cSpeed
Price = 618.3 -931.2Type + 57.9*Speed
1486.8
age per minute and the price of each printer.

s the independent variable.


can be explained by the printing speed?

Price = a + b*Type + c*Speed


SUMMARY OUTPUT

Regression Statistics
Multiple 0.95019
R Square0.90287
Adjusted 0.87511
Standard281.976
Observatio 10

ANOVA
df SS MS F Significance F
Regressi 2 5E+06 3E+06 32.5325 0.0002856321
Residual 7 556572 79510
Total 9 5729910.9

Coefficients
Standard Error t Stat P-value Lower 95% Upper 95%Lower 95.0%
1 Intercept 618.373 448.455 1.3789 0.21037 -442.0549332 1678.8 -442.05
0 Type -931.24 247.93 -3.756 0.00711 -1517.499101 -344.98 -1517.5
15 Speed 57.995012 22.964807 2.5253864 0.0394983 3.6918718815693 112.29815 3.6918719

1488.3
Upper 95.0%
1678.8
-344.98
112.29815
Bergans of Norway has been making outdoor gear since 1908. The foll
rating (Deg F) and the price (USD) for 11 models of sleelping bags.

a. Develop a scatter diagram for these data with temperature rating as t


b. What does the scatter diagram tell you about the relationship betwee
c. Use Least Squares to estimate the regression model.
d. Using the estimated model, predict the price of a sleeping bag with a
e. Create a scatterplot, fit a linear model to the scatter, and place the RS

Model Rating Price


Ranger 3-Seasons 12 319
Ranger Spring 24 289
Ranger Winter 3 389
Rondane 3-Seasons 13 239
Rondane Summer 38 149
Rondane Winter 4 289
Senja Ice 5 359
Senja Snow 15 259
Senja Zero 25 229
Super Light 45 129
Tight & Light 25 199
Price
450

400

350
f(x) = − 5.27719665271967 x + 359.266736401674
R² = 0.804334051751381
300

250

200

150

100

50

0
0 5 10 15 20 25 30 35 40 45 50
since 1908. The following data show the temperature
sleelping bags.

mperature rating as the independent variable.


relationship between temperature rating and price?

sleeping bag with a temperature rating of 20 Deg F.


ter, and place the RSQ on the graph.

y = a + bx
Price = a + b*Rating
SUMMARY OUTPUT

Regression Statistics
Multiple R 0.8968467
R Square 0.8043341
Adjusted R Square 0.7825934
Standard Error 37.937208
Observations 11

ANOVA
df SS MS F
Regression 1 53246.914 53246.914 36.996762
Residual 9 12953.086 1439.2318
Total 10 66200

Coefficients
Standard Error t Stat P-value
1 Intercept 359.26674 20.064323 17.90575 2.403E-08
20 Rating -5.277197 0.8676038 -6.082496 0.0001831

prediction 253.723
45 50

RESIDUAL OUTPUT

Observation Predicted PriceResiduals


1 295.94038 23.059623
2 232.61402 56.385983
3 343.43515 45.564854
4 290.66318 -51.66318
5 158.73326 -9.733264
6 338.15795 -49.15795
7 332.88075 26.119247
8 280.10879 -21.10879
9 227.33682 1.6631799
10 121.79289 7.207113
11 227.33682 -28.33682
Significance F
0.0001831

Lower 95% Upper 95%Lower 95.0%


Upper 95.0%
313.87809 404.65539 313.87809 404.65539
-7.239853 -3.314541 -7.239853 -3.314541
The NFL rates prospects by position on a scale that ranges from 5 to 9. The ratin
are interpreted as follows: 8-9 should start the first year; 7-9 should start; 6-6.9 w
the team as backup; and 5-5.9 can make the club and contribute.
The data below shows position, weight, time in seconds to run the 40 yards, and r
25 NFL propsects.

a. Develop a dummy variable that will account for player's position.


b. Develop an estimated regression equation to show how rating is related to posi
weight, and time to run 40 yards.
c. Test whether the estimated regression equation
developed above indicates a significant relationship between the independent var
and the dependent variable.
d. Does the model provide a good fit?
e. Is position a significant factor in a player's rating?
f. Suppose a new offensive tackle prospect who weighs 300 pounds ran the 40 yar
in 5.1 seconds. Use the model to estimate the rating for this player.

Name Position Weight Time Rating


Cosey Coleman Guard 322 5.38 7.4
Travis Claridge Guard 303 5.18 7
Kaulana Noa Guard 317 5.34 6.8
Leander Jordan Guard 330 5.46 6.7
Chad Clifton Guard 334 5.18 6.3
Manula Savea Guard 308 5.32 6.1
Ryan JohanningmeGuard 310 5.28 6
Mark Tauscher Guard 318 5.37 6
Blaine Saipaia Guard 321 5.25 6
Richard Mercier Guard 295 5.34 5.8
Damion McIntoshGuard 328 5.31 5.3
Jeno James Guard 320 5.64 5
Al Jackson Guard 304 5.2 5
Chris Samuels Offensive tack 325 4.95 8.5
Stockar McDouglOffensive tack 361 5.5 8
Chris McIngosh Offensive tack 315 5.39 7.8
Adrian Klemm Offensive tack 307 4.98 7.6
Todd Wade Offensive tack 326 5.2 7.3
Marvel Smith Offensive tack 320 5.36 7.1
Michael ThompsoOffensive tack 287 5.05 6.8
Bobby Wiliams Offensive tack 332 5.26 6.8
Darnell Alford Offensive tack 334 5.55 6.4
Terrance BeadlesOffensive tack 312 5.15 6.3
Tutan Reyes Offensive tack 299 5.35 6.1
Greg Robinson-RaOffensive tack 333 5.59 6
om 5 to 9. The ratings
hould start; 6-6.9 will make

the 40 yards, and ratings for

ng is related to position,

Y = a + bx1 + cx2 + dx3


he independent variables Rating = a + bPosition + cWeight + dTime

unds ran the 40 yards


Rating = a + bPosition + cWrig
Rating = 11.9 -0.73Position + 0

PositionWeight Time Rating


1 322 5.38 7.4 SUMMARY OUTPUT
1 303 5.18 7
1 317 5.34 6.8 Regression Statistics
1 330 5.46 6.7 Multiple 0.6895
1 334 5.18 6.3 R Square 0.4755
1 308 5.32 6.1 Adjusted 0.4005
1 310 5.28 6 Standard 0.6936
1 318 5.37 6 Observations 25
1 321 5.25 6
1 295 5.34 5.8 ANOVA
1 328 5.31 5.3 df SS
1 320 5.64 5 Regressi 3 9.1562
1 304 5.2 5 Residual 21 10.101
0 325 4.95 8.5 Total 24 19.2576
0 361 5.5 8
0 315 5.39 7.8 Newdata CoefficientsStandard Error
0 307 4.98 7.6 1 Intercept 11.956 4.4999
0 326 5.2 7.3 0 Position -0.7324 0.2893
0 320 5.36 7.1 300 Weight 0.0222 0.0104
0 287 5.05 6.8 5.1 Time -2.2775461 0.92895489
0 332 5.26 6.8
0 334 5.55 6.4 6.9984
0 312 5.15 6.3
0 299 5.35 6.1
0 333 5.59 6
osition + cWright + dTime
0.73Position + 0.022Weight -2.27Time

MS F Significance F
3.0521 6.3451 0.0031
0.481

t Stat P-value Lower 95% Upper 95% Lower 95.0%Upper 95.0%


2.6569 0.0148 2.5976 21.314 2.5976 21.314
-2.5311 0.0194 -1.3341 -0.1306 -1.3341 -0.1306
2.1352 0.0447 0.0006 0.0438 0.0006 0.0438
-2.4517295 0.02305381 -4.2094135 -0.3456786 -4.2094135 -0.3456786

You might also like