Correlation and Simple Linear Regression (Problems With Solutions)
Correlation and Simple Linear Regression (Problems With Solutions)
1. ThefollowingdatarepresenttheyearsofexperienceXandsalaryY(inthousanddollars)ofarandom
sampleofprofessionalengineers(n=27)
X 1 2 2
Y 25 28 37
13
52
14
66
18
53
3 3 4 4 6 6 6
31 40 36 43 40 43 51
18
70
20
70
21
60
22
61
22
71
7 7
42 55
25
58
9
41
9
51
9
60
12
50
12
64
25
74
a. Findthecoefficientofcorrelationr.
b. Isitsufficientsampleevidencetoindicatethatthereislinearcorrelationbetweentheyearsofexperienceand
salary?
c. Testthehypothesisthat = 0.6 againstthealternative > 0.6 .Use = 0.01
2. Achemicalengineerisstudyingtheeffectoftemperatureontheyieldofacertainproductinchemicalprocess.
Theprocessisrun10timesandthefollowingdataisobservedforthetemperatureofeachprocessXand
correspondingyieldY.
TemperatureX(in C ) 95 110 118 124 145 140 185 190 205 222
Yield(inkgs)
108 126 102 121 118 155 158 178 159 184
Thefollowinginformationisavailable:
x = 1534 , x
xy = 226,463
a. Findthecoefficientofcorrelationr.
b. Isitsufficientsampleevidencetoindicatethatthereislinearcorrelationbetweentemperatureandyieldof
chemicalproduct?
c. Testthehypothesisthat = 0.8 againstthealternative 0.8 .Use = 0.05
3. Inthefollowingdata,XrepresentsthenumberofyearsofformaleducationandYrepresentsthesalaryin
thousandsofdollarsofarandomsampleofadultmales.
X
13
17
9
18
16
18
13
16
Y
21.6
25.8
15.9
48.3
38.2
56.4
28.4
43.3
a. Plotthescatterdiagram.
b. Computethecoefficientofcorrelationandtestthehypothesis H 0 : = 0 against H1 : 0 .
c. Findthecoefficientsoftheleastsquareslineandwritetheequationforestimatedregressionline.
d. Computemeansquareerror.
e. Finda95%confidenceintervalfor 1 and 0 .
f.
Testthehypothesisthat 0 = 0 againstthealternative 0 0 .
g. Testthehypothesisthat 1 = 0 againstthealternative 1 0 .
h. Estimatethetruemeanresponse Y | x tox=16yearsofeducation.
i. PredictthesalaryYcorrespondingto16yearsofeducation.
j. Findthecoefficientofdeterminationandexplainitsmeaning.
k. ConstructANOVAtableforregression.
4. Thetablebelowdisplaysthemathematicsachievementtestscoresforarandomsampleofn=10students,
selectedfromthepopulationof12thgraders,togetherwiththeirfinalcalculusgrades.
Mathachievement
39
43
21
64
57
47
28
75
34
52
Testscore
Finalcalculusgrade
65
78
52
82
92
89
73
98
56
75
a. Findthecoefficientsoftheleastsquareslineandwritetheequationforestimatedregressionline.
b. B.Determinewhetherthereissignificantrelationshipbetweenthecalculusgradesandtestscores.
Use = 0.05
c. Finda95%confidenceintervalfortheslopeofregressionline.
d. Estimatetheaveragecalculusgradeforthestudentswhoseachievementscoreis50witha95%CI.
e. Astudenttooktheachievementtest,buthasnotyettakenthecalculustest.Predictthecalculusgradefor
thisstudentwitha95%predictioninterval.
f. Finda95%confidenceintervalfortheinterceptofregressionline.
g. Calculatethecoefficientofdeterminationandexplainitsmeaning.
h. ConstructANOVAtableforregression.
5. Youaregivendatasetwith6pairsofxvaluesandyvalues.Weassumethatxisindependentvariableandyis
dependentvariable.
x
2
1
0
1
2
y
1
1
3
5
5
a. Findtheleastsquareslineforthedata.
b. Dothedatapresentsufficientevidencetoindicatethatyandxarelinearlyrelated?
c. ConstructtheeANOVAtableforlinearregressionandusetocalculateF=
MSR
.Verifythatthesquareoftest
MSE
statisticusedinpartb.coincideswithcriticalvalueforFat = 0.05 .
d. Find90%confidenceintervalfortheslopeoftheline.
e. Estimatetheaveragevalueofywhenx=1,using90%confidenceinterval.
f. Finda90%predictionintervalforfuturevalueofywhenx=1.
6. Thedata,togetherwithaportionofMinitabprintout,isgivenbelow.CertaininformationinMinitabprintoutis
missed.
x
1
2
3
4
5
6
y
5.6
4.6
4.5
3.7
3.2
3.7
a. Findtheleastsquareslineforthedata.
b. FillinthemissingentriesintheMINITABanalysisofvariancetable.
c. Dothedatapresentsufficientevidencetoindicatethatyandxarelinearlyrelated?
UsetheinformationintheMINITABprintouttoanswerthisquestionatthe1%levelofsignificance.
d. Findthecoefficientofdetermination.
e. Find90%confidenceintervalfortheslopeoftheline.
f. Estimatetheaveragevalueofywhenx=2,using90%confidenceinterval.
g. Finda95%predictionintervalforthefuturevalueofywhenx=2.
PredictorCoefSt.devtP
Constant6.00000.175934.100.000
X0.557140.0451812.330.000
S=0.1890Rsq=97.4%
Analysisofvariance
SourceDFSSMS
Regression**5.4221
Residualerror*0.1429*
Total*5.5750