ClassWork 04 Experiments With Blocking Factors - Sol
ClassWork 04 Experiments With Blocking Factors - Sol
Exercise 1 [M]
Three different washing solutions are being compared to study their effectiveness in retarding
bacteria growth in 5-gallon milk containers. The analysis is done in laboratory, and only three trials
can be run on a day. The experimenter suspects that days could represent a potential source of
variability. Observations are taken for four days, and the data are shown here.
Day
Solution 1 2 3 4
1 13 24 18 39
2 16 24 17 44
3 2 4 1 22
a) Does the washing solution affect the retarding bacteria growth? (α=0.05)
b) Which washing solutions are different with respect to the retarding bacteria growth? Use the
method of Tukey.
Solution
a) Does the washing solution affect the retarding bacteria growth? (α=0.05)
Main Effects Plot for Growth Interaction Plot for Growth
Data Means Data Means
Days Solution 50 Days
1
35
2
40 3
4
30
30
25
Mean
Mean
20 20
15 10
10
0
1 2 3
1 2 3 4 1 2 3 Solution
The factor washing solution seems to influence the response variable; also the block factor seems to
be relevant. Instead, the interaction lines of factor and block appear parallel; this allows us to not
suspect any interaction. Remember that this kind of design assumes that the interaction between the
factor and the block in absent. Vice versa, the estimated variance will be greater and the experiment
could become inefficient to discover the significance of the factor.
Analysis of Variance
Source DF Adj SS Adj MS F-Value P-Value
Days 3 1147,33 382,444 65,25 0,000
Solution 2 788,17 394,083 67,24 0,000
Error 6 35,17 5,861
Total 11 1970,67
1
Model Summary
S R-sq R-sq(adj) R-sq(pred)
2,42097 98,22% 96,73% 92,86%
Percent
60
SRES1
0 10 20 30 40 1 2 3 4 50
40
Solution
2 30
20
1
10
5
0
-1 1
-3 -2 -1 0 1 2 3
1 ,0 1 ,5 2,0 2,5 3,0 SRES1
All the residual values belong to the interval (-3;+3), thus no outliers appear. The hypothesis of
normality cannot be refused. From the analysis of the scatterplot, we can assume that the hypothesis
of homogenous variance is satisfied.
In conclusion, the factor washing solution affects the bacteria growth and we rightly add the block
factor to the analysis.
b) Which washing solutions are different with respect to the retarding bacteria growth? Use
the method of Tukey.
2
Exercise 2 [M]
Consider the ratio control algorithm experiment described in the exercise 3.8(M). The experiment
was actually conducted as a randomized block design, where six time periods were selected as the
blocks, and all four ratio control algorithms were tested in each time period. The average cell
voltage and the standard deviation of voltage (shown in parentheses) for each cell are follows:
Algorithm Observations
1 4,93(0,05) 4,86(0,04) 4,75(0,05) 4,95(0,06) 4,79(0,03) 4,88(0,05)
2 4,85(0,04) 4,91(0,02) 4,79(0,03) 4,85(0,05) 4,75(0,03) 4,85(0,02)
3 4,83(0,09) 4,88(0,13) 4,90(0,11) 4,75(0,15) 4,82(0,08) 4,90(0,12)
4 4,89(0,03) 4,77(0,04) 4,94(0,05) 4,86(0,05) 4,79(0,03) 4,76(0,02)
a) Analyze the average cell voltage data. Does the choice of ratio control algorithm affect the
average cell voltage? (α=0.05)
b) Does the choice of ratio control algorithm affect the pot noise cell voltage? (α=0.05)
c) Which ratio control algorithm would you select if your objective is to reduce both the average
cell voltage and the pot noise?
Solution
a) Analyze the average cell voltage data. Does the choice of ratio control algorithm affect the
average cell voltage? (α=0.05)
4,84 4,85
Mean
4,82
4,80
4,80
4,75
1 2 3 4 5 6
1 2 3 4 1 2 3 4 5 6 Period
The nuisance factor period might be significant. The interaction between the 2 factors seems relevant
but it cannot be evaluated in the model.
Analysis of Variance
Source DF Adj SS Adj MS F-Value P-Value
Algorithm 3 0,002746 0,000915 0,19 0,901
Period 5 0,017437 0,003487 0,72 0,615
Error 15 0,072179 0,004812
Total 23 0,092363
3
Model Summary
S R-sq R-sq(adj) R-sq(pred)
0,0693682 21,85% 0,00% 0,00%
Percent
60
SRES1
-1
1
-2 -3 -2 -1 0 1 2 3
2 4 6 SRES1
No outliers appear in the scatterplots, the hypothesis of normality cannot be refused. Observing the
scatterplots, the variance appears homogeneous. Thus, the residual assumptions are checked and the
ratio control algorithm does not affect the mean cell voltage (The interaction may have concealed the
significance of the factors).
b) Does the choice of ratio control algorithm affect the pot noise cell voltage? (α=0.05)
0,10 0,12
0,09
0,10
Mean
0,08
Mean
0,08
0,07
0,06 0,06
0,05 0,04
0,04
0,02
0,03 1 2 3 4 5 6
1 2 3 4 1 2 3 4 5 6 Period
Both factors seem to influence the response and their interaction seems moderate.
Analysis of Variance
Source DF Adj SS Adj MS F-Value P-Value
Algorithm 3 0,026013 0,008671 50,76 0,000
Period 5 0,002721 0,000544 3,19 0,037
Error 15 0,002563 0,000171
Total 23 0,031296
Model Summary
4
S R-sq R-sq(adj) R-sq(pred)
0,0130703 91,81% 87,45% 79,04%
Let us check the residual assumptions.
Scatterplot of SRES2 vs FITS2; Algorithm; Period Probability Plot of SRES2
Normal
FITS2 Algorithm
2 99
Mean -9,43690E-1 6
1 StDev 1 ,022
95 N 24
0 AD 0,454
90
P-Value 0,247
-1 80
70
Percent
-2 60
SRES2
-1
1
-3 -2 -1 0 1 2 3
-2
2 4 6 SRES2
All the standardized residuals belong to the interval [-3;+3], thus there are not any outliers. Analyzing
the scatterplot, problems of variance inhomogeneity do not appear. The hypothesis of normality
cannot be refuse. In conclusion the ratio control algorithm does affect the pot noise.
c) Which ratio control algorithm would you select if your objective is to reduce both the average
cell voltage and the pot noise?
The ratio control algorithm does not affect the average cell voltage, thus any algorithm can be
selected. Concerning the pot noise, we are interested in the control algorithm:
a ( a − 1) 4 ( 4 − 1)
Bα t α
= ( df E=
) rA = = 6= df E 15
2 rA 2 2
=Bα t0.05/12
= (15) 3.03624
1 1 4.08
Tα
= qα ( a, df
= E) q0.05 ( 4,15
= ) = 2.88
2 2 2
MS E 0.000171
The critical value
= is: Tα 2 2.88
= 2 0.02174
b 6
Using MINITAB we get:
Grouping Information Using the Tukey Method and 95% Confidence
Algorithm N Mean Grouping
3 6 0,113333 A
1 6 0,046667 B
4 6 0,036667 B
2 6 0,031667 B
Means that do not share a letter are significantly different.
In order to reduce the pot noise, there is not difference among the algorithms 1, 4 and 2. We would suggest a
future experiment focused on these algorithms.
5
Exercise 3 [M]
An aluminum alloy manufacturer produces grain refiners in ingot form. The company produces the
product in four furnaces. Each furnace is known to have its own unique operating characteristics, so
any experiment run in the foundry that involves more than one furnace will consider furnaces a
nuisance variable. The process engineers suspect that stirring rate affects the grain size of the
product. Each furnace can be run at four different stirring rates. A RCDB is run for a particular
refiner, and the resulting grain size data is as follow.
Furnace
Stirring rate (rpm) 1 2 3 4
5 8 4 5 6
10 14 5 6 9
15 14 6 9 2
20 17 9 3 6
a) Is there any evidence that stirring rate affects grain size? (use α=0.05)
b) What should the process engineers recommend concerning the choice of stirring rate and
furnace for this particular grain refine if small grain size is desirable?
c) Suppose that the observation for stirring rate 4 in furnace 3 is missing.
Solution
a) Is there any evidence that stirring rate affects grain size? (use α=0.05)
Main Effects Plot for Size Interaction Plot for Size
Data Means Data Means
Rate Furnace 18 Rate
14 5
16 10
13 15
14 20
12
12
11
10
Mean
10
Mean
8
9
6
8
4
7
2
6
0
5 1 2 3 4
5 10 15 20 1 2 3 4 Furnace
The stirring rate does not seem to be relevant; instead the furnace (factor block) seems to influence
the response. The interaction seems to be fairly relevant but we have to assume that it is absent.
Analysis of Variance
Source DF Adj SS Adj MS F-Value P-Value
Rate 3 22,19 7,396 0,85 0,500
Furnace 3 165,19 55,063 6,35 0,013
Error 9 78,06 8,674
Total 15 265,44
6
Model Summary
S R-sq R-sq(adj) R-sq(pred)
2,94510 70,59% 50,99% 7,05%
Percent
-2 60
SRES1
10
0
5
-1
1
-3 -2 -1 0 1 2 3
-2
1 2 3 4 SRES1
From the analysis of the scatterplot, no outliers appear and, moreover, the hypothesis of equal
variance cannot be refused. The hypothesis of normality cannot be rejected, in fact the p-value is
lower than 0.05.
In conclusion, the factor stirring rate does not affect the grain size, which instead is influenced by the
furnace (block factor).
b) What should the process engineers recommend concerning the choice of stirring rate and
furnace for this particular grain refine if small grain size is desirable?
The choice of the stirring rate is irrelevant for the grain size. We suggest using the stirring rate that
allows the manufacturer to decrease the production costs or the production time.
From an optimization point of view, we can compare the furnaces. The furnace (1) negatively
influences the grain size. We can suggest to analysis this furnace and to understand why it is not
working well as the other furnaces.
If a data is missing, we have to add a * in the corresponding cell in Minitab and then we can analyze
the data as usual.
Analysis of Variance
Source DF Adj SS Adj MS F-Value P-Value
Rate 3 37,94 12,648 1,94 0,202
7
Furnace 3 146,94 48,981 7,50 0,010
Error 8 52,22 6,528
Total 14 242,00
Model Summary
S R-sq R-sq(adj) R-sq(pred)
2,55495 78,42% 62,24% 25,74%
The conclusions do not change. The Stirring rate is still insignificant and the factor furnace influences
the grain size.
8
Exercise 4
yi2 y2
Consider a p x p Latin square without replications. Prove that=
SS A ∑i p p2
− and obtain least
Solution
The model is:
i = 1, 2.. p
yijk = µ + τ i + β j + α k + ε ijk j = 1, 2.. p
k = 1, 2.. p
∑(y − µ − τ i − β j − α k ) by solving the following system of linear
2
We must minimize ∑=
ε ijk2 ijk
i , j ,k i , j ,k
equations:
∂ ∑ ( yijk − µ − τ i − β j − α k )
2
i , j ,k
=0
∂µ
∂ ∑ ( yijk − µ − τ i − β j − α k )
2 ∑τ i
i =0
i , j ,k
∂τ i
= 0 considering the additional constraints ∑β j
j =0
...
∑α k =0
∂ ∑ ( yijk − µ − τ i − β j − α k )
2 k
i , j ,k
=0
∂α k
τˆi : yi = µˆ p + τˆi p + ∑ βˆ j + ∑ αˆ k
j k
βˆ j : y j =µˆ p + ∑τˆi + βˆ j p + ∑ αˆ k
i k
ˆ k y k − y
α=
9
yijk = µˆ + τˆi + βˆ j + αˆ k + eijk
∑( y )
2
ijk − µˆ= ∑δ τˆ + ∑ δ ijk βˆ j2 + ∑ δ ijkαˆ k2 + ∑ δ ijk eijk2
ijk i
2
10
Exercise 5 [M]
An industrial engineer is investigating the effect of four assembly methods (A, B, C, D) on the
assembly time for a color television component. Four operators are selected for the study.
Furthermore, the engineer knows that each assembly method produces such fatigue that the time
required for the last assembly may be greater than the time required for the first, regardless of the
method. That is, a trend develops in the required assembly time. To account this source of variability,
the engineer uses the Latin square design shown below.
Order of Operator
Assembly 1 2 3 4
1 C=10 D=14 A=7 B=8
2 B=7 C=18 D=11 A=8
3 A=5 B=10 C=11 D=9
4 D=10 A=10 B=12 C=14
a) Analyze the data from this experiment (use α=0.05) and calculate manually the residual of the
data (C,1,1);
b) Verify whether it is advantageous to conduct an experiment with two block factors.
Solution
1 y2 y2
SSOrder
= ∑
p j
y2j − 2= 18.5
p
SSTOT= ∑y
i , j ,k
2
ijk −
p2
= 153.0
Source SS df MS F0 p
Method 72.5 3 24.2 13.8 0.004
Operator 51.5 3 17.2 9.8 0.01
Order 18.5 3 6.2 3.5 0.089
Error 10.5 6 1.75
Total 153.0 15
11
The residual of the data (C,1,1) is:
eC11 = yC11 − yˆC11 where yˆC11 = µˆ + τˆC + βˆ1 + αˆ1
y 164
µˆ =2 = =
10.25 τˆC = 3 βˆ1 =
yC − y = −0.5 αˆ1 =
y1 − y = −2.25
y1 − y =
p 16
yˆC11 =10.25 + 3 − 0.5 − 2.25 =10.5 → eC11 =yC11 − yˆC11 =−0.5
Operator
11
Mean
15 1
2
3
10 Operator 10
4
5
9
8
Methods
7
1 2 3 4 1 2 3 4 1 2 3 4
Method seems to be relevant, the interactions appear moderate but they can not be considered in the
model.
Analysis of Variance
Source DF Adj SS Adj MS F-Value P-Value
Order 3 18,50 6,167 3,52 0,089
Operator 3 51,50 17,167 9,81 0,010
Methods 3 72,50 24,167 13,81 0,004
Error 6 10,50 1,750
Total 15 153,00
Model Summary
S R-sq R-sq(adj) R-sq(pred)
1,32288 93,14% 82,84% 51,20%
-1 80
70
Percent
-2 60
SRES1
5 10 15 1 2 3 4 50
Operator Methods 40
2 30
20
1
10
0 5
-1
1
-3 -2 -1 0 1 2 3
-2
1 2 3 4 1 2 3 4 SRES1
12
All the standardized residuals belong to the interval [-3;+3], thus no outliers appear. Moreover, the
hypothesis of homogeneous variance is assumed from the analysis of the scatterplot. The hypothesis
of normality cannot be refused. Finally the assembly method does affect the assembly time. Moreover
the assembly order and the operator are nuisance factors that influence the assembly time, thus we
can suggest a deep study to better understand the differences among the operators and the assembly
orders.
In order to understand the advantageous, we can do the analysis without the block factors.
Analysis of Variance
Source DF Adj SS Adj MS F-Value P-Value
Methods 3 72,50 24,167 3,60 0,046
Error 12 80,50 6,708
Total 15 153,00
Model Summary
S R-sq R-sq(adj) R-sq(pred)
2,59005 47,39% 34,23% 6,46%
Without using the block factors, we can conclude with some difficulty (the p-value is almost 5%) that
the factor method is significant.
13
Exercise 6 [September 21st 2005]
A restaurateur was studying the performance of five different brands of oil. A tester, after testing
each brand, had to assign a score for each one. The tester could taste only three brands per day. An
experiment was conducted and the data are shown below:
Day
Brand 1 2 3 4 5 6 7 8 9 10
1 32.4 37.3 38.2 36.4 41.4 40.8
2 30.7 31.1 34.4 33.2 37.0 41.2
3 32.3 29.4 32.1 34.8 40.7 46.6
4 32.7 34.1 31.3 41.5 43.3 43.3
5 36.0 34.1 41.4 35.7 45.6 44.3
Solution
The experiment was conducted using a Randomized Incomplete Block Design (RIBD). This design
is used when a nuisance factor exists and we may not be able to run all the treatment combinations in
each block. In our case, the design is not symmetric.
40
40
Mean
Mean
38
38
36
36
34
34 32
32 30
30 1 2 3 4 5 6 7 8 9 10
1 2 3 4 5 1 2 3 4 5 6 7 8 9 10 day
The block factor seems to affect the response and to be more relevant than the factor brand. The
interaction does not appear significant.
Analysis of Variance
Source DF Adj SS Adj MS F-Value P-Value
brand 4 69,10 17,275 3,35 0,036
14
day 9 533,83 59,314 11,49 0,000
Error 16 82,56 5,160
Total 29 701,11
Model Summary
S R-sq R-sq(adj) R-sq(pred)
2,27163 88,22% 78,66% 58,60%
-1 80
70
Percent
-2 60
SRES1
30 35 40 45 1 2 3 4 5 50
day 40
2 30
20
1
10
0 5
-1
1
-3 -2 -1 0 1 2 3
-2
0,0 2,5 5,0 7,5 1 0,0 SRES1
No outliers appear in the scatterplots and the hypothesis of normality cannot be refuse.
We conclude that both factors are significant.
15
Exercise 7 [M]
The factors that influence the breaking strength of synthetic fiber are being studied. Four
production machines and three operators are chosen to run a factorial experiment. Two batches of
fiber are chosen to perform two replications. The results are as follows:
Machine
Operator 1 2 3 4
109 110 108 110
1
110 115 109 108
110 110 111 114
2
112 111 109 112
116 112 114 120
3
114 115 119 117
In blue the first batch.
Analyze the data (use α=0.05),
Solution
The batch is clearly a nuisance factor. Then, the analysis is done considering two factors of interest
(Operator and Machine) and one blocking factor (Batch). Let us notice that for each level of the
blocking factor, a complete plan is done. The model for the analysis will be complete for the factors
operator and machine and additive for the factor batch.
Main Effects Plot for Strength Interaction Plot for Strength
Data Means Data Means
Operator Machine Batch 1 2 3 4 1 2
120
116 Operator
1
115 2
115 3
Operator
110
114
120
Machine
Mean
1
113 2
115
Machine 3
4
112
110
111
110 Batch
1 2 3 1 2 3 4 1 2
The operator seems to be relevant. It affects the breaking strength more than the machine and batch
factors. The interaction between operator and machine seems to be fairly important. The interaction
between machine and batch is suspected but it cannot be verified with this kind of model.
Analysis of Variance
16
Source DF Adj SS Adj MS F-Value P-Value
Operator 2 160,333 80,167 20,29 0,000
Machine 3 12,458 4,153 1,05 0,409
Batch 1 2,042 2,042 0,52 0,487
Operator*Machine 6 44,667 7,444 1,88 0,172
Error 11 43,458 3,951
Total 23 262,958
Model Summary
S R-sq R-sq(adj) R-sq(pred)
1,98765 83,47% 65,44% 21,33%
-1 80
Percent 70
-2 60
SRES1
-1
1
-3 -2 -1 0 1 2 3
-2
1 2 3 4 1 ,00 1 ,25 1 ,50 1 ,75 2,00 SRES1
From the analysis of the scatterplot, no outliers appear and the hypothesis of homogeneous variance
cannot be refused. Also the hypothesis of normality cannot be rejected. Finally, the factor operator is
the only significant. Instead, the nuisance factor does not affect the breaking strength.
17
Exercise 8 [September 20th 2012]
The effect of five different ingredients (A, B, C, D, E) on the reaction time of a chemical process is
being studied. Each batch of new material is only large enough to permit five runs to be made.
Furthermore, each run requires approximately 1 hour and half, so only five runs can be made in one
day. The data of the experiment follows.
Day
Batch 1 2 3 4 5
1 A (8) B (7) D (1 C (7) E (3)
2 C (11) E (2) A (7) D (3) B (8)
3 B (4) A (9) C (10) E (1) D (5)
4 D (6) C (8) E (6) B (6) A (10)
5 E (4) D (2) B (3) A (8) C (8)
Solution
1
6 2
3
Day 5
4
5
5
0
Ingredient
3
1 2 3 4 5 1 2 3 4 5 1 2 3 4 5
The factor Ingredient seems to be relevant and affects the reaction time more than the nuisance
factors. The interactions appear moderate but they cannot be tested with this model.
Analysis of Variance
Source DF Adj SS Adj MS F-Value P-Value
Batch 4 15,44 3,860 1,23 0,348
Day 4 12,24 3,060 0,98 0,455
18
Ingredient 4 141,44 35,360 11,31 0,000
Error 12 37,52 3,127
Total 24 206,64
Model Summary
S R-sq R-sq(adj) R-sq(pred)
1,76824 81,84% 63,69% 21,19%
-1 80
70
Percent
-2 60
SRES1
2 4 6 8 10 1 2 3 4 5 50
Day Ingredient 40
2 30
20
1
10
0 5
-1
1
-3 -2 -1 0 1 2 3
-2
1 2 3 4 5 1 2 3 4 5 SRES1
From the analysis of the scatterplots, no outliers appear and the hypothesis of homogeneous variance
cannot be refused. The assumption of normality cannot be rejected with a p-value equal to 0.452. The
residual assumptions are verified and, in conclusion, the factor ingredient is significant whereas the
nuisance factors do not affect the reaction time.
c) What should the experiment recommend if we want to reduce the reaction time? (Use Tukey)
In order to reduce the reaction time, there is not difference among the ingredients B, D, E.
A further experiment focused on these ingredients is suggested.
19
Exercise 9 [February 1st 2012]
The following experiment was described by Kuehl (1994) and run by J. Berry and A. Deutschman at
the University of Arizona to obtain specific information about the effect of pressure on percent
conversion of methyl glucoside to monovinyl isomers. The conversion is achieved by addition of
acetylene to methyl glucoside in the presence of a base under high pressure. Five pressures were
examined in the experiment, but only three could be examined at any one time under identical
experimental conditions. The data of the experiment follows.
Experimental condition
Pressure I II III IV V VI VII VIII IX X
250 16 19 20 13 21 24
325 18 26 19 13 10 24
400 39 21 33 34 30 31
475 32 46 35 47 31 37
550 45 61 55 48 52 50
Solution
40
40
Mean
Mean
35 30
30
20
25
10
20
1 2 3 4 5 6 7 8 9 10
250 325 400 475 550 1 2 3 4 5 6 7 8 9 10 Block
The factor pressure seems to affect the percent of conversion. The interaction appears to be moderate
but it cannot be considered in this kind of model.
20
Factor Type Levels Values
Pressure Fixed 5 250; 325; 400; 475; 550
Block Fixed 10 1; 2; 3; 4; 5; 6; 7; 8; 9; 10
Analysis of Variance
Source DF Adj SS Adj MS F-Value P-Value
Pressure 4 3688,6 922,14 29,90 0,000
Block 9 346,9 38,55 1,25 0,334
Error 16 493,4 30,84
Total 29 5576,7
Model Summary
S R-sq R-sq(adj) R-sq(pred)
5,55328 91,15% 83,96% 68,89%
-1 80
70
Percent
-2 60
SRES1
-1
1
-3 -2 -1 0 1 2 3
-2
0,0 2,5 5,0 7,5 1 0,0 SRES1
All the standardized residuals belong to the interval [-3;+3], thus no outliers appear. Observing the
scatterplot, no problems of inhomogeneous variance can be detected. The hypothesis of normality
cannot be refused. We conclude that the Blocking factor does not have an influence on the response,
while pressure has.
21