Tutorial Minitab
Tutorial Minitab
of Minitabs most important features. Each of the tutorials are designed to explain the fundamentals of using Minitabhow to use the menus and dialog boxes, how to manage and manipulate data and files, how to produce graphs, and more. There are five successive tutorials: Session One: Minitab Basics Session Two: Doing a Simple Analysis Session Three: Advanced Minitab Session Four: Quality Control and Improvement Session Five: Designing an Experiment
The worksheet POPLAR1 contained the data collected so far, but you just received new observations from the field, and there are five new rows to enter. 1 Press until you reach the first blank cell in row 16 or, with your mouse, click on the first blank cell in row 16. 2 Make sure the data entry arrow points to the right. If it does not, click on it to change its direction. Verify data window. 3 Type the following from left to right across each row: 1.52 [Enter] 2.9 [Enter] .07 [Ctrl]+[Enter] 4.51 [Enter] 5.27 [Enter] .79 [Ctrl]+[Enter] 1.18 [Enter] 2.2 [Enter] .03 [Ctrl]+[Enter] 3.17 [Enter] 4.93 [Enter] .44 [Ctrl]+[Enter] 3.33 [Enter] 4.89 [Enter] .52 [Ctrl]+[Enter] Tip If you make a mistake: click on or move to a cell (the contents will be automatically selected, type the correct value, and press [Enter].
The new Site column will appear in the Columns folder in the Project Manager. Note To view the column info, double-click on the folder POPLAR1.MTW in the Project Manager and click on the Columns folder. The column info will appear in the right pane of the Project Manager.
Minitab displays text output in the Session window and each graph (three, in this case) in its own Graph window. Now you can tile the graphs to view all of them at one time on your screen. 1 Activate the Project Manager by typing [Ctrl]+[I] or by clicking the Project Manager button on the toolbar. 2 Click on the Graphs folder. 3 In the right pane of the Project Manager, select the graphs you want. You can select multiple graphs by clicking on the graph titles while holding down the [Ctrl] key, 4
or by clicking below or to the side of the graph titles and dragging across the desired selections. 4 Right-click on the selected graphs and choose Tile. Judging from the boxplots , poplars grown at Site 2 are larger than those grown at Site 1. The Session window data confirm that the median values for Diameter, Height, and Weight of the poplars are larger for Site 2 than for Site 1. Also, the variable Weight has a very large standard deviation relative to its mean. At Site 2, the minimum weight is only 0.03 kg while the maximum is 1.11 kg. It appears that some of our poplars are doing very well, while others are barely alive.
Looking at the scatter plot, you see a positive linear relationship between Weight and D2H. That is, as D2H increases, so does Weight. You also notice an unusual data point a tree that has a very low weight for a relatively high D2H value. For now, you decide to ignore it, but it is something you may want to check on later. Next, you will compute the correlation between these two variables to quantify the relationship.
If there is no check mark next to Output Editable, then select it to enable Session window editing. If there is already a check mark next to Output Editable, then Session window editing is already enabled. Press [Esc] twice to close the menu. Now you can edit your output. 3 Delete all the text above the Descriptive Statistics output and all the text between the Descriptive Statistics output and the Correlation output. Select the text by dragging over it with your mouse, then delete it by choosing Edit > Cut or pressing [Delete]. 4 Scroll to the top of the Session window and type four comment lines as shown below: Original data set was POPLAR1.MTW Five lines of data were added. 6
Two variables were added. Site and D2H = (Diameter**2)* Height. The resulting data set was stored in the project POPLAR.MPJ. 5 Save your work. Choose File > Save Project.
You could go to each Graph window and print them separately, but if you have more than one graph there is a faster way. 2 Activate the Project Manager by typing [Ctrl]+[I] or by clicking the Project Manager button on the toolbar. 3 Left-click on the Graphs folder in the Project Manager. 4 In the left pane of the Project Manager, select the four graphs you have created: click below Plot Weight * D2H and drag up. Verify Project Manager. 5 Right-click on the selected graphs and choose Print. Click OK.
More If you want to use output or data in another application or another Minitab project, you can save your Session window output, data, and graphs as separate files. These separate files are copies of what is currently in your projectthe contents of your project are not changed in any way.
2 Minitab may ask if you want to save changes to your project. Since you already saved your project above, there is no need to do it again here. Click No. Otherwise you may continue with Session 2: Doing A Simple Analysis.
Time required About 30 minutes. Proceed with Step 1: Start a New Project.
You decide you might as well do a series of plots for residual analysis to check for any potential problems. 4 Click Graphs. 5 Under Residuals for Plots, click Standardized. 6 Under Residual Plots, check Histogram of residuals and Normal plot of residuals. 7 In Residuals versus the variables, enter D2H. Verify dialog box. 8 Click OK in each dialog box. In the Session window output, Minitab displays the regression equation, the table of coefficients, the analysis of variance table, andin the table of unusual observations the identity of the outlier and influential observations (rows 12 and 15). Before proceeding with further analysis, you want to examine rows 12 and 15 to make sure they contain valid data. Minitab also displays three graph plots. A quick glance at the Residuals Versus D2H plot shows you that the data contains an outlier.
1 To view the worksheet, click on the Data window, choose Window > POPLAR2.MTW, or press [Ctrl]+[D]. 2 Now go to the first unusual observation, in row 12 of the column named Weight: Choose Editor > Go To. (Select the first Go To.) In Enter column number or name, type Weight. In Enter row number, type 12 and click OK. Verify dialog box.
The Data window now shows the 12th observation of Weight as the highlighted cell. Both Weight and D2H seem rather large, so you double-check the researchers log sheets. It turns out that poplar #12 is a very healthy treethe values are correct. 3 Click on the Weight value in row 15 to highlight it, or press three times.
Double-checking the log sheet shows that this value is actually an error. The correct value should be 0.70, not 0.07. 4 Type .7 and press [Enter]. Verify data window.
As before, Minitab displays the text output in the Session window, and displays each of the three graphs in its own Graph window. First, look at the Session window output. If you have a good model and have satisfied all the statistical assumptions, then you can measure the diameter and height of any poplar in this population and be able to predict its weight without cutting it down, drying it, and weighing it on a scale. From the regression output, you see a high t-ratio and a low p-value for D2H in the table of coefficients, indicating strong evidence of a relationship between D2H and Weight. The large F-statistic and low p-value in the analysis of variance table quantify this relationship in a different way. The R2 and adjusted R2 values of greater than 98% 10
further reinforce the assertion that there is a strong linear relationship between D2H and Weight. Before making a final conclusion, however, you decide to look at the plots. You notice from the Residuals Versus D2H plot that the variance does not appear to be constantan important assumption for a regression model to meet. It is larger at bigger values of D2H. In the interest of time we will continue with our session, but this is something you would want to examine more closely.
Put the graph in edit mode 1 Make the Graph window active by clicking on it or choosing its name from the Window menu. 2 Maximize the Graph window. 3 Choose Editor > Edit. More A graph can be in one of three modes: View mode allows you to view your graph but nothing else, Edit mode allows you to edit your graph, and Brush mode allows you to identify data points. You choose the mode from the Editor menu.
11
Two palettes should appear. If they do not, open them: Choose Editor > Show Tool Palette. Choose Editor > Show Attribute Palette.
Your Graph window should look like this. Your palettes may be in different positions. You can move a palette around by dragging the title bar at the top of the palette. Change the text of the title 1 2 On the Tool palette, click the selection tool if it is not already selected. Click anywhere on the title and press [Enter], or double-click on the title.
A text box containing the current title will appear on the screen. 3 In the box, edit the title to say Regression Plot for Poplar2 Data. Click OK.
Resize the text box of the title Because the title is longer, it is now on three lines. You need to resize the box that surrounds the title. 1 If necessary, click on the title to make it active. Handles will appear. 2 Place the cursor over the second handle up on the right edge. The cursor changes to crossed diagonal lines. 3 Click and hold the mouse button down and drag the right edge so the title is on two lines. 4 Place the cursor over the second handle up on the left edge. Click and hold the mouse button down and drag the left edge so the title is on one line. Now you can position the title where you want it. 1 Put the cursor in the middle of the title. 2 Click and hold the mouse button down and drag the title to the position you want. Release the mouse button.
12
The Brushing palette widens to display the additional information. If you like, you can move it so both the palette and the plot can be easily seen. You want to identify the two points on the extreme right of the plot. You can select a block of points by enclosing them in a rectangle. To draw the rectangle, you begin at the upper-left corner and drag down to the lower-right corner of the rectangle. 4 Move the cursor to the location where you want to begin drawing the rectangle. This location will be the upper-left corner. 5 Hold the mouse button down and drag down to the lower-right until the rectangle encloses the two points. The points are enclosed in a rectangle and identified in the Brushing palette. These are the same points identified previously as being an outlier and influential observation. Here, brushing lets you quickly see the diameter, height, weight, and site information for these points. The brushed points are also marked in the Data window.
13
14
Two windows can show you information about this worksheet. 3 Choose Window > Project Manager or press [Ctrl]+[I].
If the left page of the Project Manager is too narrow, simply click the divider between the panes and drag to the right. 4 In the left pane of the Project Manager, click on the Columns folder. Verify.
The right pane displays information about the data in the worksheet. 5 If the worksheet is not visible, open the Data window by pressing [Ctrl]+[D].
The Data window shows you the columns of data in detail. This worksheet contains seven variables: Site, Year, Treatment (experimental treatment), Diameter (cm), Height (m), Weight (kg), and Age (years). Tip If you want to adjust the column widths to fit the data, point with your mouse to the top of a line dividing the two columns until the mouse cursor turns into a two-sided arrow. Then, press the mouse button down and drag the column border to make it wider or narrower.
You want to maximize yield, so you will focus on what factors influence the weight of trees. Begin by looking at the descriptive statistics for the variable Weight. 1 box. 2 Choose Stat > Basic Statistics > Display Descriptive Statistics. Verify dialog In Variables, enter Weight. Click OK.
When you examine the session window output, notice the minimum value for Weight. It is certainly impossible to have a weight of -99 kilograms! The real story here is that our data gatherers in the field recorded the value -99 to represent a dead tree. Leaving values of -99 in the worksheet is going to considerably throw off any analyses you do. In fact, it has already affected the output of the descriptive statistics just computed. The means and medians are artificially low, while the standard deviation is artificially high. You need to convert all -99s to missing values. Missing values do not affect the results of any statistical analyses. Minitab represents a missing value for numerical data with an asterisk (*).
1 2
Choose Stat > Tables > Tally. In Variables, enter Age. Click OK. Verify dialog box.
The session window output shows that you have 147 three-year-old trees and 151 fouryear-old trees.
Minitab will split the POPLAR3 worksheet using the values of Age. Since there are two unique values in the age column (3 and 4), Minitab will create two new worksheets. The worksheet that contains the data for the three-year-old trees will be named POPLAR3.MTW(Age = 3); the worksheet that contains the data for the four-year-old trees will be named POPLAR3.MTW(Age = 4). Rename the worksheet that contains the four-year-old trees data 1 2 4). 3 4 5 Press [Ctrl]+[I]. Verify dialog box. In the left pane of the Project Manager, right-click on POPLAR3.MTW(Age = Click Rename. Verify dialog box. Type 4YROLDS.MTW. Press [Enter].
You will now perform the analysis on the four-year-old poplar data.
17
You see from the histogram that the weights of the poplars are approximately normally distributed (in a bell-shaped curve).
This tells Minitab to produce a separate boxplot of weight for each treatment. 4 Click OK.
Examine the boxplot of Weight by Treatment. The line drawn across each box indicates the median, or middle, of the data. The bottom and top edges of the box mark the first (25th percentile) and third (75th percentile) quartiles, respectively. The boxplots suggest that Treatments 2 and 4 (fertilizer and fertilizer/irrigation) have produced the heaviest trees, while Treatments 1 and 3 (control and irrigation) have yielded lighter trees. You might also expect the site to have an impact on weight. The Site 1 trees planted in the rich, well-drained soil would be expected to weigh more than the Site 2 trees planted in the dry, sandy soil. You can determine if this assumption is true by looking at a boxplot of weight for each site. Rather than repeat your previous menu selection from the beginning, recall the last Boxplot dialog box and change the X, or category, variable. 5 6 7 Choose Edit > Edit Last Dialog, or press [Ctrl]+[E]. In X, enter Site. Verify dialog box. Click OK.
The new boxplot will appear. Surprisingly, the Site 1 tree weights do not seem very different from the Site 2 tree weights. The spreads are different for each site, but the medians are almost the same.
18
You have seen from the boxplots that poplar weights differ noticeably among the four treatments, but not as noticeably between the two sites. Now you decide to use analysis of variance to see if there are statistically significant differences in weight due to the different levels of the factors site and treatment. When you have two or more factors, Minitab gives you a choice between balanced ANOVA and the general linear model (GLM). Balanced ANOVA requires a balanced design, that is, you must have the same number of observations for each site/treatment combination. Because your design is not balanced, you will use the general linear model. 1 2 Choose Stat > ANOVA > General Linear Model. In Responses, enter Weight.
Next, you will enter the model you want GLM to fit. You decide to look at a model with Site, Treatment, and the Site* Treatment interaction. 3 In Model, type Site | Treatment.
The vertical bar tells Minitab that you want to include all possible interactions in the model. To make a vertical bar on most keyboards, press [Shift]+[\], or you can use the symbol ! instead. 4 Click OK. Verify dialog box.
The session window output for the GLM lists each factor in the model and the number of levels in each factor. Next, GLM lists the analysis of variance table, and finally, it lists unusual observations. Suppose you want to perform an F-test for each effect in the model. For example, to test the null hypothesis that the treatment effect is the same for both sites (the Site*Treatment interaction), compare Minitabs p-value with the commonly used alpha level of 0.05. Because the p-value is 0.091 (a value larger than 0.05) you cannot reject the null hypothesis. That is, you cannot conclude that the treatment effect differs for the two sites. Now you can look at the main effects, Site and Treatment. The Site p-value of 0.219 is also larger than 0.05, so you cannot conclude that poplar weights differ significantly between the two sites. The p-value for Treatment is small (0.000) thereby supporting the conclusion that mean weights do differ significantly for different treatments. This agrees with what you saw earlier in the boxplotsthat poplar weights were different for different treatments, but only varied slightly between the two sites. Before you decide that Treatment is the only important factor influencing poplar weight, take a look at the Year effectremember that the researchers planted half the trees in Year 1 and half in Year 2.
19
You decide to look at a boxplot to compare the weight of poplars planted in Year 1 with those planted in Year 2. 1 2 3 Choose Graph > Boxplot. In Y, enter Weight. In X, enter Year.
This says to draw a separate boxplot of weight for each year. Notice the first two rows of the Data display table. IQRange Box instructs Minitab to display a box showing the interquartile range, from the 25th to the 75th percentile. Outlier Symbol instructs Minitab to display an asterisk (*) for all outlier values. You decide to also display a confidence interval box within the IQ Range Box. 4 5 6 7 In the Data display table, in the Display column, click in the row for item 3. Click the down arrow beside Display, and choose CI Box. Click in the next cell to the right. Click the down arrow beside For each and choose Graph. Verify dialog box.
This row tells Minitab to include a confidence interval on each boxplot. By default, Minitab draws boxplots vertically, but you also can draw them horizontally. 8 9 Click Options. Check Transpose X and Y. Verify dialog box. Click OK in each dialog box.
In the boxplot, the inner boxes show a 95% confidence interval for the median. The boxplot suggests that poplars planted in Year 2 are heavier than those planted in Year 1. But why is year important? Trees were planted in two different years simply to replicate the data. You interview the field researchers and learn that they did not apply herbicides to control weeds during the first year planting. As a result, many young trees either died or were severely stunted. To improve the trees ability to survive, researchers did apply herbicides when they planted poplars the second year. You draw three preliminary conclusions from your analysis. One, fertilization appears to be an effective way to maximize the weight of poplar clones. Two, it is important to control weeds while the trees are very young. Three, given proper planting and nutrient conditions, poplar clones may not require a high-quality site in order to yield a substantial amount of biomass. Not only were the Year 2 trees heavier, their weights were more consistent. But before you recommend the use of herbicides and fertilizers, you want to look more closely at the Year 2 trees. Specifically, you want to know if you still see Site and Treatment effects, when you look at the Year 2 trees alone.
20
Repeat the analysis on the Year 2 trees 1 Make sure the Data window named YEAR2.MTW is active. To make a Data window active, click on it, or choose its name from the Window menu. 2 3 Press [Ctrl]+[I]. In the left pane of the Project Manager, click the History folder.
The right pane of the Project Manager displays all the commands you have executed in the current project. 4 Scroll through the commands until you find Histogram.
This was the command you used to check for normality. 5 Click on the Histogram command to select it.
6 Scroll to the bottom of the commands, hold down [Shift], and click the last command. All the commands from Histogram to the end are selected. Verify history folder contents. 7 Choose Edit > Command Line Editor, or as a shortcut, press [Ctrl]+[L].
21
A dialog box appears containing the Minitab commands from the section you highlighted. Verify dialog box. 8 Click Submit Commands.
The entire analysis, a histogram of Weight, boxplots of Weight by Treatment and Weight by Site, an analysis of variance, and a boxplot of Weight by Year are all done, with no further work. Tip You can also select consecutive commands in the History folder by clicking and dragging through them. You can select nonconsecutive commands by holding down [Ctrl] while you click them.
22
Time required About 30 minutes. Proceed with Step 1: Start a New Project.
The R chart for Length does not show any points out of control.
23
The Xbar chart shows that the process is out of control. Specifically, one point has failed test 1, and two points have failed test 6. To find out what these tests mean, look in the Session window. 7 Choose Window > Session.
By looking at the Session window output, you see that the process produced one point more than 3 sigmas from the center line, and four of five points more than 1 sigma from the center line (on one side of the center line). Now that you have confirmed that a problem does exist, it is time to look for causes and solutions. Unfortunately, the sampling plan did not allow for detailed inspection of precisely where and when the problems occurred because only one sample was taken per shift. A better plan would have been to take multiple samples per shift for the troubleshooting phase, and to switch to this monitoring plan after special causes were found and eliminated. Nonetheless, you are determined to get what you can out of the data that you have.
The Graphs subdialog box appears. 4 5 Check Histogram of data, with normal curve. Click OK in each dialog box.
Examine the histogram. In general, we expect a variable such as Length to follow the normal distribution. In this case, the histogram would be approximately bell-shaped. The histogram you just created is certainly not bell-shaped. In fact, it would appear 24
from the spikes at 598, 599, and 601 that we may be dealing with more than one separate and distinct distributions. An examination of the inventory records indicates that there are two suppliers for the camshafts. Now you are starting to understand the odd histogram. You decide to obtain measurements from both suppliers and run Xbar and R charts separately on each set of data with a subgroup size of 5 for each. The data for each supplier are stored in the columns Supp1 and Supp2 of your worksheet.
According to the Xbar-R Chart, both the means and ranges for Supplier 1 appear to be in control, although you notice that the mean is 599.5 mm, not 600. The average range for Supplier 1 is 1.36 mm. Evaluate Supplier 2 You can produce the same control charts for Supplier 2, using the variable named Supp2, as you did for Supplier 1. 1 Press [Ctrl]+[E].
This keyboard shortcut, for Edit > Edit Last Dialog, brings up the Xbar-R Chart dialog box again. 2 In Single column, enter Supp2. Click OK.
You do not need to enter a subgroup size because it was still set to 5 from the last time you used this dialog box. Minitab remembers the dialog box settings from the last time a dialog box was used in a session. Supplier 2s Xbar-R chart reveals problems. From the chart, you can see that two points are above the upper control limit.
25
The R chart does not indicate that the process is out of control. However, you notice that the center line is at 3.72, which is almost three times larger than Supplier 1s Rbar of 1.36. As team leader, you recommend that longer production runs be accepted from Supplier 1 until Supplier 2 can demonstrate that camshaft production is in control. You will work with Supplier 2 to reduce process variability to an acceptable level. Because of the statistical evidence to support your position, your recommendation is implemented.
Minitab produces a histogram with normal curve again, based on your earlier selections. You are satisfied by the bell shape of the distribution, and you do not see multiple modes or peaks as you did earlier. You are ready to proceed.
Next, enter the specification limits. 4 5 6 7 In Lower spec, type 598. In Upper spec, type 602. Verify dialog box. Click Options. In Target (adds Cpm to table), type 600. Verify dialog box. 26
From the graph you can see that the process mean falls short of the target and the process distribution mean lies to the left of the target. Also, the left tail of the distribution falls outside the lower specification limit. Therefore, some camshafts will not meet the lower specification of 598 mm. The Capability Analysis command also produces a table of statistics. The Cpk index indicates whether the process will produce units within the tolerance limits. A Cpk index of 1 means that a process is exactly capable of meeting specifications, while less than 1 means that the process is not meeting specification limits. Ideally, you would like to see a Cpk much larger than 1, because the larger the index, the more capable the process. The Cpk index for Supplier 1 is only 0.90, indicating that they need to improve by reducing variability and by centering the process around the target. Since Supplier 1 is currently your best supplier of camshafts, you will work with them to improve their process and, therefore, your own. Minitab offers analysis of variance (ANOVA), regression, design of experiments (DOE), and many other statistical tools that you will use to continuously improve your processes.
design a factorial experiment to tell which factors are important to the reaction fit a full model to the data use several simple graphical methods to help determine which effects are active (important) or inactive fit a reduced model to the data, and then assess the adequacy of the model Time required About 30 minutes.
Since you have three factors that are of interest, you can see in the table that you have two options. You can choose a fractional factorial design of resolution III with 4 runs, or a full factorial design with 8 runs
A two-level design with three factors has 2**3 (or eight) possible factor combinations. By choosing a design with all possible combinations, called a full factorial design, you will get results that show effects free from confounding, that is, all effects are distinguishable from other effects. However, you may also be able to obtain meaningful results by doing fewer runs or combinations. Designs that use less than all possible combinations are called fractional factorial designs. You decide that the full factorial design with 3 factors and 8 runs is more appropriate than the fractional factorial design. At your chemical plant, runs that manipulate the factors of interesttime, temperature, and type of catalyst are not expensive or time-consuming. Also, the experiment can be performed at a non-peak period without disturbing the workflow at the plant. If the runs of the experiment were costly or difficult to perform, you may have made a different decision. 28
3 4 5 6
Click OK. You are now back in the main dialog box. Choose 2-level factorial (default generators). In Number of factors, choose 3. Click Designs.
The box at the top shows all available designs for the design type and number of factors you selected. 7 8 In the Designs box, select Full factorial. In Number of replicates, choose 2. Verify dialog box. 9 Click OK. This selects the design and brings you back to the main dialog box. Notice that the remaining buttons are now enabled.
Verify dialog box. 3 Click OK. This brings you back to the main dialog box.
More If you have a design that includes center points and you have both numeric and text factors, you need to be aware that there really is no true center to the design. In this case, center points are called pseudo-center points See Help or Chapter 19, Factorial Designs, in Minitab Users Guide 2 for a discussion of pseudo-center points.
29
Entering a base for the random data generator allows you to control the randomization so that you obtain the same pattern every time. This way you will get the same design order that is used in this sample session. 3 Make sure Store design in worksheet is checked. Verify dialog box. Click OK. 4 You are now back in the main dialog box. Click OK. This will generate the design and store the design in the worksheet. Tip It is usually a good idea to randomize the run order. Randomizing the order of the runs lessens the effects of other factors that are not included in the study, particularly effects that are time-dependent.
The Data window should now look like this: Notice the columns named StdOrder (C1) and RunOrder (C2). Every time you create a design, Minitab reserves C1 and C2 to store the standard order and run order. StdOrder shows what the order of the runs in the experiment would be if the experiment were done in standard, or Yates, order. RunOrder shows what the order of the runs in the experiment would be if the experiment was run in random order. If you do not randomize a design, the standard order and run order are the same. In addition, Minitab stores the center point indicators in C3 and the block numbers in C4. Since you did not add center points or block the design, Minitab sets all the values in C3 and C4 to one. Next in the worksheet are the factor columns, beginning with C5. In this example, the factors are in C5 through C7. Since you entered the factor levels in the Factors subdialog box, you see the actual levels in the worksheet. More You can use Stat > DOE > Display design to switch back and forth between a random and standard order display, and between a coded and uncoded display in the worksheet. There are two ways to change the factor settings or names: use Stat > DOE > Modify Design, or type new factor names directly in the Data window. 30
You can use both the Session window output and the two effects plots to help you decide which effects are important to your process. First, you will look at the Session window output. You fit the full model, which includes the three main effects, three two-way interactions, and one three-way interaction. Use the values in the P column of the Estimated Effects and Coefficients table to determine which of the effects are significant. Using a = 0.05, the main effects for Pressure and Catalyst, and the Pressure*Catalyst interaction are significant; that is, their p-values are less than 0.05.
You will fit a model that includes Pressure, Catalyst, and the Pressure*Catalyst interaction. 1 2 3 Choose Stat > DOE > Factorial > Analyze Factorial Design. Click Terms. Set up the model you want to fit.
From Include terms in the model up through order, choose 2. Notice this moves ABC to the Available Terms box. Click on A:Temp in the Selected Terms box, then click . This will move the A:Temp variable to the Available Terms list box. Repeat these actions to move the AB and AC interactions to the Available Terms box. 4 Verify dialog box. Click OK. You are now back in the main dialog box. 5 Click Graphs. Uncheck Normal and Pareto. 6 Check Histogram, Normal plot, Residuals versus fits, and Residuals versus order. Verify dialog box. Click OK and return to the main dialog box. 7 Click OK in the Analyze Factorial Design dialog box. The output will display in the Session window and the residual plots will display in Graph windows. Was your choice of active effects a good one? Is your model valid? You will try to answer these questions next when you evaluate the reduced model.
33
Click on B:Pressure in the Available box, then click on the single arrow that points to the right. This will move the B:Pressure variable to the Selected box. Repeat these actions to move C:Catalyst to the Selected box. Verify dialog box. Click OK. 5 6 Check Interaction and click Setup. Repeat steps 3 and 4. Verify Interactions dialog box. 9 Click OK in the main Factorial Plots dialog box to display each plot in a separate Graph window.
34
To calculate main effects, Minitab subtracts the mean response at the low or first level of the factor from the mean response at the high or second level of the factor. The table below summarizes the findings: Factor Size of Effect Interpretation Pressure +14.13 runs at 4 atmospheres of pressure had higher yields than runs at 1 atmosphere of pressure Catalyst 30.37 runs that used catalyst A had higher yields than runs that used catalyst B If you have no interactions between the factors, this graph will adequately describe where you can get the biggest payoff for changes in your factors. The next step, then, is to look at the significant interaction. Although you have already verified a significant interaction with the Session window output, you can look at the interaction plot to see how big this effect is. 2 To make the interaction plot the active window, choose Window > Interaction Plot for Yield. Verify Interactions Plot. An interaction plot shows the impact that changing the settings of one factor has on another factor. Because an interaction can magnify or diminish main effects, evaluating interactions is extremely important. The significant interaction between pressure and catalyst shows up as two lines with sharply differing slopes. The yields for catalyst A are greater than yields for catalyst B at both 1 and 4 atmospheres of pressure. However, you can see that the difference in yields between runs using catalyst A and runs using catalyst B at 4 atmospheres is much greater than the difference in yields between runs using catalyst A and runs using catalyst B at 1 atmosphere. In order to get the highest yield for your experiment, your results suggest that you should set pressure to 4 atmospheres and use catalyst A.
2 You ran the experiment and entered the responses. 3 You fit the full model to look at some numerical values and generated two effects plots to see which terms seemed to be active. 4 You screened out unimportant effects, then fit a reduced model. 5 You generated main effects and interactions plots with the Factorial Plots command to visualize the effects. 6 You evaluated the reduced model with the p-values in the Analyze Factorial Design output and the various residuals plot. You could have used additional analysis techniques in Minitab as well. Lets summarize what you have learned: From looking at the effects plots, you determined that pressure, type of catalyst, and the interaction between pressure and catalyst were active. Evaluating interactions is extremely important, because an interaction can magnify or cancel out main effects. You can eliminate (screen out) the other terms without significantly affecting predictions. Now that you have a model to predict the yield, you can apply this model to help obtain higher yields in future experiments. In order to get the highest yield for your experiment, your results suggest that you should: set pressure to 4 atmospheres use Catalyst A evaluate higher levels of pressure with future experiments
36