Guide To Using Minitab
Guide To Using Minitab
Minitab Step-By-Step
A guide to the following subjects:
1. Obtaining a Simple Random Sample 2. Frequency of Relative Frequency Distributions from Raw Data 3. Bar Graphs from Summarized Data 4. Bar Graphs from Raw Data 5. Pie Chart from Raw or Summarized Data 6. Histograms 7. Stem-and-Leaf Plots 8. Dot Plots 9. Determining the Mean and Median 10. Drawing Boxplots Using Technology 11. Determining Quartiles 12. Scatter Diagrams 13. Correlation Coefficient 14. Determining the Least-Squares Regression Line 15. The Coefficient and Determination, 16. Residual Plots 17. Simulation 18. Computing P(x) as Binomial Probabilities 19. Computing P(X x) as Binomial Probabilities 20. Computing P(x) as a Poisson Probabilities 21. Computing P(X x) as a Poisson Probabilities 22. Finding Areas under the Standard Normal Curve 23. Finding z-Scores Corresponding to an Area 24. Finding Areas under the Normal Curve 25. Finding Normal Values Corresponding to an Area 26. Normal Probability Plots 27. Confidence Intervals about , Known 28. Confidence Intervals about , Unknown 29. Confidence Intervals about p 30. Confidence Intervals about 31. Hypothesis Tests Regarding , Known 32. Hypothesis Tests Regarding , Unknown 33. Hypothesis Tests Regarding a Population Proportion 34. Hypothesis Tests Regarding a Population Standard Deviation
5. NOTE2: Sometimes students have played with Minitab for a while and have checked or unchecked one of the options in whatever they were doing. If they cannot undo that and now the histogram, box-and-whiskers display, scatterplot, etc. comes up messy every time, the best thing to do is to save everything, close Minitab and open it again. It should come back to normal again. 6. HINT: Books for Data Analysis have a Technology Step-By-Step guide at the end of some sections on how to use the technology tools including Minitab. If you are working with a student ask him/ her for a book to look up the proper procedure(s). The book for Engineering Statistics (1016-314) has no guide to Minitab except for occasional printouts used in examples. Students taking Probability and Statistics for Engineers (1016-345) are not required to complete Minitab projects. Also some of the statistics books contain the tables and formulas with explanation of notations either on the inside covers of the book or in one of the appendixes. So even if you are not familiar with a particular symbol you may know very well the concept behind it. When you work with a student on his/her project leave any interpretation up to the student.
II.
WORKING IN MINITAB
1. CREATING A SET OF DATA IN MINITAB a. From previously created set: (Example: rolling a fair six-sided die) i. Create the set to sample from: Calc > Make Patterned Data > Simple Set of Numbers Store it in C1, from first value (choose a number) to last value (choose a number) in steps of 1 ii. Sample from (created) set: Calc > Random Data > Sample From Columns Number of rows: . (whatever number of data pieces you need) From sample C1 Store data in C2 (do it again with C1 and see what happens). If the data set that you need is larger than your original set that you are sampling from, make sure to check the with replacement box). You can experiment with different types of data (check how much you remember from statistics). iii. Rank sort data: To sort data: Data > Sort To rank date: Data> Rank b. Or you can always type the collected set of data into Minitab. (Make sure that the data is in numeric format) 2. FREQUENCY DISTRIBUTION Stat > Tables > Tally Individual Variables. Select the appropriate column into the Variables: box. Choose one of the options from the results that can be displayed. Counts- (i.e. frequency), -Percents- (i.e. relative frequency) > OK Minitab does not have any option to create a grouped data frequency table. The closest thing to it is a histogram.
3. DESCRIPTIVE STATISTICS a. To be displayed in a session window: Stat> Basic Statistics> Display Descriptive Statistics> check columns to calculate the descriptive statistics from>Statistics> check whatever measures are needed. b. All required statistics to be shown as single entries in cells of a worksheet: Stat> Basic Statistics> Store Descriptive Statistics> check columns to calculate the descriptive statistics from>Statistics> check whatever measures are needed. c. To graph a histogram, test for normality and display all statistics: Stat> Basic Statistics> Graph Descriptive Statistics> OK 4. INFERENTIAL STATISTICS a. Confidence Intervals: i. 1-Sample Z: Stat> Basic Statistics> 1-sample Z > check the alpha level in options. ii. 1-Sample t: Stat> Basic Statistics> 1-sample t > check the alpha level in options. b. Hypothesis Testing: i. 1-Sample Z: Stat> Basic Statistics> 1-sample Z > check the alpha level and alternative hypothesis in options ii. 1-Sample t: Stat> Basic Statistics> 1-sample t> check the alpha level and alternative hypothesis in options 5. REGRESSION AND CORRELATION a. Correlation coefficient: Stat > Basic Statistics >Correlation > choose columns - make sure that you choose a different column for each type of data. b. Regression Equation Stat > Regression > Regression. Select output column for the Response and input column for the Predictors > OK. c. Scatter diagram Graph>Scatterplot> choose simple or with regression > OK > Enter the Y-variables and X variables > OK d. Fitted line plot (Scatter diagram with Regression Lin, Equation and Correlation Coefficient displayed) Stat>Linear Regression>Fitted Line Plot Choose columns for x and y according to the scatterplot you have chosen > OK.
6. HOW TO CREATE AND EDIT A GRAPH IN MINITAB a. To graph a histogram from a row data: Graph > Histogram > Simple or Simple with Fit > Graph variable - choose column with the row data > OK b. To graph a histogram from a data organized in a frequency table: Graph > Histogram > Simple or Simple with Fit > Graph variable - choose column with grouped data > Data Options > Frequency choose the column with the frequencies > OK c. To graph a cumulative histogram: Graph > Histogram > Simple or Simple with Fit > Graph variable - choose column with the row data > Scale> Y-scale Type - check Accumulate values across bins box > ok d. To edit a histogram: Do not worry about labeling everything appropriately before you create a histogram. You can always do it on the existing histogram by double clicking on the appropriate area or by clicking on the right button. To change the number of intervals: Double click on the histogram (make sure that you are clicking the whole graph not a single bar)> Binning > Select the appropriate midpoint cutpoint, in interval definition enter the midpoint/cutpoint values separated by space. Or Double click on the histogram>Binning > Number of intervals type the number of intervals e. To graph and edit a pie chart from a frequency table: Graph > Pie Chart > check the box: Chart values from a table> enter the categorical data, enter the summary variables> OK f. To graph and edit a box-and whiskers-display from a frequency table: Graph > Box plot > Simple> Choose a column for: Graph variables> OK (this is usually graphed from the row data not grouped data)
III.
TYPES OF GRAPHS 1. To assess relationships between pairs of variables: a. Scatterplot shows relationship between two variables b. Matrix Plot shows the relationships between many pairs of variables at once. c. Marginal Plot Similar to scatterplot, but adds a histogram or boxplot of each variable in the margins of the graph. 2. To assess distributions: a. Histogram displays the shape and central tendency of data. b. Dotplot similar to a histogram but more useful with small amounts of data. c. Steam-and-Leaf - displays actual data values in binned format. d. Probability Plot Displays how well your data follow a specific distribution. e. Empirical CDF similar to probability plot but its scales are always linear. f. Boxplot (Box-And-Whiskers display) compares sample distribution characteristics such as median, range and symmetry, and identifies outliers. 3. To compare summaries or individual values of a variable a. Boxplot (Box-And-Whiskers display) compares sample distribution characteristics and screens outliers. b. Interval Plot compares means and confidence intervals. c. Individual Value Plot assesses and compares individual data values. d. Bar Chart compares a summary statistic, such as the mean, across grouping levels. e. Pie Chart compares the proportion of each group relative to the whole. 4. To assess distributions of counts: a. Bar Chart compares the distribution of counts. b. Pie Chart compares the proportion of each group relative to the whole. 5. To plot a series of data over time: a. Time Series Plot- for data that was collected in equally spaced time intervals and is in chronological order. b. Area Graph shows how the compositions of the sum changes over time with stacked data. c. Scatterplot for data that is collected at irregular intervals or not in chronological order in the worksheet.
5. The reason we generate 10 rows of data (instead of 5) is in case any of the random numbers repeat. Select OK, and the random numbers will appear in column 1 (C1) in the spreadsheet.
6. Histograms
1. Enter the raw data in C1. 2. Select the Graph menu and highlight Histogram . . . 3. Highlight the simple icon and press OK.
4. Put the cursor in the Graph Variables box. Highlight C1 and press Select. Click SCALE and select the Y-Scale Type tab. For a frequency histogram, click the frequency radio button. For a frequency histogram, click the frequency radio button. For a relative frequency histogram, click the percent radio button. Click OK twice. Note: To adjust the class width and to change the labels on the horizontal axis to the lower class limit, double-click inside one of the bars in the histogram. Select the binning tab in the window that opens. Click the cut point button and the midpoint/cutpoint positions radio button. In the midpoint/cutpoint box, enter the lower class limits of each class. Click OK.
7. Stem-and-Leaf Plots
1. With the raw data entered in C1, select the Graph menu and highlight Stem-and-Leaf. Select the data in C1 and press OK 2. Select the data in C1 and press OK.
8. Dot Plots
1. 2. 3. 4. Enter the raw data in C1. Select the Graph menu and highlight Dotplot. Highlight the simple icon and press OK. Put the cursor in the Graph variables box. Highlight C1 and press Select. Click OK.
11.Determining Quartiles
Follow the same steps given to compute the mean and median from raw data. (Section 3.1)
12.Scatter Diagrams
1. Enter the explanatory variable in C1 and the response variable in C2. You may want to nae the variables. 2. Select the Graph menu and highlight Scatterplot . . . . 3. Highlight the Simple icon and click OK.
4. With the cursor in the Y column, select the response variable. With the cursor in the X column, select the explanatory variable. Click OK
13.Correlation Coefficient
1. With the explanatory variable in C1 and the response variable in C2, select the Stat menu and highlight Basic Statistics. Highlight Correlation. 2. Select the variables whose correlation you wish to determine and click OK.
16.Residual Plots
Follow the same steps as those used to obtain the regression output (Section 4.2). Before selecting OK, click GRAPHS. In the cell that says Residuals versus the variables, enter the name of the explanatory variable. Click OK
17.Simulation
1. Set the seed by selecting the Calc menu and highlighting Set Base . . . Insert any seed you with into the cell and click OK. 2. Select the Calc menu, highlight Random Data, and then highlight Integer. To simulate rolling a single die 100 times, fill in the windows shown in Figure 4 on page 267. 3. Select the Stat menu, highlight Tables, and then highlight Tally . . . Enter C1 into the variables cell. Make sure that the Counts box is checked and click OK.
3. In the Graph variables cell, enter the column that contains the raw data. Make sure Distribution is set to Normal. Click OK.
Known
1. Enter raw data in column C1 if necessary. 2. Select the Stat menu, highlight Basic Statistics, and then highlight 1-Sample Z . . . 3. Click Options. In the cell marked Alternative, select the appropriate direction for the alternative hypothesis. Click OK.
4. If you have raw data, enter C1 in the cell marked Samples in columns: If you have summary statistics, select the Summarized Data radio button. Enter the value of the sample size and the sample mean. Enter the population standard deviation. Check the box for perform hypothesis test and enter and the hypothesized mean (this is the value of the mean stated in the null hypothesis). Click OK.
Unknown
1. Enter raw data in column C1. 2. Select the Stat menu, highlight Basic Statistics, then highlight 1-Sample t . . . 3. If you have raw data, enter C1 in the cell marked Samples in columns.: If you have summary statistics, select the summarized data radio button. Enter the value of the sample size, sample mean, and sample standard deviation. Check the box for Perform hypothesis test and enter the hypothesized mean (this is the value of the mean stated in the null hypothesis). Click OK.