Measurement Systems Analysis - Appendices
Measurement Systems Analysis - Appendices
Step 11
Be Prepared
Step 44
Type of Study
In This Section:
Continuous Data
1. Setting-up and Randomising the
Spreadsheet in Minitab
4. Supplementary Information on
interpreting the output
Setting up and randomising the worksheet for a Gauge R&R Study for
Variable Data
1. The starting point for setting up the worksheet for a variable data Gauge R&R
study is the same regardless of which randomisation method for the
worksheet is required. To begin go to:
Stat > Quality Tools > Gage Study > Create Gage R&R Study Worksheet
2) Enter the
identity of
the parts to 3) Enter the
be used number of people
in the study
4) Enter the
identities of the
people in the
study
Options
At this stage you must decide which randomisation method to use taking into
consideration the practicalities of running the experiment and the most economical use
of people’s time.
a. Do not randomise: As it states, this option does not randomise the data. This
option will sequence the parts then the people for each part and provide a run
order column as shown below.
b. Randomise all runs: This will completely randomise the order that the
measurements are taken in as shown below. This is useful to prevent the
appraisers from memorising their previous measurements and also to reduce the
impact of time related factors. It does however require all of the appraisers to be
present at once which can be impractical in many situations such as where
different shifts are worked.
Following data collection and analysis the standard order can be used to re-sort the
recorded data so that the pattern of collection may give an insight into what happened.
This should only be done if the measurement system analysis study is not clearly
acceptable and in this case can be useful in identifying combinations which were
awkward for the appraisers.
On completion of Note 4, Minitab returns you to previous dialogue box (Create Gage
R&R Worksheet) then press OK on this screen.
Minitab will now generate the worksheet, you will need to add in your data column and
collect the data before running the study.
1) Areas of the worksheet have been previously used OR the wrong sort of data
has been entered resulting in the column being in the wrong data format
The wrong type of data format for columns then has the effect of hiding columns that
are expected to be numeric (or vice versa) when conducting an MSA study.
The most common occurrences of this is when a space is added somewhere in the
column or when the letter O is used instead of 0 (zero). In both cases, even if the
typing error is rectified this will change the format of the column from numeric to text
format.
Text format columns can be identified by the addition of a ‘T’ to the column number as
in the example above.
2) The second issue is that of human mistakes when entering the data.
To guard against this, the facilitator of the measurement study must control the
study to maintain the concentration, time, speed and discipline required to
type/record each data point.
In addition to this, it is possible to assist the person entering the data to select the
correct cell by highlighting the line (descriptions and details) of the active entry.
The example shown has used the option ‘randomise runs within operators’ with the
next entry being from appraiser 2 for part identity 9.
It can also be very beneficial to record comments when entries are typed. This
additional information can be useful for analysis where the Measurement System is
not acceptable and further investigation is needed.
A dialogue box will appear. Enter the data into the fields as shown below:
Click on
“Do not display percentage contribution” & “Do not display percentage study
variation”.
1.75
100
1.50
0
Gage R&R Repeat Reprod Part-to-Part 1 2 3 4 5 6 7 8 9 10
Parts
R Chart by Appraiser
Measurement by Appraiser
1 2 3
Sample Range
2.00
0.10
UCL=0.0789
0.05 _ 1.75
R=0.0307
0.00 LCL=0
1.50
1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9 10
Parts 1 2 3
Appraiser
Xbar Chart by Appraiser
1 2 3 Parts * Appraiser Interaction
2.00
Sample Mean
_
_ 2.00 Appraiser
UCL=1.8504
X=1.819 1
LCL=1.7876
Average
1.75 2
1.75 3
1.50
1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9 10
1.50
Parts 1 2 3 4 5 6 7 8 9 10
Parts
In the following appendix the results and interpretation is explained for each of the six
graphs in the Minitab Graphical Output. Please note that all of the statistical analysis and
graphs should be considered before making conclusions for the study and for the potential
actions required. Note also that ‘no action required’ is a possible and valid conclusion.
The first graph to look at is the “Components of Variation” on the top right
of the graphical output.
This graph shows where most of the variation in the study came from.
The Gauge R&R column shows the % Tolerance taken up by the
measurement system variation. Remember this was 76.99%
If the Part-to-Part columns are high (or very high compared to the others)
Components of Variation
this tells us that most of the variation in the study was due to the fact that
the parts being measured were not identical (which we would expect).
If the Repeat columns are high compared to the others, this indicates that
there is a problem with Repeatability (i.e. one or more of the appraisers is
inconsistent with themselves). The remaining graphs will help us
investigate this further.
If the Reprod columns are high compared to the others, this indicates that
there is a problem with Reproducibility (i.e. some of the appraisers are
inconsistent with each other). The remaining graphs will help us
investigate this further.
The Gauge R&R column is the variation component total for Repeat and
Reprod.
Graphical Data
However, it is unlikely that they will all be zero, therefore we use this
chart to help us identify any measurements of concern.
We interpret this graph by saying that any point which is above the
upper red line is worth investigating, as this indicates that the range of
results for that appraiser and part was higher than expected.
Next Julie and the team look at the Xbar Chart by Appraiser graph –
this is the graph from the case study.
The 2 red lines on the chart are control limits. We would expect all at
least 50% of the points to lie outside the control limits (red lines) on
this chart. This is different and the opposite to the conventional use of
SPC charts.
Julie and the team note the results of this graph and ask each
appraiser what they did to identify differences then move to the next
one.
If you are not sure on interpreting control limits on an Xbar chart then ask a
local Black Belt to help you choose the most appropriate type of MSA.
Next Julie and the team look at the Measure by Part graph – this is the graph from the
case study.
This graph shows circles for all measured values of each part,
together with the average values for each part (shown by ‘crossed
circles’.)
We interpret this graph by saying that any part for which there is a
noticeably larger spread in the results, might be worth investigating.
In this case parts 10, 8, 4 & 5 appear to have greater variation than
the rest of the parts. The team need to consider why these parts were
more difficult to measure? Julie and the team note this finding and
move on to the next graph.
We would also expect the spread of results (boxes and whiskers) for
all 3 appraisers to be the same (however, unlike the previous graph,
we wouldn’t necessarily expect the spread to be small, as the results
for all of the parts are shown against each appraiser).
We interpret this graph by saying that if, for any appraiser, there is a
noticeably larger spread in results, or the average value is noticeably
different from the others, this might be worth investigating.
Again Julie and the team note the findings and move onto the final
graph.
Julie and the team agree that this graph confirms some of their
thoughts from the previous graphs. They are now ready to summarise
their findings.
1. To begin go to Stat > Quality Tools > Gage Study > Gage Run Chart
This is a plot of all of the observations by operator (denoted by different colours) and
by part number (each box numbered 1 – 10 represents one of the 10 parts).
The plot allows you to see if any patterns are evident in the data. For instance, you
might see that one operator consistently measures higher than the others or that the
measurements on certain parts vary more when compared to other parts.
Here for example you can see that for part 4, appraiser 1 (in black) has measured
higher than the other two appraisers. You can also see for part 10 noticeable
difference between the measurements of the three appraisers.
Looking at the repeatability within appraisers, for parts 2 and 8 it can be seen that
appraiser 2 (in red) noticeably differs in their three measurements indicating a
repeatability problem.
30 | © 2013 Rolls-Royce plc
Continuous Data
Q: “We only have 2 operators using this gauge, shall I take part in the study to
make the numbers up?”
A: “Not unless the operator is trained to use the gauge and familiar with the component
being measured. Having an untrained operator may result in the failure of the study due
to the stability and/or reproducibility of the measurements due to the untrained operator.
It would be far better to compromise on the amount of measurement readings than to
perform a study that is not representative of the way the process works”
Q: “We can’t get parts that represent the full process variation as the only ones
we have are from the same batch. What should we do?”
A: “If the sample does not represent the true process variation then this will affect the
results of gauge R&R against study variance, the number of distinct categories and the
limits on the X bar chart on the R&R output.
If this is the case 2 options exist. Either proceed and study the % Gauge R&R against
tolerance.”
Q: “Our gauge measures hundreds of features, do I have to run gauge R&R study
on each of them or can I use read-across methodology?”
A: “The Quality Management System requires that all product features/characteristics
are measured using capable measurement systems. That said there are situations
where similar features measured with the same gauge presents an opportunity to
demonstrate capability without direct study of every single feature as long as this is
done robustly. This must however be done in a robust and traceable way. Read across
is not permitted on CMM equipment, not because the CMM’s tend to be incapable but
because of the risk of program errors due to the manual nature of the program
creation.”
© 2013 Rolls-Royce plc | 31
MSA How-to Guide
Continuous Data
Q: “I have a surface finish gauge, and it keeps failing R&R. What do I do?”
A:”Some gauges are notoriously difficult to perform gauge R&R on. Some gauges are
the best available for a given measurement. In this situation contact a measurement
practitioner or Metrologist.”
Q: “My gauge R&R against tolerance is very good but I only get 1 distinct
category. Is my measurement process good or not?”
A: “It is likely that the parts selected for the study are not representative of the total
process variation. This will affect the %R&R against study variance, the number of
distinct categories and the X bar chart. If the parts are representative then the
measurement process is not adequate for the application of SPC analysis as the
majority of the variation seen will be from the measurement system and not the
underlying process.”
Q: “How many decimal places on my gauge shall I use when conducting the
gauge study?”
A: “As a minimum you should ensure that the study represents the requirements on the
part drawing but the more the better. For instance if the drawing requires measurement
to 2 decimal places, run the gauge R&R to 3.”
Q: “My gauge passes the gauge R&R study. Is this all I need to consider?”
A: “No – gauge R&R will not highlight gauge bias. For instance it is possible to be
repeatibly wrong. Comparison with a known standard will enable you to study the
amount of gauge bias. Calibration is not done on production parts, real parts can
introduce large differences.”
Attribute Data
6. Setting-up and Randomising the
Spreadsheet in Minitab
9. Supplementary Information on
how to interpret the output
3) Enter the
identity of
the parts to 4) Enter
be used and the
the text number
standard of people
in the
study
5) Enter
the
identities
of the
people in
the study
Options
a. Do not randomise: As it states, this option does not randomise the data. This
option will sequence the parts then the people for each part and provide a run
order column as shown below.
b. Randomise all runs: This will completely randomise the order that the
measurements are taken in as shown below. This is useful to prevent the
appraisers from memorising their previous measurements and also to reduce the
impact of time related factors. It does however require all of the appraisers to be
present at once which can be impractical in many situations such as where
different shifts are worked.
Following data collection and analysis the standard order can be used to re-sort the
recorded data so that the pattern of collection may give an insight into what happened.
This should only be done if the measurement system analysis study is not clearly
acceptable and in this case can be useful in identifying combinations which were
awkward for the appraisers.
1) Areas of the worksheet have been previously used OR the wrong sort of data
has been entered resulting in the column being in the wrong data format
The wrong type of data format for columns then has the effect of hiding columns that
are expected to be numeric (or vice versa) when conducting an MSA study.
The most common occurrences of this is when a space is added somewhere in the
column or when the letter O is used instead of 0 (zero). In both cases, even if the
typing error is rectified this will change the format of the column from numeric to text
format.
Text format columns can be identified by the addition of a ‘T’ to the column number as
in the example above.
2) The second issue is that of human mistakes when entering the data.
To guard against this, the facilitator of the measurement study must control the
study to maintain the concentration, time, speed and discipline required to
type/record each data point.
In addition to this, it is possible to assist the person entering the data to select the
correct cell by highlighting the line (descriptions and details) of the active entry.
The example shown has used the option ‘randomise runs within operators’ with the
next entry being from appraiser 2 for part identity 9.
It can also be very beneficial to record comments when entries are typed. This
additional information can be useful for analysis where the Measurement System is
not acceptable and further investigation is needed.
To run an Attribute Measurement System Analysis (MSA) then use the menu commands:
Stat>
Quality Tools>
Attribute Agreement Analysis
A dialogue box will appear. Enter the data into the fields as shown below:
The information dialogue will appear, so that you can complete the details as
appropriate.
Once completed, click OK button to close the information screen, then click OK again
to close on main window.
In addition to the ‘How to guide: Measurement System Analysis’ the following pages
give supplementary information on some of the statistical concepts to deepen your
understanding of how to fully interpret the output
Interpretation
of graphical
output
The ‘blue’ dots have been explained as the actual proportion of agreement within the
sample of parts appraised in the study. Also shown are the ‘red’ lines which end in a
cross. These indicate the confidence intervals for each Appraiser and for each
Appraiser vs. Standard.
The actual numbers for each confidence interval are recorded in the session window
and used in plotting this graph.
These confidence intervals take into account the fact that the actual agreement %
calculated (as represented by the blue dot) is based only on a relatively small sample
of data. If it was possible to know the ‘true’ agreement % of the appraiser (based on
every part they ever inspected) then this % would be likely to be different from the %
seen in the study sample. The confidence interval indicates the possible range of
values that the ‘true’ % agreement could be. Minitab defaults to a confidence level of
95%. This means that we can interpret the confidence interval for Appraiser 1 for
example as saying “I am 95% confident that the true % of within appraiser agreement
for appraiser 1 is between 62.1% and 96.8% (which is the range indicated by the blue
crosses and red line).
© 2013 Rolls-Royce plc | 49
MSA How-to Guide
Attribute Data
Appendix 9: Supplementary Information on interpreting the Output of Attribute
Agreement Analysis
Each ‘red’ line and two crosses should be as short as possible indicating less potential
error in the ‘blue’ dot point indication, when using 95% confidence (the default level for
most statistical analysis).
These confidence intervals are calculated using the F distribution which also requires
a thing called degrees of freedom. The larger the degrees of freedom, which can be
influenced by using a larger selection of items for inspection (sample size) and the
quantity of matched agreements (such as Pass / Pass and the standard also as Pass)
also contribute to reducing the confidence intervals.
The F distribution is not symmetrical and therefore the confidence intervals can look a
little odd, one (usually the lower) will be longer than the other.
You must however take into consideration that the sample size will significantly
influence this. Where the sample size is small (less than 20 parts for example) the
lower confidence intervals will nearly always fall below 70% even when the average
agreement within the sample is fairly good). In these circumstances you should
discuss with a Black Belt an appropriate sample size to use for the assessment of
critical measurement systems. If the measurement needed is highly critical then
consideration should also be given to whether it is possible to redesign the
measurement system to use variable data rather than attribute and assess the system
using Gauge R&R instead.
Each of the percentage results shown in the session window tables have been
explained in the ‘How to guide: Measurement System Analysis’. This section will
explain the other statistics such as the ‘Kappa’ listed and the P-value. Each of the
tables is constructed with the same format and therefore the ‘Within Appraisers’ table
will be used for these explanations.
Interpretation
of the
numerical
output
Within Appraisers table is shown for a simple Fail or Pass judgement which gives a
Kappa value similar BUT not the same as the percent listing.
Fleiss’ Kappa becomes very useful when a scale or multiple class of judgement is
used which then has a Kappa value indicating agreement for each scale value.
An example of this could be shade judgements, where the ends of the scale, shades A
or B (light), I or J (dark) have very high Kappa values near to 1 and middle shades
such as E or F have lower (approximately 0.778291 say) showing for that person these
were harder to judge as correct. © 2013 Rolls-Royce plc | 51
MSA How-to Guide
Attribute Data
Appendix 9: Supplementary Information on interpreting the Output of Attribute
Agreement Analysis
P-value Judgements
Interpretation
of the
numerical
output
The P-value (which is short for the probability value) is a back-up statistic describing
the chance of having a Fleiss Kappa near 0 (zero).
As the tabular notation shows, the judgement is made about the ‘chance’ (probability)
where the lower the P (vs. > 0) column p-value shows a low number, the less
statistical chance the Kappa values are 0 (zero) or less.
In other words, the p-values of 0.0000 are very good and what should be seen.
Change History