Eee 502 Notes - Maintenance and Maintainability
Eee 502 Notes - Maintenance and Maintainability
1
o Short down
o Breakdown
B. Unplanned
• Emergency
2.2. Factors to consider in designing a maintenance policy
(i) Operational requirements
(ii) Equipment/system characteristics
(iii) Tools and test equipment
(iv) Spares provisioning
(v) Maintenance instructions and manuals
(vi) Logistics
3. TESTING METHODS
3.1. Categories of Testing
3.2. Prototype testing
3.3. Pre-production (qualification) testing
o Environmental
o Reliability
o Maintainability
o Packaging and Transport
o Physical and Electrical Characteristics
o Ergonomic
3.4. Production testing
3.5. Reliability Demonstration and Acceptance
3.6. Accelerated Life Testing
3.7. Relationship Between Testing and Inspection
3.8. Relationship Between Quality and Reliability
3.9. Sampling Plans (Approaches for Testing Small and Large Batch Quantities)
2
1.1. MAINTAINABILITY
In a layman’s language, maintainability is the ease and speed with which a failed item can be
repaired and brought back into service. Perhaps at this stage, we may consider a simple
example in order to illustrate the concept of equipment/system maintainability. Suppose there
are two Television (TV) sets from different manufacturers, one of them has a control knob
outside the back cover to adjust the TV set when pictures are rolling vertically on the screen
while the other TV set has a similar control knob inside the back cover. Now, we take it that a
TV set is faulty (i.e. it has failed) if it has pictures rolling vertically on its screen and it is to be
repaired and brought back into normal service. At this juncture, it is not difficult to be
convinced that the TV set with the external knob will not require opening of the back cover
before reaching the knob and carrying out appropriate adjustment manually. Consequently, it
can be repaired in this case with greater ease and speed than the other TV set with internal
control knob. Therefore, we can say that the TV set with external knob has a higher
maintainability. In a more general case, we must consider many repair factors in an equipment
(or a TV set, as in our example) before we conclude whether or not one equipment has a lower
or higher maintainability than another similar equipment.
However, maintainability has a precise definition which is stated as follows:
Maintainability is the probability that a unit or system will be restored to operational efficiency
within a given period of time when the maintenance action is performed in accordance with
prescribed procedures. From the definition, it is clear that maintainability can be quantified
and can assume a dimensionless numerical figure between 0 and 1, except if expressed in
percentage form. However, it is important that maintainability figure should be kept as high as
possible if optimum use is to be derived from a unit or system. At this stage, it is pertinent to
mention that the achievement of high maintainability is dependent on some factors; such as
type of system location, efficiency and skill of the maintenance crew, and provision of spare
parts etc. Furthermore, we should note from the definition of maintainability the emphasis on
two very important phrases, viz; carrying out ‘maintenance action in accordance with
prescribed procedures’, and within ‘given time’. The simple explanation of these phrases is that
if we decide to put two skillful and efficient men on a repair job instead of one, we should
expect to increase the probability that the job will be completed within “a given time” if they do
the job as technically expected. The aim of maintainability is to design and develop
systems/equipment which can be maintained in the least time, at the least cost and with a
minimum expenditure of supporting resources (such as manpower, spare parts, tools and test
equipment, etc) without adversely affecting the equipment performance or safety
characteristics.
3
The mean time anticipated for these operations is obtained and the MTTR is evaluated by
using the relationship:
∑ 𝑛𝑖 𝜆𝑖 𝑡𝑚𝑖
𝑀𝑇𝑇𝑅 = (1.1)
∑ 𝑛𝑖 𝜆 𝑖
A worked example is shown in Fig. 1.1 to further illustrate this point.
∑ 𝑛𝑖 𝜆𝑖 𝑡𝑚 = 0.689595
∑ 𝑛𝑖 𝜆𝑖 𝑡𝑚 0.689595
𝑀𝑇𝑇𝑅 = = = 0.235 ℎ𝑜𝑢𝑟𝑠
∑ 𝑛𝑖 𝜆 𝑖 2.935
(iii) Maintenance action (repair) rate: (𝜇) is the number of maintenance actions that can be
carried out on a particular item per hour or average number of items (equipment) which can be
restored to normal working condition per hour, assuming that there is no waiting between
repair jobs. Maintenance action rate (u) is the reciprocal of the MTTR,
1
𝜇= (1.2)
𝑀𝑇𝑇𝑅
(iv) Maintenance time constraint: (tm) is the permissible repair time.
(v) Maintainability equation: The probability of completing one or more maintenance actions is
expressed through the maintainability equation as:
−𝑡𝑚
𝑀 = 1 − 𝑒 −𝜇𝑡𝑛 = 1 − exp ( ) (1.3)
𝑀𝑇𝑇𝑅
Example 1.1
The average time to repair any fault in a given system is 1 hour. Determine maintainability of
the system for a time of 21/2 hours.
4
Solution
From equation (1.3)
𝑀 = 1 − 𝑒 −2.5/1
= 1 − 0.082 = 0.918
Comment: The probability, M of the system being returned to a working condition within
21/2 hours is 0.918 (or 91.8%).
Note:
From the foregoing, it is clear that the prediction of maintainability involves establishing a value
for the MTTR of a system. Although an accurate assessment of MTTR is a difficult task, however
a designer can aim at a low rate value of MTTR by considering such factor as accessibility of
components, built-in fault units etc.
(vi) Utilization factor (U): Utilization factor of a unit or system is defined as the ratio of the
operating time (top) to the sum of the maintenance time, (tm), idle time (tid), (which may
occur between completion of maintenance and use due to administrative reasons) and the
operating time. Mathematically, the utilization factor, U can be expressed as
𝑡𝑜𝑝
𝑈= (1.4)
𝑡𝑚 + 𝑡𝑖𝑑 + 𝑡𝑜𝑝
(vii) Repairability: Repairability is the probability that a unit or system will be restored to
operational efficiency within a given ‘active repair time’, if specified maintenance is
performed on it. Meanwhile, it should be noted that active repair time is the time during
which technicians are actually working on repairs, adjustment etc.
(viii) Maintenance action time: Maintenance action may consume time due to the following
activities which are associated with it:
(a) Reporting failure to the appropriate staff;
(b) The maintenance team or a maintenance technician collects the repair tools and possibly
some spares he guesses may be needed;
(c) Sending for additional spares, replacement, etc, if the need arises;
(d) Trying to verify the precise way in which the item fails to function and consequently
deciding to disassemble some parts prior to the proper commencement of fault
diagnosis;
(e) Locating the actual fault;
(f) Carrying out repairs or replacing a faulty unit with a good one and making any
adjustments, if necessary, and
(g) Testing the item after repairs to ensure satisfactory performance.
NOTE
From the foregoing it is clear that the ‘maintenance action time’ is the sum total of the times
spent on all the activities listed (a – g) above.
(ix) Active repair time: this is the time during which actual repair work; adjustments etc are
carried out by the maintenance technician(s). This period is covered by activities (d) – (g).
(x) Logistics time: This is part of the total repair time and represents the time taken to obtain
all the repair tools, materials and spares etc, and the movement of repair crew to the failed
item. It is covered by activities (b) and (c).
(xi) Administrative time: This is the time which is not specifically allocated to either active
repair or logistics. This includes the time lost due to the non-availability of maintenance
5
staff, partly as a result of meal breaks (i.e. tea break or lunch time) or because the
maintenance technician needed for the repair job is yet to complete a previous job etc.
This time may be covered by the activities (a) and (e).
(xii) Availability (A): Availability is the probability that an item will perform its required function
at a stated instant of time or over a stated period of time. From Equation (1.4), if the idle
time is equal to zero (i.e. 𝑡𝑖𝑑 = 0) and the maintenance time becomes as small as possible,
then utilization factor will approach its maximum value, and can now be called availability
of a unit or system. Mathematically, this can be expressed as:
𝑡𝑜𝑝
𝐴 = 𝑈𝑚𝑎𝑥 = (1.5)
𝑡𝑚(𝑚𝑖𝑛) + 𝑡𝑜𝑝
On the other hand, we know from our previous knowledge of MTBF and
MTTF that the following assumptions are true:
(i) 𝑡𝑜𝑝 = MTBF (mean time between failures)
(ii) 𝑡𝑚(𝑚𝑖𝑛) = MTTR (mean time to repair)
Using these assumptions in equation (1.5) above,
𝑀𝑇𝐵𝐹
𝐴= (1.6)
𝑀𝑇𝐵𝐹 + 𝑀𝑇𝑇𝑅
If the availability of equipment is given as 0.90, this means that the equipment is working
satisfactorily for 90 percent of the time, and under repair, for the remaining 10 percent. The
concept of availability is further illustrated through worked examples shown immediately
below.
Example 1.2
An electrical generating set designed for continuous operation fails twice in a period of 123
days. The total time for repairs during the period is 3 days. Determine the following
parameters;
(a) MTBF (days)
(b) MTTR (days)
(c) Availability
Solution
(a) Total operating time = (123 – 3) days = 120 days
Total number of failures = 2
MTBF = 120/2 = 60 days
(b)
𝑡𝑜𝑡𝑎𝑙 𝑡𝑖𝑚𝑒 𝑓𝑜𝑟 𝑟𝑒𝑝𝑎𝑖𝑟𝑠 3
𝑀𝑇𝑇𝑅 = = 𝑑𝑎𝑦
𝑡𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑓𝑎𝑖𝑙𝑢𝑟𝑒𝑠 2
(c)
𝑀𝑇𝐵𝐹 60
𝐴𝑣𝑎𝑖𝑙𝑎𝑏𝑖𝑙𝑖𝑡𝑦 = = × 100% = 97.6%
𝑀𝑇𝐵𝐹 + 𝑀𝑇𝑇𝑅 60 + 3/2
Example 1.3
The ratio between maintenance time and operating time of a complex system is 1:10. Calculate
the ratio between maximum time the system can be allowed to stand idle and the operating
time to achieve a maximum utilization of 0.8.
Solution
6
From equation (1.4)
𝑡𝑜𝑝 1
𝑈= = 0.8 = 𝑡 𝑡𝑖𝑑
𝑡𝑚 + 𝑡𝑖𝑑 + 𝑡𝑜𝑝 𝑚
𝑡𝑜𝑝 + 𝑡𝑜𝑝 + 1
Using the data given,
1
0.8 =
1 𝑡𝑖𝑑
+ +1
10 𝑡𝑜𝑝
𝑡𝑖𝑑
= 0.15 𝑜𝑟 0.15: 1
𝑡𝑜𝑝
1.3. Analysis of System Availability
In this section, we are going to discuss on different types of availability, and how to determine
the availability of a complex system consisting of sub-units connected either in series or parallel
configuration. There are three forms of system availability, namely: steady-state availability,
instantaneous availability, and mission availability.
1.3.1 Steady-state Availability (Ass)
Steady-state availability is the proportion of time that a system is availability for use when the
overall period is of considerable duration. Since availability of a repairable system is a function
of its failure rate, , and of its repair rate, , therefore Equation (1.6) can be put in another
form to read
1
𝑀𝑇𝐵𝐹 𝜇
𝐴𝑠𝑠 = = 𝜆 = (1.7)
𝑀𝑇𝐵𝐹 + 𝑀𝑇𝑇𝑅 1 1 𝜆 + 𝜇
+𝜇
𝜆
On the other hand, we can express system unavailability as:
Steady-state unavailability,
𝜆
̅̅̅̅̅
𝐴𝑠𝑠 = 1 − 𝐴𝑠𝑠 = (1.8)
𝜆+𝜇
Note
Example 1.2 previously discussed fits the situation of steady-state availability. The reader may
wish to have another look at that example. In the example, the values of MTBF and MTTR
involved are large enough for our purpose.
Fig. 1.2: Simplified block diagram of a missile launching and guidance system.
𝑀𝑇𝐵𝐹 250
𝑢𝑛𝑖𝑡 𝐴 = = = 0.98
𝑀𝑇𝐵𝐹 + 𝑀𝑇𝑇𝑅 250 + 5
200
𝑆𝑖𝑚𝑖𝑙𝑎𝑟𝑙𝑦, 𝑎𝑣𝑎𝑖𝑙𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑜𝑓 𝑢𝑛𝑖𝑡 𝐵 = 210 = 0.95
650
𝐴𝑣𝑎𝑖𝑙𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑜𝑓 𝑢𝑛𝑖𝑡 𝐶 = 656 = 0.99
800
𝐴𝑣𝑎𝑖𝑙𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑜𝑓 𝑢𝑛𝑖𝑡 𝐷 = 808 = 0.99
Since the units are connected in series, the overall system availability is the product of
individual availability of the units given by, Equation (1.11).
𝑇ℎ𝑒𝑟𝑒𝑓𝑜𝑟𝑒 𝑜𝑣𝑒𝑟𝑎𝑙𝑙 𝑠𝑦𝑠𝑡𝑒𝑚 𝑎𝑣𝑎𝑖𝑙𝑎𝑏𝑖𝑙𝑖𝑡𝑦 = 0.98 × 0.95 × 0.99 × 0.99 = 0.91
9
(ii) If two of the unit C modules are connected in parallel, then then whole system
arrangement will be configured as shown in Fig. 1.3.
A B D
C
Fig. 1.3
Using Equation (1.12) the resultant availability 𝐴𝐶𝑃 of two unit C modules connected in parallel
is given by
𝐴𝐶𝑃 = 1 − (1 − 𝐴𝐶 )2
Where 𝐴𝐶 = availability of one unit C,
From the above calculation, 𝐴𝐶 = 0.99
𝐻𝑒𝑛𝑐𝑒, 𝐴𝐶𝑃 = 1 − (1 − 0.99)2 = 0.999
Using equation (1.12) the overall system availability
= 0.98 × 0.95 × 0.999 × 0.99 = 0.922
̅
(xiii) Unavailability (𝐴): Unavailability of an equipment or system due to breakdown is the
complement of availability.
𝑖. 𝑒. 𝐴̅ = 1 − 𝐴
Multiplying both sides of Equation (1.6) by -1 and adding 1 to both sides of the result, we have
𝑀𝑇𝐵𝐹 𝑀𝑇𝐵𝐹 + 𝑀𝑇𝐵𝐹 + 𝑀𝑇𝑇𝑅
𝐴̅ = 1 − =
𝑀𝑇𝐵𝐹 + 𝑀𝑇𝑇𝑅 𝑀𝑇𝐵𝐹 + 𝑀𝑇𝑇𝑅
𝑀𝑇𝑇𝑅
̅=
𝑇ℎ𝑒𝑟𝑒𝑓𝑜𝑟𝑒 𝑈𝑛𝑎𝑣𝑎𝑖𝑙𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝐴
𝑀𝑇𝐵𝐹 + 𝑀𝑇𝑇𝑅
Planned Maintenance
Planned maintenance is the type of maintenance work organized and carried out with
forethought, control and records. Planned maintenance is not a specific type of maintenance,
but represents the application of maintenance tackled in a scientific manner. This covers the
comprehensive planning of the maintenance function and includes the whole range of
maintenance such as fixed-time maintenance, planned replacement, preventive, operate-to-
failure maintenance and condition-based maintenance, etc.
Fixed-time maintenance is the individual or group replacement or repair of items after the
expiration of a fixed period (e.g. changing the oil filter in a car after 2000 km, changing the
engine oil in an electric generator after, say, 500 running hours) before failure occurs.
Operate-to-failure maintenance involves only such maintenance tasks as normal lubrication and
adjustments of parts but no other predetermined action is taken to prevent failure.
Planned replacement can be very suitably applied in situations when an equipment operates as
an individual unit e.g. machine tools, mobile electric generators etc. Where a policy of planned
replacement is practiced the pivot of the planning is to ensure the money is available for the
purchase of new equipment and the provision of spare parts.
In condition-based maintenance, it is the actual current condition of the equipment/system
which determines the form and frequency of maintenance. Condition monitoring of equipment
can be carried out by using human senses (visual inspection, hearing, touch and smell), portable
test instruments and fixed monitoring systems such as simple remote alarm system,
minicomputers, XY plotters, etc.
12
The fundamental basis of any planned maintenance system hinges on deciding in advance the
following points:
(i) The individual items of equipment to be maintained;
(ii) The form, method and details of the maintenance scheme for each item;
(iii) The tools, test gears, replacement spares and time needed for carrying out a maintenance
job;
(iv) How frequency maintenance operations must be carried out;
(v) The method of administering the system; and
(vi) The method of analyzing the results.
The details required for (ii), (iii) and (iv) are likely to be found in the manufacturer’s manual,
normally supplied with the equipment). In order to apply the above listed points practically the
following elements are required:
(i) An inventory of all equipment to be maintained;
(ii) A complete list containing all the individual tasks that must be carried out on each item of
equipment;
(iii) A calendar of events, indicating the sequence of maintenance which must be carried out;
(iv) A method of monitoring and ensuring that the work listed in the programme is carried out;
and
(v) A method of recording the results and assessing the effectiveness of the overall
maintenance.
The benefits derivable from planned maintenance include the following:
(i) Fewer breakdowns of regular and correct maintenance operations are carried out;
(ii) Appreciable reduction of downtime since spares and equipment demands are known in
advance and are available immediately on request;
(iii) It ensures regular, simple servicing which is cheaper than sudden expensive stop-gap
repairs; and
(iv) It guarantees greater and more effective labour utilization because it is planned ahead and
is made to spread evenly throughout the year, thus reducing widely fluctuating demands
upon the labour force.
13
this clarification, there are four specific types of preventive maintenance, viz: routine
maintenance, preventive replacement, identification of dormant failures, and identification of
degradation conditions.
(i) Routine preventive maintenance
This covers such operation as clearing, lubricating and adjusting at specified intervals of time.
There is the need for care in carrying out this type of maintenance because more failures may
be caused than prevented.
(ii) Preventive Replacement
Preventive replacement is based on the policy of replacing items after a particular period of use
even if they have not failed. In that case, it is imperative to have knowledge of the wear-out
distribution of the particular items concerned in the equipment. Preventive replacement is
characteristic of wastages because many items may be replaced before they actually fail. On
the other hand, the great benefit of avoiding equipment failure can be readily achieved. The
cost of equipment failure and its attendant inconvenience may be far outweighing the cost of
the preventive replacement of certain items.
(iii) Identification of Dormant Failures
Some items fail in an equipment without necessarily causing the equipment itself to fail or
malfunction immediately. However, a failed item in equipment will definitely increase the
stresses on the remaining items and hence accelerate equipment failure. Consequently,
occasional checks for dormant faults are practicable although it may be expensive. A very
useful example is the indicator lamps in a motor car. They are there to signify the occurrence of
a fault, which if not rectified immediately may lead to greater problems. A burnt-out lamp
must be replaced immediately. A careful motorist must always check out for the proper
functioning of the indicators in his/her vehicle.
(iv) Identification of degradation condition:
In systems where extremely high reliability is expected, such as in aircrafts, it is often necessary
to carry out measurements of some parameters for the purpose of identifying parts which have
degraded or drifted and which might be liable to become a failure in the near future. In such
cases, preventive replacement may become inevitable.
14
breakdown maintenance is normally worked out after an equipment/system has been
considered in advance and provision made by planning the method of repair; tools and spares
requirements and possible replacements.
Unplanned Maintenance
Unplanned maintenance is characteristic of maintenance work not initially organized and not
carried out with any forethought, control and records. Whenever a system/equipment fails to
work as intended, then repairs and adjustments must be carried out to bring it back into
service. Since we can never foresee exactly when a particular type of failure will occur we
cannot plan for it in advance.
Unplanned Emergency Maintenance
Unplanned emergency maintenance is the type which necessitates a maintenance which has
not been foreseen and to which no advance thought has been given. This involves a failure that
is thought could not possibly happen. In a situation of emergency maintenance, all available
resources are urgently utilized to restore the equipment to its normal operational standard in
the shortest possible time. For example, a radio broadcast transmitter may suddenly cease to
transmit messages to public listeners probably due to a sudden failure of one of its
components. In this case, the maintenance crew will swing into action immediately in order to
rectify the fault and ensure it is put back into service without much delay. If a standby
transmitter is available it will be put immediately to use while repair work progresses on the
faulty one. Unfortunately, many industries/establishments in developing countries rely on this
type of maintenance, leaving things to chance because of lack of financial resources or foresight
to embark on planned maintenance. This is why many industrial machinery and equipment
pack up, leading occasionally to the closure of the industry itself. That is the bane of industrial
undevelopment.
2.2. Factors to Consider in Deciding a Maintenance Policy
In the preceding section, we have discussed different forms of maintenance, any or a
combination of which may be adopted for an equipment/system maintenance. Therefore it is
pertinent to discuss here those factors which should always be considered in choosing a
maintenance policy. In fact, the choice of a maintenance policy invariably leads to the decision
on the choice of a maintenance method. Such factors will include:
(vii) Operational requirements
(viii) Equipment/system characteristics
(ix) Tools and test equipment
(x) Spares provisioning
(xi) Maintenance instructions and manuals
(xii) Logistics
Now, we shall throw some light on the effect which each factor has on a maintenance
policy and how they assist in achieving good maintainability.
Operational Requirements
By operational requirement we mean the function equipment has to perform and the
conditions under which it operates. It is perhaps easy to see the considerable difference
between maintaining a communication satellite launched and working in outer space several
thousands of kilometers from the earth surface, and a domestic radio receiver used in a living
room. In line with the operational requirement of the satellite, once launched, it is kept
15
working continuously in the outer space for several years without direct contact with human
beings for any maintenance operation. One must therefore be prepared to pay heavily for this
by improving reliability in order to eliminate the possibility of a breakdown and maintenance.
Conversely, a domestic radio receiver is in an environment where spare parts are readily
available, and less reliability will be accepted.
Equipment characteristics
Equipment characteristics encompass such things as electrical and mechanical features which
the equipment must satisfy in line with its operational requirement. The latter includes factors
as reliability complexity, safety precautions, and the degree of heat generation and evacuation,
as in thermionic circuits. The complexity of an electronic circuit is manifested by the number of
components and the interconnections among them. It is easy to see that the greater the
complexity of a circuit, the more difficult it is to isolate faults, and the more tedious a
maintenance operation. However, if the task is difficult then, there is the need for increase in
demand for well-trained maintenance staff, or for other aids to the maintenance task.
Mechanical feature of equipment also affects the maintenance task mainly through
requirements for manual skills, involving dismantling and assembling one thing or the other.
Some of these tasks may be generally delicate since they may have to be carried out in confined
or narrow spaces within equipment.
Tools and Test Equipment
The provision of appropriate tools and test equipment influence the maintainability of
equipment considerably. However, the following points must be considered when specifying
tools and test equipment. These are: the test gear must be easy to set up and operate. It must
also be more reliable than the equipment/system for which it is designed. Finally, it must be
maintainable and available for purchase for as long as the system for which it is made exists. A
great deal can be done to achieve rapid maintenance by the use of built-in test equipment
(BITE) or indicators for immediate visual detection of fault.
Maintenance Personnel
The following points concerning maintenance personnel vis-à-vis their influence on the
maintainability of equipment must be considered:
- Level of training given
- Skill level employed
- Motivation
- Quantity and deployment of maintenance personnel.
Definitely, a complex system may involve a wide range of maintenance and hence a good level
of training will be required for the maintenance personnel. Ability in carrying out corrective
maintenance is achieved by a combination of theoretical knowledge and practical diagnostic
skills. Consequently, a very effective maintenance technician should have a balance of
theoretical knowledge and practical skill commensurate to the complexity and diversity of
equipment or systems for which he is engaged. An ill-motivated maintenance technician, no
matter the level of training given him and skill acquired, may be less productive. Staff
motivation is closely associated with a number of factors such as staff promotion,
accommodation, condition of service etc. In fact, an ill-motivated technician can possibly
sabotage the maintenance effort (policy) of his employer; although if he does that, it will be at
the risk of losing his job. The MTTR may be reduced by deploying two technicians where
16
necessary to a maintenance work instead of one, especially where complicated maintenance
operation is involved.
Maintenance instructions and manuals
The provision of a well-explained and carefully illustrated maintenance manual and instructions
is a very important tool in achieving good maintainability. Maintenance instructions and
manuals, containing a logical sequence of tests are normally used as an aid to fault diagnosis.
They may also contain information on safety hazards and precautions, spare requirements and
replacements, procedure for failure reporting, and details of preventive maintenance.
Spares Provisioning
Based on the information concerning failure rate of some components, provision could be
made for spares. Faulty equipment without a provision for urgently needed spare parts can be
temporarily forgotten altogether! No complete maintenance job can be achieved without
spares. However, for spares provisioning, allowance must be made for damages during
transportation and storage. The main considerations of spares provisioning are: failure rate-
which determines essentially the quantity of spares needed, cost of each spare which also
assists in fixing the quantity of spares to be maintained, and the supply time after ordering. If
the spares can be obtained easily and locally without much delay, there may be no sense in
massive stock-piling of spares.
Logistics
Logistics deals with time and effort involved in transporting equipment, spares and manpower
to the venue where maintenance action is needed. The aspects of what to centralize or
decentralize should be well appreciated particularly in relation to the nature and location of
equipment/system to be maintained. As a guide especially where small MTTR is vital, it is
advisable to decentralize:
(i) Small tools and standard items that may be frequently required for maintenance
operations;
(ii) Spares stocking;
(iii) Fragile test gear.
The best maintenance policy can be obtained as a result of the optimum combination of the
contributing factors discussed earlier. It is important for the technician to realize and
appreciate that the maintenance policy he is following evolves as a result of a balance between
these factors. Consequently, the recently discussed factors contribute immensely towards the
achievement of good maintainability.
3 Testing Methods
3.1 Categories of Testing
It is no simple matter trying to classify testing. This is because the criteria for the classification
are not so distinct and are therefore not free from causing possible overlapping. Nonetheless,
three stages/categories of testing can be identified in an ideal manufacturing process. The
three categories of testing are called prototype testing, pre-production (qualification) testing
and production testing. Apart from these, there exists reliability demonstration and acceptance
testing which of course, may not be a mandatory process after the production of an item. At
this stage, it suffices to say that this type of testing is based purely on an agreement between
the manufacturer and the customer. More light will be shed on it later on in this chapter.
17
Finally, there is also what is known as ‘accelerated life testing’ which actually does not stand on
its own as a category of testing but associated with any of the three categories of testing earlier
mentioned. Further explanation on accelerated life testing will be provided later on.
Meanwhile, we shall examine in some detail all the types of testing already referred to.
3.2 Prototype Testing
Before a design goes into production a prototype or few prototypes are first produced and are
thoroughly tested to prove that a design will meet the specification. If the tests reveal any
inadequacies in either performance or any other parameters, the design must be modified and
the testes repeated.
It is very desirable that prototypes are tested under the correct environmental conditions by
competent staff who are independent of the designers. Besides, there is the need for care not
to allow every shortcoming to be glossed over with the hope that it will be corrected at the pre-
production or production stage. At times, prototype testing might be made the responsibility
of competent staff in the reliability or the quality control department. Since the designer
should be free from the responsibility of testing the prototype of his own design, therefore, the
designer must be completely answerable for its performance and reliability. If tests reveal
some shortcomings, it is the designer’s responsibility to correct them. As a result of prototype
testing three types of errors may be revealed. They are:
(i) Design errors – here the prototype might have been produced according to drawing, but
still it does not work satisfactorily.
(ii) Manufacturing errors – where the prototype has not been made correctly, in line with the
design.
(iii) Parts faults – where individual components parts of the prototype are found to be
defective.
19
3.5 Reliability Demonstration and Acceptance Testing
As mentioned earlier in Section 3.1, there exists what is called reliability demonstration and
acceptance testing associated with the post-production stage. It should be reiterated that this
is not a mandatory exercise, and when it happens it will be an affairs between the
manufacturer and the customer.
Reliability demonstration test is a test which is normally carried out when a manufacturer,
having produced certain equipment, wishes to demonstrate to a customer that its reliability is
at least as good as he claims it to be. Conversely, a customer may receive a finished product
untested and he decides to test it himself to verify whether or not the product satisfies the
reliability figure as stated in the specification. In this case the customer will be said to be doing
what is called a reliability acceptance test.
Accelerated life tests can take the following two main forms:
(i) Tests which may be carried out under normal use conditions except that what would in
practice be intermittent use now becomes a continuous use; and
(ii) Tests where the normal conditions of use are intensified, although it may be under
intermittent use.
Suppose we want to test a batch of microprocessor chips. In this case, the test engineer
operates the microprocessor beyond the normal limits specified for voltage, temperature, and
clock rate to test for operating margins and to expose latent failures. The stresses caused by
testing the device at excessive conditions generate information which are helpful in predicting
reliability over the devices’ lifetime. However, this type of test must always be carried out with
considerable caution.
20
However, the inspection process may take place before, during and after each production
stage. Inspection is usually directed at detecting defective items. An item is said to be
defective if it fails to conform with the pre-stated specification. In any industrial process there
are a number of places where inspection can and should be done. It can take place at the input
(when inspection of raw materials or output of previous stage is involved), at the output (to
reduce the risk of bad quality being passed on and causing customers’ complaints), and finally it
may involve inspection of the process itself, sometimes at several points, and checking up how
it is working. The aims of inspection are mainly two-fold. Firstly, it may provide information
that may necessitate the adjustment of the production process in order to reduce the amount
of bad production. Secondly, it may be necessary to sort out the bad production and return
items for re-processing.
Ideally, inspection should extend to each produced item for the entire attribute. It is in general
impracticable due to the following reasons:-
(i) The inspection test may be destructive. An example is testing which involves bending
strength of transistor leads.
(ii) Inspection may take too much time and may make the product uneconomical.
(iii) Insufficiency of inspection personnel that may be needed to carry out inspection of
individual item.
(iv) Individual inspection of all produced items may even fail to detect all the defective ones,
more so if the border line cases are large. This may occur if there is either a calibration
problem with the testing equipment or operators’ tiredness.
3.9 Sampling Plans (Approaches for Testing Small and Large Batch Quantities)
Concept of Sampling Plan
21
We have discussed in Section 3.7 why it may be unreasonable to carry out inspection test on
individually produced items (especially when the quantity involved is large) unless absolutely
defect-free items are needed and we are ready to pay the cost. In practice, instead of
embarking on inspection testing of each item, some sort of acceptance sampling plan is always
adopted. The underlying idea is to reject the whole lot of the produced items based on
information received from a sample. As an example, if a batch size ‘N’ is taken for inspection a
random sampling is employed to collect ‘n’ samples for test. These samples are then tested
according to the prescribed and pre-decided sampling plans. The main aim of the sampling
plan is to provide criteria for deciding the lot acceptability in the light of the number found
defective from the sample. The point will be made clearer in the succeeding sub-stations
treated immediately below.
Single Sampling Plans
The single sampling plan is concerned with the selection of a sample of a given size (say, 50
resistors) from a lot of a known size (say, 1000 resistors) and check against its known
specifications may be, its resistance value, tolerance etc). The allowable number of defective
items is specified in the sampling plan depending on the risks both the customer and producer
are ready to accept. If the number of defective items (resistors) is less than or equal to that
agreed in the sampling plan, then, the lot is accepted (i.e. all the 1000 resistors will be assumed
alright), otherwise the lot is rejected.
For example, we can specify that a batch of 1000 transistors is to be inspected at 1 percent
A.Q.L. (Acceptable Quality Level). (A.Q.L is defined as the maximum percent of defectives
which can be considered satisfactory as a process average for the purpose of sampling
inspection). We may take a sample of ninety, and accept the batch if the number of defectives
does not exceed say, two.
Double Sampling Plans
In a double sampling plan, there are two sample sizes, and two allowed numbers of defectives.
An example of a double sampling plan may follow the pattern given below:
This means that a sample of one hundred and twenty should be taken and inspected. If the
number of defective items discovered does not exceed one, accept the batch on the first
sample. It will then be needless to take the second sample. However, if the number of
defective items is four or more reject the batch on the first sample. If the number of defective
items is more than one, but less than four, take a second sample of one hundred and twenty.
Now if the number of defectives from the second sample does not exceed four, accept the
batch, otherwise reject it.
Multiple and Sequential Sampling Plans
By extending the principle of double sampling plans to reach any desired number of stages then
we shall have what are called multiple sampling plans. In this case, the number of defective
items for each sample is normally specified in the sampling plan. It is worthy of note that as the
number of samples required to arrive at a decision increases, so the size of each sample must
22
decrease. Ultimately, a limit is reached with what is called ‘sequential sampling’, where the
sample size is one. The results are normally plotted on a graph, and the method involves taking
a sample of one, testing it and plotting the result. This procedure is then repeated, taking
successive samples of one, until the graph contains enough information on which to arrive at a
decision.
23