Machine Learning For Predictive Maintenance A Multiple Classifier Approach
Machine Learning For Predictive Maintenance A Multiple Classifier Approach
net/publication/277723565
CITATIONS READS
168 5,826
5 authors, including:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Gian Antonio Susto on 17 February 2018.
Abstract—In this paper a multiple classifier machine and newly introduced state-of-the-art algorithms. At the
learning methodology for Predictive Maintenance (PdM) is same time the efficient management of maintenance
presented. PdM is a prominent strategy for dealing with activities is becoming essential to decreasing the costs
maintenance issues given the increasing need to minimize
downtime and associated costs. One of the challenges with associated with downtime and defective products [14],
PdM is generating so called ’health factors’ or quantitative especially in highly competitive advanced manufactur-
indicators of the status of a system associated with a given ing industries such as semiconductor manufacturing.
maintenance issue, and determining their relationship to op- Approaches to maintenance management can be
erating costs and failure risk. The proposed PdM methodol-
grouped into three main categories which, in order of
ogy allows dynamical decision rules to be adopted for main-
tenance management and can be used with high-dimensional increasing complexity and efficiency [22], are as follows:
and censored data problems. This is achieved by training i. Run-to-Failure (R2F) - where maintenance interventions
multiple classification modules with different prediction are performed only after the occurrence of failures. This
horizons to provide different performance trade-offs in terms is obviously the simplest approach to dealing with main-
of frequency of unexpected breaks and unexploited lifetime
tenance (and for this reason it is frequently adopted),
and then employing this information in an operating cost
based maintenance decision system to minimise expected but it is also the least effective one, as the cost of
costs. The effectiveness of the methodology is demonstrated interventions and associated downtime after failure are
using a simulated example and a benchmark semiconductor usually much more substantial than those associated
manufacturing maintenance problem. with planned corrective actions taken in advance. ii.
Index Terms—Classification Algorithms, Data Mining, Ion Preventive Maintenance (PvM) - where maintenance ac-
Implantation, Machine Learning, Predictive Maintenance, tions are carried out according to a planned schedule
Semiconductor Device Manufacture. based on time or process iterations. With this approach,
also referred to as scheduled maintenance, failures are
I. I NTRODUCTION usually prevented, but unnecessary corrective actions are
often performed, leading to inefficient use of resources
The increasing availability of data is changing the and increased operating costs.
way decisions are taken in industry [17] in important iii. Predictive Maintenance (PdM)1 - where maintenance is
areas such as scheduling [15], maintenance management performed based on an estimate of the health status of
[24] and quality improvement [6], [23]. Machine learning a piece of equipment [7]. PdM systems allow advance
(ML) approaches have been shown to provide increas- detection of pending failures and enable timely pre-
ingly effective solutions in these areas, facilitated by the failure interventions, thanks to prediction tools based
growing capabilities of hardware, cloud-based solutions, on historical data, ad hoc defined health factors, statistical
inference methods, and engineering approaches.
Manuscript received March 17, 2014. Accepted for publication Au-
gust 7, 2014. Among statistical inference based methods, those
Copyright 2014 IEEE. Personal use of this material is permitted. based on ML are the most suitable for dealing with mod-
However, permission to use this material for any other purposes
must be obtained from the IEEE by sending a request to pubs-
eling of high-dimensional problems, such as those aris-
[email protected] ing in semiconductor manufacturing where hundreds
G.A. Susto (corresponding author) is with the Department of In- or thousands of physical variables (pressures, voltages,
formation Engineering, University of Padova, Italy and with Statwolf
LTD, Ireland. E-mail: [email protected].
currents, flows, etc.) act on the process [10], [26].
A. Schirru and S. Pampuri are with the National University of In this paper a new PdM methodology based on mul-
Ireland, Maynooth and with Statwolf LTD, Ireland. tiple classifiers is introduced for integral type faults (the
S. McLoone is with Queen’s University Belfast, United Kingdom and
with the National University of Ireland, Maynooth. most frequent in semiconductor manufacturing), a term
A. Beghi is with the Department of Information Engineering, Uni- which describes the failures that happen on a machine
versity of Padova, Italy. due to the accumulative ’wear and tear’ effects of usage
This work has been done within the framework of INTEGRATE (In-
tegrated Solutions for Agile Manufacturing in High-mix Semiconduc-
tor Fabs), an European Nanoelectronics Initiative Advisory Council’s 1 Other authors [8] refer to this class of maintenance approaches as
project. Condition-Based Maintenance
0000–0000/00$00.00©2014 IEEE
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS. 2
and stress on equipment parts. Even if no direct evidence From a ML perspective, supervised approaches re-
of process/machine degradation is available, PdM tools quire the availability of a dataset S
exploit process and logistic variables collected during n
S = {xi , yi }i=1 , (1)
production to identify the ’footprint’ of this degradation
in the data. The proposed methodology, referred to as where a couple {xi , yi } (called observation) contains the
Multiple Classifier (MC) PdM, can effectively deal with information related to the i-th process iteration. Here,
the unbalanced datasets that arise in maintenance classi- vector xi ∈ R1×p contains information related to the p
fication problems, that is datasets where the observations variables associated with available process or logistic
relating to normal production greatly outnumber the ob- information. Depending on the type of output y two
servations associated with abnormal/faulty production classes of supervised problem are possible:
[22]. It also allows planning of maintenance schedules i. if y assumes continuous values a regression problem is
using a statistical cost minimization approach. obtained;
The remainder of the paper is structured as follows: ii. if y assumes categorical values a classification problem
Section II provides a brief literature review and an results.
introduction to ML-based approaches to PdM, while In PdM problems, regression-based formulations gen-
Section III is dedicated to presenting the proposed MC erally arise when predicting the Remaining Useful Life
PdM methodology. Tuning guidelines and classification of a process/equipment, directly [2] or indirectly (eg.
approaches are discussed in Section IV. Then in Section through the computation of the conditional reliability
V, to demonstrate the principles and efficacy of MC [5]), while classification-based PdM formulations occur
PdM, results are presented comparing MC PdM with when seeking to discriminate between healthy and un-
PvM for a benchmark semiconductor manufacturing healthy conditions of the system being monitored [1].
maintenance problem, namely, changing of filaments in While classification tools are a natural choice for dis-
ion implantation tools. Finally, in Section VI, concluding tinguishing between faulty and non-faulty process itera-
remarks and possible extensions of the work are dis- tions based on observed process data, they do not map
cussed. naturally to health factors that can be extrapolated for
maintenance related decision making, unlike for exam-
ple, regression models of Remaining Useful Life. In the
II. L ITERATURE R EVIEW following a methodology based on multiple classifiers is
presented to address this limitation.
Maintenance issues can be completely different in
nature and the predictive information to be fed to the
III. M ULTIPLE C LASSIFIER P D M
PdM module has in general to be tailored to the par-
ticular problem at hand, thus justifying the presence A. Classification for PdM
in the literature of many different approaches to PdM Let us suppose that data regarding N maintenance
PN
[12]. However, PdM-related solutions based on ML tech- cycles are available, for a total of n = i=1 ni machine
niques seem to be among the most popular (see e.g. [5] runs. We define the matrix X ∈ Rn×p containing all the
and [21] with reference to semiconductor manufacturing collectible information
applications). T
X = x1 x2 . . . x n (2)
ML-based PdM can be divided into two main classes:
i. supervised - where information on the occurrence of regarding p physical variables or logistic information on
failures is present in the modeling dataset; the process or on the tool that are used as inputs to
ii. unsupervised - where logistic and/or process informa- the PdM module. Information on maintenance events is
tion is available, but no maintenance related data exists. contained in the variable Y . If during the run the fault
The availability of maintenance information mostly under consideration takes place, then the observation
depends on the nature of the existing maintenance man- is marked as faulty (F), and not faulty (NF) otherwise.
agement policy: in the case of R2F policies the data Accordingly:
related to a maintenance cycle (the production activity
F if iteration i is faulty
between two successive failure events) is available and Y (i) = yi = . (3)
N F if iteration i is not faulty
therefore supervised approaches can readily be adopted;
on the other hand, when PvM policies are currently in When dealing with R2F data, each maintenance cycle
place the full maintenance cycle may not be observable ends with a failure, hence the available data contains N
given the fact that maintenance is generally performed samples for class F and n − N samples for class N F .
well in advance of any potential failure, and hence only Based on X and Y , a classifier learns a decision rule f (·)
unsupervised learning approaches are feasible. When that assigns one of the two classes {F, N F } to each point
possible, supervised solutions are evidently preferable: in the input space Rp .
given the wide diffusion of R2F maintenance policies in The formulation presented above is weak from a PdM
industry, and hence the availability of suitable datasets, perspective in two aspects:
we consider in this work a supervised approach to PdM. 1. only the current process iteration is classified, i.e. no
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS. 3
2 The
employed as a performance indicator.
R2F approach clearly guarantees a total of ρUB = 1 and ρUL =
0, while the performance of PvM and PdM approaches depend on Fab integration and on-line operation of the MC PdM
some tuning parameters as will be illustrated in Section IV module are described in Fig. 2. Estimates of ρUB and
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS. 4
B. Classification Algorithms
The MC PdM methodology presented does not im-
Fig. 2. Overview of how the MC PdM module integrates with the pose any restrictions on the classification algorithm that
Fab
can be adopted for the individual classifiers. Here, we
consider two well known and widely used classification
techniques, namely: Support Vector Machines (SVMs)
ρUL are provided by the historical and simulation per- and k-Nearest Neighbors (k-NN). This choice is moti-
formances of the PdM module, while cUB and cUL are vated by the fact that it provides a comparison between
provided by the user, and may be changed at each two contrasting types of classifier; a powerful, but com-
evaluation of the PdM module (i.e. unexpected breaks putationally complex to estimate, model based classifier
are much more costly during highly intense production (SVM); and a low-complexity non-parametric classifier
periods than during normal production periods). (k-NN). In the following the two selected classifiers are
While undesirable from a operating cost perspective, briefly presented.
when an unexpected break occurs during operation of a 1) Support Vector Machines: SVMs are probably the
PdM module it means that a full maintenance cycle has most popular approach to classification, thanks to their
been observed, and hence valuable new data is available high classification accuracy, even for non-linear prob-
with which to update the MC PdM module and related lems, and to the availability of optimized algorithms for
performance metrics. Therefore, as illustrated in Fig. 2, it their computation [3].
is important to retain the facility to update the MC PdM Consider the problem where two classes of data (F
module following deployment. and N F in the problem at hand) have to be classified;
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS. 5
90
PvM−µ PvM−µ
80 PvM−η 0.5 PvM−η
PdM−lin PdM−lin
Unexploited Lifetime
Unexpected Breaks
70
PdM−rbf 0.4 PdM−rbf
60 MC PdM−knn MC PdM−knn
MC PdM−svm 0.3 MC PdM−svm
50
40 0.2
30
0.1
20
0
5 15 25 35 45 55 65 75 85 5 15 25 35 45 55 65 75 85
m m
Fig. 4. Average ρUL over Q2 Monte Carlo simulations with the various Fig. 5. Average ρUB over Q2 Monte Carlo simulations with the various
maintenance management approaches maintenance management approaches
0.6
of ρUL and ρUB to be effective. The criterion described PvM−µ
in Eq. (5) and Eq. (7) was used to select the failure
0.55 PvM−η
0.5 PdM−lin
horizon for each classifier as this was found to provide a
J [Arbitrary Unit]
PdM−rbf
better distribution of results in terms of ρUL and ρUB than 0.45
MC PdM−knn
the one described in Eq. (8) and Eq. (9). In the interest 0.4
MC PdM−svm
of compactness, the results of this investigation are not 0.35
classifiers that make up MC PdM-knn and MC PdM- classifier distance based PdM alternative. It has also been
svm. In computing these values a process iteration with shown in this case study that SVMs offer superior per-
a broken filament is considered as the positive condition, formance to k-NN classifiers when implementing MC-
while an uninterrupted process iteration is considered PdM, and that in general MC-PdM-knn also consistently
as the negative condition. It can be observed that as the outperforms PvM approaches.
fault horizon m increases, precision decreases (more false The high dimensionality of datasets typical of PdM
positives, i.e. unexploited lifetime) while recall increases problems encountered in semiconductor manufacturing
(less false negatives, i.e. unexpected breaks). and other manufacturing industries, suggests that par-
In Fig. 6 the performance of the six maintenance ap- simonious (or sparse) classification approaches such as
proaches is plotted as a function of the ratio of the costs, Relevance Vector Machines [27] could provide competi-
cUL /cUB . Here, J is the minimum total cost achieved with tive results. This will be the subject of a future extension
each approach and is computed based on the average of the present work.
Monte Carlo results reported in Fig. 4 and 5. The values We have seen how the choice of the fault horizon m
of the costs ratio cUL /cUB were chosen with the aid of strongly affects the performance of the corresponding
process experts in order to have plausible results for the classifier in MC PdM. It may be the case that some areas
problem at hand. of the [ρUB , ρUL ] space are more ’interesting’, given the
These results highlight that MC PdM-svm consistently associated costs, than others in the context of minimis-
guarantees better performance than any of the other ing total operating costs. One approach to cope with
approaches considered. This is the most important out- this issue is to employ a two steps procedure, where
come of the present work, since it demonstrates that classifiers assigned to an uninteresting area of [ρUB , ρUL ]
the proposed maintenance methodology achieves the are reallocated in the second training iteration to more
lowest operating costs while at the same time being relevant areas of the cost space.
robust to different choices of ρUL and ρUB . MC PdM-knn
consistently delivers lower operating costs than classical R EFERENCES
PvM techniques and exhibits comparable performance to [1] R. Baly and H. Hajj. Wafer classification using support vector
PvM-svm, especially for low values of the ratio cUL /cUB . machines. IEEE Transactions on Semiconductor Manufacturing,
25(3):373–383, 2012.
[2] S. Butler and J. Ringwood. Particle filters for remaining useful
VI. C ONCLUSIONS life estimation of abatement equipment used in semiconductor
A novel multiple classifier PdM system for integral manufacturing. In IEEE Conference on Control and Fault-Tolerant
Systems, pages 436–441, 2010.
type faults has been presented. The multiple ML clas- [3] C.-C. Chang and C.J. Lin. Libsvm: a library for support vector
sifiers work in parallel to exploit the knowledge of the machines. ACM Transactions on Intelligent Systems and Technology
tool/logistic variables at each process iteration in order (TIST), 2(3):27, 2011.
[4] T. Hastie, R. Tibshirani, and J. Friedman. The Elements of Statistical
to enhance decision making. The proposed tool guar- Learning. Data Mining, Inference and Prediction. Springer, 2009.
antees improved maintenance management decisions in [5] A. Heng, A. Tan, J. Mathew, N. Montgomery, D. Banjevic, and
terms of minimising operating cost and can be applied A. Jardine. Intelligent condition-based prediction of machinery
reliability. Mechanical Systems and Signal Processing, 23:1600–1614,
to any maintenance problems characterised by integral 2009.
type faults provided R2F historical data is available or [6] G. Koksal, I. Batmaz, and M.C. Testik. A review of data mining
can be collected. applications for quality improvement in manufacturing industry.
Expert Systems with Applications, 38(10):13448 – 13467, 2011.
The MC PdM module is robust to variations in the [7] L. Krishnamurthy, R. Adler, P. Buonadonna, J. Chhabra, M. Flani-
operating costs associated with unexpected breaks and gan, N. Kushalnagar, L. Nachman, and M. Yarvis. Design and
unexploited equipment part lifetime that can occur over deployment of industrial sensor networks: Experiences from a
semiconductor plant and the north sea. In SenSys, 2005.
time. It also allows the user to dynamically change main- [8] R. Langone, C. Alzate, B. DeKetelaere, and J.A.K. Suykens. Kernel
tenance policy based on current costs and needs of man- spectral clustering for predicting maintenance of industrial ma-
ufacturing production. Process engineers are provided chines. In Computational Intelligence and Data Mining (CIDM), 2013
IEEE Symposium on, pages 39–45, April 2013.
with a tool that enables them to adjust the performance [9] S.S. Lee. Noisy replication in skewed binary classification. Com-
and the action policy so that the total operating cost is putational statistics & data analysis, 34(2):165–191, 2000.
minimized. Moreover, the presence of several classifiers [10] B. Lenz and B. Barak. Data mining and support vector regression
machine learning in semiconductor manufacturing to improve
allows the proposed tool to be employed as a health virtual metrology. In System Sciences (HICSS), 2013 46th Hawaii
factor indicator in a more natural and straightforward International Conference on, pages 3447–3456. IEEE, 2013.
fashion that with other PdM approaches. Specifically, the [11] S.-Y. Lin and S.-C. Horng. A classification-based fault detection
and isolation scheme for the ion implanter. IEEE Transactions on
k classifiers provide an estimate of the current status of Semiconductor Manufacturing, 19(4):411–424, 2006.
the maintenance issue, enhancing process understand- [12] M. Luo, Z. Xu, H.L. Chan, and M. Alavi. Online predictive
ing and allowing cost aware maintenance management maintenance approach for semiconductor equipment. In Industrial
Electronics Society, IECON 2013-39th Annual Conference of the IEEE,
decisions at any time. pages 3662–3667. IEEE, 2013.
The proposed methodology has been demonstrated for [13] C.M. McKenna. A personal historical perspective of ion implan-
a semiconductor manufacturing Ion-Implanter related tation equipment for semiconductor applications. In Conference
on Ion Implantation Technology, pages 1 – 19, 2000.
maintenance task and shown to guarantee better perfor- [14] R.K. Mobley. An Introduction to Predictive Maintenance.
mance than classical PvM approaches and a single SVM Butterworth-Heinemann, 2002.
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS. 9
[15] L. Mönch, J.W. Fowler, S. Dauzère-Pérès, S.J. Mason, and O. Rose. Andrea Schirru received the M.S. degree (cum
A survey of problems, solution techniques, and future challenges laude) in computer science and the Ph.D. in
in scheduling semiconductor manufacturing operations. Journal Information Engineering from the University of
of Scheduling, 14(6):583–599, 2011. Pavia, Pavia, Italy, in 2008 and 2011. He is cur-
[16] R.R. Picard and R.D. Cook. Cross-validation of regression models. rently a post-doctoral associate at National Uni-
Journal of the American Statistical Association, 79:575–583, 1984. versity of Ireland, Maynooth and Chief Technol-
[17] General Electrics Intelligent Platforms. The rise of industrial big ogy Officer at Statwolf Ltd.
data, August 2012. He has been an Intern Researcher at Infi-
[18] A. Schirru, G.A. Susto, S. Pampuri, and S. McLoone. Learning neon Technologies Austria AG, Villach, Austria
from time series: Supervised aggregative feature extraction. In (2010-2011), and during his studies he has been
51st IEEE Conference on Decision and Control, Proceedings, pages awarded with the ICINCO Best Student Paper
5254–5259. IEEE, 2012. Award (2011) and the Best Student Presentation at the Intel European
[19] B. Scholkopf and A.J. Smola. Learning with Kernels. MIT Press, Research and Innovation Conference (2010).
2002.
[20] J. Shao. Linear model selection by cross-validation. Journal of the
American Statistical Association, 88:486–494, 1993.
[21] Y.-C. Su, F.-T. Cheng, M.-H. Hung, and H.-C.Huang. Intelligent
prognostics system design and implementation. IEEE Trans. on
Semiconductor Manufacturing, 19:195–207, 2006. Simone Pampuri received the M.S. degree in
[22] G.A. Susto, A. Beghi, and C. DeLuca. A predictive maintenance computer science and the Ph.D. in Informa-
system for epitaxy processes based on filtering and prediction tion Engineering from the University of Pavia,
techniques. IEEE Trans. on Semiconductor Manufacturing, 25:638– Pavia, Italy, in 2009 and 2012. He is currently
649, 2012. a post-doctoral associate at National University
[23] G.A. Susto, S. Pampuri, A. Schirru, A. Beghi, and G. DeNicolao. of Ireland, Maynooth and Chief Executive Offi-
Multi-step virtual metrology for semiconductor manufacturing: a cer at Statwolf Ltd.
multilevel and regularization methods-based approach. Comput- He has been an Intern Researcher at Infi-
ers & Operations Research, (0):–, 2014. neon Technologies Austria AG, Villach, Austria
[24] G.A. Susto, A. Schirru, S. Pampuri, D. Pagano, S. McLoone, and (2011). His research interests include manu-
A. Beghi. A predictive maintenance system for integral type facturing data analytics, machine learning and
faults based on support vector machines: An application to ion sentiment analysis.
implantation. In Automation Science and Engineering (CASE), 2013
IEEE International Conference on, pages 195–200. IEEE, 2013.
[25] J. Suykens and J. Vandewalle. Least squares support vector
machine classifiers. Neural Processing Letters, 9(3):293–300, 1999.
[26] K. Tamaki and S. Kaneko. Multiparametric virtual metrology
model building by job-shop data fusion using a markov chain Seán McLoone (S’94, M’96, SM’02) received a
monte carlo method. 2013. Master of Engineering degree with Distinction
[27] M.E. Tipping. Sparse bayesian learning and the relevance vector and a Ph.D. degree in Control Engineering
machine. The journal of machine learning research, 1:211–244, 2001. from Queens University Belfast (QUB), Belfast,
[28] M.-L. Zhang and Z.-H. Zhou. Ml-knn: A lazy learning approach Ireland, in 1992 and 1996, respectively. He is
to multi-label learning. Pattern recognition, 40(7):2038–2048, 2007. currently Professor and Director of the Energy
Power and Intelligent Control (EPIC) Research
Cluster at QUB.
His research interests lie in the general area
to data based modelling and analysis of dy-
namical systems. This encompasses techniques
ranging from classical system identification, fault diagnosis, and sta-
tistical process control to modern computational intelligence based
adaptive learning algorithms and optimization techniques. He is a Past
Chairman of the UK and Republic of Ireland (UKRI) Section of the
IEEE.