Parameter Estimation and Remaining Useful Life Prediction of Lubricating Oil With HMM
Parameter Estimation and Remaining Useful Life Prediction of Lubricating Oil With HMM
Wear
journal homepage: www.elsevier.com/locate/wear
art ic l e i nf o a b s t r a c t
Article history: Lubricating oil plays a vital role in the full life-span performance of the machine. Lubricating oil dete-
Received 2 September 2016 rioration, which leads to the attenuation of oil performance and severe wear afterwards, is a slow de-
Received in revised form grading process, which can be observed by condition monitoring, but the actual degree of the oil de-
23 November 2016
gradation is often very difficult to examine. The main purpose of lubricating oil degradation prediction is
Accepted 25 November 2016
to estimate the failure time when the oil no longer fulfills its functions. We suppose that the state process
evolution of lubricating oil degradation can be modeled using a hidden Markov model (HMM) with three
Keywords: states: healthy state, unhealthy state, and failure state. Only the failure state is observable. While the
Lubricating oil deterioration lubricating oil is in service, vector data that are stochastically related to the deterioration state are ob-
Hidden Markov model
tained through on-line condition monitoring by an OLVF (On-line Visual Ferrograph) sensor at regular
Parameter estimation
sampling epochs. A method of Time Series Analysis (TSA) is applied to the healthy portions of the oil data
Remaining useful life
histories to get the residuals as the observable process containing partial information to fit the hidden
Markov model. The unknown parameters of the fitted hidden Markov model are estimated by the Ex-
pectation-Maximization (EM) algorithm. The remaining useful life (RUL) of lubricating oil can be eval-
uated through explicit formulas of the characteristics such as the conditional reliability function (CRF)
and mean residual life (MRL) function in terms of the posterior probability.
& 2017 Elsevier B.V. All rights reserved.
1. Introduction Wear debris level in lubricating oil has been proved to be one of
the most common degradation features to evaluate lubricating oil
Lubricating oil is used to reduce wear and friction from the degradation [2]. Relative wear debris concentration has been ob-
mobile components, eliminate contamination, remove heat from tained from an image captured by an on-line sensor (OLVF) at a
friction surfaces, and avoid machine failure and reduce the cost for sampling epoch [4]. By this, the wear debris presenting in a lu-
unscheduled maintenance afterwards. Therefore, lubricating oil bricating oil sample can be categorized as a large and small group
condition should be monitored and the oil should be replaced according to their sizes by controlling the oil flow rate and magnet
regularly to extend the period when the machine is in good state field intensity [4]. When the machine is in operation, wear debris
[1]. Recently, condition monitoring (CM) of lubricating oil has at- accumulate in the lubricating oil and the concentration increases,
tracted a considerably attention in research and it plays a vital role which leads to the lubricating oil degradation [5].
in industries [2]. The oil data obtained from CM have been used to Although analysis on wear debris in lubricating oil has been
assess the actual condition of the operating machine in [3], but to utilized in practice for many years to estimate machine condition,
little work has been done using statistical approaches to model
our knowledge, HM models for lubricating oil deterioration and
and analyze oil data for the purpose of assessing the lubricating oil
replacement when the machine is in the healthy state have not
degradation and predicting its remaining useful life. The predic-
been developed in the literature. Taking into account that the time
tion with the CM data can be obtained by the conditional relia-
period when a machine is in the healthy state is usually con-
bility function (CRF) and mean residual life (MRL) function [3],
siderably longer than the length of time between oil replacements,
which indicates the failure time when the oil cannot fulfill its
it is assumed in this paper that the machine condition is stable and
functions anymore, and should be changed. In condition mon-
will not affect considerably the speed of oil deterioration. itoring area, RUL prediction was applied for particle contaminated
lubricating oil by applying physical models using a particle filter-
n
Corresponding author. ing technique [5], as well as the application on rotational bearings
E-mail address: [email protected] (T. Wu). with two-phase threshold model using Bayesian methods [6].
http://dx.doi.org/10.1016/j.wear.2016.11.047
0043-1648/& 2017 Elsevier B.V. All rights reserved.
1228 Y. Du et al. / Wear 376-377 (2017) 1227–1233
The real condition monitoring data were obtained for the de-
tection of lubricating oil deterioration from a four-ball test rig [15]
Fig. 2. Wear debris concentration of small and large particles.
in order to predict the RUL of the lubricating oil afterwards. During
the operational life of the tribo-pairs, oil data were collected every
▵ = 4 minutes by an OLVF sensor, which is an on-line ferrographic Table 1
Working conditions for the test.
sensor based on Image Technology, and it provided wear debris
concentrations that came from the direct wear during these Test no. Load/N Rotated rate/rpm Time/min Downtime duration/min
4 minutes. The total number of data histories recorded is 27, which
consist of N ¼ 11 failure histories and M¼ 16 suspension histories. 1 1500 1000 360 0
2 1500 1000 360 240
The failure history is defined as the history that ends with ob-
3 2000 1000 210 720
servable failure, which indicates that the lubricating oil is out of 4 2000 1000 240 60
use at that moment, and the suspension history is defined as the 5 2000 2000 60 480
history that ends when the lubricating oil is still in operation and
has not lost its functions.
To avoid over-parameterization, we use the 2-dimensional history is given in Fig. 2, and the working conditions are listed in
monitoring data consisting of small wear particles and large wear Table 1. IPCA shows three stages including run-in, normal, and
particles obtained from the OLVF sensor for analysis. A typical data severe stages, which agrees with the typical “Bathtub Curve”.
Y. Du et al. / Wear 376-377 (2017) 1227–1233 1229
⎡ 1 z N+M M ⎤
… ztNN++ M
⎢ tN + M − 1 − p⎥
⎢1 ⋮ ⋮ ⋮ ⎥
⎢ N+M
⎥
⎢ 1 z p … z1N + M ⎥
V = ⎢1 ⋮ ⋮ ⋮ ⎥
⎥
⎢
⎢ 1 zt1 − 1 … zt1− p ⎥
1
1
⎢ ⎥
⎢1 ⋮ ⋮ ⋮ ⎥
⎢1 z 1p z11 ⎥⎦
⎣ … (4)
A = ⎡⎣ μ , Φ1, …, Φp ⎤⎦ ′ (5)
Using the method of least squares [16], the estimates for A and
C are given by
Fig. 3. Scatter plot for all the residuals. (For interpretation of the references to color
^
A = (V′V)−1V′W (7) in this figure, the reader is referred to the web version of this article.)
1230 Y. Du et al. / Wear 376-377 (2017) 1227–1233
Table 2 oil data histories as { F1, F2, …FN }, with the form of
P-value of the residual normality test.
( )
Yi = y1i , y2i , …yTi , (i = 1, 2, …N ) for each failure history Fi. The
i
Test Healthy data set Unhealthy data set observable failure time ξi for each history is
ξi = ti, (Ti ▵ < ti ≤ (Ti + 1)▵). The sampling history Yi consists of the
Normality(Henze-Zirkler) 0.8232 0.2983
residuals yti ∈ R2, t ≤ Ti , calculated from the collection of oil data
until the lubricating oil fails at time ti.
Table 2, which show that the calculated residuals follow the multi- Similarly, we denote the M ¼16 suspension oil data histories as
variate normal distribution and satisfy the assumption of multi-
variate normality for both healthy and unhealthy data sets [19].
{ S1, S2, …SM }, with the form of Yj = ( y , y , …y ), (j = 1, 2, …M )
1
j
2
j j
Tj
Table 3
Iterations of the EM algorithm.
Explicit formulas of the CRF and the MRL functions of the g (yn|XnΔ = 1, ξ > nΔ, yn − 1) × P (XnΔ = i|ξ > nΔ, yn − 1)
Πn =
model proposed in Section 3 for the RUL prediction of lubricating ∑j g (yn|XnΔ = j, ξ > nΔ, yn − 1) × P (XnΔ = j|ξ > nΔ, yn − 1) (19)
oil will be considered in this section, and as it is shown in [3], both
formulas are functions of the posterior probability of the lu- where
bricating oil being in the warning state (state 1). ⎧ f (y |μ , Σ 0 ), i = 0
⎪
n 0
We assume that the deteriorating state process of lubricating g (yn|XnΔ = i, ξ > nΔ, yn − 1) = ⎨ ⎪
oil is described by a continuous-time homogeneous Markov chain ⎩ f (yn|μ1, Σ1), i = 1 (20)
(Xt : t ∈ R +), with state space Z = { 0, 1, 2}, where state 0 denotes
Therefore, we have
the state that the lubricating oil is working in a healthy condition,
state 1 denotes the state that the lubricating oil is operating in a (P01 (Δ)(1 − Πn − 1 ) + P11 (Δ) Πn − 1 )
Πn =
warning condition, and state 2 represents the failure or absorbing f (yn |μ0 , Σ 0 )
P00 (Δ)(1 − Πn − 1 ) + (P01 (Δ)(1 − Πn − 1 ) + P11 (Δ) Πn − 1 )
state of lubricating oil, which indicates that it requires an im- f (yn |μ1, Σ1 ) (21)
mediate replacement. The transition rate matrix of the state pro-
where Pij are given in Eq. (17), and the ratio of normal densities has
cess is given by Eq. (15). Moreover, lubricating oil starts in state
0 and runs on a continuous basis. While the lubricating oil is in the following representation
service, the residual observation process (Yn ▵: n ∈ N), which is ⎛ 1 ⎞
obtained from Eq. (10) using the collected data through condition exp ⎜ − (yn − μ 0 )′Σ 0−1 (yn − μ 0 ) ⎟
f (yn|μ 0 , Σ 0 ) Σ1 ⎝ 2 ⎠
monitoring at equidistant sampling times ▵, 2▵, …n▵ , for = ·
f (yn|μ1, Σ1) Σ0 ⎛ 1 ⎞
▵ = 40 min , has a state-dependent multivariate normal distribu- exp ⎜ − (yn − μ1)′Σ1 (yn − μ1) ⎟
− 1
⎝ 2 ⎠ (22)
tion defined in Eqs. (15,16). By solving the Kolmogorov backward
differential equations [20], the transition probability matrix for the For more details, see reference [17].
state process Xt has the following form: Suppose that at decision epoch n, the lubricating oil has not
failed, i.e. ξ > n▵ . For any t ∈ [0, ▵], the conditional reliability
where the transition probabilities Pij (t ) = P (Xt = j|X0 = i ), i, j ∈ { 0, 1}. function (CRF), which denotes the probability that the lubricating
We will show that the CRF and MRL functions can be expressed oil will not fail by n▵ + t , is defined as
in terms of the posterior probability statistic Πn, denoting the
posterior probability that the lubricating oil is in warning state R (t|Πn ) = P (ξ > n▵ + t|ξ > n▵, Y▵, …, Yn ▵, Πn )
(state 1) given all available information until time n▵ , which is
= P00 (t ) + P01 (t ) + (P11 (t ) − P00 (t ) − P01 (t )) Πn (23)
defined as
Πn = P (XnΔ = 1|ξ > nΔ, YΔ, Y2Δ, …, YnΔ ) (18) Using the CRF in Eq. (23), the MRL function at the nth epoch can
where Π0 = P (X0 = 1) = 0. Using Bayes’ rule for n ≥ 1, we have be obtained by the following formula:
1232 Y. Du et al. / Wear 376-377 (2017) 1227–1233
Acknowledgment
Fig. 6. The conditional reliability function of real failure history.
The financial support for the present research was provided by
→ ∞
the National Science Foundation of China (Grant Nos. 51675403,
μ nΔ = E (ξ − nΔ|ξ > nΔ, Πn ) = ∫0 R (t|Πn ) dt
51275381), which is very much appreciated. Special thanks are also
λ12 + λ 01 + Πn (λ 02 − λ12 ) due to the China Scholarship Council (Grant No. 201506280150) for
=
λ12 (λ 01 + λ 02 ) (24) their financial support during the term of this project.
The CRF and MRL functions have been widely used for RUL
prediction [3]. (See e.g., [3] and the references in that paper for
References
more details).
We apply the above formulas to one of our failure histories
[1] Y. Du, T. Wu, J. Cheng, Age detection of lubricating oil with on-line sensors, in:
with bivariate residual observations, which is shown in Fig. 4. This SENSORS, 2015 IEEE, IEEE, 2015a, pp. 1–4.
failure history ended between 25th and 26th sampling epoch. The [2] J. Zhu, J. Yoon, D. He, B. Qiu, E. Bechhoefer, Online condition monitoring and
remaining useful life prediction of particle contaminated lubrication oil, in:
black solid line denotes the residual observations of small parti- Proceedings of the Prognostics and Health Management (PHM), 2013 IEEE
cles, and the red solid line denotes the residual observations of Conference on, IEEE, 2013, pp. 1–14.
large particles. [3] A. Khaleghei, V. Makis, Model parameter estimation and residual life predic-
tion for a partially observable failing system, Nav. Res. Logist. (NRL) 62 (3)
Fig. 5 plots the posterior probability of lubricating oil data (2015) 190–205.
history presented in Fig. 4. The posterior probability was calcu- [4] T. Wu, J. Mao, J. Wang, J. Wu, Y. Xie, A new on-line visual ferrograph, Tribol.
Trans. 52 (5) (2009) 623–631.
lated using Eqs. (21,22). The plot shows that the lubricating oil [5] J. Zhu, J.M. Yoon, D. He, E. Bechhoefer, Online particle-contaminated lubrica-
started in the healthy state (state 0) at time 0, ran under the tion oil condition monitoring and remaining useful life prediction for wind
healthy condition till sometime between 1st and 15th sampling turbines, Wind Energy 18 (6) (2015) 1131–1149.
[6] N. Chen, K.L. Tsui, Condition monitoring and remaining useful life prediction
epoch, and after 16th sampling epoch, the lubricating oil was using degradation signals: revisited, IIE Trans. 45 (9) (2013) 939–952.
working in the warning state (state 1). [7] M.-Y. You, L. Li, G. Meng, J. Ni, Two-zone proportional hazard model for
equipment remaining useful life prediction, J. Manuf. Sci. Eng. 132 (4) (2010)
Fig. 6 provides the estimated conditional reliability at each 041008.
sampling epoch for this failure oil data history shown in Fig. 4. It [8] X.-S. Si, W. Wang, C.-H. Hu, M.-Y. Chen, D.-H. Zhou, A Wiener-process-based
can be seen that the value of the reliability dropped at the 15th degradation model with a recursive filter algorithm for remaining useful life
estimation, Mech. Syst. Signal Process. 35 (1) (2013) 219–237.
sampling epoch when the state changed to the warning state [9] D. Lin, V. Makis, Filters and parameter estimation for a partially observable
(state 1), and after 25th sampling epoch, the lubricating oil shifted system subject to random failure with continuous-range observations, Adv.
Appl. Probab. (2004) 1212–1230.
to the failure state (state 2). It is shown that there is a high [10] R. Jiang, System availability maximization and residual life prediction under
probability at that time that the state of the lubricating oil has partial observations, (Ph.D. thesis), University of Toronto, 2011.
Y. Du et al. / Wear 376-377 (2017) 1227–1233 1233
[11] M.J. Kim, V. Makis, R. Jiang, Parameter estimation for partially observable Tribology Conference 2015, Malaysian Tribology Society, vol. 2015, 2015b, pp.
systems subject to random failure, Appl. Stoch. Model. Bus. Ind. 29 (3) (2013) 185–186.
279–294. [16] G.C. Reinsel, Elements of Multivariate Time Series Analysis, Springer Science &
[12] Z.-J. Zhou, C.-H. Hu, D.-L. Xu, M.-Y. Chen, D.-H. Zhou, A model for real-time Business Media, Springer, New York, 2003.
failure prognosis based on hidden Markov model and belief rule base, Eur. J. [17] M.J. Kim, Optimal control and estimation of stochastic systems with costly
Oper. Res. 207 (1) (2010) 269–283. partial information, (Ph.D. thesis), University of Toronto, 2012.
[13] V. Makis, Multivariate Bayesian control chart, Oper. Res. 56 (2) (2008) [18] N. Henze, B. Zirkler, A class of invariant consistent tests for multivariate nor-
487–496. mality, Commun. Stat.-Theory Methods 19 (10) (1990) 3595–3617.
[14] A. Ghasemi, S. Yacout, M.-S. Ouali, Evaluating the reliability function and the [19] J. Yang, V. Make, Dynamic response of residuals to external deviations in a
mean residual life for equipment with unobservable states, IEEE Trans. Reliab. controlled production process, Technometrics 42 (3) (2000) 290–299.
59 (1) (2010) 45–54. [20] G. Grimmett, D. Stirzaker, Probability and Random Processes, Oxford Uni-
[15] Y. Du, T. Wu, J. Cheng, R. Gong, Lubricating oil deterioration on a four-ball test versity Press, Oxford, 2001.
rig via on-line monitoring, in: Proceedings of Malaysian International