A Decision Tree Based Data-Driven Diagnostic Strategy For Air Handling Units
A Decision Tree Based Data-Driven Diagnostic Strategy For Air Handling Units
a r t i c l e i n f o a b s t r a c t
Article history: Data-driven methods for fault detection and diagnosis of air handling units (AHUs) have attracted wide
Received 11 May 2016 attention as they do not require high-level expert knowledge of the system of concern. This paper presents
Received in revised form 10 August 2016 a decision tree based data-driven diagnostic strategy for AHUs, in which classification and regression tree
Accepted 20 September 2016
(CART) algorithm is used for decision tree induction. A great advantage of the decision tree is that it can
Available online 21 September 2016
be understood and interpreted and therefore its reliability in fault diagnosis can be validated by both
testing data and expert knowledge. A steady-state detector and a regression model are incorporated
Keywords:
into the strategy to increase the interpretability of the diagnostic strategy developed. The proposed
Decision tree
CART
strategy is validated using the data from ASHRAE 1312-RP. It is shown that this strategy can achieve a
Feature selection good diagnostic performance with an average F-measure of 0.97. The interpretation of the diagnostic
Air handling unit decision tree using expert knowledge showed that some diagnostic rules generated in the decision tree
Fault diagnosis comply with expert knowledge. Nevertheless, the interpretation also indicated that some diagnostic rules
Interpretability generated are not reliable and some of them are only valid under certain operating conditions, which
indirectly demonstrated the importance of the interpretability of fault diagnostic models developed using
data-driven methods.
© 2016 Elsevier B.V. All rights reserved.
1. Introduction knowledge of the system and the computational costs are generally
manageable [4].
The operation of building Heating, Ventilation, and Air- Air handling unit (AHU) is one of the important components
Conditioning (HVAC) systems is vulnerable to various faults, and in HVAC systems and a number of studies have been focused on
the occurrence of any fault could lead to increased energy con- detection and diagnosis of various faults in AHUs. For instance, a
sumption or indoor thermal discomfort [1,2]. Successful detection rule-based FDD strategy for AHU temperature sensors was devel-
and isolation of any faults in HVAC systems in a timely manner can oped by Yang et al. [7], in which a set of if-then rules were
improve building energy efficiency and reduce carbon footprint. formulated based on expert knowledge. The performance evalu-
Over the last two decades, considerable efforts have been made ation using the data from a green building and a small scale AHU
in the development of fault detection and diagnosis (FDD) strate- simulator showed that this strategy is capable of isolating AHU sen-
gies for building HVAC systems [3–5]. Fault detection is a process sor faults under different operating modes. A FDD strategy based
that determines whether or not an HVAC system is operating on Bayesian Belief Network (BBN) for AHUs was presented by Zhao
in a healthy condition while fault diagnosis aims to identify the et al. [8,9], in which a probabilistic graphical model was used to
causes of the fault. Compared to fault detection, fault diagnosis represent the relationships of probabilistic dependencies within
is more challenging and complicated. The existing FDD methods different variables. It was shown that this method can success-
can be generally categorised into model-based methods, rule-based fully isolate common faults occurred in AHUs. A drawback of this
methods and data-driven methods [6]. Data-driven methods have method is that it requires high-level expert knowledge, especially
attracted wide attention as they do not require high-level expert in determining the probability parameters. A sensor fault detection
strategy for AHUs using cluster analysis was developed by Yan et al.
[5], in which the clustering algorithm Ordering Points to Identify
the Clustering Structure (OPTICS) was used to identify the spa-
∗ Corresponding author.
tial separated data groups which possibly indicate the occurrence
E-mail addresses: [email protected], [email protected] (Z. Ma).
of sensor faults. Wang et al. [10] described a hybrid FDD method
http://dx.doi.org/10.1016/j.enbuild.2016.09.039
0378-7788/© 2016 Elsevier B.V. All rights reserved.
38 R. Yan et al. / Energy and Buildings 133 (2016) 37–45
Fig. 2. The fully grown AHU fault diagnostic tree with the residual feature.
There are a finite number of ␣ values corresponding to a AHU faults considered in this study. These faults are among the typ-
sequence of pruned trees T0 , T1 ,. . .,Tn . Each Ti in this sequence is ical AHU mechanical faults and are commonly considered in AHU
characterised by a different value ˛i . To generate the sequence, FDD studies.
CART adopted an iterative algorithm starting with the fully grown The diagnostic strategy developed was validated based on the
tree (T0 ) and the first pruned tree (T1 ) is then developed by finding data collected under the AHU mechanical cooling mode tested dur-
the node t ∈ T0 that minimises Eq. (5) [35]. ing the summer period. In the mechanical cooling mode, the supply
The minimum value of this function is the value of ␣0 corre- air temperature was controlled at 12.8 ◦ C and the indoor temper-
sponding to T0 and a new pruned tree T1 is generated by pruning ature was maintained at 22.2 ◦ C. The supply fan was controlled to
tree T0 at the node t. The same procedure will be repeated based maintain the supply air static pressure at 348 Pa while the return
on tree T1 and generates ˛1 and T2 . The iteration terminates until fan operating speed was maintained at 80% of the supply fan oper-
the root node is reached. ating speed [40].
Once the sequence of the minimal cost-complexity trees is gen-
erated, the final pruned tree can be determined by plotting the 4.2. Classification tree induction
relative error (RE) versus the complexity parameters. The relative
error of a decision tree is defined as in Eq. (9). The optimal prune Based on the randomly selected 2/3 training data from the new
point is the point where RE does not have any significant decrease data set, a fully grown decision tree with a minimum number of 20
if further decreasing the cost of complexity. observations in a leaf node was generated, as illustrated in Fig. 2.
ncorrect The total number of observations corresponding to each fault and
RE = 1 − (9) fault free conditions used as the training data is presented in Table 1.
ntotal
The number of the observations in a leaf node is constrained in order
where ncorrect is the number of correctly classified observations and to avoid generating a too large decision tree and also for making the
ntotal is the total number of observations. visualization easier. The fully grown decision tree has a total of 26
splits. The root and internal nodes were labelled with the test condi-
4. Validation and interpretation of the AHU diagnostic tions in order to split the observations with different characteristics
strategy while the leaf nodes were labelled with the classification results.
The left branches of the root and internal nodes are the branches
In this study, the proposed strategy was implemented in R [36] that met the node splitting criteria while the right branches did not.
with CART implemented using the rpart [37] and rpart.plot [38] The number in the leaf node represents the number of the correctly
packages. The majority of figures presented were generated using classified observations out of the total number of the observations
R package ggplot2 [39]. classified to this leaf node. For instance, 333/334 in the cooling
coil valve stuck at 65% opening (i.e. CCVS65%) indicated that 333
4.1. Experimental data observations were correctly classified to CCVS65% fault while one
observation was mis-classified.
The experimental data from ASHRAE project 1312-RP [40–42] From Fig. 2, it can be seen that some faults such as cooling coil
was used to test and evaluate the proposed AHU fault diagnos- valve stuck at fully closed (i.e. CCVSFC) and AHU duct leakage before
tic strategy. ASHRAE 1312-RP tested various AHU faults based on supply fan (i.e. DLBSF) can be easily isolated after only a few node
an experimental AHU facility. Each fault was tested for 12 h from splitting steps. However, isolating some faults such as the heating
6.00a.m. to 6.00p.m. while the fault-free test was conducted for coil valve leakage (i.e HCVL) and exhaust air damper stuck at fully
three days. The sampling rate was one minute. The details of the closed (i.e. EADSFC) required considerable partitions, which sig-
experimental AHU facility and how the faults were introduced and nificantly decreased the interpretability of the diagnostic decision
tested can be found in [41–43]. Table 1 summarizes the different tree. Some leaf nodes only included a very small proportion of the
R. Yan et al. / Energy and Buildings 133 (2016) 37–45 41
Table 1
Summary of AHU faults considered and their abbreviations.
Case No. Fault description Abbreviation Number of data points used as training data
Fig. 4. Post-pruned AHU fault diagnostic tree with the residual feature.
In this section, the rules generated by the decision tree are inter-
preted in order to confirm the reliability of the proposed strategy.
It is worthwhile to note that the following analysis is based on the
faults investigated in this study (Table 1) without considering other
possible AHU faults.
The five faults with F-measure of 1.0, which are also the top five
branches of the diagnostic tree were first analyzed. The first split
in Fig. 4 indicated that the AHU suffered from the cooling coil valve
stuck at fully closed (i.e. CCVSFC) when the supply air temperature
(Tsa ) is not less than 19.2 ◦ C. This rule is easy to understand as there
is no chilled water supply when the cooling coil valve is fully stuck,
and therefore the supply air cannot be cooled down to the required
set-point.
The next split in Fig. 4 isolated the AHU duct leakage before
Fig. 5. Number of splits in the sequence of the pruned subtrees with different costs of supply fan (i.e. DLBSF) fault based on the residual between the pre-
complexity and the corresponding relative errors without using the residual feature. dicted and actual supply fan speed control signal. If the difference
between the predicted and the actual supply fan speed control sig-
nal is not less than 5.23%, the AHU will then be deemed as suffering
Fig. 6. Post-pruned AHU fault diagnostic tree without the residual feature.
R. Yan et al. / Energy and Buildings 133 (2016) 37–45 43
Fig. 8. Density distribution of the return fan power under the faulty and fault-free
conditions.
Fig. 7. Relationship between supply fan speed control signal and supply air flow
rate under the faulty and fault-free conditions.
From Fig. 7, it can be seen that, for the remaining non-isolated faults,
only the data of CCVS65% fault was not overlapped with the fault-
from the DLBSF fault. This also complies with the expert knowledge free data, which indicated that CCVS65% fault may change the air
that the duct leakage before the supply fan leads to a decreased duct duct resistance. However, this symptom did not match with the
resistance before the supply fan as the static pressure before the common knowledge that abnormal operation of the cooling coil
supply fan is negative. Fig. 7 illustrates the relationship between the valve should not result in a variation in the air duct resistance.
AHU supply air flow rate and supply fan speed control signal under ASHRAE 1312-RP report [40] mentioned that the control strategy
different faulty and fault-free conditions, where the black curve of the test AHU facility opened the heating coil valve to maintain
represents the predicted supply fan speed control signal under the the supply air temperature if it is under the over-cooled condition.
fault-free conditions. A large deviation can be observed between Nevertheless, the control strategy had a malfunction which led to
the data points under the AHU normal operating condition and the fully close the outdoor air damper and exhaust air damper and fully
operating condition suffering from the DLBSF fault, which clearly open the return air damper. Due to the malfunction of the test AHU
illustrates why the residual feature was selected by CART for iso- under CCVS65% condition, the training data of this fault was con-
lating this fault. The patterns of other faults will be discussed later. sidered as invalid and was not representative to the AHU CCVS65%
Once CCVSFC and DLBSF faults have been isolated, the return fan fault.
power was then selected as the next splitting feature. Fig. 8 shows To isolate the exhaust air damper stuck at fully closed (i.e.
the density distribution of the return fan power under the fault-free EADSFC) and heating coil valve leakage (i.e. HCVL) faults, CART algo-
condition and all faulty conditions considered except the above two rithm further selected the outdoor air temperature (Toa ) as a new
faults that have been isolated. It should be noticed that the density splitting feature. If the outdoor air temperature is less than 26.8 ◦ C,
distribution of the return fan completely failed (i.e. RFCF) fault is the AHU was more likely suffering from the HCVL fault. The isola-
not visible in Fig. 8 as the power of the return fan of all observations tion of faults based on the outdoor air temperature alone does not
under the RFCF fault was 0. It is clearly shown that the boundary have any physical meaning, but could be meaningful when con-
of the return fan power of 182 W can isolate the return fan fixed at sidered with the previous rule of the cooling coil valve opening
30% speed (RFF30%) and RFCF faults from the other faulty and fault- (Ucc ) that is not less than 64.3%. In another word, if the outdoor
free conditions while the splitting point 77.9W was further picked temperature is not very high but the cooling coil valve opening
up to separate the RFCF fault from the RFF30% fault. Actually, it is is relatively large, the AHU might experience the HCVL fault. This
obvious that 77.9W is not the only point that can separate both combined rule complies with the common knowledge that an HCVL
faults as there is a large gap in the return fan power distribution fault will result in a larger cooling coil valve opening, which will be
under the RFCF fault and RFF30% fault conditions. more significant under the low cooling load conditions. However,
The residual feature was then further used to isolate the outdoor the cooling coil valve opening is not only influenced by the out-
air damper stuck at fully closed (i.e. OADSFC) fault from the other door air temperature but also is influenced by the factors such as
non-isolated faulty and fault-free conditions. When the difference supply air flow rate, supply water temperature, etc. Therefore, a
between the predicted and actual supply fan speed control signal is conclusive fault diagnostic result cannot be drawn based on these
less than −4.09%, the CART determined that the outdoor air damper rules. Fig. 9 shows the isolated fault boundaries in terms of the cool-
was stuck at fully closed. This is because that the fully closed out- ing coil valve control signal and outdoor air temperature. The right
door air damper increased the duct resistance, resulting in a higher bottom corner area with the light red color labelled as Condition
supply fan speed at a given air flow rate. This relationship can also 1 is the data space that the cooling coil valve opening (Ucc ) is not
be observed in Fig. 7. less than 64.3% and the outdoor air temperature (Toa ) is less than
After isolating all faults with F-measures of 1.0, further tree split- 26.8 ◦ C. As the majority data points in this region were with the
ting isolated the following remaining faults, including the cooling heating coil valve leakage (i.e. HCVL) fault and some of them with
coil valve stuck at 65% opening (i.e. CCVS65%) fault, exhaust air the exhaust air damper stuck at fully closed (i.e. EADSFC) fault, it is
damper stuck at fully closed (i.e. EADSFC) fault and heating coil therefore highly possible that the AHU is under the HCVL fault but
valve leakage (i.e. HCVL) fault, from the fault-free condition. The this conclusion is not definite. Another leaf node labelled with the
next split picked up the cooling coil valve control signal (Ucc) as HCVL fault (see Fig. 4) was isolated in the same way, which labelled
the splitting feature, but without resulting in a leaf node. The left as Condition 2 in the area with the light brown color (Fig. 9). It
branch first isolated CCVS65% fault based on the residual feature. deemed that the AHU is also under the HCVL fault if the cooling
44 R. Yan et al. / Energy and Buildings 133 (2016) 37–45
data-driven methods with expert knowledge might be a possible [18] I.T. Jolliffe, Principal Component Analysis, Springer-Verlag, New York, 1986.
solution for developing effective data-driven based fault diagnostic [19] M. Padilla, D. Choinière, A combined passive-active sensor fault detection and
isolation approach for air handling units, Energy Build. 99 (2015) 214–219.
strategies. [20] B. Fan, Z. Du, X. Jin, X. Yang, Y. Guo, A hybrid FDD strategy for local system of
AHU based on artificial neural network and wavelet analysis, Build. Environ.
Acknowledgment 45 (12) (2010) 2698–2708.
[21] Z. Du, X. Jin, Y. Yang, Fault diagnosis for temperature, flow rate and pressure
sensors in VAV systems using wavelet neural network, Appl. Energ. 86 (9)
The authors would like to sincerely thank ASHRAE for granting (2009) 1624–1631.
the permission to use the data from ASHRAE 1312-RP to validate [22] W.Y. Lee, J.M. House, C. Park, G.E. Kelly, Fault diagnosis of an air-handling unit
using artificial neural networks, Trans.-Am. Soc. Heating Refrigerating Air
the strategy developed. Conditioning Eng. 102 (1996) 540–549.
[23] Z. Du, B. Fan, X. Jin, J. Chi, Fault detection and diagnosis for buildings and
References HVAC systems using combined neural networks and subtractive clustering
analysis, Build. Environ. 73 (2014) 1–11.
[24] W.-Y. Lee, J.M. House, N.-H. Kyong, Subsystem level fault diagnosis of a
[1] V. Gunes, S. Peter, T. Givargis, Improving energy efficiency and thermal
building’s air-handling unit using general regression neural networks, Appl.
comfort of smart buildings with HVAC systems in the presence of sensor
Energ. 77 (2) (2004) 153–170.
faults, in: IEEE 17th International Conference on High Performance
[25] J. Wall, Y. Guo, J. Li, S. West, A dynamic machine learning-based technique for
Computing and Communications (HPCC), 2015, 2015, pp. 945–950.
automated fault detection in HVAC systems, ASHRAE Trans. 117 (part 2)
[2] Z. Ma, S. Wang, Fault-tolerant supervisory control of building condenser
(2011) 449–456.
cooling water systems for energy efficiency, HVAC&R Res. 18 (1–2) (2011)
[26] Z. Yu, F. Haghighat, B.C.M. Fung, H. Yoshino, A decision tree method for
126–146.
building energy demand modeling, Energy Build. 42 (10) (2010) 1637–1646.
[3] Z. Ma, S. Wang, Online fault detection and robust control of condenser cooling
[27] K. Basu, L. Hawarah, N. Arghira, H. Joumaa, S. Ploix, A prediction system for
water systems in building central chiller plants, Energy Build. 43 (1) (2011)
home appliance usage, Energy Build. 67 (2013) 668–679.
153–165.
[28] K. Bruton, P. Raftery, B. Kennedy, M. Keane, D.T.J. O’sullivan, Review of
[4] S. Katipamula, M.R. Brambley, Methods for fault detection, diagnostics, and
automated fault detection and diagnostic tools in air handling units, Energy
prognostics for building systems - a review (Part I), HVAC&R Res. 11 (1)
Effic. 7 (2) (2014) 335–351.
(2005) 3–25.
[29] S. Li, A Model-Based Fault Detection and Diagnostic Methodology for
[5] R. Yan, Z. Ma, G. Kokogiannakis, Y. Zhao, A sensor fault detection strategy for
Secondary HVAC Systems, Ph.D. Thesis, College of Engineering, Drexel
air handling units using cluster analysis, Autom. Constr. 70 (2016) 77–88.
University, 2009.
[6] Z. Du, B. Fan, J. Chi, X. Jin, Sensor fault detection and its efficiency analysis in
[30] P.N. Tan, M. Steinbach, V. Kumar, Introduction to Data Mining, Pearson
air handling unit using the combined neural networks, Energy Build. 72
Addison Wesley, USA, 2006.
(2014) 157–166.
[31] W.Y. Loh, Classification and regression trees, Wiley Interdiscip. Rev.: Data
[7] H. Yang, S. Cho, C.S. Tae, M. Zaheeruddin, Sequential rule based algorithms for
Min. Knowl. Discov. 1 (1) (2011) 14–23.
temperature sensor fault detection in air handling units, Energy Convers.
[32] L. Rokach, Data Mining with Decision Trees: Theory and Applications, World
Manage. 49 (8) (2008) 2291–2306.
Scientific, Singapore, 2008.
[8] Y. Zhao, J. Wen, F. Xiao, X. Yang, S. Wang, Diagnostic bayesian networks for
[33] Y. Yohannes, P. Webb, Classification and regression trees, in: CART: A User
diagnosing air handling units faults − Part I: faults in dampers, fans, filters
Manual for Identifying Indicators of Vulnerability to Famine and Chronic Food
and sensors, Appl. Therm. Eng. (2016), http://dx.doi.org/10.1016/j.
Insecurity, International Food Policy Research Institute, 1999.
applthermaleng.2015.09.121, available online.
[34] L. Breiman, J. Friedman, R.A. Olshen, C.J. Stone, Classification and Regression
[9] Y. Zhao, J. Wen, S. Wang, Diagnostic Bayesian networks for diagnosing air
Trees, Wadsworth, Inc, USA, 1984.
handling units faults −Part II: Faults in coils and sensors, Appl. Therm. Eng. 90
[35] L.F.R.A. Torgo, Inductive Learning of Tree-based Regression Models, Ph.D
(2015) 145–157.
Thesis, Department of Computer Science University of Porto, 1999.
[10] H. Wang, Y. Chen, C.W.H. Chan, J. Qin, J. Wang, Online model-based fault
[36] R. Development Core Team, R: A Language and Environment for Statistical
detection and diagnosis strategy for VAV air handling units, Energy Build. 55
Computing [Software], R Foundation for Statistical Computing, Vienna,
(2012) 252–263.
Austria, 2008.
[11] L.H. Chiang, R.D. Braatz, E.L. Russell, Fault Detection and Diagnosis in
[37] T. Therneau, B. Atkinson, B. Ripley, rpart: Recursive Partitioning and
Industrial Systems, Springer London, Great Britain, 2001.
Regression Trees, 2015.
[12] S. Wang, F. Xiao, AHU sensor fault diagnosis using principal component
[38] S. Milborrow, rpart.plot: Plot ’rpart’ Models: An Enhanced Version of
analysis method, Energy Build. 36 (2) (2004) 147–160.
’plot.rpart’, (2015).
[13] Z. Du, X. Jin, L. Wu, Fault detection and diagnosis based on improved PCA with
[39] H. Wickham, Ggplot2: Elegant Graphics for Data Analysis, Springer, New York,
JAA method in VAV systems, Build. Environ. 42 (9) (2007) 3221–3232.
2009.
[14] Z. Du, X. Jin, Multiple faults diagnosis for sensors in air handling unit using
[40] S. Li, J. Wen, Description of fault test in Summer of 2007, In: ASHRAE 1312
Fisher discriminant analysis, Energy Convers. Manage. 49 (12) (2008)
Report, 2007.
3654–3665.
[41] S. Li, J. Wen, X. Zhou, C.J. Klaassen, Development and validation of a dynamic
[15] W.Y. Lee, J.M. House, D.R. Shin, Fault diagnosis and temperature sensor
air handling unit model, Part 1, ASHRAE Trans. 116 (1) (2010) 45–56.
recovery for an air-handling unit, Trans.-Am. Soc. Heating Refrigerating Air
[42] S. Li, J. Wen, X. Zhou, C.J. Klaassen, Development and validation of a dynamic
Conditioning Eng. 103 (1997) 621–633.
air handling unit model, part 2, ASHRAE Trans. 116 (1) (2010) 57–73.
[16] J. Liang, R. Du, Model-based fault detection and diagnosis of HVAC systems
[43] L.K. Norford, J.A. Wright, R.A. Buswell, D. Luo, C.J. Klaassen, A. Suby,
using support vector machine method, Int. J. Refrig. 30 (6) (2007) 1104–1114.
Demonstration of fault detection and diagnosis methods for air-handling
[17] T. Mulumba, A. Afshari, K. Yan, W. Shen, L.K. Norford, Robust model-based
units, HVAC&R Res. 8 (1) (2002) 41–71.
fault diagnosis for air handling units, Energy Build. 86 (2015) 698–707.