Thesis Feature Engineering Feature Extraction and ML Classifier Development FDD
Thesis Feature Engineering Feature Extraction and ML Classifier Development FDD
i
MASTER OF APPLIED SCIENCE (2020) McMaster
Ontario, Canada
ii
Abstract
In this research, a fault detection and diagnosis strategy for internal combustion
engine is developed using measurements that are readily available in engine testing
the engine on a cycle-by-cycle basis using measurements that are accessible on a running
vehicle. Pressure measurements are easily accessible in a testing facility that provide useful
insight into the quality of the combustion occurring inside the engine. However, due to its
cost and complex installation procedures, it is not feasible to obtain in-cylinder pressure
investigated using vibration. Due to the low cost and non-invasive nature of
cylinder pressure measurement is used during the development of the FDD strategy to
assist in characterizing the vibration measurement. Upon data acquisition, features are
measurements are analyzed using thermodynamic theories to assess the combustion quality
of each cycle. The vibration measurements are labelled corresponding to the pressure
analysis. An artificial neural network classifier is developed using the extracted and
labelled features. Developed classifier detected the fault and its location with an overall
accuracy of 96.3%.
iii
To my parents, my sister & R.T for their endless love and support
iv
Acknowledgements
I would like to thank my supervisor, Dr. Habibi for his guidance throughout this
research. I would also like to thank Dr. Tjong and his team at Ford Powertrain Engineering
Research and Development Centre for their support during my time at PERDC.
v
Table of Contents
Acknowledgements ..............................................................................................................v
2.2 Fault Detection & Diagnosis for Internal Combustion Engines ..............................13
vi
2.2.1 FDD Approach ................................................................................................. 14
vii
3.1.2 Proposed Fault Induction Method .................................................................... 57
viii
4.2 Engine Test Procedure .............................................................................................84
ix
Step 3: Defining Baseline ....................................................................................... 118
Chapter 6: Conclusion......................................................................................................136
References ............................................................................................................................ I
x
List of Figures
Figure 1: Four-stroke cycle of internal combustion engine with piston and crankshaft
Figure 9: Frequency and Time Resolution of analysis in different domains [46] ............ 29
xi
Figure 17: Development overview of the proposed strategy: ........................................... 54
Figure 23: Overhead valve vs overhead cam engine design [57] ..................................... 73
Figure 25: Engine cylinder numbering for the V8 test engine [59] .................................. 73
Figure 27: The mechanical load on the crystal producing electrical charge through electric
Figure 32: Raw accelerometer data from front valley, axis parallel with front-back
xii
Figure 33: Raw accelerometer data from front valley, axis parallel with right-left
Figure 35: Raw accelerometer data from front left knock sensor ..................................... 88
Figure 39: Overlay of all of the raw pressure traces from each cylinder @600RPM....... 93
Figure 44: Overlay of trigger data and Cylinder 1 pressure trace @ 600RPM ................ 99
Figure 46: Filtered accelerometer data in crank angle domain ....................................... 100
Figure 47: All pressure traces for 8 cylinder engine cycle ............................................. 102
xiii
Figure 50: Cylinder 8 Pressure Trace of 2 Consecutive Cycles ..................................... 105
Figure 51: Cylinder 1 Pressure Traces for Faulty and Healthy sample cycles ............... 107
Figure 52: Front-Back Axis Accelerometer data from Front Valley for Faulty and Healthy
Figure 53: Right-Left Axis Accelerometer data from Front Valley for Faulty and Healthy
Figure 54: Up-Down Axis Accelerometer data from Front Valley for Faulty and Healthy
Figure 55: Axis Accelerometer data from Front Left Knock Sensor for Faulty and Healthy
Figure 57: Low and high pass filter for db16 ................................................................. 115
Figure 58: Low pass and high pass filter for db10.......................................................... 116
Figure 59: Low pass and high pass filter for db4............................................................ 116
xiv
List of Tables
xv
List of Abbreviations
xvi
List of Symbols
𝑋𝑇 Transpose of X
𝜑 Combined Index
xvii
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Chapter 1: Introduction
1.1 Overview
Fault detection and diagnosis has been a topic of interest for many years in the
industrial sector such as in the oil and gas, the power generation and the automotive
industries. The origins of FDD is rooted in the field of control systems and more
early form of automation is called regulatory control, which allows for low level control
actions to be performed without a human operator [1]. More recently, due to great
regulatory control and advanced well into more complex tasks such as condition
monitoring (including fault detection and diagnosis) of very complex processes and
systems.
Fault Detection and Diagnosis (FDD) has mostly been applied to industrial
components and systems to prevent events that may bring forth tremendous amount of
consumer products become more and more in demand and as sophisticated data analysis
methods arise such as machine learning algorithms, FDD technologies are in demand by a
variety of industries that aim to provide the best quality for service and product.
1
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Currently, the automotive sector is at the verge of a very important change that may
redefine the automotive sector itself and the way consumers perceive transportation and
vehicles. The factor that are initiating such monumental changes in the automotive sector
learning algorithms have proven that there is very useful information embedded in data
that are being collected by sensors that are currently not being extracted. In the automotive
industry large quantities of data are being collected every day. One of the areas that have
great potential for extracting useful information through tools like machine learning is in
components of the vehicle such as the engine. In this research, the potential opportunities
for technological advancement through the access of unused data or unused data source is
The motivation for this research can be viewed from different perspectives. For
example, the condition monitoring or FDD technology for internal combustion engines can
be developed and utilized in a variety of different sectors of the automotive industry such
as in the manufacturing lines, fleet operations, engine diagnostics, and engine controls.
achieved with the help of FDD technology, which could ensure the detection of faulty
engine components. Automated FDD strategy can greatly reduce the number of human
operators that are required in quality assurance. Automation of quality assurance would
2
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
reduce the time required to assess the quality of the components and increase the accuracy
rates of the quality assessment, which allows for human operators to attend to more
Productivity of the manufacturer and the quality of their products can be improved
For a fleet operator, in-vehicle FDD technology would be useful in monitoring the
health of the engine. Although On-Board Diagnostics (OBD) exists in every consumer
vehicle, it can be argued that there is room for improvement in the sophistication of the
data analysis methods to broaden the range of faults and the levels of faults that are
monitored.
For engine controls, an FDD technology that is capable of monitoring the engine’s
combustion quality in real time would be of great significance as control systems could be
for Mechatronics and Hybrid Technologies (CMHT). This research is a continuation of the
3
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
The objective of this research is to develop a FDD strategy for internal combustion
intermittent faults. The development of a FDD strategy involves three main stages as
follows, [2]:
This thesis consists of seven chapters that are organized as follow. Chapter 2 presents
the literature review of FDD and feature extraction methods. Chapter 3 present the theory
behind the feature extraction and classification strategy that is implement in this research.
Chapter 4 presents the experimental set-up, data collection, and data pre-processing
4
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
the results, and the observations. Chapter 6 presents the noteworthy findings of the research
5
Chapter 2: FDD Strategies for Internal Combustion Engines
2.1 Background
In this section, a review of the internal combustion engine and associated FDD
strategies is provided. The objective of this section is not to provide an exhaustive overview
of each topic, but only the important factors that influence their interaction and integration.
applications by converting chemical energy that are contained in the fuel into useful
mechanical power. The type of internal combustion engine that is most often used in
consumer vehicles is the spark-ignition four stroke engine using the Otto Cycle [3]. The
Otto cycle has four strokes that consist of intake, compression, power and exhaust. These
occur over two full revolutions of the crank shaft [4]. As shown in Figure 1, one rotation
of the crank shaft results in linear motion of the piston from top-dead center (TDC) to
bottom-dead center (BDC) then back to TDC. This repetitive linear motion, which is also
rotational motion of the shaft into linear motion of the piston and vice versa.
6
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Figure 1: Four-stroke cycle of internal combustion engine with piston and crankshaft position [5]
Each stroke in the Otto cycle can be expressed as a thermodynamic process as shown in
Figure 2.
7
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
1. Intake stroke: The intake stroke starts with the piston at top-dead center where the
volume of the chamber is at its minimum and ends with the piston at (BDC) where the
volume of the chamber is at its maximum [3]. As the piston travels down the chamber,
the intake valve opens. During this process, the air-fuel mixture enters the chamber
2. Compression stroke: The compression stroke begins with the intake valve closing and
the piston traveling up the chamber from BDC. This compression is described to be
adiabatic and reversible which is also known as isentropic [7]. During this compression,
the piston does work on the mixture inside the chamber by decreasing the volume and
3. Power stroke: In between the compression and the power strokes, a thermodynamic
process called combustion occurs where the air-fuel mixture is ignited. In the case of a
spark ignition engine, which will be the focus of this research, the mixture is ignited by
using an electrical discharge. During the very rapid combustion process, the volume
stays constant while the pressure and temperature are significantly increased due to the
substantial amount of heat that is released inside the chamber. The power stroke, which
can be described as an isentropic expansion follows the combustion process [7]. During
the power stroke, the piston is pushed from top-dead center to BDC by the gas that is
expanded adiabatically upon the combustion of the air-fuel mixture [3]. In the power
stroke, pressure decreases while the volume increases and the work is said to be done
8
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
4. Exhaust stroke: The exhaust stroke is where the residual gas is removed from the
chamber as the piston travels back to top-dead center with the exhaust valve open. The
piston movement decrease the volume while the open exhaust valve maintains a
The analysis of each stroke shows that the power generated from the cylinder does
not occur constantly throughout the cycle instead it occurs in bursts, specifically during the
power stroke. This means that a single cylinder engine would deliver power in a pulsating
form. Therefore, most engines in a vehicle are equipped with multiple cylinders to provide
more consistent power output by staggering the occurrences of the power stroke for each
cylinder. Although this results in more consistent power output, it does not eliminate the
presence of the pulsation. The fluctuations observed in the multi-cylinder engine occurs
more rapidly with smaller amplitude variations. This fluctuation in power delivery is
reflected in the rotation of the crankshaft which is never constant in a given engine cycle.
Heywood describes the combustion process that occurs inside the chamber as a fast-
exothermic gas-phase reaction [4]. During a normal engine operation, the fuel is mixed
together with air and inducted into the chamber through the open intake valve during the
intake stroke. As the air-fuel mixture enters the chamber, it gets mixed with the residual
gas that remains in the chamber after the exhaust stroke. This mixture of fuel, air, and
residual gas from the previous combustion process is then compressed and ignited by an
electric discharge of a spark plug. Upon ignition, a flame develops and propagates through
the chamber until it reaches the chamber walls where the flame is extinguished [4]. Flames
9
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
propagate in gaseous state [4]. In addition, flame development and propagation vary cycle-
by:
3. variations in the ratio of fresh mixture to residual gases in the cylinder [4].
This leads to the fact that no two engine cycles, even for a given cylinder, are identical.
Due to this inconsistent nature of the engine, the combustion may not occur optimally for
a given cycle since most engines are controlled to run at an “optimal” condition that was
configured for an average cycle [4]. Further information regarding internal combustion
engines that has not been covered in this section can be found in Heywood’s text [4].
Fault detection is defined as the capability to recognize that a fault has occurred
and fault diagnosis is defined as the ability to locate the fault and the cause of the fault [9].
which are arguably the greatest assets of an industrial company, and to prevent extreme
costs associated with unexpected machinery or facility downtime. FDD technology poses
great benefits as it eliminates the escalation of minor faults through their prompt
identification and initiation of timely mitigation measures. Lately, Fault Detection and
10
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Diagnosis (FDD) technology is being pursued by a wide range of industries that are simply
in the system, isolation and localizing of faults and identification of the fault condition
[10]. Before delving into the different concepts of FDD, a few key terms should be
explained and defined. A fault can be broadly defined as a state where a measured variable
or a calculated parameter within a process or a system is out of its acceptable range [11].
Malfunction of a component can be defined as the root cause for the symptoms exhibited
transformations that are involved in determining a fault or failure class given a set of
measurements.
1. Transformation of the input to the overall FDD system called measurements into
features through a process called feature extraction. These methods use a priori
11
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
given FDD system, it becomes evident that feature extraction and diagnostic search
When developing a FDD system, there are two different strategies called model-
based and signal-based approaches. As the name suggests, the model-based approach
utilizes a mathematical model that formulates an expected behavior of the system. The
variations in the parameters that results in change of the system behavior is monitored using
observers or filters [12]. On the other hand, signal based strategies rely purely on historical
measurement data. For complex systems that are challenging to model, signal based FDD
may be better suited. This eliminates the need for a mathematical model and the difficulties
and limitations of modeling [1]. Instead of mathematical models, signal based approaches
12
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
artificial neural network algorithms and statistical methods are utilized to obtain patterns
FDD strategies for internal combustion engines have been explored extensively in
the past. Earlier FDD strategies for internal combustion engines were motivated by the
need for improved productivity and quality in production such presented by Miller and
Tjong [13], [14]. These researchers envisioned a system that is capable of detecting
manufacturing and assembly defects through an online engine monitoring and diagnostic
system [13], [14]. Utilization of such monitoring and diagnostic systems were intended to
be integrated in the manufacturing line through a process that was later popularized as cold
tests and hot tests [13]. During these tests, the engine is put on a test stand where it is
operated in cold or hot modes while measurements are collected at very high sampling
rates [15]. Cold test refers to the operation of the engine without combustion and hot test
refers to the operation of the engine with combustion. The obtained measurements are
analyzed to determine whether or not the engine should proceed to its next manufacturing
process [16]. Due to its simplicity and low costs, more research efforts have been found in
the development and improvement of cold tests. Following Miller and Tjong, a variety of
measurement analysis methods were studied to improve the performance of the cold test
that can be used in the engine manufacturing line. These strategies involve running the
13
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
engine in different modes such as cold testing, hot testing with load, and hot testing without
load while obtaining measurements such as speed, torque, and pressure. Although it is not
clear as to how the measurements are analyzed, it is noted that a variety of defects such as
missing piston rings, leaky head gaskets, leaky intake and exhaust valves along with many
More recently, FDD algorithms for in-vehicle applications were explored, where
with the appropriate hardware, online monitoring of the engine can be achieved within the
vehicle [22]. Currently, vehicles are equipped with On-Board Diagnostic (OBD) systems
that monitor the engine closely with a variety of measurements. However, as concepts of
autonomous and connected vehicles are becoming closer and closer to reality, it is
becoming increasingly important for the automotive industry to broaden the diagnostics
for a wider range of components in the vehicle [22], [23]. With ever increasing computation
power, more sophisticated methods such as wavelet analysis, Principal component analysis
and artificial neural networks (ANN) are becoming tangible for automotive applications
In the literature, the widely accepted FDD approach for internal combustion
engines is the signal-based approach. Model-based strategies have been used but with very
limited scope. Wei applied model-based FDD to diesel engines by modeling the in-cylinder
pressure and torque produced by each cylinder in crank angle domain based on
14
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
behaviors were not within the scope of the model as it was intended for the cold test
application [17]. Chandroth confirms that signal-based strategies are preferred over the
engine, especially the combustive process, given its turbulent and variable nature that was
correlations that are assumed to exist in the measurements selected act as the a priori
knowledge. The type of the measurement such as speed, pressure and/or voltage, the sensor
used to obtain the selected type of measurement and the location of the measurement are
example of all the decisions that must be made in selecting the measurements for a signal-
based strategy. In the following section, the types of measurements that should be
considered for the development of FDD for internal combustion engines are discussed.
Vibration Signals
Sound and vibration measurements are commonly used in FDD systems for
mechanical systems [28], [29], [30], [31] . Vibration is inherent in any mechanical system
that consists of moving components [32]. Every system has its own specific vibration
modes that can be observed during its normal operating conditions. These modes are
dependent on system parameters such as mass, system stiffness, fitting tolerances and
frictions. When changes occur in these system parameters due to a faulty component,
observable vibration behaviors may deviate from its normal signature. In addition,
15
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
vibration signals can be obtained through non-intrusive methods that are relatively cheap
and easy to install. In the literature, a wide range of vibration analysis for FDD of rotating
machines are present [1], [28], [31], [33], [34], [35], [36].
engines are very complex. Vibration of an engine may arise from a variety of components
such as crank train, valve train, piston assembly, fuel system and exhaust system [37]. In
addition, the turbulent combustion process contributes to the overall vibration of the
engine, which further complicates the task analyzing the vibration. Tjong categorized three
components such as valve train, gears, connecting rod, pistons and bearing;
components are exhibited in different forms. For example, the engine is comprised of
components that function in reciprocating motion and rotating motion. Vibration due to
motion are exhibited in impulses in the time domain or relative to the crank position.
16
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Furthermore, engine vibrations also comprise of vibration components that are induced by
Pressure Signals
amount of information on the quality of the combustion in a cycle by cycle basis [4].
Researchers such as Sharkey had performed FDD studies using vibration and in-cylinder
pressure signal trace to detect faults such as leaking intake and exhaust valves [30]. Miller,
Tjong, Jones and Chandroth presented valuable studies that exhibit great potential for
accurate FDD strategies using pressure traces. However, it should be noted that obtaining
in-cylinder pressure traces require retrofitting of engine to allow for installation of very
This section presents a review of a variety of feature extraction methods that are
commonly utilized for signal based FDD strategies involving vibrational measurements.
Time domain analysis is the simplest and cheapest approach to analyzing signals
for FDD strategies as measurements are often obtained in time domain. These time series
signals are usually referred to as raw signal. A variety of statistical indicators such as crest
factor, kurtosis root mean square and standard deviation have been applied to FDD
17
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Time domain analysis has been performed on diesel engines for cold test
applications, which were aimed at determining the reliable threshold values for pass or fail
decision for the cold test given vibrational measurements with very low computational
costs [15]. The research concluded that kurtosis coefficients and the RMS value of the
vibration signal in time domain was successful at characterizing and detecting faults such
tightened oil pump screw and improperly tightened oil jet during cold test operating
conditions. Cold test operating conditions refer to conditions at which the engine is
operated without combustion. However, time domain analysis is not the most ideal
vibration analysis method for FDD strategies for engines at normal operating conditions,
which includes combustion, as time domain measurements of engines are very difficult to
understand and analysis in time domain yield information in reference to time, which is not
very meaningful for the application of FDD on ICE. Time referenced information is not
very valuable as engines operate in various different rotating speeds, which would indicate
that a fault condition analyzed at one speed would not be able to correlate its learned
As mentioned in the previous section, time domain is not a good reference domain
obtained in reference to the angle of the crank shaft is widely used. Miller, Tjong and
Delvecchio applied statistical indicators from time domain analysis on crank angle domain
[13, 15, 14]. Analysis in crank angle domain allows for occurrences of abnormal behaviors
18
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
of a signal to be trace to a specific location in an engine cycle which gives insight into to a
range of components that may have induced such abnormal behavior. This is because
movement of a certain component within the engine is directly related to the position of
the crank shaft. Sophisticated methods of converting time domain signal to crank angle
domain are explored by Arasaratnam [38]. It is important to note that conversion methods
cannot assumed that the crankshaft of the internal combustion engines has a constant
rotational speed. If such assumptions were to be made during the conversion process, the
reference of the measurements with respect to specific crank angles will be skewed
addition, analysis in crank angle domain eliminates the need for FDD strategy to operate
only under the same operating speeds that it was previously trained on. This characteristic
is especially important for the development of in-vehicle condition monitoring systems for
given time-domain signal into their frequency components. These frequency components
can be used as features for a given condition. Frequency analysis are most appropriate for
stationary signals that are periodic as time information is not available in Fourier transform.
In the literature, frequency analysis such as Fourier transform has been used for
FDD of machinery [39], [40], [41]. However, it is often concluded that the event of interest
must be periodic for frequency analysis to be effective [42]. Therefore, frequency domain
19
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
analysis is not an ideal method to be applied for internal combustion engines where signals
are time varying. If internal combustion engines were stationary signals such as in the case
of motoring, frequency domain analysis may be able to detect the presence of a fault and
narrow down the list of potential culprits. However, if multiples of a component that share
the same frequency component exist, the features obtained from frequency domain analysis
will not be able to tell which of those components are faulty. This is because the frequency
analysis provides information on which frequency appears in the signal but not when the
rotating and reciprocating machineries such as engines, turbines and pumps [45]. Instead
20
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
of performing FFT on time-domain signals, order analysis describes a method where FFT
is performed on crank angle domain signals. This allows frequency components of non-
the case of engines would be the crankshaft. Order analysis provide behavior of harmonic
orders of the crankshaft speed with each order corresponding to the multiple of the
crankshaft’s rotational speed. This method is especially useful for an application where a
wide range of speeds are being observed. However, like frequency domain analysis, order
component unless the harmonic order associated to the faulty component is unique in the
whole system.
Retrieving the frequency components and the occurrences of those components for
a given signal is essential for FDD of rotating and reciprocating machines or for any
application that requires the analysis of a non-stationary or transient signal. However, time-
domain analysis is only able to provide information on when a component of the signal
occurred with respect to time, and frequency analysis is only able to provide information
on the frequency components that the signal consists of. To mitigate this constraint and
obtain time localization, a variety of approaches that are known as time-frequency analysis
was introduced. These approaches include but are not limited to Short-Time-Fourier-
21
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
time and frequency. In this literature review, Short Time Fourier Transform (STFT) and
from a compromise between time and frequency analysis. STFT achieves the two-
through the utilization of windowing technique and Fourier Transform [45]. The
windowing technique divides the given signal into short equal time-framed segments,
applied with the assumption that signals can be considered to be stationary if they are
segmented into small enough segments. Figure 5 illustrates the basics of the windowing
technique. As shown in Figure 5, the windowing function is first located at time, t = 0, then
the window is shifted throughout the signal. The most notable characteristic of STFT is
that its window size is kept constant throughout the whole transformation.
22
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
where 𝜏 is the time, 𝜔 is the frequency, 𝑡 is the offset, 𝑥(𝑡) is the signal to be transformed,
and 𝑤(𝑡) is the windowing function. And the energy density of the signal at a given time
23
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
frequency components and its occurrences given a non-stationary signal. However, the
compromise between time and frequency domain analysis has its limitation, which is
described by the Heisenberg uncertainty Principal. It infers that the increase in resolutions
cannot be achieved concurrently due to the inability to know the exact time frequency
representation of a signal. Figure 7 and Figure 8 depict the two extreme cases of high
frequency and high time resolution. It can be observed that when the resolution of one
domain is increased, the resolution of the other decreases. For example, when the size of
the window is decreased the time resolution improves while frequency resolution decreases
and vice versa when the size of the window is increased. This phenomenon is often referred
24
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
A FDD strategy using STFT was applied to bearings by Yazici, where it was
reported that high accuracy was achieved [46]. However, the application of STFT on
engine measurements may not be enough for the development of an effective FDD strategy
characteristic of the feature extraction methods for FDD of engines due to their non-
stationary and transient behavior. STFT has been applied to the analysis of engine vibration
by Chen, where different vibration sources are attempted to be identified and characterized.
The research concluded that STFT is capable of identifying events such as strong valve
impact and piston slap but unable to separate closely-overlapping or weak events such as
valve closures of different cylinders. This finding supports the prior judgment that STFT
is simply not sophisticated enough for the objective of the FDD strategy that is being
explored in this research [50]. Due to the fixed resolution in time and frequency in STFT,
other Time-Frequency techniques such as wavelet analysis are suggested in the literature
25
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
non-stationary and time-varying characteristics. Unlike STFT, where the resolution of the
signal is fixed in both time and frequency domains throughout the whole transformation of
the signal, wavelet utilizes different window sizes to develop a multi-resolution analysis
(MRA) in the time and frequency domain of a signal. In this section, the fundamental
review of continuous wavelet transform (CWT), discrete wavelet transform (DWT), and
26
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
The word wavelet originates from the equivalent French word ondelette, which
translates to small wave. It is defined as continuous functions that decay to zero at both
sinusoids. The rate of decay is altered by the dilation or compression of the wavelet. The
wavelet is then translated to capture localized events along with the time domain of the
details of the signal being analyzed, which other forms of analysis may not be able to.
Therefore, the ability to detect such minor details of a signal that is very favorable in
components. It is known that low frequencies can be observed throughout longer time
periods and that higher frequencies can be observed in very short time periods. With this
intuition, it was suggested that a large window size be used to obtain information regarding
low-frequency components and very small window sizes to obtain information regarding
high-frequency components. The ability to capture both the time and frequency
components of a signal with high resolution makes wavelet transform a promising method
for analyzing transient signals that consist of both high and low-frequency components. It
is important to note that the Heisenberg uncertainty Principal still applies to wavelets,
however, wavelet transform is a very clever work around method. The resolution problem
is mitigated by prioritizing the resolution that is more important at a given frequency range.
27
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
The difference between STFT and wavelet analysis is best presented in Figure 9.
The wavelet transform’s multiresolution property is illustrated by the different sizes of the
boxes. The boxes indicate the frequency and time resolution at which each transform can
analyze the signal. As illustrated in Figure 9, time and frequency domain analysis are only
able to provide information on the signal at their own domain. This is indicated by their
rectangular shape that has a fixed width in the frequency domain for time-domain analysis
and time domain for frequency domain analysis. STFT is shown to have a square shape
with the same width for time and frequency. This infers that STFT has a fixed resolution
in both time and frequency throughout the entire frequency ranges. On the other hand, the
wavelet transform has both rectangular and square shapes in different sizes. Every box in
the wavelet transform has the same area due to the Heisenberg uncertainty Principal;
however, the widths of both frequency and time are altered, providing different proportions
of resolutions at different frequency levels. For example, at low frequencies, the boxes
have larger heights with thinner widths, which indicates that it has better frequency
resolution with lower time resolution. And at high frequencies, the boxes are shorter and
wider, indicating that it has lower frequency resolution but higher time resolution.
28
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Similar to STFT, wavelet transform multiplies the signal with the wavelet function,
which acts as STFT’s window function, at different time segments in the signal. The
continuous wavelet transform and the inverse continuous wavelet transform are described
∞
1 𝑡−𝑏
𝑋𝑤 (𝑎, 𝑏) = 1⁄∫ 𝑥(𝑡)𝜓 ( ) 𝑑𝑡 , 𝑎 𝜖ℝ + 𝑎𝑛𝑑 𝑏 𝜖ℝ (2.3)
|𝑎| 2 −∞ 𝑎
∞
1 𝑑𝑎 𝑑𝑏
𝑥(𝑡) = ∬ 𝑋𝑤 (𝑎, 𝑏) 𝜓𝑎,𝑏 (𝑡) 2 (2.4)
𝐶𝜓 −∞ 𝑎
29
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
where a is the scale factor, b is the translational value and 𝜓(𝑡) is the mother wavelet where
1 𝑡−𝑏 (2.4)
𝜓𝑎,𝑏 (𝑡) = 1 𝜓( )
|𝑎| ⁄2 𝑎
Wavelet analysis is a process where different scaled and time-shifted versions of the mother
wavelets are compared with the signal that is being analyzed to measure the similarity
between the signal and the wavelet function. The scale parameter corresponds to frequency
resolution of the analysis where higher scale provides more global behaviors of the signal
and lower scale provide more detailed information that may only last for a short time span.
Scaling parameter achieved varied frequency resolution analysis through dilation and
Unlike Fourier transform, the wavelet functions are localized in both time and
frequency [47]. Due to this localization, the wavelet functions must be translated
throughout the time duration of the signal to ensure characterization of the entire signal.
Similarly, wavelets must be scaled in time and shifted in frequency domain due to the
described as a process where the signal being analyzed is multiplied with analysis windows
that are changed in scale and shifted in time then integrated over time. The square root of
the scaling factor is an energy normalization term which ensures that wavelets of different
30
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
All mother wavelets and its offspring have two main characteristics that are very
∞
∫ 𝜓𝑎,𝑏 (𝑡) = 0 (2.5)
−∞
which basically states that all wavelets must have zero average. Second characteristic is
described as,
‖𝜓‖ = 1 (2.6)
which states that their energy must equal to unity. The ability to obtain the transformation
inverse is very useful in Fourier transform. In wavelet transform, dual wavelet function
which is a synthesis function for a given wavelet function is utilized to perform the inverse
mother wavelets are chosen under admissibility condition which is described in the
following equations.
∞
|𝜓(𝜔)|
𝐶𝜓 = ∫ 𝑑𝜔 < ∞ (2.7)
−∞ 𝜔
The admissibility condition requires that 𝜓(𝜔) is differentiable. There are numerous
amounts of wavelets that have been developed for different applications to analyze
very important for the success of the application. Although the concept of wavelet
31
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
to reduce computation time through elimination of redundancy while still ensuring that an
appropriate amount of information is provided for analysis and synthesis of the signal.
Multi-resolution analysis method suggests the decomposition of the input signal into two
different signals using two channel filter bank and down sampling of the filtered signal. In
order to analyze the signal at different scales, the cut-off frequencies of the filter are altered.
The signal is essentially being decomposed into different frequency bins using a series of
half band high and low pass filters. This process is illustrated in Figure 10 where g[n] and
h[n] are filters which are associated with the mother wavelet and scaling function [48]. The
The filtering of signals is described by a convolution operation of the signal and the impulse
The low pass filter removes all the frequencies that are higher than the half of the full
frequency range of the signal and the result is known as approximation. The high pass filter
removes all the frequencies that are lower than the half of the full frequency range of the
signal and the result is known as detail. Through this process, half of the information from
the original signal is lost which results in the reduction of the resolution in half. The
32
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
filtering operation varies the frequency of the signal. The down-sampling process upon
filtering alters the scale through the reduction of the sampling rate, which is achieved
exist in the filtered signal according to the Nyquist sampling theorem and doubles the scale.
The following equations describes how approximate and detail representations are
obtained.
representations of a given signal can be performed iteratively to break down the original
signal into smaller frequency ranges. The decomposition of a signal reduces the time
resolution in half as the number of sample has been reduced by half. On the other hand, the
33
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Reconstruction of the original signal can be obtained using Equation 2.12, where 𝑦ℎ𝑖𝑔ℎ and
𝑦𝑙𝑜𝑤 are given by Equations 2.12a and 2.12b. The summary of the decomposition and
𝑥[𝑛] = ∑(𝑦ℎ𝑖𝑔ℎ [𝑘] ∙ 𝑔[−𝑛 + 2𝑘]) + (𝑦𝑙𝑜𝑤 [𝑘] ∙ ℎ[−𝑛 + 2𝑘]) (2.12)
34
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Wavelet packet transform is very similar to the discrete wavelet transform in that
However, the wavelet packet transform differs from the discrete wavelet transform as it
passes both approximation and detail coefficients through low and high pass filters. Figure
12 shows the wavelet packet decomposition. The main benefit of wavelet packet transform
is that the entire frequency spectrum is highlighted equally unlike DWT, which mainly
highlights the high frequency components. The difference in distribution of the frequency
35
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Measurements from multiple sensors, either from the same type or different types
of sensors, are required as the complexity of the system that is being observed is increased.
For fault detection and diagnosis applications, behaviors that relate to faults must be
36
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
analysis refers to statistical methods in which more than one statistical variable is observed
and analyzed. Multivariate analysis aims to obtain relationships that exist among the
different variables which effect the system’s behavior and to obtain dominant variables that
measurements into statistical indices. One of the key factors that must be accounted for is
the amount of cross-correlation that exists in the entire dataset. The level of cross-
correlation would indicate that, although many different measurements were obtained, very
little information can be extracted from it. Depending on the situation, an additional
measurement may reach a point of diminishing return due to the lack of uniqueness that
popular statistical method for multivariate analysis in the field of engineering including
correlations among a set of multiple variables that are provided for a given system. With
that assumption, PCA aims to convert the dataset into principal components using
orthogonal transformation. In other words, a dataset with many variables that are assumed
which are a set of linearly uncorrelated orthogonal axes. The contribution of each principal
component on the system’s behavior is dictated by the variance that exists in the data along
behavior is high if large variance is observed along its axes. For example, the first principal
component has the largest variance of the dataset, the second principal component has the
37
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
largest variance of the dataset given the constraint that it must be orthogonal to the first
eliminating the data points that dependent on principal components with very low variances
which are considered to have very little contribution to the system’s behavior. This
performs statistical analysis on the compressed dataset to understand the dynamics of the
method that extracts the dominant dynamics that exist in the complex dataset, which is
38
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
𝑇 = 𝑋𝑃 (2.12)
PCA transform is described by Equation 2.12, where 𝑋 is the data matrix of size
number of variables. Principal component loadings represent the basis vectors of the new
uncorrelated signal representation of the dataset and is organized from highest to low
variance components. This indicates that first couple of columns of the principal
components scores hold the most information regarding the data set when compared to the
last couple of principal components. This implies that PCA can identify which
Every column of P represents a basis vector in the new principal component dimension.
There are two conditions that must be met by the transformation matrix P. These conditions
states that the basis vectors must be orthogonal and that the magnitude of variance
influences the principal direction [24]. In addition, the principal component scores matrix
must be a diagonal. A diagonal matrix is defined as a matrix whose entries outside of the
main diagonal equals to zero. As described by Equation 2.12, the aim of PCA is to
transform the data that are described by correlated variables into data that are described by
uncorrelated variables. In other words, the main purpose of PCA is to remove cross
correlation that exist in the raw dataset. Mathematical, this would translate to obtaining the
orthogonal transform matrix P that maps the raw dataset into transformed diagonal
39
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
covariance matrix [24]. The derivation of the solution is introduced in the following series
1
∑𝑇 = 𝑇𝑇𝑇
𝑛−1
1
∑𝑇 = (𝑋𝑃)𝑇 𝑋𝑃
𝑛−1
1
∑𝑇 = 𝑃𝑇 𝑋 𝑇 𝑋𝑃
𝑛−1
1
∑𝑇 = 𝑃𝑇 (𝑋 𝑇 𝑋)𝑃
𝑛−1
1 (2.13)
∑𝑇 = 𝑃𝑇 𝐴𝑃
𝑛−1
𝐴 = 𝑉Λ𝑉 −1 (2.14)
matrix 𝐴, orthogonality of its eigenvectors can be inferred. A matrix property which states
that a transpose of an orthogonal matrix is equal to its inverse is applied to Equation 2.14
to arrive at Equation 2.15. When Equation 2.15 is substituted into Equations 2.13, Equation
40
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
𝐴 = 𝑉Λ𝑉 𝑇 (2.15)
1
∑𝑇 = 𝑃𝑇 (𝑉𝛬𝑉 𝑇 )𝑃 (2.16)
𝑛−1
1
∑𝑇 = 𝛬 (2.17)
𝑛−1
2.18 describes the SVD theorem, where 𝑋 is any arbitrary matrix with size 𝑛 𝑥 𝑚, 𝑈 is a
unitary matrix of size 𝑛 𝑥 𝑛 called left singular matrix, 𝑉 is a unitary matrix of size 𝑚 𝑥 𝑚
called right singular matrix, and 𝛴 is a rectangular diagonal matrix of size 𝑛 𝑥 𝑚 called
𝑋 = 𝑈𝛴𝑉 𝑇 (2.18)
SVD is applied to calculate PCA as shown in Equation 2.19 given that 𝑈 is a unitary matrix
and 𝑈 𝑇 𝑈 = 𝐼. The relation provided in Equation 2.20 relates the SVD and eigenvalue
deposition which infers that transformation matrix P is equal to the right singular matrix V
since 𝑃 = V.
𝑋 𝑇 𝑋 = 𝑉Σ 𝑇 𝑈 𝑇 𝑈𝛴𝑉 𝑇
= 𝑉Σ 𝑇 𝛴𝑉 𝑇 (2.19)
Σ = 𝑋𝑇 𝑋 (2.20)
The application of SVD ensures that columns of 𝑉 , the eigenvectors and the
corresponding elements of Σ, the eigen values are organized in the descending order which
means that the first principal components will have the highest variance. The small
41
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
eigenvalues are often eliminated in practice as these components of the measurement have
PCA transform described in Equation 2.21, where 𝑇̂ and 𝑃̂ are the remaining
𝑋̂ = 𝑇̂𝑃̂𝑇 (2.21)
The method of defining the threshold limit is a field of its own that is being
researched extensively as it is crucial for the performance of PCA. Heuristic rule and
The heuristic rule states that components that consists of 95% of the total variance
should be considered which infers that any principal component with an eigenvalue higher
than 0.05 should remain. The Kaiser’s rule states that eigenvalues that are greater than the
42
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
the application of PCA should be considered. Non-centered PCA is the method of PCA
application with no prior processing of the raw dataset. Centered or covariance PCA is
method where the mean value of each variable is subtracted from each instance. And
correlation PCA is a method where dataset is scaled and normalized to have zero mean and
unit standard deviation by subtracting the mean value and dividing it by the variable’s
standard deviation. For applications where the variables are in different units or where the
PCA has been widely utilized in signal based FDD applications due to its ability to
quantify the correlation between the components of a multivariate dataset and the system’s
behavior. This quantification, which provides measure of the covariance, allows for the
ability to highlight the components that have the most significant influence on the system’s
system. PCA technique is especially useful in signal based FDD approaches where system
is complex, and dataset is composed of high dimensional, noisy and correlated data [51].
For the application of PCA on FDD, Cherry and Qin suggests decomposing the
multidimensional data collected for a given system into two smaller sets as shown in
Equation 2.21 [52]. 𝑋̂ represents the main features that consists of high varying principal
components and 𝑋̃ represents the residuals that consists of low varying principal
components. Equations 2.22 shows how the principal component scores and loading
43
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
𝑋 = 𝑋̂ + 𝑋̃ (2.21)
= [𝑇̂ 𝑇̃][𝑃̂ 𝑃̃ ]
= 𝑇𝑃𝑇
The thresholding is applied to the variance of the entire dataset by setting a criterion
on the variance, where meeting the criterion means that the principal component in
question belongs to 𝑋̂ subset and inability to meet the criterion means that the principal
components, 𝑧 number of principal components belong to the 𝑋̂, feature subset while 𝑝 −
thresholding techniques that differ in the criterion that it uses to decompose the dataset into
main features and residual subsets. Most popular methods of thresholding are Heuristic
rule, Kaiser rule and manual selection. Heuristic rule categorizes the principal components
that collectively represent 95% of the variance of all principal components into the feature
subset and the remaining into the residual subset. Kaiser rule categorizes the principal
components that have higher variance than the average variance of all the principal
components into the feature subset and the remaining into the residual subset.
44
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Upon decomposition of the dataset into features and residuals subsets, the
information that are embedded in the principal components must be expressed in terms of
statistical indices to proceed with fault detection and diagnosis methods. Indices
combined index 𝜑. These indices provide a measure of how different a given measurement
Hoteling’s index is calculated using Equation 2.23, where 𝑥𝑛𝑒𝑤 is the new
̂ is 𝒯 2 projecting matrix. 𝐷
measurements, and 𝐷 ̂ is
̂ is defined in Equation 2.24, where Λ
defined in Equations 2.25 and 𝑛 is the size of the measurements. Hoteling’s index
calculates the distance between the feature subset of the baseline and the new measurement
̂ 𝑥𝑛𝑒𝑤
= 𝑥𝑛𝑒𝑤 𝐷 𝑡 (2.23)
𝐷 ̂−1 𝑃̂𝑡
̂ = 𝑃̂ Λ (2.24)
1 (2.25)
̂=
Λ 𝑇̂𝑇̂ 𝑡
𝑛−1
SPE index is calculated using Equation 2.26, where 𝐶̃ is the residual subspace that
the new measurement is projected on. 𝐶̃ is defined in Equation 2.27. SPE index calculates
45
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
the distance between the new measurement and the residuals subset of the baseline
2 (2.26)
𝑆𝑃𝐸 = 𝒬 = ‖𝑥̃𝑛𝑒𝑤 ‖2 = ‖𝑥̃𝑛𝑒𝑤 𝐶̃ ‖
𝐶̃ = 𝑃̃𝑃̃𝑡 (2.27)
The combined index introduced by Yue and Qin is obtained using Equation 2.28
and 2.29, where 𝛿 is the upper limit of 𝒬, 𝜏 is the upper limit of 𝒯 2 , and Φ is the projecting
matrix. Equations 2.30 defines Φ. As shown in Equation 2.28, the combined index is a
𝒬 𝒯2
𝜑= 2+ 2 (2.28)
𝛿 𝜏
𝑡
𝜑 = 𝑥𝑛𝑒𝑤 Φ𝑥𝑛𝑒𝑤 (2.29)
The calculated indices are compared to an upper limit where exiting the limit
indicates the presence of a fault condition in the system. Upper control limit of index 𝒯 2 ,
which is proportional to the F-distribution, is calculated using Equation 2.31 or 2.32 [53],
[18]. ∝ is the assigned confidence level, 𝑛 is the sample size of the original data, and 𝑙 is
the number of principal components. Equation 2.31 is utilized for larger sample size
applications whereas, Equation 2.32 is more suitable for smaller sample size applications.
𝑙(𝑛 − 𝑙) (2.31)
𝜏 2 = 𝜏𝛼2 = Ƒ (𝑙, 𝑛 − 𝑙), 𝑤ℎ𝑒𝑟𝑒 ∀𝒯 2 , 𝒯 2 ≤ 𝜏 2
𝑛−𝑙 𝛼
46
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
𝑙(𝑛2 − 𝑙) (2.32)
𝜏 2 = 𝜏𝛼2 = Ƒ (𝑙, 𝑛 − 𝑙), 𝑤ℎ𝑒𝑟𝑒 ∀𝒯 2 , 𝒯 2 ≤ 𝜏 2
𝑛(𝑛 − 𝑙) 𝛼
For index 𝒬, which is quadratic in nature, the upper control limit is calculated using
the weighted chi-squared distribution as shown in Equation 2.33, where 𝒳𝛼2 is the chi-
square distribution, 𝑚 is the sample mean, 𝑣 is the sample variance, 𝑔 is the weight of the
𝑣 (2.34)
𝑔=
2𝑚
2𝑚2 (2.35)
ℎ=
𝑣
the chi-square distribution by Yue and Qin [15 ismail]. The upper control limit is described
in Equation 2.36, where 𝑔𝜑 is the weight of distribution, ℎ𝜑 is the degree of freedom and
∝ is the assigned confidence level. Definitions of the weight of distribution and degree of
̃), 𝜃2 =
freedom are shown in Equation 2.37 and 2.38, respectively, where, 𝜃1 = 𝑡𝑟𝑎𝑐𝑒 (Λ
1
̃2 ), and Λ
𝑡𝑟𝑎𝑐𝑒 ( Λ ̃= 𝑇̃𝑇̃ 𝑇 .
𝑛−1
𝑙 𝜃2 𝑙 𝜃1 (2.37)
𝑔𝜑 = ( 4
+ 2 )⁄( 2 + 2 )
𝜏𝛼 𝛿𝛼 𝜏𝛼 𝛿𝛼
47
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
𝑙 𝜃1 𝑙 𝜃2 (2.38)
ℎ𝜑 = ( 2
+ 2 )2 ⁄( 4 + 4 )
𝜏𝛼 𝛿𝛼 𝜏𝛼 𝛿𝛼
As discussed in previous sections, PCA and wavelet transforms are very powerful
tools that can be applied to FDD applications. More specifically, PCA is specialized in its
ability to analyze multiple sensor measurements and identifying the relationships between
time series measurement into time and frequency components. However, FDD strategy for
complex systems such as an internal combustion engine requires the specialities of both
PCA and wavelet transform simultaneously. In other words, for a successful FDD strategy,
component is vital.
Analysis (MSCPA), which can be described as a hybrid between PCA and wavelet
wavelet transform then performing PCA on the wavelet coefficients [55]. The concept of
MSPCA is presented in Figure 15 where the signal is shown to be decomposed using DWT
and the resulting wavelet coefficients become inputs to PCA where the loading and scores
matrices are calculated for each wavelet coefficient. Upon loading and scores calculation,
upper limits of 𝒯 2 , 𝒬, and 𝜑 are calculated for fault detection and diagnosis that were
48
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
introduced in Section 2.3.6. The difference between PCA and MSPCA is Principal
components and the upper limits are calculated for each wavelet bin.
2.3.8 Mod-MSPCA
MSPCA [56]. The schematic of Mod-MSPCA is provided in Figure 16, which shows that
the wavelet coefficients are not directly inputted into PCA upon decomposition through
DWT. The wavelet coefficients in Mod-MSPCA are reconstructed using wavelet synthesis
method then inputted into PCA at each level. The purpose of this modification is to utilize
Reconstruction Based Charts for better detection and diagnosis of faults. RBC required
sample size closer to the measurement sample size but due to the decomposition step prior
to the PCA original MSPCA was not compatible with RBC. The reason for applying RBC
for fault detection and diagnosis was to mitigate the issues of the upper limit technique
such as inability to quickly detect changes in states of the operating conditions. Such
limitation would manifest itself as false alarm indicating a presence of a fault even when
49
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
the fault has been resolved. Yoon and MacGregor states that RBC guarantees a higher
accuracy in fault diagnosis. Further information regarding RBC can be found in Yoon and
50
Chapter 3: Proposed FDD Strategy
This section will provide the details on the proposed fault detection and diagnosis
strategy that will be developed and tested using real engine test-data. An overview of the
purpose, constraints and assumptions involved in the development of the proposed FDD
The purpose of the proposed FDD strategy is to monitor the internal combustion
engine in a cycle-by-cycle basis to detect intermittent misfire faults that may occur in
different cylinders of the engine. In this context, intermittent faults are faults that do not
exhibit in every engine cycle. The application of the FDD strategy is aimed at detecting the
presence of a misfire and the location of the fault condition, in other words, in which
Due to the complexity of engine dynamics discussed in Section 2.2.1, the proposed
FDD strategy utilizes the signal-based approach to enable rapid development of a robust
strategy. The measurements used in the development of the FDD strategy includes
pressure, vibration and angular position. The data acquisition for the development of the
development testing procedures. However, it is very important to note that the in-vehicle
51
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
application of the proposed FDD strategy is intended to only consider the vibration and
angular position measurement. This is due to the very expensive cost associated with
pressure sensors and the retrofitting of the pressure sensors in the engine as discussed in
Section 2.2.2. The ability to detect fault conditions using only the vibration signal is
correlation between the pressure and vibration measurement in the crank angle domain.
With the knowledge of the pressure measurement’s ability to assess the combustion quality,
plausible with just the vibration measurement if the knowledge of the combustion quality
obtained through the analysis of the pressure measurement is transferred through data
labeling. Another important concept that will be utilized in the proposed FDD strategy is
the analysis of data in the crank angle domain to correlate signal behaviors to specific
outlines four major components of the development method: equipment set-up & data
chapter will introduce detailed concept of each component of the strategy. Chapter 4 and 5
will provide the detailed implementation of the concepts discussed in this chapter on real
data set.
52
53
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
54
3.1 Proposed Experimental Set-up & Data Acquisition
As mentioned before, the proposed FDD strategy utilizes the signal-based approach
due to the complex nature of the system being monitored. From the literature review, it can
be concluded that modeling of engines requires tremendous amount of effort and resources
that is not feasible for the development of a FDD strategy. Therefore, the use of engine
testing process is suggested to develop a signal based FDD strategy for internal combustion
engines. The benefit in utilizing the testing environment for the data acquisition component
of the FDD development process is the amount of quality measurement that is available.
These measurements have potential to provide valuable insight into the engine conditions
and behaviors. The specifics of the engine and test cell set-up utilized in this research is
vibration in crank domain due to its low cost and effectiveness in characterizing faults in
development of the proposed FDD strategy is the pressure traces of each cylinder. Pressure
monitored in the application of the FDD strategy as pressure sensors are extremely costly
and require complex retrofitting of the engine. Therefore, the pressure traces are proposed
to be incorporated only in the development of the FDD strategy. The role of the pressure
55
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
traces during the development of the FDD strategy is to provide valuable information,
specifically on the combustion quality of each engine cycle. This high quality information
is to be incorporated into the development of the FDD strategy through date labeling
method, where each vibration cycle is labeled as healthy or faulty through the analysis of
the pressure measurement. The specifications and the locations of the accelerometers is
further elaborated in Section 4.1.5. Similarly, the specifications of the pressure transducer
are provided in Section 4.1.4. The pressure analysis method proposed is explained in
Section 3.2 and the implementation of the method in this research is presented in Section
4.3.
The position measurement of the crankshaft of the engine is utilized to segment the
continuous vibration and pressure measurement for the development process and just the
vibration measurement for the application of the developed FDD strategy into aligned
individual engine cycles. Also, the position measurement is to be utilized to convert the
vibration and pressure measurement into crank angle domain from time domain. The
rotational speed of an internal combustion engine is variable and never constant which
means that events that are referenced in time provide little information regarding what
process or component caused that event. However, when events are referenced using crank
angle, simple analysis can infer useful information regarding the components or the process
that caused the observed event. Detailed information regarding the optical encoder utilized
is provided in Section 4.1.3. The theoretical concepts involved in the analysis of the
encoder signals will be discussed in Section 3.2 and the implantation of the method is
56
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
The purpose of data acquisition is to obtain engine behavioral data on healthy and
faulty conditions under the same operating conditions to ensure that the fault condition is
not being induced due to the change in the operating condition. The proposed method of
fault induction during data acquisition process is to test the engine at parametric conditions
that increases the possibility of a misfire. This method is proposed to obtain data that
closely resembles the behavior that would be observed from an engine under normal
unplugging the spark plug from the engine [22]. This provides conditions of the engine that
does not provide a realistic representation of how a misfire may occur in an engine that is
operating within a vehicle. The details on the test equipment such as the engine,
dynamometer, and the data acquisition hardware are provided in Section 4.1 and the testing
The purpose of the proposed data pre-processing method is to transform the raw
signal that has been acquired through testing into a format that will allow the feature
extraction method to extract the most information it can infer from the raw signals. This
transformation includes the conversions of raw time series measurement into crank angle
engine cycles.
57
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
measurements in crank angle domain is very important for reciprocating machines such as
the internal combustion engine, where all events of the engine are intricately timed in terms
of the crank position. The raw measurements that are obtained during the data acquisition
is in the time domain and engine events are difficult to accurately be inferred in this domain
due to wide operating speed ranges and its inability to hold a constant speed, which was
angular displacement of the crank shaft in time domain. The time measurements are
proposed to be converted into crank angle domain using the crank angle and time
In order for the FDD strategy to monitor the engine in cycle-by-cycle manner, the
continuous crank angle domain measurements need to be segmented into individual engine
cycles. Engine cycle is defined as two revolution of the crankshaft which is equivalent of
720 degrees. Using the measurements in crank angle domain, 720 degrees of the crank
shaft can easily be obtained. However, each individual cycle must be aligned to a specific
engine event to ensure that alignment of the entire data set of cycles. Alignment is
58
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
If the engine cycles of the same fault condition are not aligned at the same engine event,
the time information that is extracted from the feature extraction process will differ
between the features extracted from the two separate cycles. This means that the
representation of the fault condition using both of these engine cycles can be considered
invalid. For successful characterization of each fault condition, it is crucial that aligned
engine cycles are utilized for fault extraction. The engine event that is utilized for this
The combustion quality information of the engine cycles inferred from the pressure
proposed FDD strategy. The method of pressure analysis that is proposed in this research
is the calculation of Indicated Mean Effective Pressure (IMEP). The purpose of this
represents the average pressure that is acting on the piston during the engine cycle. IMEP
is a very accurate and an easy to calculate indicator of misfires. For the pressure trace of a
combustion chamber when an engine is motoring, which means that there are no
combustion occurring inside the combustion chambers, the IMEP value is zero. A
calculated value of IMEP below a set threshold limit will be defined as a misfire Therefore,
value close to the threshold limit, it would indicate that a misfire has occurred in that
59
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
combustion chamber. The steps involved in calculating the IMEP values of each engine
The information regarding the combustion quality obtained through the pressure
labeling. Labeling of vibration data for each engine cycle is required as the fault conditions,
which are differentiated by its combustion quality, are not easily inferred using vibration
signals. With the assumption that correlation exists between the vibration measurements
and the corresponding fault condition, the label is aimed at guiding the characterization of
each condition using vibration measurements. The true state of the combustion quality is
to be determined through the pressure analysis discussed in the previous section. Different
fault labels were given to misfires occurring in different cylinders to allow the classifier to
be able to differentiate a misfire that has occurred in one cylinder or another, for fault
diagnosis component of the FDD strategy. The main purpose of the labeling process is to
eliminate the need for using pressure transducers in vehicles for the application of the FDD
engine cycle that are labeled using pressure measurements are proposed to be performed
60
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
that utilizes PCA and wavelet packet transform (WPT). This method is used due to its
ability to analyze non-transient behaviors of a complex system that have been captured
through multiple sensors. In short, the Extended-MSPCA decomposes a given signal using
WPT, which provides corresponding wavelet coefficients then applies PCA on those
coefficients.
2.3.7. However, the feature that differentiates the Extended-MSPCA from Mod-MSPCA
difference between the two methods is that Extended-MSPCA utilizes Wavelet Packet
Transform (WPT) while Mod-MSPCA utilizes Discrete Wavelet Transform (DWT). The
mathematical theories regarding WPT and DWT were discussed in Section 2.3.5.2. The
two transforms differ in the way they decompose the signal being transformed. DWT
decomposes the signals in a way that highlights the higher frequency contents whereas,
WPT decomposes the signal equally along the signal’s frequency spectrum. Therefore,
WPT provides better granularity in terms of frequency in comparison to the DWT. The
highlight all of the frequency bands equally. As faults may occur in both high and low
frequencies, it is very important to utilize the feature extraction method that is capable of
61
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
from Mod-MSPCA in the way normalization is achieved. In Mod-MSPCA the raw signal
is immediately decomposed using DWT and the output coefficients are normalized for
them using WPT. Therefore, the outputs of WPT, which are the wavelet coefficient, are
considered to be normalized which means that normalization upon WPT is unnecessary for
PCA. The summary of the proposed normalization method is provided in Figure 18 and
Figure 19. The proposed order of normalization allows for features extraction process to
minimize the effects of varying scale and variances of measurements due to difference in
steps involved in executing the Extended-MSPCA is as follows. Upon data acquisition and
necessary data pre-processing that were introduced previously in this chapter, the baseline
62
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
normal or healthy measurement. However, due to the variability that is inherent in engine
operation, which was discussed in Section 2.1.1, the selection of the baseline measurement
is not trivial. It is clear that the attempt to represent what normal engine cycle looks like
is to be plotted in the feature domain, the normal cycle will be represented as a single point.
If all of the normal cycles were to be plotted in the feature domain, the normal cycle will
normal conditions; the cluster representation is a more realistic depiction of what normal
conditions look like as opposed to the single point representation. Therefore, the baseline
calculated baseline data such as an average of the multiple baseline measurement. Instead,
Upon selection of the baseline measurements, the measurement, that the feature
will be extraction from, is proposed to be normalized using Equations 3.1 and 3.2 where
The assigned baseline measurement is also normalized using its own parameters.
63
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
1
𝑥𝑟𝑎𝑤 − 𝑛 ∑𝑛1 𝑥𝑟𝑎𝑤,𝑖
𝑥𝑛𝑜𝑟𝑚𝑎𝑙𝑖𝑧𝑒𝑑 = 2 (3.2)
1 1
∑𝑛1 (𝑥𝑟𝑎𝑤,𝑖 − ∑𝑛1 𝑥𝑟𝑎𝑤,𝑖 )
𝑛−1 𝑛
using WPT. The PCA component of the proposed feature extraction method is then applied
to calculate the principal component matrices for the healthy and the faulty measurements
using Equations 3.3 and 3.4 given the relationship provided in Equation 3.5.
A statistical index was introduced by Ismail to enable fault diagnosis capabilities of the
specific application method is proposed. This application method is heavily focused around
64
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
how to perform feature extraction using multiple baselines. The proposed feature extraction
process for the development of the FDD strategy aims to produce multiple representations
of the fault signature based off of multiple baseline data as shown in Figure 17. This will
lead to a clustering of healthy and faulty data in the feature domain. More representation
of one faulty data with respect to multiple baseline data allows for the classification
algorithm to account for the variable nature of the engine cycle. It should be noted that
these baseline data are not used to represent a condition. In other words, a feature
representation of baseline data is not extracted or used in the training process. It is also
important to have healthy data represented in comparison to the multiple baseline data to
develop a classifier algorithm that would classify features into fault categories. Supervised
learning algorithms take labelled data set that provide input-output examples and learns the
relationship between the input features and the output labels [57]. During the learning
process, the algorithm refines the mapping of the inputs and the outputs by minimizing the
error between the desired outputs and the classifier algorithm’s output. The errors are
65
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
proposed to be the classifier algorithm for this research due to its compatibility with
nonlinearity and ease of application. Similar to the human brain, ANN are composed of
clustering or a row of neurons called a layer, as shown in Figure 20. The simplest MLP
consists of three layers: input, hidden and output layer. Figure 20 depicts MLP architecture
with two hidden layers. It is important to note that every neuron from a layer is connected
to every neuron in the previous and the following layer, this type of architecture is known
as a fully connected neural network. Figure 20 also shows that the information flows from
the input to the hidden layers then to the output layer, this type of architecture is known as
66
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
function and an output, is depicted in Figure 21. As shown in the figure, each neuron
receives m number of inputs which are multiplied with their respective weights. The
weights correspond to the connection between a neuron from the previous layer and a
neuron in the following layer. The weights are unique to each connection and are updated
during the training process. Figure 21 also shows that each neuron consists of its own bias
which also gets multiplied with its corresponding weight value. The values obtained
is fed into the activation function as an input to calculate the output of the neuron, 𝑦𝑘 =
𝜑(𝑣𝑘 ) [57]. A variety of activation functions are available such as sigmoid, tangent
hyperbolic, softmax and rectified linear units. The nonlinear behavior of these activation
multi-class classification problem, a softmax activation function is used in the output layer.
67
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
are two types of signals that flow through the network which are called function and error
signals as shown in Figure 22 [57]. The main distinction between the two signals are the
direction in which they travel through the network. The forward propagation is carried out
by the function signal and the back propagation is carried out by the error signal [57].
During forward propagation, the function signal travels through each layer from one neuron
to another to transform the input into an output value. During back propagation, an error
signal is obtained by comparing the output value of the neural network to the desired
output. This error signal travels backwards through the network layer by layer. As the error
signal travels, the weights associated with each neuron to neuron connection is adjusted to
minimize the error [57]. As mentioned before, errors are represented by loss functions
repeated until the loss function is minimized to a satisfactory value. For this research a
cross-entropy loss function is proposed to be minimized using the scaled conjugate gradient
backpropagation methods.
68
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
take into consideration the variable engine dynamics. Given a single engine cycle, n
individual features are extracted if n number of baselines are used. These individual
features are different feature representations of a single engine cycle. During the training
individual training sample. The classifier algorithm will not know that these
representations were obtained from one engine cycle. This method is proposed to provide
classifier with enough variations of each fault condition to learn from. Exposing the
classifier to as many variations possible allows the classifier to understand the variable
nature of the engine behavior. This is especially useful if the dataset obtained is small.
Once the classifier is fully trained, the FDD strategy will utilize a majority voting
method to detect a fault. For example, if n number of baselines are used to extract features
from a new engine cycle, n number of feature representations will be extracted. Each of the
feature representations will separately be classified using the trained classifier, which will
69
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
provide n different assessments of the fault condition. A majority voting method is utilized
to make the final prediction on the fault condition of the new engine cycle. The concept of
the majority voting method is to select, among all the of the different predictions that were
provided by individual features, the prediction that had the most votes. The schematic
process of this method is shown in Figure 17 under the classifier development subset. This
majority voting method is utilized to reduce detection error caused by uncertainty of the
70
Chapter 4: Data Acquisition
In this section, the details of the proposed strategies regarding data acquisition are
provided. These subsections include the experimental setup, the engine test procedures and
data pre-processing.
This chapter provides detailed descriptions of the experimental setup utilized for
the development of the FFD algorithm for internal combustion engines. The FDD
algorithm monitors the engine on a cycle by cycle basis. It is a signal-based approach that
The experimental setup was used to acquire data for feature extraction of different
conditions, training of the machine learning classification tool, and testing of the classifier.
The main elements of the experimental setup include a test cell, an engine, an engine
transducers and accelerometers). The data acquired includes measurements from the
71
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
4.1.1 Engine
All the experiments for data acquisition were performed on a Ford’s V8 spark
ignition, pushrod developmental engine. The engine was provided by Ford’s Powertrain
engine are omitted due to confidentiality. However, the focus of this research and the
resulting conclusions are not dependent on the specifics of this particular engine. Instead,
the measurements acquired and the strategy implemented consider phenomena that are
universal to engines of this type, at the least; V-style, 8 cylinder, and overhead valve model.
Figure 23 illustrates the difference between an overhead valve and an overhead cam engine
design and Figure 24 illustrates the difference between an inline engine and a V-style
engine.
72
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Figure 25 shows the cylinder numbering for the test engine utilized in this research.
The firing order of the engine is cylinders 1-5-4-8-6-3-7-2 as shown in Figure 22.
Figure 25: Engine cylinder numbering for the V8 test engine [60]
73
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
The testing facility was provided by Ford Powertrain Engineering Research and
assembly and monitoring instrumentations. The test cell that was utilized for this research
was unique amongst all of the test cells that are located at PERDC. The test cell is in a
semi-anechoic chamber which ensures the elimination of surrounding noise and absorption
of the energy created within the chamber. In addition, only the engine assembly, including
the engine, was housed in the test cell to ensure that vibrations from the engine alone are
The engine assembly consisted of the clutch, transmission assembly, fan belt,
power steering pump, alternator, compressor, powertrain mounts, and mount brackets. The
purpose of the engine assembly is to simulate the in-vehicle conditions that the engine is
to operate within.
4.1.2.2 Dynamometer
The dynamometer was housed outside the chamber in a separate utility room. This
separation ensures that only the noise and vibration from the engine assembly is being
captured during data acquisition without external noise. The following table summarizes
74
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
4.1.3 Encoder
The encoder is used to measure the rotational position of a shaft [61]. An encoder
is equipped with an optical sensor, light source and rotating disk with either opaque and
transparent sections or slits. The optical encoder is mounted on the rotating shaft to be
studied and the optical sensor outputs “on” or “off” pulses. depending on whether or not it
sensed the light that was emitted by the light source. The optical sensor would sense the
light in the case where the transparent or the opening of the slit is passing through the light
source and the sensor which are in line with each other with the disk rotating in between
the two as shown in Figure 26. The output will be “off” if the light is blocked due to the
opaque section or the solid portion of the disk. The output of the encoder is a digital signal
which means that it will be in the form of a square wave with the high values indicating
the “on” state and the low values indicating the “off” state. With the knowledge of the
angular width of the sections or the angular distances between slits, the angular position of
75
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
the shaft can be obtained with respect to time. The angular speed and acceleration of the
shaft can also be derived from the position and time data obtained from the encoder.
In this experiment, the AVL 365 angle encoder was used as it provides high
precision and high resolution angle information that is required for converting the
transducer signals from time domain to crank angle domain. This encoder has a resolution
of 0.5-degree crank angle which means that each engine cycle, which involves two
rotations of the crankshaft, will consist of 1440 data points. A trigger signal was also
obtained from this encoder which indicates the occurrence of the top dead centre (TDC) of
Piston 1 with an offset. The offset of the trigger signal to TDC of Piston 1 is 131.662 degree
which means that the trigger signal gets switched “on” 131.662 degrees after the
occurrence of TDC of Piston 1. The trigger signal is crucial in resampling of the transducer
signals as it ensures that every cycle is synchronized properly in crank angle domain and
76
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
allows for the data set to be divided into cycles in the time domain. The AVL 365 angle
encoder has the RS422 transistor-to-logic (TTL) output driver, output range of 0 – 5V and
Pressure traces provide useful insight into the combustion happening inside the
engine. To correctly label the data, pressure traces were used to calculate values such as
Indicated Mean Effective Pressure (IMEP) to determine if a combustive fault has occurred
Two main types of transducers for sensing pressure are piezoelectric or piezo-
resistive. The application determines which type of sensor should be used. These two types
of transducers differ in that piezoelectric pressure sensors measure the electrical charge
Figure 27: The mechanical load on the crystal producing electrical charge through electric dipole [64]
77
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
For the piezoelectric sensor, the electrical charge is produced by the piezoelectric
material, such as quartz, and occurs when the surfaces of the crystal are loaded
mechanically. This load causes the shift of positive and negative crystal lattice elements in
relation to each other resulting in the formation of an electric dipole [64]. These sensors
have a very high natural frequency which makes them ideal for measuring fast pressure
rise times. However, one of the downsides of piezoelectric sensors is that the measurement
For the data acquisition portion of the research, pressure traces were obtained
through Kistler’s PiezoStar Type 6125C pressure sensors which are designed for in-
heads with machined holes for each cylinder that are compatible with the pressure
transducers for testing. PiezoStar pressure transducers are piezoelectric sensors with
PiezoStar crystals which are produced by Kistler instead of quartz. It has a measuring range
78
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
of 0 – 300 bar and a sensitivity of -33 pC/bar [66]. The negative sensitivity of the pressure
transducer is due to the fact that pressure applied to a piezoelectric sensor produces
negative charge signal. This phenomenon is not found in the output signal as external
charge amplifier converts the negative going charge into a positive voltage signal [64].
4.1.5 Accelerometers
transducers that were discussed in the previous section, accelerometers use piezoelectric
piezoelectric material and preload bolt. The acceleration experienced by the component
being studied is transmitted to the accelerometer and causes the seismic mass inside it to
also accelerate. A proportional force generated by the acceleration of the seismic mass is
induced on the piezoelectric material that is clamped between the seismic mass and the
base of the accelerometer by a preloaded bolt. The proportional force applied to the
piezoelectric material by the seismic mass results in an electric dipole formation due to the
shift in the lattice structure of the piezoelectric material. This electric dipole then generates
electrical charge proportional to the acceleration that can be acquired by the data
acquisition hardware.
79
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
There are two types of accelerometers, charge type piezoelectric sensors and
integrated electronic piezoelectric (IEPE) sensor and they differ in the way they convert
the charge from the piezoelectric crystal. In the charge type accelerometers, the charge is
sensor does not require an external amplifier as it has a built-in charge amplifier. The IEPE
sensors require a data acquisition hardware that provides current excitation for the built-in
amplifier.
In this research, Dytran IEPE tri-axial accelerometers were used to obtain the
vibration from various parts of the engine. The accelerometer has a range of +/- 5000g,
frequency response of 10 kHz and a sensitivity of 1mV/g [68]. It should be noted that some
may argue that charge type accelerometer would have been preferred over the IEPE as they
are suited for high temperature applications. However, given the fact that the normal
operating temperature for an engine is anywhere between 195 to 220 degrees Fahrenheit
80
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
and that the operating temperature of the IEPE accelerometers are rated up to 250 degrees
Fahrenheit, IEPE sensors are used with cautionary measures in place such as wrapping the
The accelerometers were mounted on the engine with epoxy or with bolts when
possible to ensure that the true vibrational readings are being obtained. Two accelerometers
were placed on top of two knock sensors in the left bank of the V8 engine. From the
accelerometer placed on top of the knock sensors, only the z-axis measurement was
acquired. The z-axis is parallel to the vibrational measurement obtained by the knock
that knock sensors observe. This location is strategically selected as the success of the
proposed FDD strategy using sensors and sensor locations that already exist in the
manufacturing process would reduce cost of implementing such strategy. The locations of
81
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
the knock sensors for the experimental engine are shown on a similar V8 engine in Figure
30. Two other accelerometers were placed in the valley of the V-style engine as shown in
Figure 31. The placement of these sensors was solely explorative which aims to capture
the overall vibrational behaviors of the engine. All 3 axes from these accelerometers were
acquired to observe the up and down, left to right and front to back vibrations of the engine.
Y Y
X X
prior to mounting. The calibrations were performed using The Modal Shop 9100D Portable
Shaker Table.
82
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
electrical signals which are collected and digitized by a data acquisition hardware [71].
Depending on the data acquisition hardware, the module may be capable of providing
single device or all of the listed functionalities on a single multifunction device. The type
of sensor dictates the functionality of the data acquisition hardware required to acquire the
signal [71]. For example, the encoder signal which outputs a digital signal would require
the digital input/output functionality whereas, the accelerometer signal which outputs an
analogue signal would require the analogue input functionality and Analog-to-Digital
For this research experimentation, signals from the encoder, accelerometers and
pressure transducers were acquired by multifunction devices called PROSIG P8020 and
P8012. These two modules allow for fixed channel counts of 24 low speed 24-bit ADC
channels and 16 high speed 16-bit ADC channels to be recorded simultaneously. These
multifunction devices have a range of +/- 10 V and an accuracy of +/- 0.1% of the range.
The high speed and low speed modules are able to support a maximum sampling frequency
of 400k and 100k samples/sec/channel, respectively. Given the maximum engine speed
that the data will be acquired at, frequency component of the encoder signal was calculated.
sampling rate equal to twice the highest frequency of the interested component, National
83
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Instruments advise that a sampling rate equal to ten times the highest frequency of the
interested component should be used [72]. The sampling rate of the encoder was carefully
selected to ensure that the angular position of the crankshaft is being captured as precisely
as possible without any loss of information. All the vibration and pressure signals were
recorded at 100kHz/channel and the encoder data was recorded at 400kHz/channel. The
encoder signal was sampled at a much higher rate compared to the transducers as the
encoder’s data will be used to resampled the transducer data in crank angle domain. It is
very important to have data point that is as close to the timestamp as possible to when the
switch occurred. Therefore, in this application a high sampling frequency of the acquisition
system is preferred.
Prosig data acquisition system performs filtering on the signals that are to be obtain
through a built-in anti-aliasing filter. The cut-off frequency for the filtering is adjusted
automatically according to the sampling rate of the signals to ensure that aliasing effects
The fault of interest for this research is intermittent misfire. The associated FDD
experiment involves obtaining both healthy and faulty cycles under a same operating
condition and by running the engine in a condition where the chances of the fault occurring
are high. Under the normal operating range of the engine, the idle condition is considered
84
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
specifies that at idle conditions, chances of partial-burning cycles are higher, [4].
Therefore, the data was collected at the idle condition where the engine is running without
a load at 600 rpm. It is also mentioned that the chances of misfires can be reduced at the
idle condition by retarding the spark timing [4]. Therefore, spark timing was advanced
incrementally to increase the severity of the engine's instability. The following list
no load
The engine was warmed up at 1500 rpm with a load of 2 bar until the oil temperature
reached 160 F, and a stable condition is reached. Once the engine is warmed up, the engine
speed was lowered to 600 rpm, the load was removed and the appropriate spark timing
advance was applied. The data acquisition was then triggered when steady state was
Accelerometer Data
Raw accelerometer data from a three axes accelerometer located on the front valley is
85
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
shown in Figure 32 to Figure 34. Raw accelerometer data from an accelerometer located
Figure 32: Raw accelerometer data from front valley, axis parallel with front-back orientation of the
engine
86
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Figure 33: Raw accelerometer data from front valley, axis parallel with right-left orientation of the engine
87
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Figure 35: Raw accelerometer data from front left knock sensor
Encoder Data
Two signals were obtained from the encoder: encoder ticks and trigger signal.
88
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Encoder ticks indicate the 0.5-degree travel of the crankshaft. Figure 36 is a plot of
the raw encoder ticks data at an engine speed of 600 rpm. To verify the data, two points
that indicate the start of an on switch on consecutive occasions are displayed. The time
differences between these two points indicate the time it took for the crankshaft of the
engine set to 600 rpm to travel 0.5 degrees. Assuming that the engine is running at a
constant speed of 600 rpm the following calculation obtains the theoretical time difference
between the two points selected in Figure 33. The difference between the theoretical and
the actual time durations can be explained by the fact that actual engines are not capable of
holding the speed constant. Therefore, the encoder ticks data obtained is considered to
89
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
accurately represent the change in the rotational position of the crankshaft with respect to
time.
𝑟𝑒𝑣 𝑚𝑖𝑛 60 𝑠
∆𝑡𝑡ℎ𝑒𝑜𝑟𝑒𝑡𝑖𝑐𝑎𝑙 = 0.5° ∙ ∙ ∙ = 1.3889 × 10−4 𝑠 (4.1)
360° 600 𝑟𝑒𝑣 𝑚𝑖𝑛
1, which occurs once per revolution of the crankshaft. Figure 37 is a plot of the raw encoder
trigger data at an engine speed of 600 rpm. To verify the data, two points that indicate the
start of an on switch on consecutive occasions are displayed as shown in Figure 34. With
similar assumptions and method, the data is verified as shown in the calculations below.
90
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
With similar reasoning for the differences observed in the two time durations, the trigger
𝑚𝑖𝑛 60 𝑠 (4.3)
∆𝑡𝑡ℎ𝑒𝑜𝑟𝑒𝑡𝑖𝑐𝑎𝑙 = 1 𝑟𝑒𝑣 ∙ ∙ = 0.1000𝑠
600 𝑟𝑒𝑣 𝑚𝑖𝑛
Pressure Data
traces from all the cylinders are plotted in Figure 39 to validate the firing order of the
engine. The order of the pressure peaks is shown to align with the firing order indicated in
Section 4.1.1 which verifies that the pressure transducers were operating and placed
properly.
91
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
92
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Figure 39: Overlay of all of the raw pressure traces from each cylinder @600RPM
In this section, the implementation of data pre-processing to transform the raw data
into a compatible form for feature extraction is described in detail. As shown in Section
4.2.3, raw data consists of three different types of measurements; encoder, pressure and
acceleration in the time domain. The final form required for feature extraction is
acceleration measurement in the crank angle domain that is segmented into individual
93
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
cycles with corresponding fault class labels. The following sections describe in detail, the
to remove noise. The sampling frequency of the vibration measurement was 100 kHz and
the frequency response of the accelerometer is 1.2 Hz to 10 kHz. Therefore, the cutoff
frequency was set to 10 kHz with a 12th order filter with phase compensation to eliminate
phase shift. Figure 40 shows the raw accelerometer signal and the filtered signal.
The purpose of the encoder measurement analysis is to obtain crank angle versus
Cylinder1. The crank angle versus time relationship is utilized to resample the pressure and
94
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
acceleration measurements in crank angle domain. The timestamps at which TDC occurs
The relationship between crank angle and time was determined by obtaining the
timestamps of the instances where the light indicator switch of the encoder was flipped
from “off” to “on”. More specifically, the instances at which the voltage was equal to the
threshold value were obtained on the rising edge. The threshold value of 3.5 V was utilized
as specified by the manufacturer. The exact timestamp was estimated through linear
interpolation of two points immediately before and after the threshold value on the rising
95
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Rotational speed of the crankshaft is derived from this crank position versus time
relationship as shown in Figure 42 to validate the alignment of the set engine speed and the
actual engine speed. It can be observed that the average engine speed aligns with the set
speed of 600RPM. However, significant fluctuations also exist in the speed profile, which
aligns with the unstable behavior described by Heywood at idle conditions [4]. The
The occurrences of TDC in Cylinder 1 referenced in time were obtained using the
trigger signal. A similar approach was used for encoder tick signal analysis to retrieve the
96
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
time at which the switch was flipped as an indicator for the occurrence of the TDC in
Cylinder 1. It is important to note that TDC occurs twice in a given engine cycle as shown
in Figure 43. Therefore, timestamps obtained by the trigger signal are not sufficient enough
to synchronize individual cycles in the data set. Figure 43 illustrates that a piston of an
engine reaches TDC once at the end of the compression stroke and again at the end of an
exhaust stroke. With this knowledge, the cycle can be synchronized with additional
Figure 44 shows an overlay of trigger signal and Cylinder 1 pressure trace. Given
97
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
can be referenced to see if this particular occurrence was at the end of a compression stoke
or the exhaust stroke. If the pressure trace at the given timestamp is close to 0 bar, the
current TDC can be concluded to have occurred upon an exhaust stroke. On the other hand,
if the pressure value is well above 0, it can be concluded that the current TDC occurred
upon a compression stroke. The timestamp associated with the exhaust stroke was chosen
as the engine event at which the cycles will be segmented at. Therefore, the timestamps at
which the TDC occurred at upon the exhaust stroke was obtained and was defined as the
start timestamps indicating the start of a new cycle. It should be noted that there exists an
offset between the occurrence of the TDC in Cylinder 1 presented by the trigger signal and
the TDC that can be inferred by using the pressure signal. This offset is as expected as
98
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Figure 44: Overlay of trigger data and Cylinder 1 pressure trace @ 600RPM
The crank position versus time relationship was used to resample the pressure and
vibration measurements. The voltage of the transducer measurements at each crank angle
– timestamps were obtained through linear interpolation. Figure 45 and Figure 46 show the
99
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
The start timestamps obtained in Section 4.3.2 is used to segment the transducer
measurements into individual cycles. The start timestamps are referenced by a specific
engine event to ensure the alignment of all the cycles obtained from different test files. The
alignment is required as the data acquisition was randomly triggered which means that each
test file starts at different positions of the crank angle that relate to different events of the
engine. In this research, an engine cycle is defined as a 720-degree rotation of the crank
angle that is aligned at 131.662 degree after the occurrence of TDC in Cylinder 1 upon an
exhaust stroke event. Each measurement per cycle consists of 1440 points with a crank
angle resolution of 0.5 degrees. Due to the fact that the offset of 131.662 degree is not
100
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
divisible by the crank angle resolution of 0.5 degrees, the start timestamp and the crank
angle timestamp never shared a common timestamp. Therefore, first crank angle timestamp
that occurs after the start timestamp is used as the start of the cycle. And the end of the
cycle is marked by the first crank angle timestamp that occurs after next start timestamp.
Pressure trace provides very accurate and insightful information regarding the
quality of the combustion occurring inside the cylinder. It would be ideal to develop a
condition monitoring tool that utilizes in-cylinder pressure data from the engine. However,
their extremely high cost. Therefore, information obtained through pressure traces in test
engines is correlated with more convenient and less costly vibration measurements. This
For this experiment, the analysis of the pressure trace was performed by calculating
the Indicated Mean Effective Pressure (IMEP). IMEP was calculated by integrating the
pressure signal from 180-degree crank angle before TDC to 180-degree crank angle after
TDC with respect to displacement volume as shown by the following equation, where 𝑝(𝑖)
is the cylinder pressure at crank angle position 𝑖, 𝑉(𝑖) is the cylinder volume at crank angle
position 𝑖, 𝑉𝑠 is the cylinder swept volume, 𝜃0 is the bottom dead center (BDC) induction
integer crank angle position, and 𝜃𝑛 is the BDC integer crank angle position [73].
101
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
𝜃𝑛
∆𝜃 𝑑𝑉(𝑖)
𝐼𝑀𝐸𝑃 = ∑ 𝑝(𝑖) (4.5)
𝑉𝑠 𝑑𝜃
𝑖=𝜃0
If the IMEP value is below the specified lower limit, it is considered to be a misfire which
Figure 47 shows a plot of all the pressure traces in one, previously defined, engine
cycle. The plot shows that at a given engine cycle, a total of 360-degree pressure trace
segment consisting of compression and power strokes required for IMEP calculation for
102
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Figure 48 to Figure 50 are plots from two consecutive cycles for Cylinders 2, 5, and
and power strokes fully represented in the segmented engine cycle. On the other hand,
Cylinders 5 and 8 are examples of those cylinders that do not have their compression and
power strokes fully represented in the segmented engine cycle as shown in Figure 47. Also,
it can be observed that the pressure trace of Cylinder 5 flows into the next engine cycle
while the pressure trace of Cylinder 8 flows from the previous engine cycle. This infers
that in order to calculate the IMEP of the combustion that is happening in Cylinder 5 for
the specified engine cycle, the pressure trace for the following cycle is required. In a similar
manner, the pressure trace of the previous cycle is required for IMEP calculation of the
combustion happening in the Cylinder 8 for the same engine cycle. This concept was
adopted for the IMEP calculations of those cylinders that were affected by the segmentation
of the engine cycles. In addition, the very first and last cycles of every test were discarded
as accurate calculation of the IMEP values for all of the cylinder were not obtainable as
103
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
104
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
carrying over the vibrations that occurred during the full compression and power strokes
of the cylinders. The vibration measurements were not carried over due to the definition of
an engine cycle that was established earlier. This lack of data representation may lead to
difficulty in correlating the pressure to the vibration relationship of faults occurring in the
affected cylinders.
The following table summarizes the labeled cycles in the dataset obtained through
engine testing. It is very important to note the imbalance of the different conditions in the
data collected. This is primarily due to the way the faults were simulated. In previous
research conducted, misfire faults were simulated by disconnecting the spark plug which
105
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
ensures that every cycle of the specified cylinder misfires [22]. This method of simulating
the fault allows for a balanced distribution of the data to be obtained. However, in this
research the faults were induced by simulating a condition of the engine where the
that were labeled in the previous section using IMEP calculations are plotted in the
following series of figures. The fault class of the faulty cycle is 1 which indicates that the
fault has occurred in Cylinder 1. Figure 51 is an overlay of pressure traces from Cylinder
1 for three different cycles. Figure 52 to Figure 54 are the vibration data from the three-
axis accelerometer located on the front valley. Figure 55 is the vibration data from the
106
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
When observing the pressure signals, the maximum pressure level and the area
under the pressure trace of faulty cycle is noticeably smaller than that of the healthy cycles
which would numerically be translated into the IMEP values. Therefore, it can be
concluded that faulty cycles are easily identifiable amongst a mixture of healthy and faulty
Figure 51: Cylinder 1 Pressure Traces for Faulty and Healthy sample cycles
When observing all the vibration data, the difference between healthy and faulty
data is not as clear. The distinguishable characteristics such as big spikes and locations of
those big spikes do not seem to be unique for healthy cycles. When observing the spikes,
it can be seen that there does not seem to be a commonality in the locations of the spike
107
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
with respect to crank angle. In terms of amplitude, it can be noticed that Healthy 2 cycle
shows lower amplitude than the Faulty cycle in all of the vibration plots. Therefore, it can
be concluded that faulty cycles are not easily identifiable amongst a mixture of healthy and
Figure 52: Front-Back Axis Accelerometer data from Front Valley for Faulty and Healthy sample cycles
108
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Figure 53: Right-Left Axis Accelerometer data from Front Valley for Faulty and Healthy sample cycles
109
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Figure 54: Up-Down Axis Accelerometer data from Front Valley for Faulty and Healthy sample cycles
110
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Figure 55: Axis Accelerometer data from Front Left Knock Sensor for Faulty and Healthy sample cycles
111
Chapter 5: Proposed FDD Strategy Implementation and
Results
detail. The purpose of the feature extraction strategy is to transform the pre-processed data,
represented in Section 4.3, into features. The theory pertaining to the proposed strategy
was provided in Chapter 3. This section focuses on elaborating on the parameters chosen
The proposed feature extraction method takes the user-defined baseline data and
faulty data and generates features that represent how different the faulty data is in
comparison to the baseline data. The proposed strategy utilizes the Extended-MSPCA to
decompose both the baseline and faulty data through wavelet packet transformation, then
transforms them into main components using PCA and generates features using statistical
In this research, the algorithm is tasked to differentiate the misfiring cycle from the
healthy data and differentiate the locations of the misfire. The difference between a misfire
112
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
occurring in one cylinder from the other is hypothesized to primarily depend on the
occurrence of the misfire with respect to the crank angle domain. This hypothesis assumes
that the differentiating characteristics of the misfire condition in comparison to the healthy
condition will occur within the compression and power strokes of a given cylinder.
Depending on the firing order for a given engine, each cylinder will fire at a different crank
angle range. This infers that the wavelet function must have enough localization in the
crank angle to be able to differentiate the same frequency components occurring 90 degrees
apart. On the other hand, the differentiation of faulty and healthy cycles may be highly
application does not exist. Consequently, the wavelet function that behaves similarly to the
signal of interest is advised to be utilized for the wavelet transform. Therefore, in the case
of fault detection and diagnosis, the ideal wavelet function to be utilized would be one that
resembles the faulty components of the signal. However, the faulty components are not
easily identifiable in the signals obtained, given the resources and time allowed for further
analysis of the signals. Therefore, a wavelet family called Daubechies, which is commonly
used for vibration in FDD applications, is utilized in this research. Daubechies wavelets
have been utilized in the previous studies conducted by Ismail, Doghri, and Narendiranath
for FDD applications on alternators and starters, internal combustion engines, and journal
Daubechies wavelets are said to be compactly supported, which means that it will
provide results that are more localized as opposed to global results. Daubechies wavelets
consist of numerous individual wavelet functions that are differentiated by its number of
vanishing moments. Wavelet db4 represents Daubechies wavelet with vanishing moments
of 4. The increase in vanishing moments implies that more complex behaviors of the signal
will be captured. In addition, Figure 56 shows that higher-order mother wavelets provide
superior frequency resolution with sharper roll-off. However, the improvements made on
For the feature extraction process component of this research, different wavelet
functions are utilized to study their effects on the performance of the FDD strategy. These
114
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
wavelet functions consist of db4, db10, and db16 were utilized. Figure 57, 55, and 56 show
the corresponding db16, db10, and db4 low pass and high pass filters that were utilized
during Extended-MSPCA.
115
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Figure 58: Low pass and high pass filter for db10
Figure 59: Low pass and high pass filter for db4
116
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Wavelet levels dictate the number of frequency bins the given signal will be
decomposed to. As wavelet levels increase, the number of frequency bins increases while
the range of each frequency is decreased. In other words, the increase in wavelet levels
leads to finer frequency resolution information. The increase in frequency resolution results
in decreased crank angle resolution as crank angle resolution is halved at each subsequent
characterization of faults in different cylinders due to the cyclic operation of the engine. In
addition, there is a maximum number of wavelet levels that can be used for a given sample
signal and the wavelet being used for the analysis. The maximum number of wavelet levels
𝑁𝑠𝑎𝑚𝑝𝑙𝑒
𝐿𝑚𝑎𝑥 < 𝑙𝑜𝑔2 ( − 1) (5.1)
𝑁𝑤𝑎𝑣𝑒𝑙𝑒𝑡
Given this relationship, the maximum wavelet level for each wavelet function is
obtained and summarized in the table below. For this research, a wavelet level of 4 will be
utilized for all three wavelets which is limited by db16’s maximum wavelet level. The
maximum level of wavelet is restricted in this research due to the short length of the dataset.
In previous studies performed by Ismail and Doghri, longer sample lengths were utilized,
which meant that higher wavelet levels could be utilized. For cycle-by-cycle monitoring,
the only method of increasing the sample length is through the increase in angular
117
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
to be taken as opposed to the single baseline data approach. For the application of this
strategy, the method of selection and the method of determining the number of baselines
required are important components of the feature extraction strategy that must be
considered further for a thorough study of the feature extraction strategy. However, in this
research, a simplified method of selection is pursued for the interest of time. The simplified
from all the healthy data set that have been collected. In this experiment, 10 different
In the previous study performed by Doghri where the FDD strategy was applied to
detect constant fault conditions, averaging method of features that belong to the same fault
condition was proposed [22]. The constant fault of an internal combustion engine can be
The purpose of the moving average is to capture the dominant characteristics of a given
fault signature. However, the application of a moving average is not recommended for
118
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
cycle-by-cycle monitoring, which is required for the detection of intermittent faults. This
is due to the implementation methods of the developed FDD strategy and the type of fault
the strategy is aimed to monitor. If averaging was performed during the development of
the FDD strategy, averaging would also be required to guarantee the success of the
classifier. If the unknown measurements are not averaged, the measurement will consist of
noise and other cycle-specific variations that the training algorithm has not seen during the
training process. The presence of noise and cycle-to-cycle variation of the unknown
measurement may affect the results of the classification tool. Therefore, implementation
methods must be taken into consideration during the development process of the FDD
strategy.
Feature Summary
From all the cycles obtained and labeled, 100 randomly selected healthy cycles and
the rest of the faulty cycles were defined as a finalized dataset, which consists of 1063
cycles, which equates to 10630 features. The number of healthy cycles was reduced to 100
cycles to even out the imbalance of data. Aside from 100 healthy cycles, 10 additional
healthy cycles were randomly selected to form the baseline dataset. From the dataset of all
the cycles, ~70%, ~20%, and ~10% of each class was randomly selected to form the
training, validation and testing subsets. The testing subset was formed by randomly
selecting ~10% of the total cycles in the dataset. For example, for the healthy condition
there are a total of 100 cycles which equate to 1000 features. To select the testing set, 10
cycles from 100 cycles were randomly chosen and the testing set was formed by 100
features that have been generated from the 10 cycles that were selected. However, to divide
119
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
the remaining data into training and validation, 70% of the features were randomly chosen
per condition to form the training set and the remainder formed the validation set. The
breakdown of total cycles and features are shown in Table 4. The method chosen to divide
the total data into subsets were chosen to ensure that the testing set will be executed in the
way that the classifier is to be used in real application. Further explanations of each dataset
The features extracted consist of 16 coefficients for each sensor signal utilized,
reduction in dimensionality from the raw signal, which consists of 1440 data points for
each sensor signal, resulting in 11520 data points. The reduction in dimensionality will
reduce the time and computational resources required for the classification algorithm.
Figure 60 and Figure 61 show the features that were extracted from accelerometers
on the valley and knock sensors, respectively. A general trend can be observed from the
120
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
features extracted from the valley accelerometers across all the fault conditions, including
the healthy condition, and subtle differences that can be observed from one condition to
the other. However, these differences are not easily translated into quantifiable
each fault class from the visual representations of the feature sets validate the need for a
sophisticated classifier algorithm that can infer subtle nuances of the features that are not
easily identifiable.
121
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
122
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
123
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
The table below summarizes the time it took for the generation of features for each
wavelet type and wavelet level used. The time of feature extraction is very important for
both the training and utilization of the FDD strategy. The Extended-MSPCA algorithm
used in this research is sufficient for the purposes of evaluating the analysis method and its
ability to perform analysis on a cycle by cycle basis. However, the algorithm will require
cycle basis.
In this section, the results of the developed FDD strategy are presented. The results
will review the performances of three different classifiers, Classifier 1, 2 and 3, that have
been developed using features extracted by three different wavelet functions, db4, db10,
and db16. The hyper parameters such as number of hidden nodes, learning rate and
stopping criteria, were tuned individually for each classifier to yield the most optimal
classifier for the given dataset. For each classifier, the results are presented through four
different confusion matrices. These include the training confusion matrix, validation
confusion matrix, testing confusion matrix, and detection confusion matrix as shown in
Figure 59 – 61.
124
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
The training confusion matrix shows how well the classifier was able to learn from
the training data set. The validation confusion matrix shows how well the classifier is able
to generalize. The testing confusion matrix shows the classifier’s ability to classify the
features that the classifier has not seen during training. And the detection confusion matrix
shows the ability to correctly classify the given cycle using majority voting method.
125
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
126
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
127
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
128
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
129
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
130
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
131
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Table 6 presents the performance scores of each classifier for training, validation,
testing and detection including precision, recall and F1-sores. The following are the notable
1. The best overall detection accuracy of 96.3% was achieved by Classifier 3 which
2. All three classifiers were not able to classify any fault representations of fault
3. Majority voting technique improved the overall accuracy rate for all classifiers
The best detection accuracy of Classifier 3 indicates that the features extracted with
db16 wavelet function had the most information that could be inferred to characterize the
fault conditions. These results prove that enough information exists in the vibration
the engine’s combustion quality. The performance of Classifiers 1 and 2 indicate that
although the information was not as readily available for correlation as it were for Classifier
3, there still exist information that can be utilized for FDD strategy. Even though Classifiers
1 and 2 showed overall accuracy rate that is about 10% lower than that of Classifier 3, more
research should be performed to improve their performance rather than ruling them out as
potential options for future FDD strategy. The reason is due to the benefits of utilizing db4
or db10 wavelet functions for feature extraction. And the most appealing benefit is the
reduction in processing time. As mentioned in Section 5.1, the processing time for
132
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
extracting features using db16 is double the amount of time it takes using db4. This could
Despite having the high accuracy rates during training and validation across all
three classifiers, as shown in Table 6, none of the classifiers were able to detect faults from
fault classes 2 and 4. The root cause of this phenomenon is the lack of data for fault classes
2 and 4. The lack of representation of a given fault class in the dataset results in classifier’s
inability to learn the characteristics that are universal to that specific fault class. The
explanation for the very high performance observed during training and very low
performance observed during testing could be that the classifiers were able to learn the
characteristics of the fault classes but the extent of the engine’s variability of behavior was
not accounted for due to the limited dataset. Another explanation could be that the classifier
was overfitting the dataset that belong to fault classes 2 and 4 due to the imbalance of the
dataset. The difference between the two explanations is that the first explanation is
reflective of the dataset’s inability to provide the classifier the overall picture while the
second explanation is reflective of the classifier’s neglect of the fault classes 2 and 4. The
classifier is trained on the training dataset by updating its weights and biases to reduce the
overall accuracy. Therefore, given that fault classes 2 and 4 had very small datasets, they
were set as low priority for the classifier to learn as learning the correlation of fault class
The effects of the majority voting technique are reflective in the improvement of
overall accuracy observed from testing to detection. As discussed in Section 5.1, a total of
133
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
10 baselines were utilized for feature extraction of cycles for both training and testing
datasets. The testing confusion matrix accuracy indicates the classifier’s ability to detect
each feature representation of a single cycle. The majority voting method’s function is to
mitigate the variations that exist among the 10 feature representations extracted. It can be
argued that not all 10 baselines utilized for training ought to be utilized for testing as
utilizing fewer baselines for testing would mean shorter processing time of new cycles.
However, all 10 baselines were utilized with majority voting method in this research in
case the dataset utilized for the development was too small. In the case where the testing
accuracy is on par with the accuracy level required to for the FDD strategy the majority
134
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Recall 71.5 80.0 92.3 70.0 91.9 95.6 95.5 88.7 77.5
77.8 88.9 92.2 82.4 89.6 95.9 94.9 90.5 79.1
Fault 1 Fault 2 Fault 3 Fault 4 Fault 5 Fault 6 Fault 7 Fault 8 Fault 9
Testing
Precision 73.8 0.0 85.7 0.0 84.4 84.7 89.1 98.2 80.8
Recall 64.3 0.0 88.4 0.0 86.5 90.0 95.9 68.8 97.0
68.7 0.0 87.0 0.0 85.4 87.3 92.4 80.9 88.2
Fault 1 Fault 2 Fault 3 Fault 4 Fault 5 Fault 6 Fault 7 Fault 8 Fault 9
Detection
Precision 57.1 0.0 88.9 0.0 84.6 80.0 94.4 100.0 90.9
Recall 57.1 0.0 84.2 0.0 89.2 100.0 100.0 62.5 100.0
57.1 0.0 86.5 0.0 86.8 88.9 97.1 76.9 95.2
Fault 1 Fault 2 Fault 3 Fault 4 Fault 5 Fault 6 Fault 7 Fault 8 Fault 9
Validation Training
Precision 88.9 94.0 96.0 100.0 92.2 98.6 97.4 97.1 93.3
Recall 83.5 78.3 96.4 100.0 95.3 98.6 96.3 94.2 90.0
86.1 85.4 96.2 100.0 93.7 98.6 96.9 95.6 91.6
Fault 1 Fault 2 Fault 3 Fault 4 Fault 5 Fault 6 Fault 7 Fault 8 Fault 9
Precision 87.0 94.4 92.2 100 87.6 96.8 93.8 91.2 84.0
Classifier 2
Recall 72.3 85.0 93.4 100 90.5 94.4 95.8 89.3 81.5
79.0 89.5 92.8 100.0 89.0 95.6 94.8 90.2 82.7
Fault 1 Fault 2 Fault 3 Fault 4 Fault 5 Fault 6 Fault 7 Fault 8 Fault 9
Testing
Precision 51.4 0.0 82.5 0.0 86.9 94.6 93.6 78.8 92.8
Recall 51.4 0.0 89.5 0.0 89.5 87.5 94.7 78.8 90.0
51.4 0.0 85.9 0.0 88.2 90.9 94.2 78.8 91.4
Fault 1 Fault 2 Fault 3 Fault 4 Fault 5 Fault 6 Fault 7 Fault 8 Fault 9
Detection
Precision 50.0 0.0 85.0 0.0 89.7 100.0 94.1 85.7 100
Recall 57.1 0.0 89.5 0.0 94.6 87.5 94.1 75.0 100
53.3 0.0 87.2 0.0 92.1 93.3 94.1 80.0 100.0
Fault 1 Fault 2 Fault 3 Fault 4 Fault 5 Fault 6 Fault 7 Fault 8 Fault 9
Validation Training
Precision 96.1 98.4 98.0 100.0 96.3 99.8 98.8 98.5 96.0
Recall 91.5 100.0 98.2 96.0 97.8 99.5 99.1 98.3 93.4
93.7 99.2 98.1 98.0 97.0 99.7 99.0 98.4 94.7
Fault 1 Fault 2 Fault 3 Fault 4 Fault 5 Fault 6 Fault 7 Fault 8 Fault 9
Precision 90.4 100.0 94.0 100.0 90.3 96.3 96.6 94.5 86.3
Recall 79.2 90.0 94.5 90.0 94.7 96.3 95.2 92.0 82.0
.Classifier 3
Precision 87.7 0.0 86.5 0.0 92.6 97.5 98.8 95.7 97.8
Recall 91.4 0.0 91.1 0.0 98.1 98.8 99.4 83.8 91.0
89.51 0.0 88.74 0.0 95.27 98.15 99.10 89.36 94.28
Fault 1 Fault 2 Fault 3 Fault 4 Fault 5 Fault 6 Fault 7 Fault 8 Fault 9
Detection
Precision 87.5 0.0 90.0 0.0 97.4 100.0 100.0 100.0 100.0
Recall 100.0 0.0 94.7 0.0 100.0 100.0 100.0 87.5 100.0
93.3 0.0 92.3 0.0 98.7 100.0 100.0 93.3 100.0
135
Chapter 6: Conclusion
The purpose of this research was to develop a FDD strategy that can monitor a
research was performed in collaboration with Ford Powertrain Engineering Research and
This section will summarize the research, highlight the notable contributions, and provide
feature extraction and classifier development specific sub strategies was proposed. The
dataset required for the development of the FDD strategy was proposed to be acquired from
currently existing testing procedures and testing facilities. During this research, the dataset
function to provide insight into what is occurring inside the engine. In this research, the
combustion quality was the characteristic being monitored by the FDD strategy. Therefore,
the state of the engine’s combustion quality was obtained through the analysis of the in-
pressure measurement functioned as the ground truth. However, due to the invasive and
136
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
expensive nature of the pressure transducers, they are only available in the testing
measurement for the FDD strategy due to the non-invasive and cheaper cost of
with the dimensionality reduction and correlation of vibration data to the combustion
measurements in crank angle domain. Given the features extracted and its corresponding
between the features and the combustion quality was developed using the machine learning
that seamlessly integrates into engine testing processes to obtain high quality
engine data.
engines that allows analysis in crank angle domain on a cycle by cycle basis.
137
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
5. Development a of classifier using ANN and majority voting technique that has
shown to be able to classify new dataset for combustion quality with 96.3%
accuracy.
This research demonstrated that there are creative ways to utilized the data that are
data analysis. More specifically, the research has shown that complex relationship between
pressure and vibration of an engine can be extracted to develop a FDD strategy that is
This research was conducted to assess the feasibility of the proposed FDD strategy
in monitoring the engine’s combustion quality. The development of the proposed FDD
strategy was focused more on the successful application rather than the fine tuning of
parameters in each of the sub-strategies. Therefore, there are many aspects of the proposed
FDD strategy that can be researched to understand the capabilities of the strategy itself and
how it can be fine-tuned to improve the its performance or to be applied to different types
of faults.
Given the set of data that was collected in this research, the following are the
138
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
1. Study of different wavelet families on the existing dataset for feature extraction.
time applications.
recommended where a set of data collection protocols are developed to acquire coherent
data sets from different types of engines. This protocol is envisioned to standardize data
collection properties such as the types of measurements being collected and the preferred
location of the transducers. Data collection protocols may differ for different types of faults
being monitored. In addition, the protocol should consist of generalized rules to enable
application to different types of engines. With data from different types of engines, a
research can be conducted to see if a FDD strategy developed for one engine can be applied
139
References
EMD and HMM," in Intelligent Autonomous Systems, Springer, 2010, pp. 167-185.
1988.
[5] S. Fallah, Electric and Hybrid Vehicles - Technologies, Modeling and Control: A
[6] Nuclear Power contributors, "Otto Cycle – Otto Engine," Nuclear Power , [Online].
Available: https://www.nuclear-power.net/nuclear-
engineering/thermodynamics/thermodynamic-cycles/otto-cycle-otto-engine/.
[Accessed 2019].
I
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
[8] N. Hall, "Ideal Otto Cycle," National Aeronautics and Space Administration, 05
Using EMD and HMM," in Intelligent Autonomous Systems, Springer, 2010, pp.
167-185.
[10] V. Venkatasubramanian, "A review of process fault detection and diagnosis Part
[11] V. Venkatasubramanian, "A Review of Process Fault Detection and Diagnosis Part
Engineering , 2003.
University , 2015.
II
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
Windsor, 1993.
[14] H. W. Miller, "Piston slap detection in combustion engines using unique signal
[16] R. Tailony, "Internal Combustion Engine Cold Testing: Modeling, Analysis, and
[17] L. Wei, "Modeling and Simulation of System Dynamics for Cold Test," Second
[18] B. Ray, "Engine defect defection using wavelet analysis," University of Windsor ,
2007.
[20] I. Arasaratnam and S. Habibi, "Engine Fault Detection Using Vibration Signal
III
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
[21] ATS Automation , "Powertrain Final Test," ATS Automation, [Online]. Available:
https://www.atsautomation.com/en/Transportation/Testing%20Automation/Power
[22] A. Doghri, "A Real Time Fault Detection and Diagnosis System for Automotive
[23] S. Hodgins, "A Wireless Sensor for Fault Detection and Diagnosis of Internal
University, 2012.
2014.
IV
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
[28] Q. Gao, C. Fan and Q. Meng, "Rotating machine fault diagnosis using empirical
mode decomposition," Mechanical System and Signal Processing, no. 22, pp. 1072-
1081, 2008.
[29] J. Wu and C. Liu, "Investigation of engine fault diagnosis using discrete wavelet
transform and neural network," Expert Systems with Applications, vol. 35, pp. 1200-
1213, 2008.
Sheffield, 2000.
[31] D. Mba and R. Rao, "Development of Acoustic Emission Technology for Condition
Engines and Rotating Structures," The Shock and Vibration Digest , vol. 38, pp. 3-
16, 2006.
V
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
[33] R. B. W. Heng, "Statistical Anlysis of Sound and Vibration Signals for Monitoring
Rolling Element Bearing Condition," Elsevier: Applied Acoustics, vol. 53, pp. 211-
226, 1998.
[34] S. Zhang, Machine Learning and Deep Learning Algorithms for Bearing Fault
[35] W. Zhang, "A New Deep Learning Model for Fault Diagnosis with Good Anti-
Noise and Domain Adaptation Ability on Raw Vibration Signals," MDPI, 2017.
VI
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
[40] I. Amihai, "An Industrial Case Study Using Vibration Data and Machine Learning
[41] M. A. Lilo, "Identify and Classify Vibration Fault Based on Artificial Intelligence
no. 2, 2016.
Paper, 1990.
Engineering," 2003.
[45] National Instruments, "Getting Started with Order Analysis," National Instruments
Corporation, 2007.
VII
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
[47] R. Polikar, "The Engineers Ultimate guide to Wavelet Analysis: The Wavelet
https://cseweb.ucsd.edu/~baden/Doc/wavelets/polikar_wavelets.pdf. [Accessed
2019].
[48] I. Daubechies, "Where do Wavelets Come From?," IEEE, vol. 84, no. 4, pp. 510-
513, 1996.
2014.
2019].
[51] T. Villegas, M. Fuente and M. Rodriguez, "Principal Component Analysis for Fault
152, 2010.
VIII
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
[53] J. Jackson, in A User's guide to principal components, New York, John Wiley and
[54] G. Box, "Some theorems on quadratic forms applied in the study of analysis of
The Annals of Mathematical Statistics , vol. 2, no. 25, pp. 290-302, 1954.
[56] S. Yoon and M. J, "Principal-Component Analysi sof Multiscale Data for Process
Monitoring and Fault Diagnosi," AlChE Journal, vol. 50, no. 11, pp. 2891-2903,
2004.
[57] S. Haykin, Neural Networks and Learning Machines, Hamilton: Pearson, 1999.
[58] Z. Benedek, "Pushrod vs. Overhead cam engines," 19 Jan 2018. [Online].
Available: https://allaboutdieselz.blogspot.com/2018/01/pushrod-vs-overhead-
IX
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
[60] R. Muscoplat, "4.6L V8 Ford firing order," Rick's Auto Repair Advice, 2009.
[Accessed 2019].
[62] S. B. Quang, "Rotary Encoder Training Material," Maruei Vietnam Precision, 2013.
[63] AVL, "Crank Angle Encoder 365-Series Prodcut Description," AVL, Feb 2010.
[64] Kistler, "Test & Measurement Pressure," Kistler, 2018. [Online]. Available:
https://www.kistler.com/?type=669&fid=541&model=download&callee=frontend
. [Accessed 2019].
Available:
X
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
https://www.avnet.com/wps/portal/abacus/solutions/technologies/sensors/pressure
ca/innovations/white-papers/06/measuring-vibration-with-accelerometers.html.
3313A1," 2010.
https://www.autozone.com/repairguides/Silverado-2008/Components-
[70] I. Dibrani, "The new V8 POwer Unit for the BMW M3," BMW Heaven, 8 April
http://www.bmwheaven.com/index2.php?option=com_content&task=view&id=1
XI
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
papers/06/introduction-to-data-acquisition.html#section-1450863631. [Accessed
Dec 2019].
[72] National Instruments, "How to Choose the Right DAQ Hardware for Your
https://www.ni.com/en-ca/innovations/white-papers/11/how-to-choose-the-right-
Birmingham, 2002.
XII
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
OBD and CAN data acquisition for light-duty vehicles, SAE, 2018, pp. 25-36.
[79] AutoTap, "OBDII: Past, Present & Future," AutoTap, [Online]. Available:
http://www.autotap.com/techlibrary/obdii_past_present_future.asp. [Accessed
DEc 2019].
[80] J. Faiz, "A Survey on Condition Monitoring and Fault Diagnosis in Line-start and
Broken Bars and Beraing using the Stator Current," IEEE Transaction on Industry
1999.
XIII
M.A.Sc Thesis McMaster University
D. Joo Mechanical Engineering
1999.
[85] A. Chen, "Internal combustion engine vibration analysis with short-term Fourier-
Processing, 2010.
XIV