0% found this document useful (0 votes)
13 views

Intelligent Fault Diagnosis of Rotating Machine Elements Using Machine Learning Through Optimal Features Extraction and Selection

Uploaded by

ali.akbari7843
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views

Intelligent Fault Diagnosis of Rotating Machine Elements Using Machine Learning Through Optimal Features Extraction and Selection

Uploaded by

ali.akbari7843
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

Available online at www.sciencedirect.

com

ScienceDirect
Available online at www.sciencedirect.com

ScienceDirect
Procedia Manufacturing 00 (2021)000–000
www.elsevier.com/locate/procedia

Procedia Manufacturing 51 (2020) 266–273

30th International Conference on Flexible Automation and Intelligent Manufacturing (FAIM2021)


15-18 June 2021, Athens, Greece.
Intelligent fault diagnosis of rotating machine elements using machine
learning through optimal features extraction and selection
Syed Muhammad Tayyaba *, Eram Asgharb, Paolo Pennacchi c, Steven Chatterton d
a,b,c,d
Dipartimento di Meccanica, Politecnico di Milano,Milan, Italy
* Corresponding author. Tel.: +39-02-2399-8442; E-mail address: [email protected]

Abstract

The rolling element bearings, and gears are the main components of rotating machines and are most prone to defects which may result in
significant economic loss. The main purpose of this study is an automated diagnosis of rolling element bearings and gears defects using machine
learning (ML) technique and statistical features extracted from time domain vibration signal and spectral kurtosis. Extracted features are used to
train K- nearest neighbors (KNN) as diagnostic classifier. The significance of segmentation size for time domain raw vibrational signals for the
purpose of feature extraction is studied. This analysis is carried out by varying the window/segment length for features extraction and observing
its effect on classification accuracy. Importance of feature selection for optimal performance of KNN in defect classification is studied by
selecting most important and useful features using Genetic Algorithm (GA). Furthermore, effect of value of K on performance of KNN classifier
has been observed by varying the value of K between 1 to 10 with step size of 1. Results show the ability of KNN classifier in combination with
GA for correct and confident fault diagnosis of rotating machine elements in case of proper selection of parameters for features extraction.

© 2020The
© 2020 TheAuthors.
Authors. Published
Published by Elsevier
by Elsevier Ltd. Ltd.
This
This isisan
anopen
openaccess
access article
article under
under the BY-NC-ND
the CC CC BY-NC-ND licenselicense (https://creativecommons.org/licenses/by-nc-nd/4.0/)
https://creativecommons.org/licenses/by-nc-nd/4.0/)
Peer-review underresponsibility
Peer-review under responsibility ofof
thethe scientific
scientific committee
committee of FAIM
of the the FAIM 2021.
2020.

Keywords: K-nearest Neighbor (KNN); Rolling element bearings and gears; Fault Diagnosis; Genetic Algorithm (GA)

1. Introduction of rolling element bearings using neural network (NN) [4,5].


Principal component analysis (PCA) based feature selection
Todays’ industry uses increasingly complex machines, with scheme was presented for machine defect classification [6].
high demanding performance criteria. Early forecast of the Another approach was proposed by Su H and Chong KT [7] for
anomalies in the equipment helps to carry out timely machine condition monitoring using support vector machine
maintenance of component degradation. Therefore, performing (SVM). Moreover, Liu H et al. [8] presented deep learning
effectual condition monitoring draws significant advantages to technique for fault detection using a short-time Fourier
manufacturing industry [1,2]. Most of the time, the human transform (STFT) scheme. A hierarchical diagnosis network
intervention cannot respond efficiently and effectively to (HDN) for fault pattern recognition was introduced by Gan M
immense measurement data. Therefore, in order to minimize and Wang C [9]. Feature extraction method for the diagnosis of
the human dependency, intelligent condition monitoring bearings using unsupervised data was proposed by Oh H et al.
systems are required to be developed [3]. During last two [10]. Another hierarchical approach for bearing defect
decades, a lot of new approaches in this field have been diagnosis using adaptive deep convolutional neural network
proposed by researchers, especially the ones related to Artificial (CNN) was presented by Guo X et al. [11]. Zhang Z et al. [12]
Intelligence (AI) techniques. Most of the approaches are proposed an intelligent diagnosis and prognosis approach using
applied on rotating machine elements focusing on bearing and integration of wavelet transform and PCA. For gear box defect
gearbox defects. An approach was proposed for fault detection diagnostics: Li C et al. [13,14] proposed multimodal deep

2351-9789 © 2020 The Authors. Published by Elsevier Ltd.


2351-9789
This is an©open
2020 access
The Authors. Published
article under bytheElsevier Ltd.
CC BY-NC-ND license (https://creativecommons.org/licenses/by-nc-nd/4.0/)
This is an openunder
Peer-review access article under the CC
responsibility BY-NC-ND
of the license
scientific https://creativecommons.org/licenses/by-nc-nd/4.0/)
committee of the FAIM 2021.
Peer-review under responsibility of the scientific committee of the FAIM 2020.
10.1016/j.promfg.2020.10.038
Syed Muhammad Tayyab et al. / Procedia Manufacturing 51 (2020) 266–273 267
2 Syed Muhammad Tayyab / Procedia Manufacturing 00 (2021)000–000

support vector classification (MDSVC) and a deep random and found same using average mean square and classification
forest fusion (DRFF) technique, Chen Z et al. [15,16] used error.
multiple layered neural network scheme and convolutional Many studies have been made in past few decades on the
neural network method. fault detection of mechanical components using AI techniques
To improve the accuracy and efficiency of fault diagnosis of [24,25]. Since using these techniques for diagnosing defects
rotating machinery, application of Machine Learning have various benefits if compared with the traditional
techniques in machine condition monitoring still need more mathematical and statistical modelling approaches. However,
encouragement and attention. Therefore, the goal of this there are lots of things still need to be improved for these
research is to develop an autonomous fault classification methods to make it more effective and practical to solve real
system, using appropriate artificial intelligence technique, that world applications [26]. Furthermore, Genetic Algorithm (GA)
can be used to improve the efficiency and applicability of applications with vibrational signal analysis in machine
condition monitoring technology. condition monitoring and fault diagnosis still need more
The remainder of the paper is arranged as follows: in the support and attention because of the lack of existing evidence.
next section recent literature related to fault diagnostics Vibration based conditional monitoring is most popular for
through AI techniques has been illustrated. Later, Section 3 early monitoring and identification of defects. However,
consists of a case study in which the proposed approach is localized faults produce very weak impulses in vibrational
applied to the vibrational samples of two types of rotating signals. Therefore, it is difficult to detect these faults using
machine elements separately. One is rolling element bearing existing frequency domain methods [27]. Various supervised
and the other component is gear. In section 4, detailed analysis machine learning methods have so far been used for fault
of the results obtained from the proposed approach has been identification of rolling element bearings and gears using time
carried out. The whole research work is being summarized in domain statistical features. However, the fault classification
the section 5 along with some future recommendations. through the usage of minimum features is still a challenge for
researchers.
2. Literature Review In the light of existing literature, it was observed that there
is still need to improve the quality of the features extracted
Accurate fault detection and diagnosis systems have gained from vibrational data so that they may contain appropriate and
significant importance so that the potential failures of enough information regarding machine health in order to
machinery can be managed properly. Various methods have properly train ML classifier for defect diagnosis. In this
been applied by researchers to tackle these issues. However, for research work, vibration-based monitoring technique is used
condition monitoring of rotating machine elements, vibration for fault diagnostics of rolling element bearings and gears.
monitoring is most widely used, as was published in [17]– [20]. Statistical features derived from time-domain signal and
Since the raw vibrational data is 1-dimensional time domain spectral kurtosis are extracted. Then, K- Nearest Neighbor is
series data; extracting the appropriate features as health applied upon the combination of all extracted features to
indicator is required. Those mechanical components that are classify the faults. The performance of classifier is compared
often highly loaded in rotating machinery are gears and for different values of K in order to determine its potential in
bearings (rolling element bearings in particular), responsible classifying various faults of bearings and gears. Afterwards,
for early damages in machine’s life. Rolling element bearing Genetic algorithm is used to select most suitable features for
and gears are vital part of rotating machinery and their failure optimizing the performance of KNN classifier.
cannot only cause production losses but also lead to the
financial losses. It is important to mention here that only 3. Case Study
bearing related defects are responsible for more than 40% of
the failure in industrial machines [21]. In this instance, a lot of For accurate mechanical fault diagnosis realistic and correct
research in the field of fault diagnosis has been carried out since features extraction from vibration signal; containing enough
last few decades. Qicai Zhou et al. presented a study in which information regarding machine/component’s health is of
K-mean algorithm was used to label the un-labeled signals paramount importance. For feature extraction from vibration
[18]. A comparison of the performance of ANN and KNN was data, it is to be divided into samples having same number of
carried out by Rohit S. Gunerkar et al. [19]. The study was points (dimension). The number of points in one sample will
performed on rolling element bearings based upon feature be referred as window size (𝑊𝑊) for feature extraction in this
extracted after wavelet transform. J. P. Patel did comparison paper. In order to find effect of window size for feature
between ANN and SVM considering bearings and found that extraction on classification accuracy, it is essential to set some
SVM gave better results as compared to ANN [20]. Few years reference. For this purpose, window size corresponding to one
back, a method based on spectral kurtosis (SK) and cross rotation (𝑊𝑊𝑊𝑊) is considered as reference in this paper, which
correlation was presented by Jing Tian et al [22] to diagnose can be described as:
defects and monitor the degradation of bearings in electric
motors. The method was validated by experiments using 𝑊𝑊𝑊𝑊 = 60 𝐹𝐹𝐹𝐹/𝑅𝑅 (1)
machinery fault simulator. Daniel Augusto [23] carried out
analysis to diagnose faults in water cooling system using data Where; 𝐹𝐹𝐹𝐹 is the sampling frequency of acceleration sensor
collected from real engine at all speeds to train the classifier. in 𝐻𝐻𝐻𝐻 and 𝑅𝑅 is the revolution per minute of the rotating
The performance of KNN and ANN classifiers was compared machine. The case study was carried out for both components
268 Syed Muhammad Tayyab et al. / Procedia Manufacturing 51 (2020) 266–273
Syed Muhammad Tayyab / Procedia Manufacturing 00 (2021)000–000 3

(rolling element bearings and gears) separately. Statistical two accelerometers installed at both the drive end and fan end
features derived from time-domain signal and spectral kurtosis of the motor housing, and two sampling frequencies of 12 kHz
(SK) are extracted. Initially, 16 features (root mean square, and 48 kHz were used. Four classes of data with 48 kHz
peak to peak, kurtosis, crest indicator, impulse, energy-I, sampling frequency were selected for research as mentioned in
skewness, standard deviation, variance, shape factor, margin Table-1. Experimental data for gear fault diagnosis was
factor, energy-II, spectral kurtosis mean, SK kurtosis, spectral acquired from online shared gear faults data sets [29] as shown
kurtosis skewness, spectral kurtosis standard deviation) were in Fig. 3. In each case of gears data set, the acceleration was
calculated keeping the window size 𝑊𝑊 = 400 (~ 𝑊𝑊𝑊𝑊/4 for recorded for a duration of 10 seconds with the sampling
bearings’ and ~ 𝑊𝑊𝑊𝑊 for gears’ data set). The dimension of frequency of 10 kHz. The gearbox fault data was collected in
feature matrix for rolling element bearing and gear data set was three different pinion conditions (see Table-2).
609x16x4 and 250x16x3, respectively. In order to analyze the
effect of window size 𝑊𝑊 for feature extraction on classifier
accuracy, it was increased to 800 ( ~ 𝑊𝑊𝑊𝑊/2 ) and 1200
(~ 2𝑊𝑊𝑊𝑊/3) for rolling element bearing and to 800 (~ 2 𝑊𝑊𝑊𝑊)
for gears.

Raw Vibrational Data of Rotating


Machine Elements

Preprocessing and Feature Extraction


Figure. 2: Experimental test setup for bearing data set [28]

Feature Selection

Training Testing

Generating Classification Model


KNN

Figure. 3: Experimental test setup for gear data set [29]


The Trained KNN
Table 1. Selected bearing fault classes

Classes Type of defect Fault Motor Motor


diameter Load Speed
Output the diagnosis results
(HP) (rpm)
Defect Classification
1 Normal No fault 0 1797
2 Ball 0.021” 0 1797
3 Outer Race 0.021” 0 1797
Classification Accuracy Centered
@6:00
Figure. 1: Working Methodology 4 Inner Race 0.021” 0 1797

Since the requirement of this era is to automatize the fault Table 2. Selected gear fault classes
diagnosis to avoid human intervention in this tedious task. For Classes 1 2 3
this instance, the classification technique from machine Type of Faults No fault One chipped Three consequent
learning (K- Nearest Neighbors) has been applied to diagnose tooth worn teeth
the defects of rotating machine elements. The flow of the
adopted methodology is illustrated in Fig.1. 3.2. Application of K-nearest neighbors algorithm

3.1. Experimental setup and vibrational data KNN is an algorithm that learns from all available patterns
(training data) and classifies the new patterns (testing data)
The rolling element bearing vibration signals used for based on the similarity measure between them. The similarity
analysis were downloaded from CWRU Bearing Centre [28] as measure is the minimum distance (e.g. Mahalanobis distance,
shown in Fig.2. Vibration data are collected for motor loads Euclidean distance etc.) between the data points. It requires to
from 0 to 3 hp and motor speeds from 1,720 to 1,797 rpm using set the number of nearest neighbors (k) that are considered for
Syed Muhammad Tayyab et al. / Procedia Manufacturing 51 (2020) 266–273 269
4 Syed Muhammad Tayyab / Procedia Manufacturing 00 (2021)000–000

classification based upon selected similarity measure. The tournament selection. In proportionate selection method
classification through KNN algorithm is divided into training individual fitness is divided by the average fitness of
and testing phases. The available data is divided into training population; roulette wheel is one of the most used method in
and testing samples. During the training phase, KNN model which slice of circular roulette wheel is assigned to individuals.
finds the relationship between predictors and targets of training Upon spinning it up to the “number of population”; gives the
data whereas; in testing phase, trained model is tested for its parents and child sequences which is repeated iteratively. In the
ability to predict the correct classes of testing data whose labels second method fitness scale is not required because every
are known but the model has not already been trained on this chromosome is ranked based on its own fitness, this helps in
data. preventing the early convergence. The disadvantage is that the
In this research, analysis is carried out to ascertain the effect ratio of expected values of individuals remains same even when
of K values and window size for feature extraction to improve the fitness variance is low or high. In the tournament selection
the classification accuracy of model. Euclidean distance is used two individuals are randomly selected from initial population
in this study to find the similarity between the data points which and the best individual from those two is preserved for next
is simple to implement and can give good results. Lesser the generation. In GA, selection method is of critical importance,
distance between data points, higher is the similarity. Initially, it aids in finding out the individuals which need to be
KNN model is trained and tested using all extracted features. prioritized to produce next generation (child chromosomes).
However, to reduce the dimensionality of feature matrix for the Thus, setting it carefully may lead the solutions towards desired
purpose of reducing the computational requirements and optimality level. If the selection is strong and strict, highly fit
improving the classification accuracy, optimal features are suboptimal solution may take over population. On contrary,
selected through wrapper approach. Genetic Algorithm (GA) is slow convergence is normally observed in case of weak
used for effective feature selection for isolating the best selection. In this case study, tournament selection has been
features evaluated through fitness criteria. The significance of used as selection criteria; two individuals executed from the
using GA over other methods is that it consumes less initialized population are set out for a tournament and the best
computational power i.e. no need for derivative calculations. It individual is preferred as a parent. Most discriminating feature
is more robust algorithm and has a superior global searching of GA over other search methods is crossover as it contributes
capability in complicated search spaces. On contrary, GA in maintaining diversity by introduction of new genetic content
needs preliminary tuning and typically many fitness when operated on two chromosomes to produce off-springs. To
evaluations are required to get better solutions. In this case accelerate the search in population recombination operator is
study, GA parameters have been defined carefully and applied. Simplest way of crossover is to choose a cutoff point
algorithm was run for enough number of generations to achieve randomly and exchange the genetic content by combination of
effective convergence. The fitness/ objective function of this single segment of two parents to give child sequence. There are
algorithm returns the numeric value calculated against every many methods to accomplish recombination. To achieve better
sample for randomly selected features. This selection is performance and better combinations, properly designed
controlled through the values generated in the binary encoded crossover mechanism is required thus recombination should be
genome. The performance of KNN classifier in terms of carried out with some probability called crossover rate.
minimizing the classification error by resubstitution/ Crossover rate is the ratio of number of child sequences
resubstitution loss is used as the fitness function for GA. produced in each generation to the population size. The third
GA operator is mutation; it simply alters one or more genes that
3.3. Optimization through Genetic Algorithm are lost during selection operator to maintain diversity.
Although it is secondary operator but its probability of applying
GA is used for selection of best features in order to reduce is as important as that of crossover. Most common mutation is
the dimensionality of features matrix and to optimize the the bit flip mutation in which it randomly selects two
performance of KNN. The algorithm starts by initializing the individuals and swaps them with a probability called mutation
population (design variables/ labelled data) which is later rate [30]. Moreover, in the applied algorithm of GA, diversity
evaluated based on considered fitness function. This population of the solutions has been maintained by preserving the best
of variables is user defined and needs to be set appropriately. individuals for the next generations, this phenomenon is called
GA can be used for feature selection using binary or real coded elitism. The algorithm for GA has been designed in MATLAB
individuals. In this case study the population of individuals is using the input extracted from the features (labelled data). GA
randomly generated “genomes” which is a binary encoded bit. rely on its population unlike traditional techniques. Thus,
The reason behind using the binary bits is that it helps to reduce population size is user defined and it has drastic effects on the
the likelihood of convergence within the population. The whole performance of GA. Small population size may result into
subset of variables is subjected to three GA operators; premature convergence and large population size takes
selection, crossover, and mutation. Genetic operators are based unnecessary computational time. In this case study population
on the idea of heredity of genetic characters over the size is set to be 50. The algorithm for GA is represented in Fig
generations. By the application of these operators, best 4. Initializing the population and then converting the
solutions/individuals are separated resulting in the convergence individuals into binary encoding facilitates in finding out the
of the fitness towards global best solution over the generations fitness of all individuals based on the selection
till the stopping criteria meets [30]. Most common fitness fitness/evaluation criteria. GA does not have intense
evaluation or selection methods are proportionate, ranking and mathematical requirements, so it has the tendency to handle
270 Syed Muhammad Tayyab et al. / Procedia Manufacturing 51 (2020) 266–273
Syed Muhammad Tayyab / Procedia Manufacturing 00 (2021)000–000 5

any type of fitness/ objective function and constraints. In this is essential to use appropriate size of window. It is observed
case study, the objective function of GA is to minimize the that acceptable classification accuracy can be achieved by
classification error. In every generation it selects the setting 𝑊𝑊 ≥ 𝑊𝑊𝑊𝑊/2 for rolling element bearings and 𝑊𝑊 ≥
individuals having minimum classification error and use these 2𝑊𝑊𝑊𝑊 for gears. Features extracted by using smaller windows
individuals for next generations and discard the other ones will not carry adequate information for correct fault
considering equal importance to the data of all classes. The best classification.
individuals are the offspring for current generation and parents Feature selection through GA was very useful in order to
for the next successive iteration, evolving the cycle till the find most important features for fault diagnostics. Individuals
stopping criteria is fulfilled. having the best fitness were obtained for generation size 100
(stopping criteria) and population size of 50. Crossover and
4. Results and Discussion mutation operator need to be designed appropriately with some
probability rate to attain better performance. In this case study
Performance of KNN classifier in terms of classification the crossover between two samples is being carried out at a rate
accuracy for both rotating machine elements (rolling element of 0.8 and the same rate has been used for mutation. KNN did
bearings and gears) is appended at Table 3. For rolling elements fault classification with 100% accuracy using only 04 and 03
bearings three window sizes (𝑊𝑊1= 400≈ 𝑊𝑊𝑊𝑊/4, 𝑊𝑊2= 800 ≈ features selected through GA in case of bearings and gears, b
𝑊𝑊𝑊𝑊/2 and 𝑊𝑊3 =1200 ≈ 2𝑊𝑊𝑊𝑊/3 ) were used for feature respectively. Furthermore, the performance of classifier was
extraction. All 16 extracted features were used for training the improved in terms of computational time as well. The
KNN model. For 𝑊𝑊=400, classification accuracy was observed computational time was reduced up to 73% in case of bearings
more than 90% for all values of k under observation. The and up to 48% in case of gears data set. Figures 9 and 10 show
minimum accuracy was 90.3% for K=2 and maximum reduction in computational time because of dimensionality
accuracy was 93.6% for K= 5&7. Confusion matrix for K=4 at reduction for bearings and gears data sets, respectively.
this window size is shown in Fig. 5a. By increasing the window Performance comparison of this study with few existing
size to 800, classification accuracy was improved for all values studies, who have used the Case Western Reserve University
of K under consideration. For 𝑊𝑊 = 800 , minimum Bearing data set, is given in Table 4.
classification accuracy of 97.3% was observed for K=2 and
maximum accuracy of 98.2 % was observed for K=4.
GA parameters
Confusion matrix for K=4 at this window size is shown in Fig (population size, Labelled Features
5b. Afterwards, window size was further increased to 1200. For stopping criteria)
this window size, 99.6% classification accuracy was observed
for K= 2 to k=10. Classification accuracy at this window size
( 𝑊𝑊 = 1200 ) was further improved to 100% after applying
Genetic Algorithm (GA) for features selection. GA selected
only 04 features (root means square, impulse, kurtosis and Initialize population
Shape Factor) which reduced the dimensions of features
matrix. Confusion matrix for K=4 at this window size are
shown in Fig 6a and 6b. Evaluate Fitness
For gears data set two window sizes ( 𝑊𝑊1 = 400 ≈
𝑊𝑊𝑊𝑊, 𝑊𝑊2 = 800 ≈ 2𝑊𝑊𝑊𝑊 ) were used for feature extraction.
Firstly, all 16 features were used for the classification. For
Ranking and Selection of
window size = 400, classification accuracy of above 92% was Best Fit individuals
observed for all values of K between 1-10. Maximum accuracy
was observed to be 94.6% at K=4. Confusion matrix for K=4
at this window size is shown in Fig 7. By increasing the
YES
window size to 800 classification accuracy was significantly Display Results
Stopping
improved for all values of K. Minimum classification accuracy (best features
criteria met?
was observed to be 98.2% for K=8 and 100% classification selected)
accuracy was achieved for K=2-5 (see Fig. 8a). When GA was NO

applied for feature selection, 03 features (kurtosis, skewness,


and standard deviation) were selected. Accuracy of GA (operators ) End
classification was improved to 100% for K=2-9 (see Fig 8b). Crossover and Mutation
Dimensionality of features matrix was reduced. For both data
sets it is observed that performance of KNN in terms of
classification accuracy is not much sensitive to value of K Elitism (Reserve the Best Fit
between 1-10. However, by increasing the window size for Individuals)
features extraction to 800,1200 for bearings and to 800 for
gears, performance of both classifiers was remarkably
Evaluate Final Fitness
improved. It shows that for proper feature extraction which will
carry the correct information regarding the equipment defect, it Figure. 4: Genetic algorithm for feature selection
6 Syed
SyedMuhammad
MuhammadTayyab / Procedia
Tayyab et al. /Manufacturing 00 (2021)000–000
Procedia Manufacturing 51 (2020) 266–273 271

Table 3. Classification Accuracy and Computational Time

Rolling Element Bearings Gears


ML KNN
Model
Window 400 800 1200 400 800
Size
Number 16 16 16 4(GA) 16 4(GA) Reduction 16 16 3(GA) 16 3(GA) Reductio
of of Time n of
Features (%) Time
K Classification Accuracy (%) CPU Time (Sec) Classification Accuracy CPU Time (%)
(%) (Sec)
1 90.7 98.0 99.2 100 4.52 2.24 50 92.0 99.1 99.1 2.78 1.69 39
2 90.3 97.3 99.6 100 4.09 2.12 48 93.7 100.0 100 2.68 1.61 40
3 93.4 97.7 99.6 100 4.47 2.28 49 93.8 100.0 100 2.70 1.67 38
4 92.2 98.2 99.6 100 4.05 1.84 54 94.6 100.0 100 2.77 1.80 35
5 93.6 97.7 99.6 100 4.64 1.86 60 94.2 100.0 100 2.77 1.77 36
6 92.9 97.7 99.6 100 5.60 1.98 65 92.0 99.1 100 2.58 1.66 36
7 93.6 97.7 99.6 100 5.31 2.02 62 93.7 99.1 100 2.70 1.61 40
8 93.2 97.7 99.6 100 5.97 1.59 73 92.9 98.2 100 2.64 1.38 48
9 93.2 97.7 99.6 99.6 4.83 1.66 66 92.9 99.1 100 2.61 1.36 48
10 93.3 98.0 99.6 99.6 4.58 1.75 62 92.0 98.3 99.1 2.70 1.85 32

a a

b b

Figure 5: Confusion charts for rolling elements bearings for (a) Figure 6: Confusion charts for rolling elements bearings for window
window size 400; (b) window size 800 size 1200 (a) all features; (b) with selected features
272 Syed Muhammad Tayyab et al. / Procedia Manufacturing 51 (2020) 266–273
Syed Muhammad Tayyab / Procedia Manufacturing 00 (2021)000–000 7

COMPUTATIONAL TIME FOR GEARS


(WINDOW SIZE 800)

without GA With GA

3.00
2.00

SEC
1.00
0.00
1 2 3 4 5 6 7 8 9 10
K

Figure 7: Confusion chart for gears at window size 400


Figure 10: Computational time comparison for gears

a Table 4. Comparison with existing studies

Features after Reduction

Types of Bearings used

Misclassification Rate
Extracted Features
Work /Approaches

Considered Classes
Feature Selection

Classifier
Approach
Zhang 21 Kernel 3 7 Drive SVM 0.47%
etal.,2 Principal End
013 Compone
[31] nt
Analysis
b

Jiang 16 Semi- 5 10 Drive SVMK 0.00%


etal.,2 supervised End NN 1.50%
013 kernel
[32] Marginal
Fisher
Analysis

Li 14 Linear 3 7 Drive Littlew 5.71%


etal.,2 Local End oods-
013 Tangent Paley
[33] Space SVM
Figure 8: Confusion chart for gears at window size 800 (a) Alignment
considering all features; (b) selected features

COMPUTATIONAL TIME FOR Zhu 8 None 8 10 Drive SVM 0.00%


BEARING (WINDOW SIZE 1200) etal.,2 End
Table 4. Comparison with Existing Approaches 014
Without GA With GA [34]

Baral 29 Wrapper 5 10 Drive KNN 0.01%


8.00 di search End
6.00 etal.,2
4.00 016
SEC

[21]
2.00 Ours 16 Genetic 4 4 Drive KNN 0.00 %
0.00 Algorithm End
1 2 3 4 5 6 7 8 9 10
K

Figure 9: Computational time comparison for bearings


Syed Muhammad Tayyab et al. / Procedia Manufacturing 51 (2020) 266–273 273
8 Syed Muhammad Tayyab / Procedia Manufacturing 00 (2021)000–000

Conclusion [13] Li C, Sanchez RV, Zurita G, Cerrada M, Cabrera D, Vásquez RE.


Multimodal deep support vector classification with homologous features and
its application to gearbox fault diagnosis. Neurocomputing. 2015 Nov
A case study was conducted for defect diagnosis of rotating 30;168:119-27.
machine elements (bearings and gears) using KNN classifier [14] Li C, Sanchez RV, Zurita G, Cerrada M, Cabrera D, Vásquez RE. Gearbox
and combination of statistical features extracted from time- fault diagnosis based on deep random forest fusion of acoustic and vibratory
domain signal and spectral kurtosis. Importance of appropriate signals. Mechanical Systems and Signal Processing. 2016 Aug 1;76:283-93.
segmentation of time domain raw vibrational signal for feature [15] Chen Z, Li C, Sanchez RV. Gearbox fault identification and classification
with convolutional neural networks. Shock and Vibration. 2015;2015.
extraction and subsequent feature selection through GA for [16] Chen Z, Li C, Sánchez RV. Multi-layer neural network with deep belief
defect classification was also studied. Different conditions for network for gearbox fault diagnosis. Journal of Vibroengineering. 2015 Aug
rolling element bearings and gears were considered and 15;17(5):2379-92.
observed that feature extraction has significant effect on [17] Kiral Z, Karagülle H. Simulation and analysis of vibration signals
diagnosis results. Selecting very small size of segment/window generated by rolling element bearing with defects. Tribology International.
2003 Sep 1;36(9):667-78.
for feature extraction to increase the amount of training data, [18] Orhan S, Aktürk N, Celik V. Vibration monitoring for defect diagnosis of
adversely affects the features quality and subsequently affects rolling element bearings as a predictive maintenance tool: Comprehensive case
the classification accuracy of ML classifier. Feature selection studies. Ndt & E International. 2006 Jun 1;39(4):293-8.
was not much useful for performance improvement of KNN [19] Wu JD, Hsu CC. Fault gear identification using vibration signal with
classifier in this case, because the classifier had already shown discrete wavelet transform technique and fuzzy–logic inference. Expert
systems with applications. 2009 Mar 1;36(2):3785-94.
very good performance while using all 16 features. However, [20] Vilakazi CB. Machine condition monitoring using artificial intelligence:
dimensionality of feature matrix was remarkably reduced, the incremental learning and multi-agent system approach (Doctoral
improving the classifier performance in terms of less dissertation).
computational time, thus making the demonstrated method [21] Baraldi P, Cannarile F, Di Maio F, Zio E. Hierarchical k-nearest
suitable for online monitoring and development of compact neighbours classification and binary differential evolution for fault diagnostics
of automotive bearings operating under variable conditions. Engineering
autonomous fault diagnosis systems as well. In future, the Applications of Artificial Intelligence. 2016 Nov 1;56:1-3.
comparison of GA will be carried out with other feature [22]Tian J, Morillo C, Azarian MH, Pecht M. Motor bearing fault detection
selection and dimensionality reduction methods using using spectral kurtosis-based feature extraction coupled with K-nearest
recommended optimal window size. neighbor distance analysis. IEEE Transactions on Industrial Electronics. 2015
Dec 17;63(3):1793-803.
[23] Do Carmo DA, Munaro CJ. Application of kNN and artificial neural
References network classifiers for fault detection and diagnosis in a diesel engine cooling
system. In2018 13th IEEE International Conference on Industry Applications
[1] Altmann J. The Application of Discrete Wavelet Packet Analysis for the (INDUSCON) 2018 Nov 12 (pp. 1067-1071). IEEE.
Detection and Diagnosis of Low Speed Rolling-element Bearing [24] Aherwar A. An investigation on gearbox fault detection using vibration
Faults (Doctoral dissertation, Monash University). analysis techniques: A review. Australian Journal of Mechanical Engineering.
[2] Baillie DC, Mathew J. A comparison of autoregressive modelling 2012 Jan 1;10(2):169-83.
techniques for fault diagnosis of rolling element bearings. Mechanical Systems [25] Chang CW, Lee HW, Liu CH. A review of artificial intelligence
and Signal Processing. 1996 Jan 1;10(1):1-7. algorithms used for smart machine tools. Inventions. 2018 Sep;3(3):41.
[3] Shebin EL. Fault diagnosis of rotating machinery based on vibration [26] Ali YH. Artificial Intelligence Application in Machine Condition
analysis. International Journal of Advanced Engineering and Global Monitoring and Fault Diagnosis. Artificial Intelligence: Emerging Trends and
Technology I. 2016;4(1). Applications. 2018 Jun 27:275.
[4] Li B, Chow MY, Tipsuwan Y, Hung JC. Neural-network-based motor [27] Wowk, Victor. "Machinery Vibration MeaSurement and Analysis,
rolling bearing fault diagnosis. IEEE transactions on industrial electronics. McGraW-Hil." (1999)
2000 Oct;47(5):1060-9. [28] Case Western Reserve University Bearing Data Center Website. Available
[5] Samanta B, Al-Balushi KR. Artificial neural network based fault online:https://csegroups.case.edu/bearingdatacenter/pages/download-data-file
diagnostics of rolling element bearings using time-domain features. (accessed on 18 December 2019).
Mechanical systems and signal processing. 2003 Mar 1;17(2):317-28. [29] Experimental Dataset for Gear Fault Diagnosis”, Amir Hosein Zamanian,
[6] Malhi A, Gao RX. PCA-based feature selection scheme for machine defect Southern Methodist University, Dallas, TX, 75205, USA. Available online:
classification. IEEE Transactions on Instrumentation and Measurement. 2004 https://goo.gl/TorZJq (accessed on 18 December 2019
Nov 22;53(6):1517-25. [30] Mitchell M. An introduction to genetic algorithms. MIT press; 1998.
[7] Su H, Chong KT. Induction machine condition monitorin using neural [31] Zhang, Y.,Zuo,H.,Bai,F.,2013.Classification of fault location and
network modeling. IEEE Transactions on Industrial Electronics. 2007 Feb performance degradation of a roller bearing.
5;54(1):241-9. Meas:J.Int.Meas.Confed.46(3),1178–1189.
[32] Jiang, L., Xuan, J., & Shi, T. (2013). Feature extraction based on semi-
[8] Liu H, Li L, Ma J. Rolling bearing fault diagnosis based on STFT-deep
supervised kernel Marginal Fisher analysis and its application in bearing fault
learning and sound signals. Shock and Vibration. 2016;2016.
diagnosis. Mechanical Systems and Signal Processing, 41(1-2), 113-126.
[9] Gan M, Wang C. Construction of hierarchical diagnosis network based on [33] Li, F.,Tang,B.,Yang,R.,2013.Rotating machine fault diagnosis using
deep learning and its application in the fault pattern recognition of rolling dimension reduction with linear local tangent space alignment. Meas.: J. Int.
element bearings. Mechanical Systems and Signal Processing. 2016 May Meas. Confed. 46 (8),2525–2539.
1;72:92-104. [34] Zhu, K.,Song,X.,Xue,D.,2014. A roller bearing fault diagnosis method
[10] Oh H, Jeon BC, Jung JH, Youn BD. Smart diagnosis of journal bearing based on hierarchical entropy and support vector machine with particle swarm
rotor systems: Unsupervised feature extraction scheme by deep learning. optimization algorithm. Measurement 47, 669–675.
InProceedings of the Annual Conference of the Prognostics and Health
Management Society, Denver, CO, UAS 2016 Oct (pp. 3-6).
[11] Guo X, Chen L, Shen C. Hierarchical adaptive deep convolution neural
network and its application to bearing fault diagnosis. Measurement. 2016 Nov
1;93:490-502.
[12] Zhang Z, Wang Y, Wang K. Intelligent fault diagnosis and prognosis
approach for rotating machinery integrating wavelet transform, principal
component analysis, and artificial neural networks. The international journal of
advanced manufacturing technology. 2013 Sep 1;68(1-4):763-73.

You might also like