Machine Learning Model Applications For Fault Detection and Classification in Distributed Power Networks (#1438528) - 3735102
Machine Learning Model Applications For Fault Detection and Classification in Distributed Power Networks (#1438528) - 3735102
2, 2021
1
Abstract—This paper compares various unsupervised finally cleared in the shortest amount of time to maintain
feature extraction techniques and supervised machine learning reliance and service continuity [4].
models for fault detection and classification in a power Methods for fault detection, classification, and location in
distributed generation system. The modified IEEE 34 bus test
power distribution systems in the presence of DGs and the
feeder was implemented for the study case simulated through
PowerFactory DigSILENT software. Data analysis results from intelligent agent’s incorporation have been studied and
three-phase voltages and currents collected were performed in published over the years [6, 10]. For instance, in [5] and [8],
Python. Simulation results confirm that by applying data mining has been incorporated into the protection
dimensionality reduction techniques such as feature extraction functions to extend performance limits by producing
and wavelet family selection adequately, high identification and enhanced conditions of states within the present protection
classification accuracy can be obtained, excluding the less
technology limitations over the distribution network. On the
essential characteristics and preventing the machine learning
models from overfitting or underfitting the datasets. other hand, in [11], they incorporate statistical machine
learning methods for preventive maintenance, using
Index Terms— Data mining, fault diagnosis, feature historical electrical data collected that has been turned into
extraction, machine learning. models that aim to forecast the risk of failures for
components and systems. Due to their versatility, high
I. INTRODUCTION capability, and high accuracy performances, machine
One of the most demanding high-quality services, which learning implementation for fault detection and classification
is experiencing vast and rapid development nowadays, is the in power systems has grown significantly in recent years
electricity supply [1]. Consequently, the electrical grid has [12]. In [1], [4], [6] and [18], authors have used Artificial
been defined as the most extensive engineered system Neural Networks (ANNs) for fault detection, classification,
worldwide due to its indispensability in our daily lives and and location, collecting three-phase voltages and currents as
importance to the economies and progress of countries [2, input data for performing the output predictions. In [7], two
3]. Likewise, continuity and reliability have become techniques for fault identification and classification have
essential requirements for customers that are particularly been presented based on TA-QSSVM and A-QSSVM,
susceptible to power blackouts [4]. Therefore, fault respectively, a modification of the Support Vector Machine
condition detection is critical for reliable services [5]. (SVM) algorithm; both methods are unsupervised and online
Furthermore, detecting short circuits in distribution networks with good performance during their accuracy calculation.
is much more difficult than in transmission networks because Besides, [9] introduces a protection scheme using statistical
they are typically unbalanced and asymmetrical due to the models like energy, entropy, and standard deviation for
increasing incorporation of renewable energy on the load microgrids using Wavelet Transforms (WTs) and Decision
side as distributed generation (DGs) [6]. This consequence Trees (DTs) as a discriminating function. [12] has presented
has triggered the modernization and development of the a semi-supervised machine learning approach based on co-
smart grids, incorporating modern measurement and training over a microgrid, where the harmony search
communication systems into the power systems’ real-time algorithm is implemented to identify optimal wavelet
monitoring [5, 7, 8]. Nowadays, the stakeholders’ main goal transform families during the data pre-processing step. In [3,
is to improve the power grids by making them more 14], the authors tested three machine learning algorithms and
intelligent, reliable, and sustainable. Fault detection systems used the third level of decomposition for the wavelet
yield a practical, fast, and reliable form of relaying transform in data pre-processing. The simulations showed
operations. Additionally, they should perform satisfactorily the high capability of the Random Forest algorithm (RF)
under multiple operating conditions and diverse electrical over the other algorithms. In [15], there are three families of
grid parameters. When it comes to fault detections, they are discrete wavelet transforms (DWT) for feature extraction
supposed to be detected first, then correctly classified, and over the input data (e.g., motor current). After their
extraction, classification results were performed by RF and
XGBoost machine learning algorithms. [16, 17], they
Manuscript received July 07, 2021; accepted June 05, 2022. implemented WT and DWT for feature extraction,
*Corresponding author: [email protected]
11
Cabus and Altaş: Machine Learning Model Applications for Fault Detection and Classification in Distributed Power Networks
respectively. Zero-sequence current has been collected to consisted of unbalanced loads distributed over the array of
perform fault detection in conjunction with Principal three-phases and one-phase (i.e., AN and BN) grid
Component Analysis, SVM, and the Adaboost+CART configurations; operational voltage levels are 24.9 kV and
algorithm as discriminant functions. 4.16 kV, with a total load of 1769 kW and 1044 MVAr
This research is based on the detection and classification distributed over the grid; two capacitors installed at buses
of faults in a power-distributed network. Despite the 844 and 848; and two regulators located in the line segments
previous contributions mentioned, a lack of interest was at 814-850 and 832-852, respectively. The test feeder is
observed in selecting the WT or DWT families for data pre- modelled and tested using the DIgSILENT PowerFactory
processing. Furthermore, through this research, it has been software. Additionally, a three-phase meshed topology
concluded that the appropriate selection of WFT or DWT arrangement has been incorporated between nodes 816 and
families and level of decomposition can improve the 832, with a length of 2.5km and 301 configurations [20].
performance and accuracy of the machine learning Distributed generations have been modelled and placed at
algorithms. Additionally, unsupervised dimensionality the weakest point over the feeder; two three-phase PV
reduction algorithms are applied to feature extraction systems and one generator model available in the software's
techniques. Simulation results validate that using feature static generator library of the DIgSILENT software are used
extraction techniques and wavelet packet transform selection and placed in nodes 840, 848, and 890, respectively [20-22].
effectively can achieve high identification accuracy by Fig. 1 shows the modelled IEEE 34 bus test feeder. Fault
removing the less relevant features from consideration, simulations have been carried out all over the electrical
preventing the machine learning algorithms from overfitting system and are stored in Excel files. The detection and
or underfitting the datasets. classification stages have been performed in Jupyter
The rest of the paper is organized as follows: Section II Notebook and Python 3 with the implementation of Numpy,
reviews the proposed methods. Simulation results are Skicit-learn, and Pandas libraries available in the software.
presented in Section III. Conclusions are presented in Computer information Intel (R) Core (TM) i5-6400 CPU
Section IV. @2.70Hz, RAM 8GB, X64 bits Windows 10 Enterprise.
feature extraction; additionally, the zero-sequence current TABLE II. STATISTICAL FEATURES
component has been used for ground fault detection and Feature Functions
classification. Energy (3)
R.M.S (5)
D. Wavelet Transform
A wavelet transform is a powerful tool consisting of low-
Variance (6)
and high-pass filters used in signal processing, voice
recognition, and a wide variety of applications due to its
good performance in frequency components (i.e. high and Standard Deviation (7)
low-frequency) [13]. In this paper, the features contained in
the voltage and current waveforms and zero-sequences are
extracted with the discrete wavelet transform [3, 9, 12, 13,
Kurtosis (8)
15]. The discrete wavelet transform (DWT) coefficients are
employed and collected using equation (2) derived from the
discretization of CWT [15]. Where (j, k) are integer
positives, ψ* represents the mother wavelet's complex Skewness (9)
conjugate, and x(t) is the signal analyzed.
(10
(2) Shape Factor
)
The use of several wavelet families for signal analysis has (11
Impulse Factor
been documented in previous research; however, no reason )
or technique for picking the mother wavelet has been
(12
presented. Aside from that, the mother wavelet selection and Crest Factor
)
decomposition levels may have a significant impact on how
regardless of its size the feature vectors are, which can result (13
Clearance Factor
in significantly different outcomes from the study. As a )
consequence, in this paper, a combination of the grid
searching method and the support vector machine is used to (14
Shannon’s Entropy
optimize the selection of wavelet family and grades of )
decomposition.
One of the essential procedures when dealing with data is
E. Feature Extraction that the data must be on the same scale because they might
The data collected from the simulation is raw data contain characteristics that vary significantly in magnitudes,
presented in quite large dimensions and unscaled. Thus, to units, and range, affecting the performance of the algorithms
obtain high accuracy, robust results, and low computational implemented for their analysis. Previous research has shown
complexity, the data must be as small as possible in that rescaling the data increases the accuracy and
dimensions. In this paper, the proposed method preprocesses performance of the algorithms. Therefore, the
the voltage and current signals through the discrete wavelet standardization function has been implemented, consisting
transform and extracts the most useful statistical features by of rescaling the data to have a mean of 0 and a standard
applying feature extraction techniques that might contain deviation of 1. Equation (15) shows its representation.
essential data during transient events such as faults or
perturbations [5, 9, 15, 24]. Then features are implemented (15)
to build the data-mining model using dimensionality
reduction and machine learning algorithms for fault
detection and classification. Twelve statistical features were F. Dimensionality Reduction
used for feature extraction in this paper; the dataset has 7655 Data compression is an essential subject in machine
instances, which contain three-phase voltage and current learning because it can help improve data storage,
with a 2-second duration. The details of the statistical computational efficiency, and predictive performance.
features are shown in Table II. Unsupervised dimensionality reduction techniques have
been implemented in this paper for feature extraction to
identify the most relevant and not relevant patterns
effectively, summarizing the original feature dataset from a
high-dimensional space onto a low-dimensional feature
13
Cabus and Altaş: Machine Learning Model Applications for Fault Detection and Classification in Distributed Power Networks
subspace for their processing [10, 24-27]. A summary of the TABLE IV. METRICS AND SCORING LIST
unsupervised dimensionality reduction techniques Name Functions
implemented in this paper is shown in Table III. Confusion (16
Matrix )
TABLE III. UNSUPERVISED DIMENSIONALITY REDUCTION
TECHNIQUES Accuracy
(17
Symbol Name )
PCA Principal Component Analysis Precision
KPCA Kernel Principal Component Analysis (18
)
LLE Locally Linear Embedding
Recall (19
Isomap Isometric Mapping )
DL Mini-Batch Dictionary Learning F1 (20
)
ICA Independent Component Analysis
Log-loss (21
G. Machine Learning Algorithms for Decision Making )
Machine learning algorithms aim to infer comprehensible
correlations or discover patterns between system variables in III. SIMULATION RESULTS
datasets that can be used later to forecast or comprehend
A. Wavelet family selection
system behaviors. Moreover, choosing the correct algorithm
for those tasks often depends on the amount, quality, and After calculating the zero components from the 3-phase
correlation of their data features, which involves a process of voltages and currents using equation (1), a subset of 600
trial and error [9, 24-27]. The proposed study uses five instances was collected randomly to process the best wavelet
supervised machine learning algorithms (i.e. Logistic family transform and decomposition levels using a zero-
Regression (LR), Support Vector Machine (SVM), Decision sequence voltage signal. In this paper, the Haar, db3, db4,
Trees (DT), Random Forest (RF), and K-Nearest Neighbors db6, Sym4, and Coif2 wavelets were selected and compared
(KNN)) in contrast with six unsupervised dimensionality in order to find the best wavelet family and decomposition
reduction algorithms to perform the data analysis in DWT level for signal analysis [3, 9, 12, 13, 15]. Table II: Twelve
selection, ground-fault detection, detection, and features were calculated from the subset. A proportion of
classification faults over the grid. 70:30 subset was used for training and testing models using
The machine learning algorithms’ robustness depends on the Grid Search algorithm and Support Vector Machine as a
their optimal hyperparameters’ adjustment. Therefore, the discriminant algorithm. The accuracy of detailed
proper values of the hyperparameters may boost the components at different decomposition levels is presented in
efficiency of the training model and test dataset accuracy Fig. 2. Accuracy results show that the db3 wavelet family at
[24-26]. In this paper, the Grid Search Algorithm has been the seventh decomposition level provides the most
proposed as an optimal model for hyperparameter selection. distinctive accuracy value (i.e., 96.85%); thus, the wavelet
This algorithm is based on implementing a trial and error decomposition step has been set to those values in order to
method to select the best parameters that give the best determine the features of the three-phase voltages and
accuracy from a subset of hyperparameters. However, currents for the next steps of the model.
choosing the best hyperparameters sometimes leads the
model to underfit or overfit the dataset, providing an
unsatisfactory performance. Consequently, a K-Folds Cross-
Validation model has been implemented to trade off the bias
and variance to avoid underfitting or overfitting the training
model [24-26].
H. Model Evaluation
The extracted features using pipeline models are fed into
the five machine learning algorithms. Quantifying the quality
of algorithms’ predictions is assessed by applying the
metrics and scoring listed in Table IV [26]. TN, TP, FP, and
FN mean True Negative, True Positive, False Positive, and Fig. 2. Wavelet family results
False Negative. For multiclass labels, a weighted average
B. Fault Detection Results
has been implemented; refer to [24-26]. Besides, Receiver
Operating Characteristic (ROC) curves and the Area Under The proposed fault detection algorithm is based on the
the Curve (AUC) percent have been implemented to zero-sequence voltage signal. The training and test ratios
visualize the algorithm’s performance. were chosen as 65:35, consisting of an unbalanced dataset
(i.e., no-faults and faults); twelve features were calculated
and scaled before performing dimensionality reduction. The
14
DATA SCIENCE AND APPLICATIONS, VOL. 4, NO. 2, 2021
15
Cabus and Altaş: Machine Learning Model Applications for Fault Detection and Classification in Distributed Power Networks
Fig. 12. Ground fault detection using Random Forest Fig. 15. Fault classification using Logistic Regression
Fig. 13. Ground fault detection using K-Nearest Neighbors Fig. 16. Fault classification using Support Vector Machine
Fig. 14. Ground fault detection ROC and AUC curves results
16
DATA SCIENCE AND APPLICATIONS, VOL. 4, NO. 2, 2021
17
Cabus and Altaş: Machine Learning Model Applications for Fault Detection and Classification in Distributed Power Networks
hyperparameters and reduce overfitting or underfitting in the vol. 34, no. 2, pp. 328-345, Feb. 2012, doi:
10.1109/TPAMI.2011.108.
models. The simulation results prove that some of the [12] T. S. Abdelgayed, W. G. Morsi and T. S. Sidhu, Fault Detection and
proposed models have achieved satisfactory performance in Classification Based on Co-training of Semisupervised Machine
fault detection and classification over the network studied. Learning, in IEEE Transactions on Industrial Electronics, vol. 65, no.
2, pp. 1595-1605, Feb. 2018, doi: 10.1109/TIE.2017.2726961.
[13] C., Hwan Kim and R. Aggarwal, Wavelet transforms in power
ACKNOWLEDGMENT systems. II. Examples of application to actual power system
Part of this work was presented orally at the IV. transients, in Power Engineering Journal, vol. 15, no. 4, pp. 193-202,
Aug. 2001, doi: 10.1049/pe:20010404.
International Conference on Data Science and Applications [14] H. Okumuş And F. M. Nuroğlu, Power System Event Classification
2021 with the title “Machine Learning Application for Fault Based on Machine Learning, 3rd International Conference on
Detection in Power Distributed Network”. Computer Science and Engineering (UBMK), Sarajevo, Bosnia And
Herzegovina, pp.402-405, 2018.
[15] N. Toma, R. and Kim, J.-M. Bearing Fault Classification of Induction
REFERENCES Motors Using Discrete Wavelet Transform and Ensemble Machine
[1] Karić A., Konjić T. Jahić A. Power System Fault Detection, Learning Algorithms. Appl.Sci. 2020,
Classification and Location using Artificial Neural Networks, In: 10,5251.https://doi.org/10.3390/app10155251Qwqwq
Hadžikadić M., Avdaković S. (eds) Advanced Technologies, [16] R., Papia and M., Debani, Support vector machine-based fault
Systems, and Applications II. IAT 2017. Lecture Notes in Networks classification and location of a long transmission line. Engineering
and Systems, vol 28. Springer, Cham. (2018) Science and Technology, an International Journal. 2016, 19.
https://doi.org/10.1007/978-3-319-71321-2_8 10.1016/j.jestch.2016.04.001.
[2] Y. Mo et al., Cyber–Physical Security of a Smart Grid Infrastructure, [17] Guo, Mou-Fa, Z. Xiao-Dan, C. Duan-Yu and Y. Nien-Che. Deep-
in Proceedings of the IEEE, vol. 100, no. 1, pp. 195-209, Jan. 2012, Learning-Based Earth Fault Detection Using Continuous Wavelet
doi: 10.1109/JPROC.2011.2161428. Transform and Convolutional Neural Network in Resonant
[3] H. Okumuş and F.M. Nuroglu, Wavelet Based Fault Detection and Grounding Distribution Systems. IEEE Sensors Journal.2017, PP. 1-
Classification Algorithm for a Real Distribution Feeder. EMITTER 1. 10.1109/JSEN.2017.2776238.
International Journal of Engineering Technology. 2019. 7. [18] Ray, P., Panigrahi, B.K. and Senroy, N., Hybrid methodology for
10.24003/emitter.v7i1.382. fault distance estimation in series compensated transmission line. IET
[4] Jamil, M., Sharma, S.K. and Singh, R. Fault detection and Gener. Transm. Distrib.,7:431-439.2013,https://doi.org/10.1049 /iet-
classification in electrical power transmission system using artificial gtd.2012.0243
neural network. SpringerPlus 4, 334 (2015). [19] J. Jiang et al., A Hybrid Framework for Fault Detection,
https://doi.org/10.1186/s40064-015-1080-x Classification, and Location—Part I: Concept, Structure, and
[5] H. A. Tokel, R. A. Halaseh, G. Alirezaei and R. Mathar, A new Methodology, in IEEE Transactions on Power Delivery, vol. 26, no.
approach for machine learning-based fault detection and 3, pp. 1988-1998, July 2011, doi: 10.1109/TPWRD.2011.
classification in power systems, 2018 IEEE Power & Energy Society 2141157.Qwqw
Innovative Smart Grid Technologies Conference (ISGT), Washington, [20] M. Davoudi, V. Cecchi and J. R. Agüero, Effects of stiffness factor
DC, USA, 2018, pp. 1-5, doi: 10.1109/ISGT.2018.8403343. on bus voltage variations in the presence of intermittent distributed
[6] Dehghani, Farzad & Khodnia, Fereydoun & Dehghan, Esfandiar. generation, 2015 North American Power Symposium (NAPS),
Fault location of unbalanced power distribution feeder with Charlotte, NC, USA, 2015, pp. 1-6, doi:
distributed generation using neural networks. CIRED - Open Access 10.1109/NAPS.2015.7335187.
Proceedings Journal. 2017. 1134-1137. 10.1049/oap- [21] R. C. Dugan and W. H. Kersting, Induction machine test case for the
cired.2017.0007. 34-bus test feeder -description, 2006 IEEE Power Engineering
[7] N. Shahid, S. A. Aleem, I. H. Naqvi and N. Zaffar, Support Vector Society General Meeting, Montreal, QC, Canada, 2006, pp. 4 pp.-,
Machine based fault detection & classification in smart grids, 2012 doi: 10.1109/PES.2006.1709506.
IEEE Globecom Workshops, Anaheim, CA, USA, 2012, pp. 1526- [22] K. Balamurugan, D. Srinivasan, and T. Reindl, Impact of Distributed
1531, doi: 10.1109/GLOCOMW.2012.6477812. Generation on Power Distribution Systems, Energy Procedia, Volume
[8] A. V. Masa, S. Werben and J. C. Maun, Incorporation of data-mining 25, 2012, Pages 93-100,ISSN1876-6102, https://doi.org/10.1016/
in protection technology for high impedance fault detection, 2012 j.egypro.2012.07.013.Sfds
IEEE Power and Energy Society General Meeting, San Diego, CA, [23] Sasa Mujovic, Snezana Vujosevic and Luka Vujosevic (2018) Zero-
USA, 2012, pp. 1-8, doi: 10.1109/PESGM.2012.6344893. Sequence Voltage-based Method for Determination and
[9] S. Samantaray, D. Mishra and G. Joos, A Combined Wavelet and Classification of Unloaded Overhead Line Operating Conditions at
Data-Mining Based Intelligent Protection Scheme for Microgrid, the Moment of Energization, Electric Power Components and
2018 IEEE Power & Energy Society General Meeting (PESGM), Systems,46:2,162-176,DOI:10.1080/15325008.201 8.1433252
Portland, OR, USA, 2018, pp. 1-1, doi: [24] Raschka S. and Mirjalili V., Python Machine Learning: Machine
10.1109/PESGM.2018.8586480. Learning and Deep Learning with Python, Scikit-learn, and
[10] Chen, K., Huang, C. and He, J., Fault detection, Classification and TensorFlow, Packt
location for transmission lines and distribution systems: a review on Publishing,2017,ISBN1787125939/9781787125933
the methods. High [25] Ankur A. Patel, Hands-On Unsupervised Learning Using Python,
Voltage,1:25,33,2016.https://doi.org/10.1049/hve.2016.0005. O’Reilly Media, Inc., 2019, ISBN9781492035640
[11] C. Rudin et al., Machine Learning for the New York City Power Grid, [26] https://scikit-learn.org/stable/index.html (Accessing date: 9 Feb 2021)
in IEEE Transactions on Pattern Analysis and Machine Intelligence, [27] https://medium.com/ (Accessing date: 14 Feb 2021)
[28] https://www.mathworks.com/help/ (Accessing date: 20 Feb 2021)
18