Machine Learning And Probabilistic Graphical Models For Decision Support Systems Kim Phuc Tran instant download
Machine Learning And Probabilistic Graphical Models For Decision Support Systems Kim Phuc Tran instant download
https://ebookbell.com/product/machine-learning-and-probabilistic-
graphical-models-for-decision-support-systems-kim-phuc-
tran-43881298
https://ebookbell.com/product/applied-stochastic-processes-chaos-
modeling-and-probabilistic-properties-of-numeration-systems-vincent-
granville-phd-44871872
https://ebookbell.com/product/applied-stochastic-processes-chaos-
modeling-and-probabilistic-properties-of-numeration-systems-vincent-
granville-44887996
https://ebookbell.com/product/machine-learning-a-probabilistic-
perspective-adaptive-computation-and-machine-learning-series-
illustrated-kevin-p-murphy-34631540
https://ebookbell.com/product/probabilistic-machine-learning-for-
finance-and-investing-early-release-1st-edition-deepak-
kanungo-48522166
Probabilistic Machine Learning For Finance And Investing Deepak
Kanungo
https://ebookbell.com/product/probabilistic-machine-learning-for-
finance-and-investing-deepak-kanungo-46122252
https://ebookbell.com/product/probabilistic-machine-learning-for-
finance-and-investing-1-converted-deepak-k-kanungo-56074418
https://ebookbell.com/product/hardwareaware-probabilistic-machine-
learning-models-learning-inference-and-use-cases-1st-ed-2021-laura-
isabel-galindez-olascoaga-36497074
https://ebookbell.com/product/machine-learning-and-internet-of-things-
for-societal-issues-ch-satyanarayana-46750136
https://ebookbell.com/product/machine-learning-and-deep-learning-
techniques-for-medical-science-k-gayathri-devi-47103764
Machine Learning and Probabilistic
Graphical Models for Decision
Support Systems
Editor
Kim Phuc Tran
Univ. Lille, ENSAIT, ULR 2461 - GEMTEX - Génie et Matériaux Textiles
F-59000 Lille, France
p,
p,
A SCIENCE PUBLISHERS BOOK
A SCIENCE PUBLISHERS BOOK
First edition published 2022
by CRC Press
6000 Broken Sound Parkway NW, Suite 300, Boca Raton, FL 33487-2742
and by CRC Press
4 Park Square, Milton Park, Abingdon, Oxon, OX14 4RN
Reasonable efforts have been made to publish reliable data and information, but the author and publisher cannot assume
responsibility for the validity of all materials or the consequences of their use. The authors and publishers have attempted to
trace the copyright holders of all material reproduced in this publication and apologize to copyright holders if permission to
publish in this form has not been obtained. If any copyright material has not been acknowledged please write and let us know
so we may rectify in any future reprint.
Except as permitted under U.S. Copyright Law, no part of this book may be reprinted, reproduced, transmitted, or utilized
in any form by any electronic, mechanical, or other means, now known or hereafter invented, including photocopying,
microfilming, and recording, or in any information storage or retrieval system, without written permission from the publishers.
For permission to photocopy or use material electronically from this work, access www.copyright.com or contact the
Copyright Clearance Center, Inc. (CCC), 222 Rosewood Drive, Danvers, MA 01923, 978-750-8400. For works that are not
available on CCC please contact [email protected]
Trademark notice: Product or corporate names may be trademarks or registered trademarks and are used only for identification
and explanation without intent to infringe.
Library of Congress Cataloging‑in‑Publication Data (applied for)
The last decades have witnessed the rapid growth of advanced technologies and their application that is
leading to the fourth industrial revolution. The recent development of information and communication
technologies has engendered to add intelligence into the industrial process to drive continuous
improvement, knowledge transfer, and data driven-based decision-making. The Internet of Things (IoT)
is one of the main technologies used to enable enabling industrial organizations to rapidly automate
and digitize traditional business processes. A huge volume of data collected can feed real-time analytic
solutions provided by Artificial Intelligence (AI), Big Data Analytics, and Decision Support Systems
(DSS), which can lead to optimal industrial operations. Based on modern technologies of the IoT, the
process of collecting, transforming, and storing data from all stages of the industrial process becomes
easier and more efficient, promoting the era of Big Data. AI algorithms provide powerful tools for
exploiting the wealth of data generated in the IoT. By extracting useful information and features from
Big Data, the AI algorithms allow complex tasks such as monitoring, and optimizing the production
process to be performed smartly and efficiently. To combine human knowledge with these above results,
DSS is integrated to help managers to make better decisions in their work.
In the era of Big Data, DSS has become vital for organizations. Machine Learning, a subfield
of AI, is a useful technology to process and analyze Big Data are a useful methodology for DSS
with a combination of data dictated and human-driven analytics. DSS applications can be used in a
vast array of diverse fields, such as making operational decisions, medical diagnosis, and predictive
maintenance. There is a lot of research in the literature regarding the development and application
of DSS. In this book, the chapters are proposed in such a way as to explore every important aspect
of Machine Learning and Probabilistic Graphical Models for Decision Support Systems. This book
presents recent advancements in research, new methods and techniques, and applications of DSS with
Machine Learning and Probabilistic Graphical Models, which are very powerful techniques to extract
knowledge from Big Data effectively and interpret decisions. The book undertakes to stimulate scientific
exchange, ideas, and experiences in the field of DSS applications. Researchers and practitioners alike
will benefit from this book to enhance the understanding of machine learning, Probabilistic Graphical
Models, and, especially, their use in DSS in the context of decision making with uncertainty. The real-
world case studies in various fields with guidance and recommendations for the practical applications of
these studies are introduced in each chapter. Current researches, trends, future directions, opportunities,
etc. will be discussed, making it friendly for beginners and young researchers.
20 November 2021 Kim Phuc Tran
Univ. Lille, ENSAIT, ULR 2461 - GEMTEX - Génie et Matériaux Textiles
F-59000 Lille, France.
[email protected]
Contents
Preface iii
Acronyms xi
3. Decision Support Systems for Anomaly Detection with the Applications in Smart 34
Manufacturing: A Survey and Perspective
Quoc‑Thông Nguyen, Tung Nhi Tran, Cédric Heuchenne and Kim Phuc Tran
1 Introduction 34
2 Decision Support Systems for Smart Manufacturing 35
3 Anomaly Detection in Smart Manufacturing 37
3.1 Smart Predictive Maintenance 37
3.2 Integrated Wearable Technology 38
3.3 Production Monitoring 40
3.4 Real-time Cybersecurity 41
4 Difficulties and Challenges of Anomaly Detection Applications in Smart 42
Manufacturing
5 Perspectives for Anomaly Detection in Smart Manufacturing 43
6 Case Studies 47
6.1 Anomaly Detection in Production Monitoring 47
6.2 Anomaly Detection in Predictive Maintenance 49
7 Concluding Remarks 52
References 53
4. Decision Support System for Complex Systems Risk Assessment with Bayesian 62
Networks
Ayeley Tchangani
1 Introduction 62
2 Bayesian Technology 64
3 BN Model for Event Oriented Risk Management 64
3.1 Variables Identification 64
3.2 Relationships Identification 66
3.3 Usage of the model 67
3.4 Illustrative Case Study in Natural Risk Management 67
4 BN for Risk Management in Industrial Systems 71
5 DBN for Risk Management of Industrial Systems 75
5.1 Brief Presentation of DBN 75
5.2 Illustrative Case Study 76
6 EOOBN for Risk Management 78
6.1 Extended Object Oriented Bayesian Network 80
6.1.1 Construction of an EOOBN 80
6.1.2 Case Study 83
7 Conclusion 85
References 86
v
Machine Learning and Probabilistic Graphical Models for Decision Support Systems
5. Decision Support System using LSTM with Bayesian Optimization for Predictive 88
Maintenance: Remaining Useful Life Prediction
Huu Du Nguyen and Kim Phuc Tran
1 Introduction 88
2 Predictive Maintenance and Remaining Useful Life Prediction 89
3 Machine Learning based Decision Support System for Predictive Maintenance 92
4 Long Short Term Memory Networks using Bayesian Optimization 93
4.1 Long Short Term Memory Networks 93
4.2 Bayesian Optimization 95
5 Decision Support System for Remaining Useful Life Prediction using LSTM with 96
Bayesian Optimization
6 A Case Study 98
7 Conclusion and Perspectives 99
References 101
6. Decision Support Systems for Textile Manufacturing Process with Machine 107
Learning
Zaohao Lu, Zhenglei He, Kim Phuc Tran, Sebastien Thomassey, Xianyi Zeng and
Mengna Hong
1 Introduction 107
2 Relevant Literatures 108
2.1 Intelligent Techniques used for Textile Process Modeling 109
2.1.1 Artificial Neural Networks 109
2.1.2 Fuzzy Logic 110
2.1.3 Fuzzy Inference System 111
2.1.4 Support Vector Machine 111
2.1.5 Gene Expression Programming 111
2.2 Decision-making of Textile Manufacturing Process 111
2.2.1 Classic Methods 112
2.2.2 Meta-heuristic Methods 112
2.2.3 Multi-criteria Meta-heuristic Methods 112
3 Case Study: Decision-making of Denim Ozonation 114
3.1 Problem Formulation 114
3.2 Methodology 115
3.2.1 ANN Model 115
3.2.2 Determining the Criteria Weights using the AHP 116
3.2.3 The Markov Decision Process 117
3.2.4 The RL Algorithm: Q-learning 118
3.3 Case Study 118
3.3.1 Results and Discussion 119
4 Conclusion 120
References 121
vi
Contents
vii
Machine Learning and Probabilistic Graphical Models for Decision Support Systems
8. Machine Learning for Compositional Data Analysis in Support of the Decision 184
Making Process
Thi Thuy Van Nguyen, Cédric Heuchenne and Kim Phuc Tran
1 Introduction 184
2 Modeling of Compositional Data 185
3 Machine Learning for Multivariate Compositional Data 187
3.1 Principal Component Analysis 188
3.2 Clustering 191
3.3 Classification 193
3.3.1 Support Vector Machine Classification using Ilr—Transformation 194
3.3.2 Support Vector Machine Classification using Dirichlet Feature 196
Embedding Transformation
3.4 Regression 198
4 Anomaly Detection using Support Vector Data Description 203
4.1 Support Vector Data Description 203
4.2 Anomaly Detection using SVDD with Dirichlet Density Estimation 204
4.2.1 Transform CoDa using Dirichlet Density Estimation 204
4.2.2 Anomaly Detection using SVDD with Dirichlet Density-transformed Data 206
4.2.3 An Example of Anomaly Detection using SVDD 207
5 Conclusion 209
References 210
9. Decision Support System with Genetic Algorithm for Economic Statistical Design 216
of Nonparametric Control Chart
Alejandro Marcos Alvarez, Cédric Heuchenne, Phuong Hanh Tran and Alireza Faraz
1 Introduction 216
2 Background 218
2.1 Statistical Process Monitoring with Control Chart 218
2.2 Parametric and Nonparametric Control Charts 219
2.2.1 The x– Chart 219
2.2.2 The SN Chart 219
2.2.3 The SR Chart 220
2.3 Related Works 221
3 Economic Statistical Design of SN & SR Control Charts 222
4 Experiments 225
5 Results Discussion 228
6 Conclusions 231
References 232
Appendix 234
viii
Contents
11. Intellectual Support with Machine Learning for Decision-making in Garment 272
Manufacturing Industry: A Review
Yanni Xu and Xiaofen Ji
1 Introduction 272
2 Problems in Garment Manufacturing 273
3 Garment Manufacturing using Machine Learning 276
4 Popular Machine Learning Algorithms 277
5 Potential Machine Learning Applications in Garment Manufacturing 280
6 Case Study 284
7 Conclusion 287
References 288
ix
Machine Learning and Probabilistic Graphical Models for Decision Support Systems
12. Enabling Smart Supply Chain Management with Artificial Intelligence 294
Thi Hien Nguyen, Huu Du Nguyen, Kim Duc Tran, Dinh Duy Kha Nguyen and
Kim Phuc Tran
1 Introduction 294
2 AI for Demand Forecasting 296
3 AI for Logistics 297
4 AI for Production 298
5 AI for Decision Support Systems in SCM 299
6 Blockchain Technique for SCM 301
7 Case Study 303
8 Conclusion 304
References 305
Index 311
About the Editor 317
x
Acronyms
xii
Chapter 1
Univ. Lille, ENSAIT, ULR 2461 - GEMTEX - Génie et Matériaux Textiles, F-59000 Lille, France.
Email: [email protected]
Machine Learning and Probabilistic Graphical Models for Decision Support Systems
model, Analytic Hierarchy Process and Q-learning for supporting the decision-making of textile
manufacturing process.
Truong Thu Huong, Nguyen Minh Dan, Le Anh Quang, Nguyen Xuan Hoang, Le Thanh
Cong, Kieu Ha Phung, and Kim Phuc Tran develop in their chapter, “Anomaly Detection
enables Cybersecurity with Machine Learning techniques” an overview of Cybersecurity issues
for Industrial systems, IoT-based Industrial systems, and the cyberattack detection issues for
Industrial Control Systems.
Thi Thuy Van Nguyen, Cédric Heuchenne and Kim Phuc Tran develop in their chapter,
“Machine learning for compositional data analysis in Support of the Decision Making Process”
a review of several researches related to applying ML to compositional data, including principal
component analysis, clustering, classification, and regression. They introduced a transformation
method based on Dirichlet density estimation to transform CoDa into real data and apply this
transformed data in anomaly detection using Support Vector Data Description.
Alejandro Marcos Alvarez, Cédric Heuchenne, Phuong Hanh Tran and Alireza Faraz develop
in their chapter, “Decision support system with Genetic Algorithm for economic statistical de-
sign of Nonparametric control chart” an economic statistical design (ESD) for two nonparametric
control charts based on the sign and the Wilcoxon signed-rank tests. Genetic Algorithm based
DSS is used to find the optimal parameters of the designed charts.
Jonathan Villain, Virginie Deniau and Christophe Gransart develop in their chapter, “Jam-
ming Detection in Electromagnetic communication with Machine Learning: a survey and per-
spective” an overview on the threat of the jamming on the use of wireless communication and
they shown the interest of ML to help to counter this threat.
Yanni Xu and Xiaofen Ji develop in their chapter, “Intellectual Support with Machine Learn-
ing for Decision-making in Garment Manufacturing Industry: A Review” insights and relevant
references for both researchers and practitioners on the machine learning-based decision support
for smart manufacturing in garment industry 4.0.
Thi Hien Nguyen, Huu Du Nguyen, Kim Duc Tran, Dinh Duy Kha Nguyen, and Kim Phuc
Tran develops in their chapter, “Enabling Smart Supply Chain Management with Artificial
Intelligence” a comprehensive overview of the applications of the AI Technique in Supply Chain
Management.
3 Conclusion
The book undertakes to stimulate scientific exchanges, ideas, and experiences in the field of DSS
applications. Researchers, postgraduate students, and practitioners alike will benefit from this
book to enhance the understanding of Machine Learning, Probabilistic Graphical Models, and
their use in DSS in practice, and especially in the context of decision making with uncertainty.
The real-world case studies in various fields with guidance and recommendations for the practical
applications of these studies are introduced in each chapter.
3
Machine Learning and Probabilistic Graphical Models for Decision Support Systems
References
[1] H.D. Nguyen, K.P. Tran, S. Thomassey and M. Hamad. Forecasting and anomaly detection
approaches using LSTM and LSTM Autoencoder techniques with the applications in supply
chain management. International Journal of Information Management, 57: 102282, 2021.
[2] K.P. Tran. Artificial intelligence for smart manufacturing: Methods and applications. Sensors,
21(16): 5584, 2021.
[3] K.P. Tran, S. Thomassey, X. Zeng, C. Yi, Z. He and J. Xu. Modeling of textile manufacturing
processes using intelligent techniques: A review. The International Journal of Advanced
Manufacturing Technology, pp. 1–29, 2021.
[4] C.W. Holsapple, R.H. Bonczek and A.B. Whinston. Foundations of Decision Support Systems.
Academic Press, 2014.
[5] J. Xu, S. Thomassey, X. Zeng, C. Yi, Z. He and K.P. Tran. Multi-objective optimization of
the textile manufacturing process using deep-q-network based multi-agent reinforcement
learning. Journal of Manufacturing Systems, 2021.
[6] K. Topuz and D. Delen. A probabilistic bayesian inference model to investigate injury severity
in automobile crashes. Decision Support Systems, p. 113557, 2021.
[7] D. Koller and N. Friedman. Probabilistic Graphical Models: Principles and Techniques. MIT
Press, 2009.
[8] J. Xu, S. Thomassey, X. Zeng, C. Yi, Z. He and K.P. Tran. A deep reinforcement learning based
multi-criteria decision support system for optimizing textile chemical process. Computers in
Industry, 125: 103373, 2021.
[9] T.H. Nguyen, K.D. Tran, P.H. Tran, A. Ahmadi Nadi and K.P. Tran. Application of Machine
Learning in Statistical Process Control Charts: A Survey and Perspective, pp. 7–42. Springer
International Publishing, Cham, 2022.
[10] M.L. Dao, D.L. Tran, M.D. Nguyen, A.Q. Le, D.T. Bui, K.P. Tran, T.H. Truong and P.B. Ta.
Detecting cyberattacks using anomaly detection in industrial control systems: A federated
learning approach. Computers in Industry, 132: 103509, 2021.
4
Chapter 2
1 Introduction
Probabilistic graphical modeling (PGM) deals with the branch of machine learning which studies
the use of probability distributions to describe a given event to make useful predictions about
it. PGM is widely used throughout machine learning and in many real-world applications. Such
techniques can be used to address problems in fields such as medicine, language processing, and
computer vision. This combination of theory and powerful applications makes PGMs one of the
most interesting topics in the modern era of artificial intelligence (AI). One major advantage
of probabilistic models is that they provide an idea about the uncertainty associated with
predictions. Such ideas related to uncertainty and confidence are of extreme utility when it
comes to sensitive and critical machine learning applications, such as clinical healthcare. To
understand probabilistic models at the abstract level, let us consider a classification problem
with N classes. If the model is probabilistic it will provide a probability for each of the N classes
for a given input, i.e., the model which provides a probability distribution over the N classes.
Usually, we consider the class with the highest probability as the output class. Typical examples
of probabilistic models in machine learning are logistic regression, hidden Markov models and
Bayesian classifiers, and neural networks with the softmax function (we will discuss in detail in
later sections). Note that logistic regression based on the sigmoid function can be considered
as an exception, as it provides the probability in relation to one class only.
Another way to understand the difference between probabilistic and non-probabilistic models
is their respective objective functions. For example, in linear regression, the objective function
is based on the squared, where the objective is to minimize the Mean Squared Error (MSE) or
Root Mean Squared Error (RMSE), the later is defined by Eq. 1.
n
1
RMSE = (y − y )2 , (yi − yi )2 (1)
n i=1
1Univ. Lille, ENSAIT, ULR 2461 - GEMTEX - Génie et Matériaux Textiles, F-59000 Lille, France.
2School of Computing & Institute of Cyber Security for Society (iCSS), University of Kent, UK.
* Corresponding author: [email protected]
Machine Learning and Probabilistic Graphical Models for Decision Support Systems
Here, n is the total number of data samples, yi is the true label, and yi is the predicted label.
�
The intuition behind this is to calculate the RSME by predicting a given data point based on
the difference between the actual value and the predicted value. As the objective function
here is not based on probabilities, but on the absolute difference between the actual value and
the predicted value they can be considered as non-probabilistic models. Typical examples of
non-probabilistic models are Support Vectors Machines (SVMs) and @Shujun: sigmoid predicts
the probability of occurrence of a binary outcome. It is a special case of linear regression
as it predicts the probabilities of outcome using log function. We use the activation function
(sigmoid) to convert the outcome into categorical value. In regards to probabilistic models, such
as neural networks with softmax output function, the objective function is usually cross-entropy
(binary cross-entropy in case of a binary classifier), given by Eq. 2. Here, p(yi ) and yi is the
predicted label and true label of data sample i respectively. The intuition behind cross-entropy
is; if the probabilistic model predicts the true class of a data point with high confidence, the
loss will be less.
n
1n
CS = − (log (p(yi))) (2)
n i=1
where y is the dependent variable which we want to predict, and x is the independent variable,
upon which y is dependent, paramaterized by α. For example, y may be the price of a car, and x
are features that affect price, e.g., color, the number of seats, the engine size, etc. We assume
that y is a linear function of x. However, real-world events are very complicated to model
because they involve a certain amount of uncertainty. Therefore, we model such events in the
form of probability distributions, represented as p(x, y). The probabilistic aspect of modeling
has significant importance, because we cannot perfectly predict the future as the world is often
stochastic. Moreover, we need to assess the confidence o f o ur p redictions. I t i s o ften t he case
that predicting a single value is not enough, we need the system to output its beliefs about what
is going on in the event. To overcome this, we can write the probability model as a product of
conditional probabilities.
n
(4)
n
P (y, x1 , x2 , . . . , xn ) = p(y) p(xi |y)
i=1
A small number of parameters (e.g., weights in case of machine learning) can be used to describe
each factor p(xi |y).
6
Decision Support Systems for Healthcare based on Probabilistic Graphical Models
test results, medical images, and information about medication remotely anytime 3 . Moreover,
healthcare requires responsibility in managing a large amount of healthcare-related information.
It can be done by proper modeling of information that field experts can continuously build a
strong policy of welfare. The main goal of DSS is to provide experts with information when
it is needed. Such systems provide knowledge, models, and data processing tools to help the
experts in making efficient and better decisions in many situations. The goal of such systems
is to resolve several problems in healthcare to help patients and their families, and the clinical
practitioners manage their healthcare by providing better access to these services 4 .
A lot of research has been carried in the application of DSS in healthcare. Feinleib 5 suggest
that data mining methods are promising in the application of DSS in healthcare. A prototype
of a system for self-management in healthcare to assist patients with diabetes and to track their
blood glucose levels has been developed in 6 . Goldberg et al. 7 used web-based DSS for depart-
ments related to emergency to assess the performance features source of the recommendations
generated by experts. The results show that a remote clinical decision support system decreases
time-to-trial in the decision support to clinical interventions.
In regards to the importance of DSS in healthcare, this chapter reviews some of the re-
search work on healthcare DSS based on Probabilistic Graphical Models (PGMs) 8 and machine
learning. The rest of the chapter is organized as follows: Section 2 discusses decision support
systems in Healthcare. Section 3 presents a review about the application of artificial intelli-
gence in healthcare. Section 4 discusses healthcare DSSs based on PGMs. Section 5 provides
perspectives for Healthcare DSSs based on PGMs. Section 6 provides case studies of DSS in
healthcare. Section 7 concludes the chapter.
provide timely information at the point of care to inform patient care decisions. The use cases
of decision support systems can be summarized as follow;
1. Clinical Management: DSS can alert healthcare practitioners to reach out to patients who
have not followed management schedules, or are due for a follow-up, and help identify
patients eligible for research based on specific criteria 14 .
2. Diagnosis Support: DSSs for healthcare diagnosis, known as diagnostic decision support
systems (DDSs) have traditionally provided computerized support, whereby they might be
provided an input (data/user selections), and then the output of possible diagnoses 15;16;17 .
Moreover, the healthcare industry generates a large amount of data. Consequently, DSSs
are used extensively to capture and transfer information. Therefore, in this section, we will
briefly overview various classification techniques for healthcare decision support systems. In
other words, this section summarizes some of the historical and state of the art decision support
systems in healthcare, and analyzes the success factors needed for widespread deployment, and
postulates the future trends of the field in the context of a new decision management paradigm.
9
Machine Learning and Probabilistic Graphical Models for Decision Support Systems
Both types contain the properties of conditional probabilities and independence. However,
they can encode the different sets of independence and the conditional probabilities of the
distribution. Each type is discussed as follows.
Figure 3: Bayesian networks over three variables, encoding different types of dependencies.
1. Difficulty: Takes values 0 and 1 for minimum and maximum difficulty, respectively.
2. Intelligence: Takes values 0 and 1 for not intelligent and intelligent, respectively.
3. Research output: Takes values 1, 2 and 3 for good, average and bad research, respectively.
The edges in the graph show the dependencies in the graph. The Research Output of a re-
searcher depends on the Difficulty of the research area and the Intelligence of the researcher.
The Research output, in turn, determines the number of publications. Note that the direction
of arrows shows the cause-effect relationships. Difficulty affects the Research Output score, but
the Research Output does not influence the Difficulty. Finally, let us look at the tables asso-
ciated with each of the nodes. Formally, these are called conditional probability distributions
(CPDs), as shown in Figure 5. The CPDs for Difficulty and Intelligence are easy to compute,
because these variables are independent. The tables basically encode the probabilities of these
variables, taking values from 0 to 1. You might have noticed, the values in each of the rows must
sum to 1.
11
Machine Learning and Probabilistic Graphical Models for Decision Support Systems
Next, let us look at the CPD for Research Output. Each row corresponds to the values that
its parent (Difficulty and Intelligence) can take, and each column corresponds to the values
that Research Output can take. Each cell has the conditional probability p(RsearchOutput =
RS|Intelligence = I, Dif f iculty = D), that is, given that the value of Intelligence and
Difficulty , what is the probability of the value of Research Output being RS. For example,
as P (ResearchOutput = RS 1 |Dif f iculty = D1 , Intelligence = I 1 ) is 0.5, that is, if the
intelligence of the researcher and the difficulty of the research area is high, then the probability
of the research output to be good is 0.5. The CPD for Research Articles is easy to understand
with the above knowledge. Because it has one parent, the conditional probabilities will be of the
form P (ResearchArticles = RA1 |ResearchOutput = RS 1 ), that is, what is the probability
of Research Articles being RA, given that the value of Research Output is RS. Each row now
corresponds to a pair of values of Research Output. Again, the row values add up to 1. An
essential requirement for BN is that the graph must be a directed acyclic graph (DAG).
Thanathornwong 20 designed a clinical decision support system to help general practitioners
assess the need for orthodontic treatment in patients with permanent dentition. Particularly, a
BN is used as the underlying model for assessing the need for orthodontic treatment. Around
one thousand permanent dentition patient’s datasets chosen from a hospital record system were
prepared in which one data element represented one participant with information for all variables
and their stated need for orthodontic treatment. The proposed system in this work provided
promising results; it showed a high classification accuracy for classifying groups into needing
and not needing orthodontic treatment.
Where C denotes the set of cliques of G, and each factor βc is a non-negative function over
cliques. The partition function � �
Z= βc (Xc ),
x1 ,...,XN c∈C
12
Decision Support Systems for Healthcare based on Probabilistic Graphical Models
Figure 6: Undirected graphical representation of a joint probability of voting preferences over four individ-
uals. Colors illustrates the pairwise preference present in the model.
is a normalizing constant which sums the distribution to one. Hence, given a graph G, there
might be factors in the probability distribution whose scope is any clique in G, it can be a single
node, an edge, etc. It is important to note that there is no need to specify a factor for each
clique. In the example above, a factor is defined over each edge (which is a clique of two nodes).
Nevertheless, cliques over single nodes have been specified.
In regards to the application of BN and MRF Rajinikanth et al. 23 present a Firefly Algo-
rithm and Shannon Entropy (FA+SE) based multi-threshold to increase the pneumonia lesion
and implements MRF segmentation to identify the lesions with better accuracy. 24 developed a
system based on BN which uses Bayesian reasoning to compute posterior probabilities of pos-
sible diagnoses depending on the given symptoms. This system was developed for diagnosis in
Internal Medicine and now covers about 1500 diagnoses in this domain, based on thousands of
findings. Barnett et al. 25 proposed a system called DXplain which uses a modified form of the
BN. It generates a list of ranked diagnoses associated with the given symptoms. It finds its
use particularly for healthcare practitioners who lack computer expertise. It is also used as a
reference with a searchable database of diseases and clinical manifestations. SimulConsult 26 ,
utilizes BN to input data in a scalable fashion and compute probabilities, accomplishing it by
focusing specialty by specialty. It uses a statistical pattern-matching method which consists of
the onset and offset of the findings in each disease. Table 1 presents a summary of a few past
developments and applications of BN and MRF in the healthcare sector.
13
Machine Learning and Probabilistic Graphical Models for Decision Support Systems
Figure 7: The illustration of major phases for development of DNN based healthcare systems. Figure
adopted from 37 .
The architecture of DNN mainly consists of three layers: Input, Output, and Hidden layer(s).
These networks are made of nodes called neurons. Weights and biases are the connection between
nodes of different layers, which are used to propagate the input between the nodes. DNN are
able to work with incomplete data which gives educated guesses about missing data and gets
improved with adaptive system learning. A method for training an unsupervised fashion is
autoencoders 38 . An autoencoder learns features of a dataset, typically of lower dimensions.
Autoencoder is a type of DNN that learns to reconstruct its input in the output. It has an
internal representation layer that describes a code used to represent the input, and it is made
up of an encoder that translates the input into the latent space, and a decoder that maps
the latent space to reconstructed input. A lot of improvement has been made so far in the
architecture and algorithm of DNN to make them learn without any supervised pretraining.
Such as, the use of RELU activation f (z) = max(z, 0), which learns more efficiently in a
multi-layer model.
A typical DNN is depicted in Figure 8. In this DNN the convolutional layer along with
the max-pooling layers are used for feature extraction and the dense layer is used for classifica-
tion. The output dense layer often uses a sigmoid function in the case of binary classification
and a softmax function in the case of multiclass classification. DNN have been used in several
14
Decision Support Systems for Healthcare based on Probabilistic Graphical Models
applications, such as image classification, computer vision, activity recognition, and deep rein-
forcement learning. For example, Kharat and Dudul 39 proposed a healthcare decision support
system based on Jordan/Elman neural network for the diagnosis of epilepsy. The proposed
system obtained comparatively a high overall training accuracy 99.83% and testing accuracy of
99.92%. A decision support system based on a DNN for the classification of heart-related dis-
eases into 5 categories of heart disease with 97.5% accuracy by using multilayer perceptron with
backpropagation training algorithm is proposed in 40 . Janghel et al. 41 proposed a decision sup-
port system using an artificial neural network to classify the fetal delivery method into normal
or surgical. They primarily used three different algorithms to train the neural network: radial
basis function, back propagation algorithm, and learning vector quantization network with an
accuracy 99%, 93.75%, and 87.5% respectively. Researchers have proposed a large number of
methods to apply DSS in healthcare. For example, 42 explains the role played by the DSS.
Luque Gallego 43 describes the medical decision-support system for the mediastinal staging of
non-small cell lung cancer, which is also known as called Mediastinet. Table 2 presents summary
of selected works in the applications of DNN for healthcare.
15
Machine Learning and Probabilistic Graphical Models for Decision Support Systems
1. Explanations: DNN are often complex and difficult to explain the reasons behind their
predictions and decisions. Explanations about the model’s decision making important for
some applications such as medical prognosis and diagnosis 52 .
3. Robustness: similar to the first point, it is hard to know about the aspect of input DNN
are using to take decisions.
PGM can help to solve these shortcomings, thereafter there is an opportunity to use these ap-
proaches to take advantage of their complementary strengths. For example, PGM provides a
practical way to represent dependence relationships between variables and spatial relations 53;54 .
DNN outperforms other approaches in classification. Thus to integrate PGM and DNN is by
representing the structure of a complex problem through PGM, followed by the use of DNNs
as classifiers for different elements of the underlying problem. The DNN, trained on labeled
data, provide an initial estimation; then these initial estimates can be combined and improved
through belief propagation in the graphical model. This approach can be used for efficient
training of the model, as each one only considers a particular dataset. Spatial analysis problems
are hybrid systems in which the above-mentioned systems can be useful. For instance, human
activity estimation, in which body parts have a certain spatial structure; this structure pro-
vides constraints that can be used by a graphical model. The spatial constraints between the
distinct elements in the model can be represented in terms of a Markov network, showing the
constraints as the local joint probabilities of neighboring elements. These elements are detected
and classified using a DNN. Another type of such problem is temporal modeling. In temporal
modeling, the outputs change and evolve over the time usually depending on the previous state,
for example, time-series. Markov chain and hidden Markov are often used to represent such
problems. In the hybrid system, DNN can be used to classify the state-based observation, and
the Markov model can be used for encoding the temporal relations. The application of such
systems is human activity recognition and speech recognition. A toy example of such hybrid
systems has been shown in Figure 9. Variational Autoencoders (VAE) are another such example.
Due to the increasing popularity of VAEs in anomaly detection, they have been used in various
fields, such as, Healthcare 55 , cybersecurity 56 and various other applications being discovered
with time. One of the important use of VAE in healthcare is anomaly detection. The idea is to
train the VAE using normal data and note down the corresponding reconstruction error. When
the VAE is subjected to anomaly data the reconstruction error is usually high. Hence, data
with reconstruction error more than that of the normal data is considered anomalous. In order
to give more in-depth details owing to the growing utility of VAEs in anomaly detection, we will
discuss a case study about VAE in healthcare anomaly detection in Section 6.2. Researchers
have proposed such hybrid architectures for human activity recognition. For example, in 57 an
architecture for the recognition of the human posture in video sequences was developed. The
proposed model consists of a convolutional neural network-based detector and a hidden markov
16
Decision Support Systems for Healthcare based on Probabilistic Graphical Models
model (CoHMM). The integration of both models allows learning spatial and temporal depen-
dencies. The detector recognizes the different joints based on a convolutional neural network
(CNN), and uses the spatial correlations between neighboring regions through a conditional
random field (CRF) 58 . Whereas, the CoHMMs computes the best possible movement sequence
among interacting processes.
More interesting research that combines deep neural networks and graphical models include:
conditional random fields as recurrent neural networks have been summarized in 59 .
by building some relationships between the input and the outcome of interest. Recently, a
hybrid of unsupervised learning and supervised learning, known as semisupervised learning
has been proposed which is suitable for scenarios where the outcome is missing for certain
subjects. Clustering and principal component analysis (PCA) are two famous and extensively
used unsupervised learning techniques. Clustering groups data points with similar features
together into clusters, without using the labeled outcome information. Clustering algorithms
predict the cluster labels as output for the given input data point by maximizing and minimizing
the similarity of the data within and between the clusters. Most popular clustering algorithms
include k-means, hierarchical, and Gaussian mixture clustering. On the other hand, PCA plays
a key role in the dimension reduction of complex data. Especially, when the recorded data is
multi-dimensional. For example, the number of genes in a genome-wide association study. PCA
works by projecting data into a few principal component (PC) directions, without losing too
much information about the underlying data. It is sometimes recommended to use PCA for
multi-dimensional data and then use clustering for better, and efficient clustering of data.
In regards to supervised learning, it considers the subject’s outcomes together with their
features, and goes through a certain training process to determine the best outputs associated
with the inputs that are closest to the mean outcomes. Usually, the output formulations vary
and are dependent on the underlying method and problem being solved. For example, the
outcome can be the probability of getting a particular clinical event, the expected value of a
disease level, or the expected survival duration.
The application of AI especially in healthcare is well studied in literature 65 . For example,
the internet of medical things (IoMT) integrates healthcare devices, sensors and machine learn-
ing algorithms to provide new applications in healthcare 66 . Machines based on AI can add
support in healthcare by providing continuous automatic monitoring and alerting the health-
care provider or clinical practitioners through an alert system. Moreover, these devices can also
help in decision-making through DSS. One of the major advantages of this transformation is the
transition of tasks from a manual, hectic and time-consuming methodology to smart, automatic,
and time-efficient systems in healthcare. Additionally, these systems help clinical practitioners
to attend to patients in emergency cases by providing timely information. DNN has always
outperformed in healthcare by providing hybrid architectures and blended concepts like CNN,
to enable new healthcare solutions. Due to the variety of healthcare data including clinical data,
HAR data, it is difficult for humans to infer the data for decision making. Accordingly, ML
has been used in healthcare for better understanding of data and for better decision-making
process 67 . For example, Azimi et al. 68 proposed a CNN-based classifier architecture for a health
case study on an ECG classification. Tahmassebi et al. 69 proposed ML-based algorithms for
early prediction of pathological complete response (PCR) to neoadjuvant chemotherapy and
survival outcome of breast cancer patients using multiparametric magnetic resonance imag-
ing (mpMRI) data and eight different ML-based classifiers. In this regard, decision-making is
incorporated at the edge thereby sending notifications to the user in the case of disease detec-
tion. This gives timely information for decision-making at the initial stage of the healthcare
monitoring and improving the healthcare system.
In conclusion, we believe that AI has an important role to play in healthcare in the future. In
the form of machine learning, it plays a primary role in the development of precision medicine,
and healthcare solutions. Although early efforts at providing prognosis, diagnosis, and treat-
18
Decision Support Systems for Healthcare based on Probabilistic Graphical Models
ment recommendations are challenging, it can be seen that AI will ultimately cope with these
challenges as well. Given the fast research advances in AI for imaging analysis, it can be seen
that most radiology and pathology images will be examined at some point by machines using
AI. As for now, automatic speech and text recognition systems are already employed for tasks
like patient communication and for clinical notes, and usage of such systems is continuously
increasing.
One of the greatest challenges to AI application in healthcare is to ensure their adoption in
daily clinical practice. For adoption to take place, AI systems must be approved by regulators,
and standardized to the extent that similar products work in a similar way. Such challenges
will be overcome ultimately, but they will take time for the technologies themselves to be
practical enough. As a result, we see limited use of AI in clinical practice for the coming
decade, but with the rapidly improving research can make use of such systems in real life soon.
For more interesting works about the application of artificial intelligence in DSS, we recommend
to read 70;71 .
19
Machine Learning and Probabilistic Graphical Models for Decision Support Systems
and opportunely support people. Finally, DSS are jointly used with the AmI paradigm for
enhancing communications among health personnel such as doctors and nurses. For example, 80
introduced a DSS based on context-aware knowledge modeling aimed at facilitating the com-
munication and improving the capability to take decisions among clinical practitioners located
in different locations.
comparisons of probabilistic models, including the use of independent, local, and baseline test
datasets. Research and development of probabilistic algorithms must be vigilant to potential
dangers, including dataset shift, accidental fitting of confounders, unintended bias, the issues
of generalization to new datasets, and the unintended bad consequences of new algorithms on
health outcomes.
In summary, the key future perspectives about the implementation of probabilistic models
in healthcare are as follow.
1. The data should be regulated properly by clear regulatory policies. The proper mecha-
nism should be developed to ensure the security and privacy of data under general data
protection regulations. For example, techniques like federated learning 82 can address such
issues, but issues in federated learning like data heterogeneity, privacy leakages needed to
be addressed 83;84 .
2. Proper metrics should be developed to measure the risk and unintended harm to users by
probabilistic models for healthcare and clinical practice.
3. Proper interpretable guidance and mechanisms should be developed to understand the re-
sults of probabilistic models for healthcare and clinical practice. For example, explainable
artificial intelligence can be used to interpret the result of deep neural networks 51 .
6 Case Studies
Discriminative and generative models are widely used machine learning models for ECG clas-
sification in healthcare. For example, logistic regression, support vector machines, are popular
discriminative models and VAE and autoencoder are examples of generative models. In this
section, we provide a case study to explore the discriminative model’s graphical structure as
PGM, using logistic regression as an example for ECG classification. We also provide a case
study to explore the generative model’s graphical structure as PGM, using VAEs as an example
for ECG anomaly detection.
Figure 11 represents the probability distribution of the model when feature X is expanded.
We can see that each feature xi depends on all the previous features. This will have no effect as
the model simply treats X as given facts and it estimates P (Y |X). As mentioned earlier that
the model estimates the probability from the training data:
new data using the latent variables. It should be noted that in VAE, unlike the autoen-
coders, the distribution of Z is forced to be close to normal distribution as possible. With
VAE parametric distribution can be achieved. Hence, during the run time, we can construct
new samples from the normal distribution and feed them to the encoder function to generate
samples, as depicted in Figure 14. The main difference between traditional autoencoder and
VAE is that the former has no continuous latent space, while the latter has continuous latent
space (a sample is mapped to a probability distribution with a certain mean and variance).
Figure 13 depicts a comparison between the mapping of input data to latent space by an au-
toencoder and a VAE.
Figure 13: (a) mapping of an input to latent space by autoencoder, (b) mapping of an input to latent space
by VAE.
The main objective of this case study is to use the VAE to learn the latent representations
of the data. We will use the VAE to map the data to latent representation. Thereafter, we
will visualize features to see the model has generalized enough to learn the data clustering or
to differentiate the data as normal or not normal. Note that we do not use labels because VAE
are unsupervised machine learning approaches. To show the applicability, we will use the ECG
healthcare dataset for anomaly detection and visualize the features that the model has learned.
The reason for using VAE is to get rid of labeling the data, as labeling data can be a hectic
task. In this case study, we trained a convolutional VAE for ECG anomaly detection. We
trained the VAE on normal (a particular distribution) ECG signals so that, when not normal
(different distribution) ECG signals are fed into the VAE, the reconstruction loss is expected
23
Machine Learning and Probabilistic Graphical Models for Decision Support Systems
to be higher than a certain threshold. The threshold is usually the reconstruction error of VAE
for normal data. If for certain data, the reconstruction error crosses the threshold, we see that
data point as not normal. In other words, via the reconstruction loss, we can keep track if an
ECG signal belongs to a particular distribution or not. The VAE has optimized over 2 losses,
the Kullback–Leibler (kl)-loss and reconstruction loss (the difference between the input ECG
and the reconstructed ECG). The kl-loss is the difference between the distribution of the latent
space and a standard Gaussian with mean zero and standard deviation one. In other words,
kl-loss is used the minimize the distance between the distribution between distinct classes yet
keep them separable. The kl loss between two distributions A and B can be calculated as the
negative sum of probability of each event in A, multiplied by the log probability of B over the
probability of the event in A, as given by Eq. 7.
B(x)
(7)
�
KL(A||B) = − A(x) log( )
x∈X
A(x)
Where, || is divergence. This is to compress the distribution of the latent space to the
standard distribution. This helps the decoder to map from every area of the latent space when
decoding the input ECG signal. Figure 14 shows a graphical representation of the VAE for new
sample generation.
Figure 14: VAE as a graphical model and its use to generate new samples.
We used the public baseline ECG dataset 86 to train and test our VAE for anomaly detection.
Figure 15 shows a scatter plot of the latent space generated by the encoder for the test dataset,
after training for 50 epochs, with stochastic gradient descent optimization. The color of each
point reflects its associated reconstruction error. In other words, it shows the marking of each
data point that has crossed the error threshold as an anomaly in dark violet color, and the
normal data point as yellow. We can clearly see one large cluster of points that seem quite on
the normal side (yellow dots), with a dark-colored cluster with a relatively high error term on
the sides. It should be noted that this VAE was just a toy example without any hyper-parameter
tuning. The performance can be enhanced by optimizing the hyper-parameters and adjusting
the layer structure of the VAE.
24
Decision Support Systems for Healthcare based on Probabilistic Graphical Models
VAE are widely used for a variety of machine learning tasks. This case study was an example
to show VAE’s application. The applicability of VAE can be enhanced by using XAI. While
reconstructing a sample in anomaly detection methods such as class activation maps can be
utilized to tap the neurons which fire for a given input and by applying max-polling over
the activation maps we can generate a spatial saliency map which shows the regions of input
signal that contribute more tho the output signal. For a reconstruction which is marked as an
anomaly, the regions with high vales of gradient mapping will be the contributor. Hence, the
region which cause the anomalous behaviour can be trace out. The example shown in Figure 16
25
Machine Learning and Probabilistic Graphical Models for Decision Support Systems
was introduced in 51 to trace back the regions which are responsible for predicting a particular
class. The regions in red are responsible for classification of the ECG signal in to the output class
(color weighted scale shows contribution of each region). Similar approaches can be adopted in
anomaly detection to trace and explain the regions which cause the anomalous behaviour.
7 Conclusions
In this chapter, we discussed different DSS based on probabilistic graphical methods and machine
learning. DSS were proven to be a useful tool. They help in the reduction of prescription
errors, and help in prognosis, with a higher capacity than the previously used methods. DSS
has been shown to help healthcare practitioners and providers and in a variety of decisions
making and diagnosis tasks, and as of now, they actively and efficiently support in providing
quality healthcare service. Moreover, they were proved to be useful in the standardization of
protocols, adjustments with a target, and warning systems. We noticed that both DSS based on
classical PGM and DSSs based on advanced machine learning methods are extensively used in
healthcare. However, PGM has fallen out a little due to the ubiquity of probabilistic methods
like neural networks. Nevertheless, we believe they still have the potential to be relevant in the
future, because of their explanatory and intuitive nature. They can be used for modeling casual
relationships and can be useful in learning the representation of abstract or high-level concepts.
As we saw in the chapter that combining neural networks with graphical models could be very
useful in the domain of machine learning, especially in healthcare DSS.
Meanwhile, we must take extra measures and precautions and careful analysis when creating,
implementing, and maintaining DSS. In this regard complete solutions will be required in prac-
tice, especially as DSS continue to evolve in complexity through advances in AI, interoperability,
explanations, and new sources of data.
26
Decision Support Systems for Healthcare based on Probabilistic Graphical Models
References
[1] D. Aronsky and P.J. Haug. Diagnosing community-acquired pneumonia with a Bayesian
network. In: Proceedings of the AMIA Symposium, American Medical Informatics
Association, p. 632, 1998.
[2] G.A. Gorry and M.S. Scott Morton. A framework for management information systems, 1971.
[3] J. Horsky, J. Aarts, L. Verheul, D.L. Seger, H. van der Sijs and D.W. Bates. Clinical
reasoning in the context of active decision support during medication prescribing.
International Journal of Medical Informatics, 97: 1–11, 2017.
[5] D. Feinleib. Big Data Bootcamp: What Managers Need to know to Profit from the Big Data
Revolution. A Press, 2014.
[6] A. Sunyaev and D. Chornyi. Supporting chronic disease care quality: Design and
implementation of a health service and its integration with electronic health records. Journal
of Data and Information Quality (JDIQ), 3: 1–21, 2012.
[7] Howard S. Goldberg, Marilyn D. Paterno, Robert W. Grundmeier, Beatriz H. Rochaab, Jeffrey
M. Hoffman, Eric Tham, Marguerite Swietlik, Molly H. Schaeffer, Deepika Pabbathi, Sara J.
Deakyne, Nathan Kuppermann and Peter S. Dayan. Use of a remote clinical decision support
service for a multicenter trial to implement prediction rules for children with minor blunt
head trauma. International Journal of Medical Informatics, 87: 101–110, 2016.
[8] D. Koller and N. Friedman. Probabilistic Graphical Models: Principles and Techniques. MIT
Press, 2009.
[9] J. Wyatt and J. Liu. Basic concepts in medical informatics. Journal of Epidemiology &
Community Health, 56: 808–812, 2002.
[10] S.B. Clauser, E.H. Wagner, E.J.A. Bowles, L. Tuzzio and S.M. Greene. Improving modern
cancer care through information technology. American Journal of Preventive Medicine,
40: S198–S207, 2011.
[11] Jason Chiang, John Furler, Douglas Boyle, Malcolm Clark and Jo-Anne Manski-Nankervis.
Electronic clinical decision support tool for the evaluation of cardiovascular risk in general
practice: A pilot study. Australian Family Physician, 46: 764, 2017.
[12] P.A. Williams, R.D. Furberg, J.E. Bagwell and K.A. LaBresh. Usability testing and adaptation
of the pediatric cardiovascular risk reduction clinical decision support tool. JMIR Human
Factors, 3: e17, 2016.
27
Machine Learning and Probabilistic Graphical Models for Decision Support Systems
[13] A.A. Montgomery, T. Fahey, T.J. Peters, C. MacIntosh and D.J. Sharp. Evaluation of computer
based clinical decision support system and risk chart for management of hypertension in
primary care: Randomised controlled trial. Bmj, 320: 686–690, 2000.
[14] R.T. Sutton, D. Pincock, D.C. Baumgart, D.C. Sadowski, R.N. Fedorak and K.I. Kroeker. An
overview of clinical decision support systems: Benefits, risks, and strategies for success.
NPJ Digital Medicine, 3: 1–10, 2020.
[15] E.S. Berner and T.J. La Lande. Overview of clinical decision support systems. In: Clinical
Decision Support Systems, Springer, pp. 3–22, 2007.
[16] S. Belciug and F. Gorunescu. Intelligent Decision Support Systems—A Journey to Smarter
Healthcare, Springer, 2020.
[17] C. Schaarup, L.B. Pape-Haugaard and O.K. Hejlesen. Models used in clinical decision support
systems supporting healthcare professionals treating chronic wounds: Systematic literature
review. JMIR Diabetes, 3: e8316, 2018.
[20] B. Thanathornwong. Bayesian-based decision support system for assessing the needs for
orthodontic treatment. Healthcare Informatics Research, 24: 22, 2018.
[21] G.R. Cross and A.K. Jain. Markov random field texture models. IEEE Transactions on Pattern
Analysis and Machine Intelligence, 25–39, 1983.
[22] C. Wang, N. Komodakis and N. Paragios. Markov random field modeling, inference &
learning in computer vision & image understanding: A survey. Computer Vision and Image
Understanding, 117: 1610–1627, 2013.
[24] Depeursinge Adrien, Vargas Alejandro, Gaillard Frédéric, Platon Alexandra, Geissbuhler
Antoine, Poletti Pierre-Alexandre and Müller Henning. Case-based lung image categorization
and retrieval for interstitial lung diseases: Clinical workflows. International Journal of
Computer Assisted Radiology and Surgery, 7(1): 97–110, 2012.
[25] G.O. Barnett, J.J. Cimino, J.A. Hupp and E.P. Hoffer. DXplain: An evolving diagnostic
decision-support system. Jama, 258: 67–74, 1987.
28
Decision Support Systems for Healthcare based on Probabilistic Graphical Models
[27] A. Alsiddiky, W. Awwad, K. Bakarman, H. Fouad and N.M. Mahmoud. Magnetic resonance
imaging evaluation of vertebral tumor prediction using hierarchical hidden markov random
field model on internet of medical things (iomt) platform. Measurement, 159: 107772, 2020.
[28] A.B. Ashraf, S.C. Gavenonis, D. Daye, C. Mies, M.A. Rosen and D. Kontos. A multichannel
markov random field framework for tumor segmentation with an application to classification
of gene expression-based breast cancer recurrence risk. IEEE Transactions on Medical
Imaging, 32: 637–648, 2012.
[29] A. Bousse, S. Pedemonte, B.A. Thomas, K. Erlandsson, S. Ourselin, S. Arridge and B.F.
Hutton. Markov random field and gaussian mixture for segmented mri-based partial volume
correction in pet. Physics in Medicine & Biology, 57: 6681, 2012.
[31] C. Zhao, J. Jiang, Y. Guan, X. Guo and B. He. Emr-based medical knowledge representation
and inference via markov random fields and distributed representation learning. Artificial
Intelligence in Medicine, 87: 49–59, 2018.
[32] F.J. Costello, C. Kim, C.M. Kang and K.C. Lee. Identifying high-risk factors of depression in
middle-aged persons with a novel sons and spouses bayesian network model 8: 562, 2020.
[33] P. Gupta, B. Bhowmick and A. Pal. Mombat: Heart rate monitoring from face video using
pulse modeling and bayesian tracking. Computers in Biology and Medicine, 121: 103813,
2020.
[34] Z. Zhou, H. Yu and H. Shi. Human activity recognition based on improved bayesian
convolution network to analyze health care data using wearable IOT device. IEEE Access,
8: 86411–86418, 2020.
[35] O.I. Abiodun, A. Jantan, A.E. Omolara, K.V. Dada, N.A. Mohamed and H. Arshad. State-of
the-art in artificial neural network applications: A survey, Heliyon, 4: e00938, 2018.
[36] W. Liu, Z. Wang, X. Liu, N. Zeng, Y. Liu and F.E. Alsaadi. A survey of deep neural network
architectures and their applications. Neurocomputing, 234: 11–26, 2017.
[37] A. Qayyum, J. Qadir, M. Bilal and A. Al-Fuqaha. Secure and robust machine learning for
healthcare: A survey. IEEE Reviews in Biomedical Engineering, 14: 156–180, 2021.
[38] I. Goodfellow, Y. Bengio, A. Courville and Y. Bengio. Deep Learning, volume 1. MIT Press
Cambridge, 2016.
29
Machine Learning and Probabilistic Graphical Models for Decision Support Systems
[39] P. Kharat and S. Dudul. Clinical decision support system based on jordan/elman neural
networks. In: 2011 IEEE Recent Advances in Intelligent Computational Systems. IEEE,
pp. 255–259, 2011.
[40] M. Gudadhe, K. Wankhade and S. Dongre. Decision support system for heart disease based
on support vector machine and artificial neural network. In: 2010 International Conference
on Computer and Communication Technology (ICCCT). IEEE, pp. 741–745.
[41] R. Janghel, A. Shukla, R. Tiwari and P. Tiwari. Clinical decision support system for fetal
delivery using artificial neural network. In: 2009 International Conference on New Trends
in Information and Service Science. IEEE, pp. 1070–1075, 2009.
[42] K. Rajalakshmi, S.C. Mohan and S.D. Babu. Decision support system in healthcare industry.
International Journal of Computer Applications, 26: 42–44, 2011.
[43] M. Luque Gallego. Probabilistic graphical models for decision making in medicine, 2009.
[44] O. Karan, C. Bayraktar, H. Gümüşkaya and B. Karlık. Diagnosing diabetes using neural
networks on small mobile devices. Expert Systems with Applications, 39: 54–60, 2012.
[45] H. Suresh, N. Hunt, A. Johnson, L.A. Celi, P. Szolovits and M. Ghassemi. Clinical intervention
prediction and understanding with deep neural networks, 322–337, 2017.
[47] R.D. Labati, E. Muñoz, V. Piuri, R. Sassi and F. Scotti. Deep-ecg: Convolutional neural
networks for ecg biometric recognition. Pattern Recognition Letters, 126: 78–85, 2019.
[48] K. Antczak. Deep recurrent neural networks for ecg signal denoising. arXiv preprint
arXiv:1807.11551, 2018.
[49] A. Elola, E. Aramendi, U. Irusta, A. Picón, E. Alonso, P. Owens and A. Idris. Deep neural
networks for ecg-based pulse detection during out-of-hospital cardiac arrest. Entropy,
21: 305, 2019.
[50] Y. Wu, F. Yang, Y. Liu, X. Zha and S. Yuan. A comparison of 1-d and 2-d deep convolutional
neural networks in ecg classification. arXiv preprint arXiv:1810.07088, 2018.
[51] A. Raza, K.P. Tran, L. Koehl and S. Li. Designing ecg monitoring healthcare system with
federated transfer learning and explainable AI. arXiv preprint arXiv:2105.12497, 2021.
[52] F.K. Došilović, M. Brčić and N. Hlupić. Explainable artificial intelligence: A survey. In: 2018
41st International Convention on Information and Communication Technology, Electronics
and Microelectronics (MIPRO), IEEE, pp. 0210–0215, 2018.
30
Decision Support Systems for Healthcare based on Probabilistic Graphical Models
[53] M.J. Johnson, D. Duvenaud, A.B. Wiltschko, S.R. Datta and R.P. Adams. Structured vaes:
Composing probabilistic graphical models and variational autoencoders. arXiv preprint
arXiv:1603.06277 2 (2016), 2016.
[54] F. Le, M. Srivatsa, K.K. Reddy and K. Roy. Using graphical models as explanations in deep
neural networks, 283–289, 2019.
[55] T. Fernando, H. Gammulle, S. Denman, S. Sridharan and C. Fookes. Deep learning for
medical anomaly detection—A survey. arXiv preprint arXiv:2012.02364, 2020.
[56] Truong Thu Huong, Ta Phuong Bac, Dao Minh Long, Tran Duc Luong, Nguyen Minh Dan,
Le Anh Quang, Le Thanh Cong, Bui Doan Thang and Kim Phuc Tran. Detecting cyberattacks
using anomaly detection in industrial control systems: A federated learning approach.
Computers in Industry, 132: 103509, 2021.
[57] R.B. Zebadúa. Human body pose tracking based on spatio-temporal joints dependency
learning, 2018.
[58] X. Chu, W. Yang, W. Ouyang, C. Ma, A.L. Yuille and X. Wang. Multi-context attention for
human pose estimation. In: Proceedings of the IEEE Conference on Computer Vision and
Pattern Recognition, pp. 1831–1840, 2017.
[59] S. Zheng, S. Jayasumana, B. Romera-Paredes, V. Vineet, Z. Su, D. Du, C. Huang and P.H.
Torr. Conditional random fields as recurrent neural networks. In: Proceedings of the IEEE
International Conference on Computer Vision, pp. 1529–1537, 2017.
[61] R. Shah and A. Chircu. Iot and AI in healthcare: A systematic literature review. Issues in
Information Systems, 19, 2018.
[62] S.E. Dilsizian and E.L. Siegel. Artificial intelligence in medicine and cardiac imaging:
Harnessing big data and advanced computing to provide personalized medical diagnosis and
treatment. Current Cardiology Reports, 16: 441, 2014.
[63] V.L. Patel, E.H. Shortliffe, M. Stefanelli, P. Szolovits, M.R. Berthold, R. Bellazzi and A.
Abu-Hanna. The coming of age of artificial intelligence in medicine. Artificial Intelligence
in Medicine, 46: 5–17, 2009.
[64] S. Jha and E.J. Topol. Adapting to artificial intelligence: Radiologists and pathologists as
information specialists. Jama, 316: 2353–2354, 2016.
[65] A. Qayyum, J. Qadir, M. Bilal and A. Al-Fuqaha. Secure and robust machine learning for
healthcare: A survey. arXiv preprint arXiv:2001.08103, 2020.
[66] S. Durga, R. Nag and E. Daniel Survey on machine learning and deep learning algorithms used
in internet of things (iot) healthcare. In: 2019 3rd International Conference on Computing
Methodologies and Communication (ICCMC), pp. 1018–1022, 2019.
31
Machine Learning and Probabilistic Graphical Models for Decision Support Systems
[67] K.R. Bisaso, G.T. Anguzu, S.A. Karungi, A. Kiragga and B. Castelnuovo. A survey of machine
learning applications in hiv clinical research and care. Computers in Biology and Medicine,
91: 366–371, 2017.
[68] I. Azimi, J. Takalo-Mattila, A. Anzanpour, A.M. Rahmani, J.-P. Soininen and P. Liljeberg.
Empowering healthcare iot systems with hierarchical edge-based deep learning. In:
Proceedings of the 2018 IEEE/ACM International Conference on Connected Health:
Applications, Systems and Engineering Technologies, pp. 63–68, 2018.
[70] S. Montani and M. Striani. Artificial intelligence in clinical decision support: A focused
literature survey. Yearbook of Medical Informatics, 28: 120–127, 2019.
[71] M. Fernandes, S.M. Vieira, F. Leite, C. Palos, S. Finkelstein and J.M. Sousa. Clinical decision
support systems for triage in the emergency department using intelligent systems: A review.
Artificial Intelligence in Medicine, 102: 101762, 2020.
[72] E. Turban. Implementing decision support systems: A survey. In: 1996 IEEE International
Conference on Systems, Man and Cybernetics. Information Intelligence and Systems (Cat.
No. 96CH35929), volume 4, IEEE, pp. 2540–2545, 1996.
[73] S.B. Eom, S.M. Lee, E. Kim and C. Somarajan. A survey of decision support system
applications (1988–1994). Journal of the Operational Research Society, 49: 109–120, 1998.
[74] S. Eom and E. Kim. A survey of decision support system applications (1995–2001). Journal
of the Operational Research Society, 57: 1264–1278, 2006.
[75] M. Omichi, Y. Maki, T. Ohta, Y. Sekita and S. Fujisaku. A decision support system for
regional health care planning in a metropolitan area. Japan-hospitals: The Journal of the
Japan Hospital Association, 3: 19–23, 1984.
[76] G. Acampora, D.J. Cook, P. Rashidi and A.V. Vasilakos. A survey on ambient intelligence in
healthcare. Proceedings of the IEEE, 101: 2470–2494, 2013.
[77] R. Snyder-Halpern. Assessing health care setting readiness for point of care computerized
clinical decision support system innovations. Outcomes Management for Nursing Practice,
3: 118–127, 1999.
[78] Mohammad Raza Perwez, Naveed Ahmad, Muhammad Sajid Javaid and Muhammad Ehsan
ul Haq. A critical analysis on efficacy of clinical decision support systems in health care
32
Decision Support Systems for Healthcare based on Probabilistic Graphical Models
domain. In: Advanced Materials Research, volume 383, Trans. Tech. Publ., pp. 4043–4050,
2012.
[79] M.C. Kaptein, P. Markopoulos, B. De Ruyter and E. Aarts. Persuasion in ambient intelligence.
Journal of Ambient Intelligence and Humanized Computing, 1: 43–56, 2010.
[80] O. Anya, H. Tawfik, S. Amin, A. Nagar and K. Shaalan. Context-aware knowledge modelling
for decision support in e-health. In: The 2010 International Joint Conference on Neural
Networks (IJCNN), pp. 1–7, 2010.
[81] J. Graham. Artificial intelligence, machine learning, and the fda, 2016.
[82] M. Aledhari, R. Razzak, R.M. Parizi and F. Saeed. Federated learning: A survey on enabling
technologies, protocols, and applications. IEEE Access, 8: 140699–140725, 2020.
[83] J.H. Yoo, H. Jeong, J. Lee and T.-M. Chung. Federated learning: Issues in medical application,
2021.
[84] N. Papernot, P. McDaniel, A. Sinha and M. Wellman. Towards the science of security and
privacy in machine learning. arXiv preprint arXiv:1611.03814, 2016.
[85] Y. Zhan, J. Zhang, Z. Hong, L. Wu, P. Li and S. Guo. A survey of incentive mechanism
design for federated learning. IEEE Transactions on Emerging Topics in Computing, 2021.
[86] G.B. Moody and R.G. Mark. The impact of the mit-bih arrhythmia database. IEEE Engineering
in Medicine and Biology Magazine, 20: 45–50, 2001.
33
Chapter 3
1 Introduction
With the growth of the volume of data collected in manufacturing, Big Data offers a tremendous
opportunity in the transformation of today’s manufacturing paradigm to smart manufacturing
(SM) and helps the scientists and engineers have Artificial Intelligence (AI)-driven Industrial
Internet of Things (IIoT) solutions working in real-time and being more accurate and efficient 1 .
The development of AI algorithms and Big Data analytic helps to transfer a human experience
to technological developments, providing ways for IIoT solutions to maximize value creation
across asset and operations life-cycles while improving profitability. Machine Learning (ML)
and Deep Learning (DL) are AI subsets that are used to assess produced data and provide useful
information about the manufacturing industry 2 . Therefore, ML and DL play important roles in
the development of SM with various applications especially anomaly detection in 3 , machine fault
diagnosis 4 , intrusion detection 5 , production monitoring 6 . Furthermore, to react and adapt to
the constantly changing industrial environment, manufacturers nowadays need a support system
that allows them analyze information to come up with reasonable decisions. Decision support
systems (DSS) in this scenario integrate human talents with computer capabilities to offer
effective data administration, reporting, analytics, modeling, and planning. By integrating these
advanced technologies in manufacturing, the factories enable it to optimize the performance,
quality, control, and transparency of the manufacturing process. While there is a large amount of
research related to these technologies in the context of SM, there are still challenges in applying
them in a practical setting 7;8;9 .
1 International Research Institute for Artificial Intelligence and Data Science, Dong A University, Danang, Vietnam.
2
Univ. Lille, ENSAIT, ULR 2461 - GEMTEX - Génie et Matériaux Textiles, F-59000 Lille, France.
3 HEC Management School, University of Liège, Liège, Belgium.
The first problem is that, to have a flexible and scalable IIoT platform, the central computing
of all data collected in manufacturing is the way for growth and innovation. Thus, cloud
computing is important to empower the workloads and applications, reduce costs, and increase
release velocity, and agility. With the recent advancements in cloud computing, enterprises can
have large storage and enough computing capacity to process collected data from IIoT devices.
Traditionally, IIoT is only used for data collection and offloads computationally intensive tasks
to cloud servers. Yet, offloading computationally intensive tasks to a cloud centre may result
in a delay, due to the time needed to transmit, process, and receive a large amount of data.
This is a serious drawback in a smart factory that must perform massive analysis in real-time.
To overcome this limitation, the concept of edge computing can be used in a smart factory 10 .
Edge computing is the technology that makes it possible to quickly perform the necessary
computational task in the network edge, i.e., between data producers and the cloud centre.
The workload that is concentrated in the central cloud can be reduced. Moreover, in some
situations, traditional ML approaches require combining data at one location which not only
prevents systems from dealing with vastly distributed data and training models on the edge
nodes but also presses communication systems in factories. In that case, Federated Learning
(FL) approach can represent a good solution. In 11 , the authors mentioned that operating an
FL system can decrease the amount of bandwidth used in transferring data between the edge
and the cloud by 35%. Integrating FL with other learning frameworks is a potential method
that can solve existing problems in SM. Therefore, this chapter is the first to show a survey on
ML and DL techniques used for anomaly detection in some aspects of SM. Then, difficulties
and challenges in applying them in a real context are discussed. Finally, our perspectives for
solving these problems are recommended.
The chapter is organised as follows: In Section 2, we briefly describe the concept of DSS.
The IIoT-based background and techniques for anomaly detection in SM are presented in
Section 3. Some difficulties and challenges also mentioned in Section 4. In Section 5, we
propose an alternative approach for the decentralised system. The case studies are given in
Section 6. Section 7 provides some concluding remarks.
35
Machine Learning and Probabilistic Graphical Models for Decision Support Systems
and adapt technologies tools through simulation and information technology. In some recent
researches, a DSS includes all of the people who are in the organisation and the hardware,
software, models and data 15 . According to 16 , DSS is described as an interactive computer-
based system or subsystem designed to help decision-makers use communication technology,
data, documents, knowledge, and/or models to identify and solve problems, complete tasks in
the decision-making process, and make decisions. DSS are divided into five categories depending
on their major sources of information 17 :
1. Communication-Driven DSS is a type that uses communication technologies and net-
works to facilitate the communication and collaboration of decision-relevant. It empha-
sizes communication, collaboration, and shared decision-making support using technol-
ogy. The most fundamental level of functionality is a simple bulletin or threaded e-mail.
Communications-driven DSS allows two or more individuals to interact, share and coordi-
nate their activities.
2. Data-Driven DSS emphasize access to and manipulation of a time series of internal
company data and in some systems real-time and external data. Business intelligence
systems for operational or strategic use are most often data-driven.
3. Document-Driven DSS integrate a variety of storage and processing technologies to
provide complete document retrieval and analysis. A search engine is a powerful decision-
aiding tool associated with a document-driven DSS.
4. Model-Driven DSS emphasize access to and manipulation of a quantitative model (e.g.,
an algebraic, financial, optimization, or simulation model). Model-driven DSS use data
and parameters provided by decision makers to aid them in analyzing a situation, but they
are not usually data-intensive.
5. Knowledge-Driven DSS suggest or recommend actions to managers. These DSS are
person–computer systems with specialized problem-solving expertise. A knowledge-driven
DSS uses AI and statistical technologies. Knowledge storage and processing technologies
are the dominant component in the architecture of these systems.
As mentioned above, DSS plays an important role in SM. Promoting effective strategies to
support decision-making can not only be through a comprehensive understanding of the dynam-
ics involved, but also the possibility of enriching these processes with the valuable information
collected at different levels. The last point represents a key aspect of effective decision-making,
especially when we consider decentralization. In addition, it is necessary to analyze the amount
of information collected to avoid the overloading of decision variables and the increasing com-
plexity of the decision-making process. Another problem is how to integrate different tasks (that
is, decisions at different levels) that need to be made so that the existing system works more
efficiently. There are 2 common methods to overcome this challenge: centralized or decentral-
ized.
1. In the centralized decision-making process, a single, decision center is acquainted with all
the system information. The central node is responsible for system planning and has the
ability to manage the operations performed by all nodes on the network. The central node
makes decisions with the aim to optimize the objectives of the entire network 18 .
36
Decision Support Systems for Anomaly Detection with the Applications in Smart Manufacturing
Review at Khartoum.
Soudan Government Steamer "Dal."
About a hundred miles from Gondokoro the White Nile enters and
spills itself in a vast and appalling swamp. Of the action of this
tremendous sponge, whether beneficial in regulating the flow, or
harmful in wasting the water through evaporation, nothing need
here be said. But its aspect is at once so dismal and so terrifying
that to travel through it is a weird experience. Our steamer, with the
favouring current, made at least seven miles an hour, and, as the
moon was full, we travelled night and day. For three days and three
nights we were continuously in this horrible swamp into which the
whole of the United Kingdom could be easily packed. By day from
the roof of the high pilot-house a commanding view revealed hour
after hour, in every direction, one uninterrupted ocean of floating
vegetation spreading to far horizons. The papyrus-plant is in itself a
beautiful, graceful, and venerable thing. To travel through the sudd,
is to hate it for evermore. Rising fifteen feet above the level of the
water, stretching its roots twenty or even thirty feet below, and so
matted and tangled together that elephants can walk safely upon its
springy surface, papyrus is the beginning and end of this melancholy
world. For hundreds of miles nothing else is to be perceived—not a
mountain-ridge blue on the horizon, scarcely a tree, no habitation of
man, no sign of beast. The silence is broken only by the croaking of
innumerable frog armies, and the cry of dreary birds.
The vigorous operations of the sudd-cutters have opened, and the
constant traffic of steamers has preserved and improved, a channel
about a hundred yards wide, winding by loops and corkscrews
through the swamp. The river presents a depth of thirty feet along
this course, and greater vessels could thread its length for nearly a
thousand miles. The navigation is intricate and peculiar. Indeed, it
would seem to be an art by itself. No effort is made by the Arab
pilots, who alone are employed, to avoid collisions with the banks.
On the contrary, they rely upon them as an essential feature of their
management of the steamer. The vessel bumps regularly at almost
every corner from one cushion of sudd to the other, or plunges its
nose into the reeds and waits for the currents to carry its stern
round, bumps again and recovers its direction. Sometimes where the
twists were very sharp we would turn completely round, not once
but two or three times, and our movements round an S-curve were
even more complicated. The bumps occasionally swept us out of our
chairs and sent us sprawling on the deck. In this strange fashion we
waltzed along at full speed for about seventy or eighty hours.
Meanwhile the Nile was accomplishing its destiny. Its vast tributary
rivers, the Sobat and the Bahr-el-Ghazal, came to reinforce its flow.
The miles spread out behind us in a long succession of hundreds. At
length the sudd expanses begin to contract. Distant mountains rise
against the steel-blue sky in serrated silhouette, and gradually draw
in upon the river. Islands of earth and trees, peaks of sharp rock
break here and there the awful monotony of waving reeds. At last
the banks become firm and clear-cut walls of yellow sand, fringed in
places with palms and shady trees, and everywhere bristling with
undergrowth of thorns. We leave the wilderness of moisture, we
approach the wilderness of drought. But first, in a middle region,
vast areas of dusty scrub-covered plains, not wholly incapable of
cultivation in the rainy season, supporting always flocks and herds,
now flank both sides of the river. The camel caravans pad slowly
across them under the blaze and glitter of the heat. The mirage
begins to twist and blur the landscape with deceptive waters. At
intervals of forty or fifty miles are the stations of the Soudan
Government, each trim and regular with its public buildings, its
storehouses, the lines of beehive huts of its garrison, a tangle of
native sailing-craft, and always, or nearly always, one or two white
gunboats of war-time days now turned policemen of the river.
Thus we reach in time Fashoda—now called Kodok for old sake's
sake; and here are clusters of Shillooks who (by request) stand
pensively on one leg in their natural attitude, and smart companies
of Soudanese troops and British officers, civil and military—the
whole clear-cut under sun-blaze dry light, veiled only in dancing
dust-devils piteously whipped by strong hot winds. All this was like a
piece of the Omdurman campaign to me—the old familiar Soudan,
so often made known to British minds by pen, pencil, and
photograph during nearly twenty years of war, unfolded itself feature
by feature. Yet we were still five hundred miles south of Khartoum!
A SHELUK AT KODOK (FASHODA).
At Meshra-er-Zeraf we stopped for two days to shoot, by the
Sirdar's invitation, in the extensive game reserve, and were fortunate
in securing a buffalo and various antelope. We wandered through a
harsh country, of white sand and tussocks of coarse grass, more
grey than green, with leafless black thorn-trees densely tangled; yet
it seemed full of game. In three hours' walk on the second morning I
shot a fine waterbuck, two reed-bucks, and two of a beautiful herd
of roan antelope, who walked slowly down to water past our
ambuscade. And, be it remembered, that the pleasure and
excitement of such sport are in these lands always heightened by
the possibility that at any moment the hunters may come upon
game of much more serious quality—lion or buffalo; so that no one
cares to be more than a few yards from his heavy rifle or give his
mind wholly to the buck he stalks. Surely they are perverse,
unenterprising folk who spend fortunes each year in preserving with
so much artificial care, and to the inconvenience of other dwellers in
a small island, well-counted herds of more or less tame deer, when
in a month, and for less expense than the year's rent of their forests,
they could pursue wild animals of every kind in their natural haunts
and gain experiences that would last them all their lives.
I was so much elated by this jolly morning's sport and the near
approach of civilized conditions—for after all, contrast is an element
in pleasure—that I permitted myself to rejoice at the safe and happy
outcome of this long journey, and to exult in our complete immunity
from serious accident or illness or even fever. How extravagant were
the accounts of the dangers of African travel! How easy to avoid the
evil chances of the road! Reasonable precautions, steady exercise,
regular quinine—were these not in themselves the guarantees of
safety? Thus I reckoned, and with specious reasons, but in a bad
hour. We were not yet at our journey's end.
Twenty-four hours' steaming from Meshra-er-Zeraf brought us
near Khartoum. The character of the country was unchanged. Yellow
sand-slopes drank at the Nile brim; thorn-scrub fringed the river on
either side; but date-palms mingled even more frequently and
numerously with the vegetation, and brown mud-built villages with
brown mud-coloured populations multiplied as the miles slipped
swiftly by. At length a solitary majestic tree, beneath whose spacious
branches and luxuriant foliage a hundred persons might have found
shelter from the relentless sun—Gordon's tree—advertised us of the
proximity of Khartoum. Soon on the one bank came into view the
vast mud labyrinth of Omdurman, with forests of masts rising along
the shore, and on the other, among palm-groves ever clustering
thicker, sprang the blue and pink and crimson minarets of new
Khartoum. Khartoum—the new Khartoum, risen from its ruins in
wealth and beauty—a smiling city sitting like a queen throned at the
confluence of the Niles, the heart and centre of a far-reaching and
formidable authority, disclosed herself to the traveller's eye. Sharp to
the right turns the steamer, leaving the dull placid waters of the
sovereign river we have so long followed, and shouldering a more
turbulent current of clearer water, swings up-stream along its noble
feudatory, the Blue Nile. And passing by the side of high stone
embankments crowned by palms, the steamer enters into a modern
Oriental port and city, and is soon surrounded by its palaces, its
mosques, its warehouses and its quays.
Transcriber's Note:
Obvious typographical errors have been corrected.
Inconsistent spelling and hyphenation in the original
document have been preserved.
Welcome to our website – the perfect destination for book lovers and
knowledge seekers. We believe that every book holds a new world,
offering opportunities for learning, discovery, and personal growth.
That’s why we are dedicated to bringing you a diverse collection of
books, ranging from classic literature and specialized publications to
self-development guides and children's books.
ebookbell.com