0% found this document useful (0 votes)
8 views11 pages

Bayesian Network-Based Knowledge Graph Inference For Highway Transportation Safety Risks

Uploaded by

paulstavio
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views11 pages

Bayesian Network-Based Knowledge Graph Inference For Highway Transportation Safety Risks

Uploaded by

paulstavio
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

Hindawi

Advances in Civil Engineering


Volume 2021, Article ID 6624579, 11 pages
https://doi.org/10.1155/2021/6624579

Research Article
Bayesian Network-Based Knowledge Graph Inference for Highway
Transportation Safety Risks

Luo Wenhui , Cai Fengtian, Wu Chuna, and Meng Xingkai


Research Institute of Highway Ministry of Transport, Beijing 100088, China

Correspondence should be addressed to Luo Wenhui; [email protected]

Received 16 October 2020; Revised 20 January 2021; Accepted 18 February 2021; Published 5 March 2021

Academic Editor: Songtao Lv

Copyright © 2021 Luo Wenhui et al. ,is is an open access article distributed under the Creative Commons Attribution License,
which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Accurate inference of knowledge about highway transportation safety risks forms a crucial aspect of building a knowledge graph.
Based on the data related to highway transportation accidents, this study has developed a Bayesian network model. ,e initial
identification of the network nodes is through expert scoring. ,e network structure is then constructed by utilizing the prior
expert knowledge and K2 greedy search algorithm. Later, the network parameters are trained via the expectation-maximization
(EM) algorithm. Finally, knowledge about highway transportation safety risks is inferred using the junction tree algorithm. A
comparison is made between the trained conditional and actual probabilities during the network parameter training to verify the
validity of the proposed model that accords with expert experience, thereby proving the model validity. Further, its main “causal
chain” is inferred to be an improper emergency response-human failure-accident occurrence, where the probability of driver
failure is 82%, and the probability of accident occurrence is 68% by taking “a certain road traffic accident” as an example. ,ere is
consistency between the inference results and the actual accident sequence that suggests the effectiveness of the proposed
knowledge inference method.

1. Introduction highway transportation safety risk management is to predict


the risks faced in transportation in advance that can be
Knowledge graphs that contain abundant knowledge from understood as a risk inference problem in knowledge graphs.
various areas have become a hot topic across various re- Extensive research achievements concerning knowledge
search fields, with the rapid explosion of network infor- inference have been made to date:
mation. As a reliable source of all-round information and
underlying support, they have performed remarkably in the (1) Inferences based on logical rules: ,e logical rule-
safety risk management realms, such as medicine and fi- based reasoning process divides knowledge into rules
nance [1, 2]. An important area of mass casualty monitoring and facts. ,ere are forward, backward, and bidi-
has always been highway transportation. ,e top priority of rectional reasoning approaches to inference
safe highway transportation is safety risk management, and, methods. ,e Markov logic network (MLN), where
for this, building a complete knowledge graph of highway the common inference tasks include the maximum
transportation safety risk management is necessary for posterior probability estimation and the conditional
achieving risk control. Crucial to the completion and ex- probability reasoning, is a particularly fruitful model.
pansion of knowledge graphs, the knowledge inference Chen et al. [3] applied the MLN to risk management
technique—among key techniques of building knowledge in response to information security issues. First, they
graphs, such as knowledge extraction, fusion, and infer- constructed an infrastructure dependency network
ence—deduces new knowledge or identifies errors in a by the expert method that was then converted into an
knowledge graph by using certain methods based on the MLN for the calculation of node weights via a
existing knowledge in the graph. ,e ultimate aim for learning algorithm. ,e proposed model’s
7074, 2021, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2021/6624579 by Nat Prov Indonesia, Wiley Online Library on [04/07/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
2 Advances in Civil Engineering

innovativeness is in the introduction of new network executed by introducing the “July 16 Oil Depot
threat factors and the calculation of their possible Explosion and Fire” that yielded correct inference
impacts on the entire system. Yang et al. [4] pro- results. Seyed Hassani et al. [9] developed a Bayesian
posed an MLN-based method for joint sentiment network related to inference on knowledge graphs by
analysis of sentences to address the inadequate taking into consideration some hidden or ignored
utilization of contextual information with the information in complex social networks. ,ey
existing knowledge inference methods, as well as the identified the nodes of the Bayesian network, such as
cross-domain connection problem of sentence in- comments, avatar information, ensemble photos, or
formation, in the context of knowledge graph in- interactive information, and using the collected data,
ference. ,ey found through experimentation that trained its parameters. ,e effectiveness of their
MLN-based knowledge inference achieved rather algorithm was tested on Facebook that found the
desirable results. Liu et al. [5] put forward an en- model is highly accurate in finding information
semble learning-based MLN model, as well as its between users. Rajabi and Ataie-Ashtiani [10] pro-
learning algorithm, in response to the difficulty of posed a fuzzy Bayesian inference method, in terms of
MLN in inferring large-scale data, respecting the model modification and improvement, to address
modification and improvement of MLN models. the unavailability of parameter training data in
,ey conducted a knowledge extraction experiment conventional Bayesian inference. ,e method fused
in Google’s large-scale corpus using this method. As the information provided by experts with the
experiments proved, their method had higher pre- Bayesian network model. ,ey also developed an
cision and recall than the pipelining approach. algorithm for solving the model. Computational
(2) Bayesian network-based inference models: Bayesian results were compared with the Markov Chain
inference is a process of calculating posterior Monte Carlo- (MCMC-) based algorithm that
probability, based on conditional and prior proba- proved the effectiveness of their model and
bilities. Relating to risk management, Lu et al. [6] algorithm.
proposed a Bayesian network-based model for flood (3) Machine learning-based models: Xie et al. [11]
risk inference aiming at the problem of flood risk proposed a model that targeted the knowledge
misreporting. Here, expert scoring identified the completion issue with knowledge graphs based on
Bayesian network, and parameter learning was bag-of-words and convolutional neural network,
performed with the Monte Carlo model. ,ese where the bag-of-words model was used for the
helped in analyzing the changes in flood risk under vector representation of texts, and the convolutional
one- and two-factor uncertainties. ,e Bayesian neural network was responsible for classifying and
network built by them allowed bidirectional rea- inferring word relationships. ,ey confirmed the
soning and probability distribution inference of validity of their model through experimentation. A
arbitrary nodes. Experiments proved that their novel convolution-based model was proposed by
model was practically applicable to the risk assess- Annervaz et al. [12] that extracted relevant prior
ment and control in reservoirs. Based on a qualitative knowledge from the graphs via an attention mech-
weighted Bayesian network, Yin et al. [7] put forward anism. Experiments on public datasets demonstrated
a human factor inference model for coalmine acci- that their method was effective in enhancing the
dents in order to address the analytical deficiency of performance of deep learning models. Godin et al.
human factors in coal mine accident analysis. ,e [13] developed a ternary reward framework to cope
Bayesian network, based on typical accident cases, with the incorrect answering problem in the existing
was constructed using the Human Factors Analysis reinforcement learning-based question answering
and Classification System (HFACS) model, and the systems that established a new evaluation criterion
weight magnitudes between network nodes were by setting different rewards for wrong answers and
obtained by the expert method. ,e mutual logical no answers, thereby achieving better evaluation of
relationships between the human factors were re- model effectiveness. All of the above—completion,
flected rather accurately by their model. With a view fast retrieval, and answering problems related to
to combing the evolution process of such events, Xia knowledge graphs—can be understood as the sce-
et al. [8] proposed a dynamic Bayesian network- nario applications of knowledge inference.
based unconventional scenario inference model for (4) Hybrid models: Jiang et al. [14] developed a method
sudden disaster events. ,ey constructed a dynamic for representing knowledge based on weighted
Bayesian network with nodes such as the scenario knowledge graphs that was then combined with the
condition, handling goals, handling measures, and probabilistic graphical model to establish a medical
intrinsic variables. ,e prior probabilities were diagnosis knowledge network. A path sorting algo-
designated for root nodes, while for variables rithm-based random walk model was proposed by
without parent nodes, the expert method was Liu et al. [15] that performed knowledge inference on
employed to determine conditional probabilities. In the semantics of sentences regarding the inverse
the end, inference of accident scenario evolution was relationship from object to subject. An exhaustive
7074, 2021, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2021/6624579 by Nat Prov Indonesia, Wiley Online Library on [04/07/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Advances in Civil Engineering 3

search was avoided by introducing a random sam- the determination of node conditional probabilities, and (iv)
pling mechanism, and the effectiveness of their the inference of safety risk knowledge for highway trans-
knowledge inference model was experimentally portation. Overall, the modeling procedure incorporates the
verified. expert method and machine learning. Figure 1 presents the
model framework proposed in this study.
,e advantage of logical rule-based reasoning is the
intuitive inference procedure that can reflect prior knowl-
edge, and its disadvantage is the difficulty in obtaining rules 2.1. Network Node Identification. At this stage, expert
that lead to error accumulation. ,e advantage of deep scoring identifies the network nodes. Prior expert knowledge
learning-based inference models is a powerful reasoning is collected according to five categories of “drivers, vehicles,
capability, while their disadvantages are insufficient inter- roads, environment, and management,” thereby con-
pretability and data limitations. Predicting the probability of structing a matrix for node information acquisition as
accident occurrence and estimating the “causal chain” are shown in the following expression:
the foremost tasks of knowledge graph inference for highway
transportation safety risks. On the other hand, we consider Ef � 􏼐eij 􏼑m×n, (1)
the task of inferring a knowledge graph about highway
where eij (0 ≤ eij ≤ 1) denotes the confidence that the ith
transportation safety risks from the Bayesian networking
expert has, regarding the causality between the jth risk and
perspective, given the limited data collected in this study.
the fth category, i ∈ [1, n], j ∈ [1, m], f ∈ [1, 5]. Matrix Ef
Expert scoring identifies the nodes of the Bayesian network.
is traversed, and if eij ≥ a, it indicates that the jth risk and the
,e node parameters are learned via the EM algorithm, and
fth category are casually correlated.
knowledge about highway transportation safety risks is
inferred with the junction tree model.
,e contributions of this paper include the following: 2.2. Network Architecture Identification. ,e network ar-
(1) A knowledge inference framework for road trans- chitecture identification is divided into two steps, given the
portation risks is proposed, and annotated datasets limited amount and quality of training data in this study,
are provided for the research field. namely, the expert method based network architecture and
the network architecture modification by data-based
(2) During the identification of network nodes, a net- learning. For network architecture learning, the greedy
work architecture identification method based on search and conditional restriction algorithms are generally
expert scoring combined with the K2 algorithm is used, of which the methods based on greedy space search are
proposed that ensures the incorporation of expert adapted to minimum data size. K2 algorithm optimizes the
knowledge about road transportation risks by the search capability on the basis of ordinary greedy search by
network architecture. deleting the redundant edges. Hence, for network archi-
(3) Based on the Bayesian network created in this paper, tecture learning, this study adopts the K2 algorithm, whose
the probability distributions of accident occurrence core idea is to find a network structure with high score
are inferred under multiple factors, including the functions. ,e K2 scoring method is described in the fol-
driver, vehicle, road, environment, and lowing formula[16]:
management. n qi ri
D ri − 1􏼁!
,e remainder of this paper is organized as follows: F􏼒 􏼓 � 􏽙 􏽑 􏽙 Nijk !, (2)
G i�1 j�1 􏼐Nij + ri − 1􏼑 k�1
Section 2 puts forward a model framework for risk
knowledge inference, Section 3 builds a Bayesian network- where G represents the Bayesian network architecture, D stands
based knowledge inference model for road transportation for the training dataset, i denotes the ith node, n denotes the
risks, Section 4 infers knowledge about road transportation number of nodes, j denotes the jth parent node of the current
risks, and Section 5 concludes the study. node, qi is the number of parent nodes of the current node, k
represents the kth value of the current node, ri represents the
2. Creation of Risk Knowledge Inference Model number of possible values of the current node, and Nijk denotes
the number of examples in the training dataset D that corre-
,e Bayesian network may be summarized as a probabilistic sponds to the kth value of current node and the jth value of
inference network that is based on the Bayesian formula, parent node, Nij � 􏽐k�1
ri
Nijk .
where the nodes represent the random variables, and the For the K2 algorithm, its pseudocode is as in Algorithm 1
directed edges between nodes represent the internode [16].
causality. Each node has a probability distribution. As-
suming a given Bayesian network G(S, P) consists of two
parts, with S being a directed acyclic graph containing all 2.3. Network Parameter Learning. During network param-
nodes and P being a collection of conditional probability eter learning, commonly used algorithms include the
distribution tables. Construction of Bayesian network for maximum likelihood estimation (MLE) and the Bayesian
inferring highway transportation safety risk knowledge estimation and expectation-maximization (EM) algorithms,
comprises four steps, namely, (i) the network node iden- of which the MLE algorithm is generally suitable for sce-
tification, (ii) the network architecture identification, (iii) narios with large data size. When the sample size is small and
7074, 2021, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2021/6624579 by Nat Prov Indonesia, Wiley Online Library on [04/07/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
4 Advances in Civil Engineering

Start

Model building

Network node Network architecture


identification identification

Network parameter learning

Model inference and modification

Expert
Information input
experience

Node probability updating and


inference

N Do the inference results agree


with facts?

End

Figure 1: Flow chart of safety risk knowledge inference for highway transportation.

Input: training data D, node sequence ρ, positive integer u (u denotes the number of parent nodes)
Output: G
for i � 1 to n do
pa(Vi ) � ∅
Fold � fCH ((Vi , pa(Vi )), D)
start � true
while start (true) and |pa(Vi )| < u do
Z←pred(Vi )\pa(V), fCH ((Vi , pa(Vi ) ∪ {Z})D) it reaches the maximum node
Fnew � fCH ((Vi , pa(Vi ) ∪ {Z})D)
if Fnew > Fold then Fold � Fnew
pa(Vi ) � pa(Vi ) ∪ {Z}
else Start � false
end
end
where pred(Vi ) represents the nodes before Vi and pa(Vi ) is a collection of parent nodes.

ALGORITHM 1

the prior probability is hardly attainable, the use of the EM variables at the network inference stage are computed under
algorithm yields a good learning effect. Hence, this study exact values given for a set of evidence variables (causes).
adopts the EM algorithm to learn network parameters. Its Among common algorithms for network inference, such
algorithmic procedure can be divided into E-step and as junction tree and variable elimination, the former is
M-step (see Algorithm 2) [17]. adopted herein for network inference, owing to its easy-to-
understand and accurate inference advantages. ,e junction
tree algorithm is divided into four phases:
2.4. Network Inference. Network inference is the ultimate
objective of this study. All joint probability distributions for (1) In the initial phase, the built Bayesian network is
nodes are obtained through the network parameter learning modularized, and the parent nodes of each node are
stage. ,e probability (result) distributions for a set of query connected with undirected edges
7074, 2021, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2021/6624579 by Nat Prov Indonesia, Wiley Online Library on [04/07/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Advances in Civil Engineering 5

Input: observation variable Y, hidden variable Z, joint distribution P(Y, Z|θ), conditional probability distribution P(Z|Y, θ).
Output: model parameter θ
(1) Assignment of the initial values for the model parameter;
(2) E-step: θi is the value of the model parameter after the ith iteration, the calculated function of expectation on i + 1 th iteration,
Q(θ, θi ) � EZ∼P(Z|Y,θi )[log P(Y, Z|θ)] �􏽐Z P(Z|Y, θi )log P(Y, Z|θ)
(3) M-step: find Q(θ, θi ) that maximizes θ, determine the estimated value θi+1 of the i + 1 th iteration.
(4) Repeat steps 2-3 until model converges.

ALGORITHM 2

(2) ,e Bayesian network is converted into an undi- Figure 2 displays the learned network architecture. After-
rected graph, where each arrow is replaced by an ward, modification of the network structure was performed
edge based on expert experience. Figure 3 illustrates the finalized
(3) ,e graph is triangulated, and an edge is added to the network architecture.
variables in the same loop As is clear from Figure 2, the network architecture
presents a strong progressive causal relationship that was
(4) ,e triangulated graph is converted into a clustering
divided into three layers vertically. ,e relationship between
tree, where each node represents factors in the
nodes is clear, and there are no connecting lines between
variable subset
unrelated factors. ,e nodes are divided into two parts,
including observable nodes and virtual nodes. ,e observ-
3. Bayesian Network-Based Knowledge able nodes are the nodes with actual data description, while
Inference Model for Highway Transportation the virtual nodes are the nodes added for the integrity of the
Safety Risks network.
As is clear from Figure 3, the hierarchy of the network is
,e purpose of safety risk knowledge inference for road not obvious, because through the learning of the algorithm,
transportation is to achieve the “prior” control of unsafe the implicit relationship between the nodes is further
risk factors. However, these risk factors have strong un- revealed, and the overall structure of the network satisfies the
certainties, while Bayesian networks have provided requirement of the cognition.
preferable solutions to complex and uncertain problems
[16]. Hence, following this idea, a Bayesian network-based
knowledge inference model for road transportation risks, 3.3. Network Parameter Learning. Learning of various net-
with which the inference problems of safety risk knowl- work parameters was accomplished via the EM algorithm in
edge for road transportation are solved, has been pro- GeNIe 2.0. ,e learning process can be summarized into five
posed in this paper. steps:
(1) Assignment of initial values for various root nodes.
3.1. Network Node Identification. Since the focus of this (2) Establishment of correspondences between node
research is on solving the problem of risk knowledge in- names and IDs.
ference, our team has listed a total of 28 risk sources in- (3) Import of datasets.
volving “drivers, vehicles, roads [18, 19], environment, and
(4) Matching of data with network nodes.
management” based on the description of safety risks in the
accident reports over the years that have been accomplished (5) Parameter learning. ,e learning results are listed in
by taking specific risks as the network nodes. We developed a Table 2, where N implies normal and F implies
quantitative questionnaire and distributed it among ten failure.
experts within the field. When eij ≥ 0.7, a causality is con- Table 2 clearly shows that, under identical conditions,
sidered to exist between risk and category. A risk has been improper emergency response and traffic violations exhibit
chosen as an effective node if more than six experts con- the highest posterior probabilities among the driver factors,
firmed its causality with the category. By employing expert followed by fatigue driving, distracted driving, and drunk
scoring, the network nodes for road transportation risk driving, while inexperienced driving shows the lowest
knowledge have been constructed according to formula (1) posterior probability. ,ese results are in line with expert
as detailed in Table 1 that was done by selecting a total of 26 experience and perception.
observable nodes and 5 virtual nodes.

4. Inference of Safety Risk Knowledge for


3.2. Network Architecture Identification. A combination of Highway Transportation
the K2 algorithm and expert scoring has been used to
identify the Bayesian network architecture. Initially, with the Using the junction tree algorithm, Bayesian network in-
aid of GeNIe 2.0 software, learning of network structure was ference is implemented on highway transportation safety
completed based on the database using the K2 algorithm. risks. According to the Bayesian network architecture,
7074, 2021, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2021/6624579 by Nat Prov Indonesia, Wiley Online Library on [04/07/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
6 Advances in Civil Engineering

Table 1: Bayesian network nodes for road transportation risk knowledge.


Final state Risk category Primary risk sources
V vehicle
V1 braking system; V2 steering system; V3 light signal system; V4 tires; V5 overload or unbalanced load
D1 drunk driving; D2 fatigue driving; D3 distracted driving; D4 traffic violations; D5 improper emergency
D driver
response; D6 inexperienced driving
R1 roadsides; R2 bridge and tunnel; R3 alignment; R4 intersections; R5 pavements; R6 signs and markings;
A accident R road
R7 sight distance; R8 safety protection facilities
E environment E1 interference from traffic accidents; E2 severe weather; E3 natural disasters; E4 night lighting
M
M1 dynamic monitoring; M2 regulations; M3 private contracting
management

V D R E M

V1 V2 V3 V4 V5 R1 R2 R3 R4 R5 R6 R7 R8 M1 M2 M3

D1 D2 D3 D4 D5 D6 E1 E2 E3 E4

Observable node

Virtual node

Figure 2: Bayesian network topology for highway transportation accidents.

Improper
emergen..
Roadside risk
Poor
alignment; Interference
Road from t..
sections.. Bridge and
Distracted tunnel
driving Lighting
Unfamiliar inadequ..
driving Severe
weather
Driver factor
Natural
Poor sight disasters
Fatigue line
driving Traffic
violations Road
sections.. Dynamic
Drink and monitoring
driving Uneven
road sec.. Overload
or unbal.. Environment
Braking factor
system f..
Road factor Private
Management vehicles
Steering factor
system f.. Vehicle
factor
Intersections
Institutional?
Accident deficie..
Light signal
system f.. Tire failure

Figure 3: Final optimized network architecture.


7074, 2021, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2021/6624579 by Nat Prov Indonesia, Wiley Online Library on [04/07/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Advances in Civil Engineering 7

Table 2: Driver parameter learning results.


Risk Status
Drink and driving N N N N N N F
Fatigue driving N N N N N F N
Distracted driving N N N N F N N
Traffic violated N N N F N N N
Improper emergency
N N F N N N N
response
Unfamiliar driving N F N N N N N
N 0.899 0.8125 0.3053 0.339638 0.558 0.506 0.5666
Driver factors
F 0.101 0.1875 0.6946 0.66036 0.441 0.493 0.4333

impact analysis is carried out keeping in mind the following are depicted in Figure 4 under human, vehicle, road, en-
aspects: vironment, and management factors.
From Figure 4(a), it is clear that, among the causes of
(1) Effects of the driver, vehicle, road, environment, and
accidents, the accident probability is the highest for the
management factors on the occurrence of accidents.
driver factors, with an inferential probability of 0.899, fol-
(2) Effects of drunk driving, fatigued driving, distracted lowed by the management factors, with an inferential
driving, traffic violations, improper emergency re- probability of 0.485. In contrast, road factors constitute the
sponse, and inexperienced driving on the driver least probable factors, primarily because they are indirect
risks. causes in general. According to Figure 4(b), the probability
(3) Effects of the braking system, steering system, light of accidents resulting from the failure of drivers, vehicles,
signal system, tires, truck overload, and unbalanced and management is quite high, while that caused by the
load on the vehicle risks. failure of environmental and road factors is rather low.
(4) Effects of roadsides, bridges and tunnels, alignment,
intersections, pavements, signs and markings, sight
4.2.2. Effects of Relevant Factors on the Driver Risks.
distance, and safety protection facilities on the road
Figure 5 illustrates the influences of drunk driving, fatigue
risks.
driving, distracted driving, traffic violations, improper
(5) Effects of traffic accidents, severe weather, natural emergency response, and inexperienced driving on the
disasters, and night lighting on the traffic environ- driver risks.
ment risks. From Figure 5, it is clear that, regarding driver factors,
(6) Effects of dynamic monitoring, regulations, and the failure probabilities attributed to improper emergency
private contracting on the management risks. response and traffic violations are comparatively higher at
0.694 and 0.660, respectively. In comparison, the failure
probability caused by inexperienced driving is lower. ,e
4.1. Data Description and Model Evaluation. Our team occurrence frequencies are the highest among causes of
collected 600 reports on road transportation accidents from accidents for speeding, illegal lane changing, illegal over-
the safety management websites that occurred between 2012 taking, illegal parking, and illegal emergency lane parking,
and 2019. ,ese accident reports have been sorted into three all of which can be classified as forms of a traffic violation.
categories. ,e risk factors of each accident are marked ,e probability of an accident is generally high in case of
either as 0 or as 1, where 1 indicates that the assessed risk is improper emergency response. ,e probability of accidents
an accident-causing factor, and 0 indicates that the assessed caused by inexperienced driving is rather low because of the
risk is not an accident-causing factor. ,e model evaluation strict management of drivers by road transportation
criteria are as follows: companies.

(a) Accordance with expert experience and knowledge


of the analysis of the effects of relevant factors on 4.2.3. Effects of Relevant Factors on the Vehicle Risks.
accident occurrence. ,e impacts of the braking system, steering system, light
(b) Conformance to the accident state description of the signal system, tires truck overload, and unbalanced load on
accident reports in the case analysis. the vehicle risks are illustrated in Figure 6.
As shown in Figure 6, the failure probabilities attrib-
utable to the braking system and tires are the highest among
4.2. Inference Analysis of Relevant Influencing Factors all vehicle factors—0.885 and 0.888, respectively. Braking
system faults generally refer to braking failure or poor
4.2.1. Effects of Relevant Factors on the Occurrence of braking caused by improper operation, wear of parts, or
Accidents. ,e probability distributions of accident occur- inadequate vehicle clearance, while tire faults often refer to
rence, based on the built Bayesian network model for the severe wear or blowout of tires. ,e lowest failure
highway transportation safety risks and the inference results, probability is attributable to light signal system faults
7074, 2021, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2021/6624579 by Nat Prov Indonesia, Wiley Online Library on [04/07/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
8 Advances in Civil Engineering

1.0 1.0

0.8 0.8

Reasoning probability value


Reasoning probability value

0.6 0.6

0.4 0.4

0.2 0.2

0.0 0.0

DREM
DM
DE
DEM

DRM

DVRE
DVE
DVEM

DVREM
DV
VE
DVM
DVRM
VREM

EM
VM
VRM
REM
VEM
VRE
RE
RM
Driver factor Vehicle Road factor Environment Management

DR

DR

DVR

VR
factor factor factor
Influence factors
Influence factors

(a) (b)

Figure 4: Effects of relevant factors on the occurrence of accidents.

0.8
1.0

0.7

0.6 0.8
Reasoning probability value

Reasoning probability value

0.5
0.6
0.4

0.3 0.4

0.2
0.2
0.1

0.0 0.0
Drink and Fatigue Distracted Traffic Improper Unfamiliar Braking system Steering Light signal Tire failure Overload or
driving driving driving violations emergency driving failure system failure system failure unbalanced load
response Influence factors
Influence factors Figure 6: Effects of relevant factors on the vehicle risks.
Figure 5: Effects of relevant driver factors.
causes of accidents, the relevant probabilities are also high. In
because accidents can often be avoided in this case as long as contrast, sight distance is the least likely cause among all road
the drivers take appropriate emergency measures. factors because such problems generally involve roadside ob-
structions or intersections, whose probabilities are rather low
according to accident statistics.
4.2.4. Effects of Relevant Factors on the Road Risks. In Fig-
ure 7, the influences of roadsides, bridges and tunnels, alignment,
intersections, pavements, signs and markings, sight distance, and 4.2.5. Effects of Relevant Factors on Management Failures.
safety protection facilities on the road risks are detailed. Figure 8 presents how dynamic monitoring, regulations, and
From Figure 7, it is clear that the failure probabilities at- private contracting affect the management risks.
tributable to pavement and alignment problems are the highest According to Figure 8, the probability of management
among all road factors—valued at 0.852 and 0.782, respectively. problems caused by improper dynamic monitoring is the
Pavement problems generally refer to slippery surfaces, potholes, highest at 0.699. At present, information about fatigued
and so forth, while alignment problems include sharp bends, driving, distracted driving, speeding, vehicle trajectories,
steep slopes, and long descents. As these are rather common and so forth can be tracked and forewarned by the dynamic
7074, 2021, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2021/6624579 by Nat Prov Indonesia, Wiley Online Library on [04/07/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Advances in Civil Engineering 9

1.0 4.2.6. Effects of Relevant Factors on the Traffic Environment.


Figure 9 illustrates how traffic accidents, severe weather,
natural disasters, and night lighting affect the traffic
0.8 environment.
From the figure, it is clear that the failure probability
Reasoning probability value

attributable to severe weather is the highest among all en-


0.6
vironmental factors, which is at 0.666. Severe weather
mainly refers to conditions such as heavy rain, fog, snow, or
wind warnings, which are the most common causes of traffic
0.4
environment failures. Meanwhile, given the relatively low
incidence of natural disasters, the probability of environ-
mental failures resulting from them is the lowest.
0.2
All the above analysis results accord with expert expe-
rience and knowledge, suggesting the effectiveness of the
created model.
0.0
Roadside Bridge and Poor Intersections Uneven Road Poor sight Road
risk tunnel alignment road sections line sections
sections with without
unclear or safety
4.3. Case-Based Analysis. Introducing “a certain major road
incomplete protection traffic accident” helps in carrying out an empirical analysis of
signs and facilities
markings knowledge inference. On February 20, 2019, at 19:07 h, an
Influence factors ordinary passenger bus collided with the sidewall of a tunnel
Figure 7: Effects of relevant factors on the road risks.
on the 2441 km + 100 m section of the G65 Bao-
tou–Maoming Expressway (within a tunnel in Hezuo Vil-
lage, Lingui District, Guilin), which caused 4 deaths, 3
serious injuries, 16 minor injuries, and severe vehicle
0.8 damage. ,e causes of this major road traffic accident were
analyzed as follows:
0.7
(1) Driver factors: after the bus entered the tunnel, the
0.6 driver oversped, ignoring the signs and markings. In
Reasoning probability value

response to the deceleration of the preceding vehicle,


0.5 the driver applied emergency brakes, which was an
improper emergency response, thereby losing con-
0.4
trol of his vehicle.
0.3 (2) Vehicle factor: during steering, the bus suddenly
went out of control, so the possibility of vehicle
0.2 failure cannot be ruled out.
(3) Environment factor: there was a heavy shower at the
0.1
time of the accident that made the road surface
0.0 slippery.
Dynamic monitoring Institutional deficiencies Private vehicles (4) Management factors: due to the long-term, nonlocal,
Reasoning probability value out-of-scope operation of the tourist bus, its dy-
Figure 8: Effects of relevant factors on management failures. namic vehicle monitoring terminal had been offline.
Based on the above description, the node elements for
monitoring system for road transport vehicles. A sub- this accident are detailed in Table 3.
stantial number of accidents can be avoided by the proper Based on the Bayesian network built in Figure 3, the
functioning of the vehicle monitoring system. In contrast, nodes relating to accident occurrence are mapped into a
the occurrence of an accident is generally attributed to the Bayesian network, followed by knowledge inference in
improper functioning of the dynamic monitoring system. GeNIe 2.0. As shown in Figure 10, the probabilities are
Regulations are the least likely factors causing manage- updated for each node.
ment failures. Overall, regulatory problems include Figure 10 clearly shows that the foremost cause of this
driving management and inadequate vehicle management accident was the human failure attributed to improper emer-
that is indirectly responsible for the causes of accidents in gency response and traffic violations, whose probability was
general. 82%; the second important cause was the vehicle failure
7074, 2021, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2021/6624579 by Nat Prov Indonesia, Wiley Online Library on [04/07/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
10 Advances in Civil Engineering

0.7

0.6

Reasoning probability value


0.5

0.4

0.3

0.2

0.1

0.0
Interference from Severe weather Natural disasters Lighting
traffic accidents inadequacy
Influence factors

Figure 9: Effects of relevant factors on the environmental risks.

Table 3: Bayesian network nodes for the accident.


Risk factor Network node
Improper emergency response
Driver
Traffic violation
Vehicle Vehicle fault
Pavement
Road
Bridge and tunnel
Environment Severe weather
Dynamic monitoring
Management
Private contracting

Accident
State 0 32%
State 1 68%

Driver factor Vehicle factor Road factor Environment factor Management factor
State 0 18% State 0 42% State 0 40% State 0 53% State 0 36%
State 1 82% State 1 58% State 1 60% State 1 47% State 1 64%

Improper emergenc.. Traffic violations Uneven road sections Bridge and tunnel
State 0 12% State 0 12% State 0 67% State 0 37%
State 1 88% State 1 88% State 1 33% State 1 63%

Vehicle failure Severe weather Dynamic monitoring Private vehicles


State 0 45% State 0 21% State 0 45% State 0 56%
State 1 55% State 1 79% State 1 55% State 1 44%

Figure 10: Probability updates and inference diagram of the accident.

attributable to vehicle fault, with a probability of 58%; besides, private contracting was 68%. Ultimately, the probability of
the probability of traffic environment problems resulting from occurrence of the accident was 0.68, which is rather high. ,ese
severe weather that was 47%; and the probability of manage- descriptions conform to the actual accident inference process
ment failure resulting from inadequate dynamic monitoring and that proves the feasibility of the proposed method.
7074, 2021, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2021/6624579 by Nat Prov Indonesia, Wiley Online Library on [04/07/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Advances in Civil Engineering 11

5. Conclusions Beijing Institute of Technology, vol. 33, no. 6, pp. 600–604,


2013.
Highway transportation safety risks involve multiple factors, [5] Y. Liu, C. Ouyang, and J. Li, “Ensemble method to joint
such as driver, vehicle, road, environment, and manage- inference for knowledge extraction,” Expert Systems with
ment. According to the results of the Bayesian network Applications, vol. 83, pp. 114–121, 2017.
inference, the primary cause of accidents is driver-related. [6] Q. Lu, P.-a. Zhong, B. Xu et al., “Risk analysis for reservoir
flood control operation considering two-dimensional un-
Meanwhile, management factors are nonnegligible as well.
certainties based on Bayesian network,” Journal of Hydrology,
,e foremost human factors are improper emergency re- vol. 589, Article ID 125353, 2020.
sponses and traffic violations, of which improper emergency [7] D. L. Yin, Z. B. Chen, J. C. Zeng et al., “Human factors in-
responses are often ignored by drivers or managers. Ad- ference of safety accidents in coal mine based on qualitative
ditionally, the foremost vehicle factor is faulty vehicles, Bayesian network with weight,” Journal of WUT (Information
which is attributed to the failure of the braking system or & Management Engineering), vol. 39, no. 1, pp. 514–518, 2017.
tires. Severe weather serves as the foremost factor for traffic [8] D. Y. Xia, X. M. Qian, and Z. P. Duan, “Scenario deduction
environment. Interference from existing accidents is another model of unconventional emergency based on dynamic
factor that is often ignored by drivers or managers. Bayesian network,” Journal of Northeastern University (Nat-
By using the Bayesian network for inference of knowl- ural Science), vol. 36, no. 6, pp. 897–902, 2015.
edge graphs oriented to highway transportation safety risks, [9] A. Seyed Hassani, M. S. Haghighi, and A. Khonsari, “Bayesian
inference of private social network links using prior infor-
the rule acquisition problem with rule-based reasoning, as
mation and propagated data,” Journal of Parallel and Dis-
well as the limitations in data size and quality can be avoided. tributed Computing, vol. 125, pp. 72–80, 2019.
With powerful reasoning, the above method can preferably [10] M. M. Rajabi and B. Ataie-Ashtiani, “Efficient fuzzy Bayesian
be integrated with the transportation safety risk knowledge inference algorithms for incorporating expert knowledge in
graphs to achieve flexible inference of accident probability in parameter estimation,” Journal of Hydrology, vol. 536,
the dynamic risk coupling scenarios. In the next phase, in pp. 255–272, 2016.
view of the dynamic nature of such knowledge graphs, we [11] R. Xie, Z. Liu, J. Jia et al., “Representation learning of
will probe deeper into the collaborative fusion between the knowledge graphs with entity descriptions,” in Proceedings of
Bayesian network-based knowledge inference and the dy- the 30th AAAI Conference on Artificial Intelligence, Phoenix,
namic knowledge graphs. AZ, USA, 2016.
[12] K. M. Annervaz, S. B. R. Chowdhury, and A. Dukkipati,
“Learning beyond datasets: knowledge graph augmented
Data Availability neural networks for natural language processing,” 2018,
https://arxiv.org/abs/1802.05930.
,e data are available through contacting the authorvia [13] F. Godin, A. Kumar, and A. Mittal, “Using ternary rewards to
e-mail: [email protected]; hyperlink (https://pan.baidu. reason over knowledge graphs with deep reinforcement
com/s/1Y9d3zGtXtowCmWKHrQ678g). learning,” 2019, https://arxiv.org/abs/1902.10236.
[14] J. Jiang, X. Li, C. Zhao, Y. Guan, and Q. Yu, “Learning and
inference in knowledge-based probabilistic model for medical
Conflicts of Interest diagnosis,” Knowledge-Based Systems, vol. 138, pp. 58–68,
,e authors declare that they have no conflicts of interest. 2017.
[15] Q. Liu, M. H. Han, L. Y. Jiang et al., “Two-tier random walk
based relational inference algorithm,” Chinese Journal of
Acknowledgments Computers, vol. 40, no. 6, pp. 1275–1290, 2017.
[16] G. F. Cooper and E. Herskovits, “A Bayesian method for the
,is work was supported by the Special Fund 2019-C507 of induction of probabilistic networks from data,” Machine
Science and Technology Innovation of Research Institute of Learning, vol. 9, no. 4, pp. 309–347, 1992.
Highway Ministry of Transport, and the Special Fund [17] S. L. Lauritzen, De EM Algorithm for Graphical Association
Project of Basic Research Expenses of Central Public Welfare Models with Missing Data, Elsevier Science Publishers,
Research Institutes 2020-9066. Amsterdam, Netherlands, 1995.
[18] C. Liu, S. Lv, X. Peng et al., “Analysis and comparison of
different impacts of aging and loading frequency on fatigue
References characterization of asphalt concrete,” Journal of Materials in
Civil Engineering, vol. 32, no. 9, Article ID 04020240, 2020.
[1] T. Yu, J. Li, Q. Yu et al., “Knowledge graph for TCM health [19] C. Liu, S. Lv, D. Jin et al., “Laboratory investigation for the
preservation: design, construction, and applications,” Artifi- road performance of asphalt mixtures modified by rock as-
cial Intelligence in Medicine, vol. 77, pp. 48–52, 2017. phalt-styrene butadiene rubber,” Journal of Materials in Civil
[2] B. Ribeiro, N. Chen, and A. Kovacec, “Shaping graph pattern Engineering, vol. 33, no. 3, Article ID 04020504, 2021.
mining for financial risk,” Neurocomputing, vol. 326-327,
pp. 123–131, 2019.
[3] Y. Chen, Y. D. Wang, J. D. Wang et al., “Method for infor-
mation system risk management based on Markov logic
networks,” Computer Engineering and Applications, vol. 52,
no. 18, pp. 104–110, 2016.
[4] L. G. Yang, S. P. Tang, and J. Zhu, “A Markov logic network
based sentence sentimental analysis method,” Transactions of

You might also like