2002 A Mounce Leak Detec ML ANN MNF FailureSensor 3LevelsDetection
2002 A Mounce Leak Detec ML ANN MNF FailureSensor 3LevelsDetection
Water Science and Technology Vol 45 No 4–5 pp 237–246 © IWA Publishing 2002
S.R. Mounce*, A.J. Day**, A.S. Wood**, A. Khan**, P.D. Widdop** and J. Machell***
* Department of Computing, Phoenix Building, University of Bradford, Bradford BD7 1DP, UK
** Department of Mechanical & Medical Engineering, University of Bradford, BD7 1DP, UK
*** Yorkshire Water, ROCC1, Western Way, Halifax Road, Bradford BD6 2LZ, UK. (E-mail:
www.laps.ac.uk)
Abstract This paper describes how hydraulic and water quality data from a distribution network may be
used to provide a more efficient leakage management capability for the water industry. The research
presented concerns the application of artificial neural networks to the issue of detection and location of
leakage in treated water distribution systems. An architecture for an Artificial Neural Network (ANN) based
system is outlined. The neural network uses time series data produced by sensors to directly construct an
empirical model for predication and classification of leaks. Results are presented using data from an
experimental site in Yorkshire Water’s Keighley distribution system.
Keywords Bursts; detection; leakage; neural networks; online monitoring; water distribution networks
Introduction
Problem definition
The issue of leakage management is of serious consideration to water companies because
major water loss can have negative economic, environmental and political consequences.
In the past, a relatively high level of leakage has been typical as the loss of treated water was
considered by water utilities on a solely economic basis and the significant reduction of
leakage level was not recognised as being economically viable. It has been estimated that
the ten major UK water companies lose on average 30 per cent of their treated water
through leaks (Butler and West, 1987). However, in the UK the Office of Water Services
(OFWAT) has in recent years imposed company specific mandatory leakage targets which
are currently reviewed annually. The water companies additionally now have a statutory
duty to conserve water and publish details of leakage reduction performance.
Consequently, the requirement for reduction in leakage levels has led to an interest in tech-
nologies for pipeline-integrity monitoring and failure detection. Current leakage manage-
ment methodologies within the water industry tends to be predominantly a manual process
which is resource intensive. For example data logging, data collection and transfer to point
of use, data analysis, reporting and then mobilisation of leakage teams (one or more leakage
teams may require several days to locate the leak). Subsequently a contractor must be
employed to effect repair. This overall process may take weeks or in extreme cases months.
Assuming an automated system performed daily data analysis, a realistic target for signifi-
cant burst lifetime would be a reduction to 2 days or less. Ideally, a state of the art detection
system should be able to:
• detect and localise bursts within an entire distribution system
• differentiate between bursts and unusual demands or system events
• differentiate between bursts which occur instantly and those which develop gradually
(leaks).
District flow metering is the most common method of leakage control in England and
Wales. Martin and Farley (1994) commented that in recent times, due to the opportunities
afforded by computer technology, continual monitoring of leakage levels through district 237
this flow data is in analysis of measured minimum night flows (NFM). The minimum night
flow is the lowest flow supplied to a hydraulically isolated supply zone. It is usually meas-
ured during the night hours between midnight and 5:00 am and includes leakage as well as a
certain minimum night consumption. Night flows are used because water use is at a mini-
mum and it is easier to identify and subtract the legitimate flows (e.g. 24 hr process loads).
If the night flow minus the legitimate flow is close to zero, the leakage must also be close to
zero. In contrast, unusual jumps in volumes will signify leakage in the absence of any other
factors. It is common to identify a nominal target nightline for each DMA. This target
implies a certain level of leakage activity with regard to the find and fix of hidden leaks. If it
is established that leakage has increased sufficiently to warrant further investigation, the
next stage involves a manual leakage detection which is carried out using methods such as
sounding, leak noise correlation and step testing.
This paper describes research into an automated computing system, based on artificial
neural network and sensor technology, for the detection and location of leakage in a water
pipeline distribution network. The methodology involves application of neural network(s)
to time series data generated by flow, pressure and other sensors (such as failure sensors
developed for the purpose). The target of the research is to automate district meter analysis
as well as allowing further localisation of the burst within the DMA prior to manual detec-
tion by leakage teams.
The complexity of a model for a water pipeline distribution network for a large urban
area is such that no single neural network can effectively monitor the behaviour of the
entire pipeline distribution network. Typically the distribution networks comprise several
DMAs with each zone containing one or more main pipelines, each of which feeds into one
or more sub-zones and finally into individual industrial and domestic consumers. Other
features such as a reservoir or pumping station may also be present. To manage such com-
plexity, it is necessary to build a number of neural networks arranged in a parallel and hier-
archical fashion to allow different types of information processing on the same data or
different subsets of data and at different levels of abstractions. This approach enables the
decomposition of a complex problem into a number of sub-problems. Each sub-problem is
then assigned to a functionally suitable network. At the top global level, level 1 (DMA
level), a neural network monitors zones and can detect leaks to within a zone (e.g. DMA)
but is unable to specify the exact position. Previous work has demonstrated the feasibility
of detection at this level using hydraulic data (Torsun et al. 1999). In the next level of
abstraction, level 2 (sub-zone level), a neural network will monitor sub-zones and can
locate leaks to within a sub-zone. Finally, at level 3 (pipe level), a neural network will mon-
itor one or more pipes and is able to accurately locate the position of the leak. Recent work
has seen attempts to link hydraulic models with on-line data from telemetry systems
(Skipworth et al., 1999; Orr et al., 1999). This type of online model could provide the pipe
level input. A very precise identification of leak location would rely on input to a neural
network of signals from mobile acoustic sensor instrumentation.
Initial work has focused on hydraulic data (from existing flow and pressure sensors) as
the principal data from which to provide features for leakage detection. Consultation with
238 domain experts, as well as analysis of historical data, had revealed hydraulic variables to be
Experimental site
Yorkshire Water’s Keighley distribution system provides water services for around 26,000
customers. It contains approximately 110 km of pipeline and seven service reservoirs. The
distribution network has more recently been partitioned into 15 DMAs. Six of these cover
the city centre and immediate environs and one (K709) has been selected as the test bed for
a leakage detection case study. As well as state of the art hydraulic sensors (flow and pres-
sure are both logged at all DMA level sites in Keighley) the facilities were available to
install failure sensors designed and manufactured at Bradford University. The zone is
effectively disconnected from neighbouring zones resulting in an isolated environment for
tests – the area is supplied solely by Highfield SR. Figure 1 illustrates Zone K709 (from the
hydraulic model) showing the pipe geometry and sensor placement. Table 1 summarises
the sensors employed for the zone.
Methods
Data acquisition
Hydraulic data for the case study network was made available by Yorkshire Water. During
the initial research period this had to be via manual download from company databases but
later the opportunity to download directly using remote telemetry access was provided. The
data consists of 15 minute values for flow and pressure. Data collection for the opacity fail-
ure sensors was via manual download on site (with data loggers providing capacity for 25
days data at reading intervals of five minutes). In either case, time stamped files of readings
form the basis for further analysis.
Data pre-processing
Once data is acquired it is then passed through a number of filters (written in Matlab or
JAVA) as follows.
Table 1 Sensors in experimental site
Albert St.
PRV Fixed Sensor Location
(flow and pressure)
Data correction. First a filter removes any rogue values. Then, to ensure a continuous
data stream an ARIMA based filter fills in any periods of missing data. These missing
entries can be due to sensor failure or temporary power outages for example.
Normalisation. The input is normalised by means of linear re-scaling with mean and
standard deviation (Z-score) to a range of 0 to 1.
Reformatting. Finally, we reformat the input stream into a tapped delay line format in
order to prepare for neural network presentation.
Training
Algorithm. In the NETLAB library the mixture model used is Gaussian with a single
covariance parameter for each component. The mixture coefficients are computed from a
group of softmax outputs, the centres are equal to a group of linear outputs, and the vari-
ances are obtained by applying the exponential function to a third group of outputs. The net-
work is trained with a scaled conjugate gradient optimiser with the error function being the
negative log likelihood of the training data:
N N ÏÔ M ¸Ô
E = - ln L = -  ln pÊË x n ˆ¯ = -  ln Ì Â pÊË x n | jˆ¯ P j ˝
Ô j =1 Ô˛
()
n =1 n =1 Ó
Ï || x - m ||2 ¸
1 Ô Ô
( )
where p x | j = d /2
exp Ì -
2
2s j
j
˝
Ê 2ps 2 ˆ Ô
Ó ˛Ô
Ë ¯
The input is a lag vector and the target is a one day ahead prediction. The prediction evalu-
ated can either be the most “likely value” (centre of the highest component) or we can take
an average across distributions.
Training data. The MDN network is trained with a period of historical data for the sensor in
order to learn the distribution. This data set should be significant (e.g. at least several
weeks, preferably months) and should show reasonable well behaved behaviour. Every so
often (either automatically or at the behest of a user) the network is re-trained with a slight-
ly updated data set. In this way, current conditions are continually built into the model. If
used to train a dedicated MDN. Experimentation revealed that a suitable number of hidden
units was 8 and a set of three Gaussians was sufficient. The main effect of variation of
hidden units was convergence time and increasing the number of Gaussians above 3 had
negligible effect on performance. There was little appreciable reduction of the error term
after around 100 complete cycles of the training set (containing 26,208 presentations). The
one day ahead prediction for the seen data was satisfactory at this stage. To test the per-
formance a test set of data was used for a period of approximately one month (26th
December 2000 to 30th January 2001). Results for this test file and supplemental sensor
data will be described in the next section.
60
50
40
flow l/s
30
20
10
0
04-May
15-May
25-May
01-Feb
12-Feb
22-Feb
03-Mar
14-Mar
24-Mar
06-Sep
16-Sep
26-Sep
03-Apr
14-Apr
24-Apr
12-Jan
22-Jan
26-Aug
04-Jun
15-Jun
25-Jun
06-Jul
16-Jul
26-Jul
06-Aug
16-Aug
07-Oct
date
Predicted v actual
0.7
0.65
Normalised flow
0.6
0.55 Actual
0.5 predicted
0.45
0.4
0.35
0.3
04/01/01
05/01/01
05/01/01
06/01/01
06/01/01
06/01/01
07/01/01
07/01/01
07/01/01
08/01/01
08/01/01
08/01/01
09/01/01
09/01/01
10/01/01
10/01/01
10/01/01
11/01/01
11/01/01
11/01/01
Date
242 Figure 4 Predicted v actual unseen data for standard conditions
0.55 Actual
0.5 predicted
0.45
0.4
0.35
0.3
30/12/00 31/12/00 01/01/01 02/01/01
Date
0.8
0.75
0.7
Normalised flow
0.65
0.6 Actual
0.55 predicted
0.5
0.45
0.4
0.35
0.3
21/01/01 22/01/01 23/01/01 24/01/01
Date
Burst
Alert
1
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
29/12/00
29/12/00
29/12/00
29/12/00
30/12/00
30/12/00
30/12/00
30/12/00
31/12/00
31/12/00
31/12/00
01/01/01
01/01/01
01/01/01
01/01/01
Figure 9 Correlation for diurnal cycle between opacity and flow in Gresley Rd. sub-zone (normalised data
from Figure 10 29/12–01/01)
correlation between abnormal flows and the opacity was witnessed on several sensors dur-
ing the period. Figures 9 and 10 illustrate the output for two of the failure sensors, along
with the Gresley Road flow meter which monitors a sub-area in K709. The data has been
sampled at a 15 min interval rate. The graph shows two interesting events. Firstly, the 31st
December to 3rd January abnormal flow previously discovered is noticeable on the Gresley
Road flow meter. Note that the opacity failure meters have a visible diurnal cycle which ini-
tially increases significantly in response to the velocity increase through the pipe. Even
with 15 minute reading interval there is a noticeable phase difference for these two geo-
graphically close sensors. Secondly, observe the sudden flow surge on the 3rd of January
and the corresponding spikes in the opacity data. The source of this event is unknown,
though it is possibly industrial usage as this section of the zone has several industrial cus-
tomers. Figure 9 shows the first 4 days data of Figure 10 which has been normalised to illus-
244 trate the high correlation for the diurnal cycle. Note that the opacity cycle is lagged behind
7
0.6
6
0.5
0.4
Low Mill Lane
Flow l/s
4 Parkwood St.
Gresley Rd (flow)
0.3
3
0.2
2
0.1
1
0 0
29/12/00 30/12/00 31/12/00 01/01/01 02/01/01 03/01/01 04/01/01
date
the flow. Also, observe the high opacity response the morning following the commence-
ment of abnormal flow.
Conclusions
Many water utility companies are beginning to amass a great deal of data by means of
remote sensing of flow, pressure and other attributes. In the past, this information has
mainly been used for district level management including water audit and calibration of
hydraulic models. However, the opportunity exists to build on existing analysis of mini-
mum night flow for leakage detection using an automated neural network based pattern
recognition system. In this paper, by means of an ongoing experimental field study, we
have indicated how this may be accomplished at the district meter level. Further, we have
demonstrated that additional in-zone failure sensors can provide a set of associated signals
possessing magnitude and phase information which, with suitable quantities of example
training data, will enable further understanding and localisation of events within a DMA.
Future work will extend the system by constructing a neural network for analysis of failure
sensor output. This will require an expanded data set: to be accomplished both by continua-
tion of monitoring within the zone (along with discovery of system events) and by specific
controlled trials in the area in which leak events are simulated. A multi-input TDNN is the
expected architecture for this component of the system. Finally, analysis of an online
hydraulic model’s pipe level data will be attempted for a more refined location.
Acknowledgements
The authors wish to acknowledge the support given for this research by the Engineering and
Physical Sciences Research Council (UK) under its Water Infrastructure and Treatment
Engineering (WITE) Initiative. The authors also acknowledge the support of Yorkshire
Water and are grateful to the directors of the company for permission to use the data and
publish this paper. Finally all the authors wish to acknowledge the intellectual lead provid-
ed to this project by the late Professor I.S. Torsun.
References
Bishop, C.M. (1994). Mixture density networks. Technical report NCRG/94/004, Department of Computer
Science and Applied Mathematics, Aston University, Birmingham, UK. 245
internal report.
Martin, M. and Farley, L. (1994). Improving water management in leakage control. Proc. of Hydrotop 94,
Marseille, France.
Orr, C., Bouulos, P., Stern, C. and Liu, P. (1999). Developing real-time models of water distribution sys-
tems. Modeling and optimisation applications (Water engineering and management series, 3–4) by D.
Savic (Editor) and G. Walters (Editor). Vol. 1, Research Studies Pr.
Skipworth, P.J., Saul, A. and Machell, J. (1999). Practical applications of real time hydraulic modelling of
water distribution networks. Comadem, 1999.
Takens, F. (1981). Detecting strange attractors in fluid turbulence. Dynamical Systems and Turbulence,
Berlin, D. Rand and L. Young Eds. Springer.
Torsun, I.S., Day, A.J., Wood, A.S., Khan, A. and Mounce, S.R. (1999). Neural network for leak detection
in pipeline distribution networks. Presented at the 1st Annual WITE Workshop, part of International
Conference on Computing and Control for the water industry CCWI’99 held at University of Exeter, UK.
246