Implementation of Computer Vision Technique For Crack Monitoring in Concrete Structure
Implementation of Computer Vision Technique For Crack Monitoring in Concrete Structure
a226
C oncrete is one of the widely used displacement field during the image correlation
construction material, made of various process on digital images. They used both
cementitious materials and aggregates. There numerical studies and a real experiment to
can be several causes for failure of concrete. verify the proposed formulation and showed
Cracking in concrete indicate deterioration in that the image correlation with the finite
element formulation is computationally
https://doi.org/10.36375/prepare_u.iei.a226
efficient, accurate, and robust. In 2006, ninth of the 607,380 bridges in the U.S. were
Besnard [12] et al. introduced the concept of structurally deficient and required a $20.5
multiscale approach on top of FEM based DIC billion annual investment for fixing the
method to generate meaningful solution for a problems by 2028. While in developing
fine texture and large initial displacement countries like China and India, more civil
measurement. In 2011, Nguyen et al. [13] infrastructure is being built. To efficiently
developed a new automated method of monitor and maintain such a large number of
fracture identification and quantification based existing civil infrastructure is critical yet
on standard DIC approach. An automatic crack challenging for both safety and economic
detection system was proposed by Zhang et al. reasons.
[14] by employing a coarse-to-fine This paper presents application of computer
methodology that also included concepts like vision technique for crack identification in
Region of Aggregation (ROA) and Region of concrete cubes subjected to compressive
Belief (ROB) for segmentation and localization loading. The developed system includes
of cracks. Chambon and Moliard [19] proposed camera for image capturing, CNN for crack
a new approach to image-based crack identification from captured images
detection, GaMM, based on a multi-scale implemented through LabView and TPU. The
extraction and Markovian segmentation, which presented system can be easily implemented
reduced the percentage of false positives in for real time monitoring of concrete structures.
comparison to morphological methods that
combine thresholding and refinement by CRACK DETECTION METHODOLOGY IN PROPOSED SYSTEM
morphological analysis. Xie et al. [15] Deep learning refers to the process by which
demonstrated the potential for Deep Learning computers can imitate human behaviour. They
based pavement crack detection by applying make use of neural networks, which are multi-
ConvNets on a dataset of 500 images of size layered systems inspired by the human brain's
3264 2448, collected using a low-cost structure. One of the most popular image
smartphone. Ying and Salari [16] proposed a recognition algorithms used in deep learning is
beamlet transform-based technique for the convolutional neural network. It takes an
pavement crack detection and classification, image as input and processes it so that
which was more robust in extracting linear different aspects of the image can be
features in the presence of noise. Oliveira and distinguished from one another. They use
Correia [17] proposed an integrated system, artificial neurons, which are mathematical
CrackIT, for automatic detection and functions that measure the weighted number
characterization of cracks in flexible pavement of multiple inputs and outputs an activation
surfaces using a combination of unsupervised value and are modelled after neurons in the
learning (clustering) followed by supervised human brain.
learning (classification), thus eliminating the The conventional image processing method
need for manually labelling the samples. It was does not work well in the detection of cracks in
noted that although CrackIT was able to detect concrete structures as the intensity range of
multiple cracks in the same image, it had cracks and non-cracks in the concrete block is
difficulty in dealing with cracks less than 2 mm almost similar. Further, as the load increases
width. Based on the fact that crack pixels in on concrete specimen, the area of the crack
pavement images had distinct grayscale increases and it merges to become a longer
intensities compared to their surrounding non- crack so, area-based filtering of the non-crack
crack pixels, Cheng et al. [18] proposed a block is quite challenging. CNN improves this
pavement crack detection algorithm based on task by combining the output of different
fuzzy logic. Feng et al.[21] proposed a deep trained convolutional networks in a fusion
active learning strategy for civil infrastructure multi-Layer perceptron and the features
defect detection and classification, where they present in the feature map extracted by the
used a deep residual network (ResNet) to train convolution layer are summarized by the
a small set of images with defect labels and pooling operation.
use this low-accuracy defect detector to filter A. Convolutional Neural Network
out many non-defect images. Convolutional Neural networks, also known as
The ageing civil infrastructure (e.g., tunnels ConvNet or simply CNN, are a deep learning
and bridges) is a common problem in many algorithm, which mainly finds its applications in
developed countries such as the United States visual imagery tasks. It can take an image as
and Japan. According to (ASCE 2013), one an input and can identify and differentiate
https://doi.org/10.36375/prepare_u.iei.a226
various features from it. The Convolutional detection in concrete. A dataset of 3600
Neural networks were inspired by the human images containing cracks and non-cracks with
brain and have a structure analogous to the a dimension of 256 256 pixels was prepared.
neuron connectivity pattern in human brains. The primary network used was a single-
They are composed of multiple layers of channel convolutional neural network (SCNN)
artificial neurons, which are mathematical in which feature extraction was done using
functions that calculate a weighted sum of intermediate max-pooling layers. The model
inputs and outputs a possibility score or was trained with a learning rate of 0.0005 and
activation value. achieved an accuracy of 90.5%. For making
1) Architecture of CNN: The convolutional the model robust, data augmentation was
neural networks have many layers and try to performed using random rotations, shifts and
extract different features from the input image. zooming. This led to a drop in accuracy to
The starting layers try to find basic features 82.25%. For optimizing this model, they
and the complexity increases as we go deeper introduced a second channel, which had
into the network. Each layer has three shallow network structure and skip
dimensions, height, width and depth. connections, thus making the model to be a
2) Convolutional Layers: The convolutional Dual-channel Convolutional Neural Network.
layers try to find the high-level features from The new model was trained with 6400 images
an input image such as edges. These are the at a learning rate of 0.0005 and achieved an
starting layers of a convolutional network and accuracy of 92.25%. The average time taken
there can be more than one convolutional by each epoch was 159.031 seconds.
layer in the network. They start by extracting
the low-level features such as edges, color, EDGE COMPUTATION
etc. and as we go deeper to other Edge computing is changing the way data is
convolutional layers, the network adapts to being
extract more complex features.
3) Pooling Layer: The pooling layer tries to
reduce the spatial size of the convolved
feature to reduce the computational power
required. It also extracts the more dominant
features, that are invariant to rotation and
position. Pooling is of two types, Max Pooling,
and Average Pooling. handled, processed and then delivered from
4) Max Pooling: It returns the maximum value many devices all over the world. The
from the portion of the image covered by the tremendous growth of internet-connected
filter. devices – the IoT – alongside new applications
5) Average Pooling: It returns the average that needs real-time computing power,
value. continues to drive edge-computing systems.
6) Classification Layer: The classification layer, Faster networking technologies, like 5G
also known as the Fully Connected Layer, are wireless, are allowing edge computing systems
the final layers of the convolutional network in to accelerate the creation or support of real-
which the output of the pooling layer acts as time applications, like video processing and
the input and uses a SoftMax function to analytics, self-driving cars, AI and robotics.
calculate the final possibility probabilities of Edge computing simply provides information
the input. Complete CNN architecture is shown processing close to the edge that is data
in Fig. 1. gathered through IoT devices are passed
through the edge computing devices to act like
a gateway where data are interpreted quickly
before being pass through the internet to be
sent to a server or cloud storage for further
processing if necessary or just for data-
keeping. To achieve high-speed data
transmission, google has launched hardware
Fig. 1 Architecture of Convolutional Neural Network called Edge TPU (Tensor Processing Unit) that
can be connected by just using a USB
Kumar and Ghosh [2], used a Ducal Channel connection. TPU is an AI accelerator
Convolutional Neural Network model for crack application-specific integrated circuit (ASIC)
https://doi.org/10.36375/prepare_u.iei.a226
developed specifically for neural network TPU option is selected, then the re-trained
machine learning in the TensorFlow ML library. inception v3 edge TPU compatible model is
used for crack detection purposes. The GUI
PROPOSED COMPUTER VISION SYSTEM FOR CONCRETE also has an option using the crack detection
CRACK MONITORING model in live stream mode in which the images
Fig. 2 shows a complete block diagram of of the concrete block are captured by the
proposed computer vision system camera in real-time and sent to the crack
implemented at Heavy Structures Laboratory detection model along with the real-time load
of Nirma University for compression testing of applied on the concrete block. If live stream
concrete cubes. In this system, for crack mode is not selected then images and load cell
detection model, the previously re-trained data are read from a saved folder on the
Inception v3 model is used along with a server device. LabVIEW analyses different
LabVIEW GUI especially developed for crack characteristics of cracks like the number of
cracks and area of cracks and plots a graph of
Load vs Number of Cracks, Load vs Crack
Length, Load vs Area of Cracks and Load vs
Time. Fig. 4 presents screenshot of proposed
system showing image information like no. of
cracks. Along with crack information, preview
of images is given so that the portion of the
cube to be focused can be changed.