Instant Download (Ebook) Advanced Soft Computing Techniques in Data Science, IoT and Cloud Computing by Sujata Dash (editor), Subhendu Kumar Pani (editor), Ajith Abraham (editor), Yulan Liang (editor) ISBN 9783030756567, 3030756564 PDF All Chapters
Instant Download (Ebook) Advanced Soft Computing Techniques in Data Science, IoT and Cloud Computing by Sujata Dash (editor), Subhendu Kumar Pani (editor), Ajith Abraham (editor), Yulan Liang (editor) ISBN 9783030756567, 3030756564 PDF All Chapters
com
OR CLICK HERE
DOWLOAD EBOOK
ebooknice.com
ebooknice.com
ebooknice.com
https://ebooknice.com/product/sat-ii-success-
math-1c-and-2c-2002-peterson-s-sat-ii-success-1722018
ebooknice.com
(Ebook) Vagabond, Vol. 29 (29) by Inoue, Takehiko ISBN
9781421531489, 1421531488
https://ebooknice.com/product/vagabond-vol-29-29-37511002
ebooknice.com
ebooknice.com
ebooknice.com
ebooknice.com
Studies in Big Data 89
Sujata Dash
Subhendu Kumar Pani
Ajith Abraham
Yulan Liang Editors
Advanced Soft
Computing
Techniques in Data
Science, IoT and
Cloud Computing
Studies in Big Data
Volume 89
Series Editor
Janusz Kacprzyk, Polish Academy of Sciences, Warsaw, Poland
The series “Studies in Big Data” (SBD) publishes new developments and advances
in the various areas of Big Data- quickly and with a high quality. The intent is to
cover the theory, research, development, and applications of Big Data, as embedded
in the fields of engineering, computer science, physics, economics and life sciences.
The books of the series refer to the analysis and understanding of large, complex,
and/or distributed data sets generated from recent digital sources coming from
sensors or other physical instruments as well as simulations, crowd sourcing, social
networks or other internet transactions, such as emails or video click streams and
other. The series contains monographs, lecture notes and edited volumes in Big
Data spanning the areas of computational intelligence including neural networks,
evolutionary computation, soft computing, fuzzy systems, as well as artificial
intelligence, data mining, modern statistics and Operations research, as well as
self-organizing systems. Of particular value to both the contributors and the
readership are the short publication timeframe and the world-wide distribution,
which enable both wide and rapid dissemination of research output.
The books of this series are reviewed in a single blind peer review process.
Indexed by SCOPUS, EI Compendex, SCIMAGO and zbMATH.
All books published in the series are submitted for consideration in Web of Science.
© The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature
Switzerland AG 2021
This work is subject to copyright. All rights are solely and exclusively licensed by the Publisher, whether
the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse
of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and
transmission or information storage and retrieval, electronic adaptation, computer software, or by similar
or dissimilar methodology now known or hereafter developed.
The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication
does not imply, even in the absence of a specific statement, that such names are exempt from the relevant
protective laws and regulations and therefore free for general use.
The publisher, the authors and the editors are safe to assume that the advice and information in this book
are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or
the editors give a warranty, expressed or implied, with respect to the material contained herein or for any
errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional
claims in published maps and institutional affiliations.
This Springer imprint is published by the registered company Springer Nature Switzerland AG
The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland
Preface
Overview
v
vi Preface
have emerged as powerful computational models. These have also shown significant
success in dealing with massive data analysis for a large number of applications.
This book, Advanced Soft Computing Techniques in Data Sciences, IoT and
Cloud Computing, aims to play a significant role in improvising human life to a
great extent. All the researchers and practitioners who are working in the field
of computer engineering, medicine, biology application, signal processing, and
mechanical engineering will be highly benefited.
This book would be a good collection of state-of-the-art approaches for soft
computing-based applications to various engineering fields. It will be very bene-
ficial for the new researchers and practitioners working in the field to quickly know
the best-performing methods. They would be able to compare different approaches
and can carry forward their research in the most important area of research which has
a direct impact on the betterment of the human life and health. This book would be
very useful because there is no book in the market that provides a good collection of
state-of-the-art methods of soft computing-based models for Multimedia retrieval,
Fault tolerance, System modeling, Network architecture, Web semantics, big data
analytics, time series, and biomedical and health informatics.
Objective
The purpose of this book is to report the latest advances and developments in the field
of Multimedia retrieval, Fault tolerance, System modeling, Network architecture,
Web semantics, big data analytics, time series, and biomedical and health informatics.
The book comprises the following three parts:
• Soft Computing Techniques for Internet of Things (IoT) devices
• Soft Computing Techniques in Cloud Computing and Computer Networking
• Soft Computing Techniques in Data Science
Organization
The book, “Advanced Soft Computing Techniques in Data Sciences, IoT and Cloud
Computing”, consists of 19 edited chapters and the whole contents of the book are
organized into the following three sections:
• Part 1: Soft Computing Techniques for IoT Devices. This section has focused on
Soft Computing Techniques for Internet of Things paradigms and their applica-
tion in wearable assistive devices for visually impaired people, security of IoT
devices, medical data security in healthcare systems, content-based video retrieval
systems, and also applications in intelligent farming. There are six chapters in this
Preface vii
section. The first chapter looks into the application of soft computing techniques
for designing IoT-based wearable assistive devices for visually impaired people.
The second, third, fourth, and fifth contributions focus on the security of IoT
devices and their application in medical data in healthcare systems and content-
based video retrieval systems. The sixth chapter discusses various existing soft
computing techniques and their applications for decision support in intelligent
farming.
• Part II: Soft Computing Techniques in Cloud Computing and Computer
Networking. The second part comprises five chapters. The first contribution
discusses about the hybrid cloud data protection using the machine learning
approach. The second chapter audits the analysis of LSTM networks for COVID-
19 impact prediction. The third chapter focuses on the progress and systematic
review of soft computing techniques for energy consumption and resource allo-
cation on the cloud. The fourth chapter discusses automatic segmentation and
classification of Brain Tumor from MR Images using DWT-RBFNN.
• The fifth chapter discusses on the application of unsupervised learning on
automatic localization of optic disc in Retinal Fundus Image.
• Part III: Soft Computing Techniques in Data Science. There are eight chapters in
this section. The first chapter discusses the performance evaluation of Hybrid
Machine Learning Algorithms for Medical Image Classification. The second
chapter discusses the issues and challenges of soft computing techniques for
Healthcare Decision Support Systems and the third chapter focuses on the truth
values of Modus Ponens and Modus Tollens Rule for Linguistic Truth-Valued
Propositions and Its Application in Taking Decisions in Health Care. The fourth
chapter focuses on a case study of Amazon based on the analysis of customers’
reviews using Soft Computing Classification Algorithms. The fifth chapter throws
light on Pattern Mining—FTISPAM applying Hybrid Genetic Algorithm and
the sixth chapter highlights soft computing techniques for Medical Diagnosis,
Prognosis, and Treatment. The concluding chapter explains Artificial Intelligence
Applications for COVID-19 Pandemic and the eighth chapter deliberates on the
prediction of transmittable diseases in a location using ARIMA.
Target Audiences
• Industry and people from Technical Institutes, R & D Organizations and working
in the field of machine learning, IoT, Cloud Computing, biomedical engineering,
health informatics, and related fields.
The editors would like to acknowledge the help of all the people involved in this
project and, more specifically, the reviewers who took part in the review process.
Without their support, this book would not have become a reality.
First, the editors would like to thank each one of the authors for their time,
contribution, and understanding during the preparation of the book.
Second, the editors wish to acknowledge the valuable contributions of the
reviewers regarding the improvement of quality, coherence, and content presentation
of chapters.
Last but not the least, the editors wish to acknowledge the love, understanding,
and support of their family members during the preparation of the book.
ix
Contents
xi
xii Contents
Subhendu Kumar Pani received his Ph.D. from Utkal University Odisha, India in
the year 2013. He is working as a Professor in the Department of Computer Science
and Engineering and also a Research coordinator at Orissa Engineering College
(OEC) Bhubaneswar. He has more than 16 years of teaching and research experience.
His research interests include Data mining, Big Data Analysis, web data analytics,
Fuzzy Decision Making, and Computational Intelligence. He is the recipient of 5
researcher awards. In addition to research, he has guided two Ph.D. students and 31
M.Tech students. He has published 7 patents and 51 International Journal papers (25
xiii
xiv Editors and Contributors
Scopus index). His professional activities include roles as Associate Editor, Edito-
rial board member, and/or reviewer of various International Journals. He serves as
Associate Editor/Editorial Board Member for more than 50+ International Jour-
nals/Conferences of high repute and impact. He is an Associate with no. of confer-
ence societies. He has more than 100 international publications, 5-authored books, 4
edited books, and 10 book chapters into his account. He is a fellow in SSARSC and
the life member in IE, ISTE, ISCA, OBA, OMS, SMIACSIT, SMUACEE, and CSI.
Yulan Liang holds a Ph.D. in Statistics and has been a tenured faculty at the Univer-
sity of Maryland, Baltimore (UMB) since 2008. Before that, she worked as a tenure-
track assistant professor at the School of Public Health and School of Medicine at
the University at Buffalo. Dr. Liang’s expertise is in the cutting-edge multidisci-
plinary fields including big data science, statistics, data mining, neural networks,
machine learning, Bayesian inferences, artificial intelligence, and bioinformatics
with applications to large biomedical, health science, omics, and healthcare data
for risk assessments, predictions, and medical decision-making. She has published
more than a hundred peer-reviewed journal and conference proceeding articles in
the data science fields. She was funded as PI by NSF and served as a statistics
core director of NIH P30, U01, and P20 grants and co-Investigator on multiple
NIH-funded grants. She also served as co-Investigator and biostatistician for private
organizations in health associations and other funding agencies, including the Robert
Wood Johnson Foundation, Alzheimer’s Association, and National Council of State
Boards of Nursing.
Contributors
Koushal Kumar Sikh National College, Qadian, Guru Nanak Dev University,
Amritsar, Punjab, India
Sushil Kumar Chandigarh Engineering College, Landran, India
Yogesh Kumar Department of Computer Science & Engineering, Chandigarh
Group of Colleges, Mohali, India
E. Madhusudhana Reddy Department of Computer Science and Engineering,
Guru Nanak Institutions Technical Campus, Hyderabad, India
L. Mary Gladence Sathyabama Institute of Science and Technology, Chennai,
India
A. Mathumathi Easwari Engineering College, Chennai, India
Abhishek Mehta Faculty of IT and Computer Science, Parul University, Vadodara,
Gujarat, India
Anand Nayyar Graduate School, Duy Tan University, Da Nang, Vietnam;
Faculty of Information Technology, Duy Tan University, Da Nang, Vietnam
P. Palanisamy Department of Electronics and Communication Engineering,
National Institute of Technology Tiruchirappalli, Tiruchirappalli, India
Bhagwati Prasad Pande Department of Computer Applications, LSM Govern-
ment PG College, Pithoragarh, Uttarakhand, India
S. Poopandian Infosys Pvt., Ltd., Chennai, India
J. Prakash Department of CSE, PSG College of Technology, Coimbatore, India
D. Praveena Department of Information Technology, R.M.D Engineering College,
Kavaraipettai, India
R. Priyatharshini Easwari Engineering College, Chennai, India
Gladis Pushparathi Velammal Institute of Technology, Chennai, India
Hari Mohan Rai Department of Electronics and Communication Engineering,
Krishna Engineering College, Ghaziabad, UP, India
Trupti Rathod Vidyabharti Trust College of MCA, Gujarat Technological Univer-
sity, Bardoli, Gujarat, India
N. T. Renukadevi Department of CT-UG, Kongu Engineering College, Erode,
India
Seema Sahai Amity International Business School, Amity University Uttar
Pradesh, Noida, India
M. Sanjay Sivakumar Easwari Engineering College, Chennai, India
Arindam Sarkar Department of Computer Science and Electronics, R.K.M.
Vidyamandira, Belur Math, WB, India
Editors and Contributors xvii
xix
xx Abbreviations
DT Decision Tree
DT Decision Trees
DW-LSTM Dynamic Watermarking LSTM
DWT-RBFNN Discrete Wavelet Transform–Radial Basis Function Neural
Network
ECDH Elliptic Curve Diffie–Hellman
ECG Electrocardiogram
ECG Electrocardiography
EHR Electronic Health Record
EL Ensemble Learning
ELM-IPSO Extreme learning machine-improved particle swarm optimization
EMG Electromyography
ERDIP Electronic Record Development and Implementation Program
FA False Alarm
FAST Features from Accelerated Segment Test
FCFS First Come First Serve
FCM Fuzzy c-means
FCM Fuzzy C Means
FCM Fuzzy C-Mean
FCMC Fuzzy C-Means Clustering, MLS
FDR False Discovery Rate
FFNN Feed Forward Neural Networks
FIVR Fine-grained Incident Video Retrieval
FKM Fuzzy K Means
FN False Negative
FP False Positive
FS Feature Selection
FS Fuzzy Systems
FT Fourier Transform
FTISPAM Fuzzy Time Interval Sequential Pattern Mining
FWT Fast Wavelet Transform
GA Genetic Algorithm
GA-ANN Genetic Algorithm–Artificial Neural Network
GBHSP Gene-Based Health Screening Personalization
GLCM Gray Level Co-occurrence Matrix
GLOGTH Global and Local Oriented Gabor Texture Histogram
GOA Grasshopper Optimization
GPU Graphical Processing Units
HBR Harvard Brain Repository
HCAPN Hierarchical Context-Aware Aspect-Oriented Petri Net
HGA Hybrid Genetic Algorithm
HIPAA Health Insurance Portability and Accountability Act
HOG Histogram of Oriented Gradients
HOG Histogram of the oriented gradient
HRF High-Resolution Fundus
xxii Abbreviations
TN True Negative
TP True Positive
TPU Tensor Processing Units
U.S United States
UI User Interface
UK United Kingdom
ULBP Uniform Local Binary Pattern
URL Uniform Resource Locator
US Ultrasound
USRP Universal Software Radio Peripheral
VOC Pascal Visual Object Classes
WHO World Health Organization
WLBP Weber Local Binary Pattern
YOLO You Only Look Once
Soft Computing Techniques for IoT
Devices
A Wearable Assistive Device for Safe
Travel Using Transfer Learning and IoT
for Visually Impaired People
1 Introduction
Globally the number of humans of all ages who are visually impaired is estimated to
be around 285 million. Partial or complete loss of vision is common among most of
the old age people and it is difficult for a person to deal with such kind of impairment
after experiencing niche lifestyle with perfect vision. Visually impaired people has
less ability to perform everyday task like other people so their quality of life gets
affected and lose the ability to experience the necessary things in the world. Visually
impaired people are comfortable only with their familiar places because they know
the layout of the place and they would have memorized it, but when it comes to new
environment they may find several difficulties. There are a number of technological
advancements and everyday developments which are focused on providing a better
lifestyle for handicapped and old age peoples. Although there are bright ideas and
thoughts, there are practical difficulties in implementing them as a product that can
be used every day by this section of the population.
One of the leading difficulties for visually impaired people in day to day scenarios
is to travel alone without any dependencies from others. Some of them can have the
blurry scene of a bus arriving at the bus terminal, still recognizing the bus route
remains a challenging and an impossible task for them. People also feel embarrassed
to request others to help them to identify bus numbers. Another major problem
they face is in the railway stations. It is difficult for visually impaired person to
choose which exit side of the train to use to get out from the train. Sometimes they
accidentally trip over the wrong exit side and fall down to the track.
These types of incidents has occurred several times. These all leads to rise in
demand to provide a simple and powerful solution to address these problems. Most
of the time they are present alone in the station and they find it difficult to receive
any assistance from other people. Numbers of bus recognition system ideas are
introduced, like identifying the bus route with help of smartphone using Maximum
Stable Extremal Regions (MSER) applications, or in another system histogram of
the oriented gradient (HOG) identifies the codes of texts from detected text regions
and produces output as audio notification. There are also systems introduced for
bus detection using wireless networks and satellite signals. It requires installation of
sensors and other modules in bus and bus stations along with periodic maintenance.
Most of these current systems lack performance required for real time detection
and usage along with few other technical snags. There are a number of breakthrough
researches and technologies coming out throughout the year. They highly involve
attributes like quick performance, efficient methodologies, less implementation cost
and many others. Therefore, coining enhanced solutions to all kinds of problems is
not difficult and this includes for the above stated problems of visually impaired too.
2 Review of Literature
In the research field of computer vision and in other areas a number of bus recognition
systems are proposed; however, most of them use active devices and sensors such as
Global Positioning System tracking system, RFID (Radio Frequency Identification),
Beacons etc. For vision based approaches, Wongta [1] introduced a system that
uses MSER (Maximally Stable Extremal Regions) to recognize bus numbers. Their
A Wearable Assistive Device for Safe Travel … 5
solution finds all the texts in an image rather than the required ones i.e. the bus
number. Guida et al. [2] proposed a system for bus route number which uses a number
of interlinked classifiers along with adaboost in order to identify the route number
or other elements present in the front of the bus and some corrections are done to the
extracted features to obtain rectified numeric values. The object is then converted
to Hue, Saturation, Value (HSV) colorspace and then partitioning of each numeric
or digit value. Finally Optical character Recognition (OCR) is used to identify the
digits and output is produced as voice to the user. Viola and Jones [3] proposed a
face detection framework that involved three key contributions which were integral
images, ada boost and cascading classifiers. It was a simple and powerful technique
to identify faces from binary images. Pan et al. [4] proposed a bus detection system
to help the visually impaired where Histogram of the oriented gradients is used to
extract features from bus images and a cascading Support Vector Machine model
is implemented for a bus classifier to detect the bus facade in frames of windows.
Tsai and Yeh [5] also introduced a system for bus detection to support the blind.
The system included the functionalities of detection of the moving bus, bus panel
detection and text detection from the text region of the panel. The system made use of
MAFD(Modify Adaptive Frame Differencing) and was found to have great accuracy
when features were extracted from different frames of the video. In [6] Bouhmed used
an ultrasonic sensor and camera in a walking cane to identify obstacles in the path
and inform it to the blind. The system produces output through voice. Zahir et al. [7]
developed a prototype of wearable head mountable device by customizing the Virtual
Reality glass with ultrasonic sensors and HC-SR04 because it take minimum amount
of time for detection and can also find obstacle in the longer range. The prototype
is developed using Arduino. Ani et al. [8] introduced a voice assistive text reading
system that is integrated with eyeglasses. A Camera is inbuilt with the eyeglass to
capture a image and with Tesseract-OCR text is extracted from the captured image.
Open Software E-speak is used for TTS. A method for end-to-end real-time scene
text localization and recognition is presented in [9] and the robustness of the proposed
method against noise and low contrast of characters is well demonstrated using “false
positives”. In [12] a Convolutional Neural Networks (CNN) model is developed to
detect English and Thai text from natural scene images with improved accuracy
compared to the other existing methods A method for Multi-script Text Extraction
from Natural Scenes is presented in [10] which exploits collaboration of proximity
and similarity laws to create text-group hypotheses. The problem of establishing
correspondences between a pair of images taken from different viewpoints is analysed
in [11]. There are a number of techniques and various innovative ideas proposed to
help the blind. Most of these systems are dynamic and they have less difficulty when
it comes to real time usage. Our proposed system is for real time usage and uses video
recognition rather than capturing only images. The bus arrival and waiting time are
always varying therefore every frame must be checked instead of single image snaps
which would be less efficient for the detection of bus and bus board. The system
proposed can be integrated with any of the existing ones, where the components
are almost similar or it can be developed as a separate one in which any other new
features can be extended.
6 R. Priyatharshini et al.
A visually impaired person after loss of vision limits them from indulging in any
activities that involves travelling in public transport facilities. They become hesitant
to ask help from other people. For them to able to live their life independently we
have developed Assistive Device for visually impaired to people that helps them to
travel.
A single integrated system is developed to assist the visually impaired. User can
choose the required operation by commanding through voice. Raspberry Pi serves
the processing unit for all the operations and it is connected to an external portable
power supply. An approach to accurately detect the real time bus object and to
segment the ROI for extracting and recognizing the route details has been proposed
to assist visually impaired people. An enhanced version of the You Only Look Once
algorithm is proposed in this paper which detects the objects in an image at a single
instance. It is an algorithm based on regression and it was pre trained on a number
of classes.
The algorithm initially splits the image into cells of 19 × 19 grid and each cell
is responsible for predicting 5 bounding boxes. Each of the bounding boxes can be
described using 5 descriptors as shown in (Fig. 1). They are,
1. Width (bw ): Width of the bounding box.
2. Height (bh ): Height of the bounding box.
3. Center of a bounding box (bx by ): Center coordinates of the bounding box.
4. Value c: Corresponding to a class of an object (i.e. car, traffic lights)
5. Value pc : Probability that there is an object in the bounding box.
For areas under each of the bounding box, the Convolution Neural Network (CNN)
processes to identify to which class the object under the area belongs. The boxes
with low object probability (pc ) and unknown classes (c) are removed. Areas under
high overlapping boundary regions across cells along with greater pc are the detected
objects in the image. By transfer learning the YOLO v3 model with a custom dataset,
the same above procedure is involved to detect the bus board in our system. The
detected bus board image is then used to obtain the text with help of tesseract optical
character recognition tool. It gathers the outlines into blobs by nesting. Blobs are
further ordered into lines of text and each of these text lines are drilled down into
probable terms and further into characters. Multiple passes are involved before finally
predicting the character, words and the sentences. Latest versions of tesseract use
deep learning frameworks which are accurate and fast. Output from the OCR is
filtered with regular expressions which are constrained to find at maximum three
characters length data that are either numeric or alphanumeric. The obtained data is
the bus number as bus route number usually falls under these constraints in modern
bus route naming systems. The identified bus number is informed to blind through
text to speech method (Fig. 2).
Speaker/headphones are present to give output notifications. Microphone is used
to obtain the voice input from the user. If the voice input is found to be difficult for
A Wearable Assistive Device for Safe Travel … 7
the user or if there is a lot of noise, button inputs can be added to the system. GPS can
be added as an extended feature which can inform the user where they are currently
located and to know the remaining time it would take to reach the destination.
For railway platform detection, an Ultrasonic Sensor is present at a certain angle
facing ground every time. It is also connected to the raspberry pi and it gives the
frequent depth measures from ground to the pi in which the depth identification
algorithm is executed. Speaker/headphones are present to give output notifications.
Microphone is used to obtain the voice input from the user. If the voice input is found
to be difficult for the user or if there is lot of noise, button inputs can be added to the
8 R. Priyatharshini et al.
system. GPS can be added as an extended feature which can inform the user where
they are currently located and to know the remaining time it would take to reach the
destination.
4 Proposed Methodology
The Proposed system mainly consists of two sub modules that are chosen for different
scenarios by the user. A methodology to detect the real time object (bus) and segment
the number plate (region of interest) has been proposed for recognizing the route
number in a bus. In this approach, there are three sub modules involved. First the bus
object is recognized and the bus board is segmented for which the core functionality
is based on Convolution Neural Network and transfer learning. For the next module,
tesseract optical character recognition engine is used to recognize the text and finally
with regular expressions, the bus route number is recognized (Fig. 3).
This subsystem deals with identifying the bus board in real time. A high definition
wide angle camera is mounted to the head wear and is connected to the pi. The camera
is used to capture the bus arrival scene and from the video frames, ROI detection and
extraction takes place.
More than 400 images were used to create a customized model. Sample images
from training dataset is shown in Fig. 4. The images obtained were in a daylight envi-
ronment. Preprocessing is done to remove the blurry and dark images before initiating
the training process. The region of interests were marked and their labeling was given.
Pascal visual object classes (VOC) is a format for providing object detection data, i.e.
A Wearable Assistive Device for Safe Travel … 9
images with bounding boxes. Using the annotation tool, the bus boards are marked
with bounding boxes in each image manually and these values are saved in a xml file.
The bus boards are the Region of Interests for our model. The process involved in
building the model for bus board detection module using transfer learning is shown
in Fig. 5.
In order to ensure that the trained custom models have better detection accuracy,
transfer learning from a pretrained YOLO v3 model was involved in the training.
Transfer learning helps us to construct precise models in a timesaving way. Using
transfer learning instead of starting afresh, we start from patterns that have been
learned while addressing a different problem. Pre-trained model are used to initiate
the transfer learning process. A pre-trained model is the one that is used to address a
problem which is similar to a problem that we want to work out. A lot of computa-
tional costs are involved in training such models therefore models from well estab-
lished literatures are used (e.g. MobileNet, VGG, Inception, YOLO) as shown in
Fig. 6.
10 R. Priyatharshini et al.
You Only Look Once (YOLO) is a fully convolutional network model and its
outputs are generated by applying a 1 × 1 kernel on a feature map. In YOLO v3, the
detection is done by applying 1 × 1 detection kernels on feature maps of three
different sizes at three different places in the model network. These places are known
as strides. Each of these strides are used for processing large, medium and small
images respectively which is shown in Fig. 7.
Transfer learning with annotated images was completed along with a pretrained
YOLOv3 model. The obtained customized model is saved for later usage in bus
recognition and bus board identification. After training is done, the model is evaluated
for accuracy. They are identified based on the decrease in the validation loss. In most
cases, the lower or less is the loss, the more accurate the model will be in detecting
required objects. However, some models may experience overfitting and have lower
losses. Therefore, to ensure that the best model is picked for our custom detection, we
evaluate with mAP (mean Average Precision) of all the trained models saved in the
detection folder. The detection accuracy of the model is improved with better mAP.
The detection mechanism in a deep learning module mainly has two phases. In the
first phase, an image is taken as an input and a number of blocks or boxes are formed
around the possible entities with statistical features to label objects. Then this output
is predicted to detect the object with its class name.
The feature to identify the bus object comes from the pretrained YOLO v3 model
which has a Convolutional Neural Network as underlying layer and there is no
requirement to train the bus images separately for its identification process. The
initial phase consists of a number of convolution layers and pooling layers through
which the image pixel values move in an array format. Activation functions are
present in the middle and through all of these, feature learning takes place. The
next phase has flattened and fully connected layers with weights and before the final
output classification is given soft maxing is done as shown in Fig. 8. The system
moves to next processing only when the bus object is present in the video frame.
Results of bus detection module is shown in Fig. 9.
Our customized model is used for detection of bus boards from the video frames. For
every frame, the detection algorithm is performed. The algorithm initially recognizes
14 R. Priyatharshini et al.
the bus. Then in the image, segmentation as 13 × 13 grids is done and each cell has
a maximum of five bounding boxes. Prediction for each of the bounding box areas is
made and scores are provided. If it is identified to be below the threshold value from
the trained model, they are neglected. For the areas with overlapping bounding box
boundaries, features are collected together and prediction scores are given. Along
with them, confidence score is also present which gives distinction between that
class (ROI) and rest of the background image. The bus board (Region Of Interest)
is finally detected if it is present in the image. If there are multiple bus boards in
the frame, the same technique is applied and all of them are extracted and they are
saved as separate grayscale images which are used for the next operation i.e., Optical
Character Recognition (OCR). It is necessary that the detected bus board image
must not be blurry or dark as it makes it difficult for OCR operation. The results of
segmented bus board is shown in Fig. 10.
For obtaining text from an image. Tesseract OCR is used in our system. The latest
recognition engine of tesseract is based upon Long Short Term Memory (LSTM). For
the system pytesseract module is chosen as it acts as a replacement for the command
line tesseract with specified configuration arguments. As all of the other modules
are python based, we also make use of Tesseract OCR in python as pytesseract.
A Wearable Assistive Device for Safe Travel … 15
Tesseract makes a two step approach for text detection. It makes use of adaptive
recognition where in the initial step it identifies the pattern in letters, pixels, words
and sentences, hence recognizing the characters. If it is not sure or successful in
finding some characters, in the next step it tries to fill in with the character or word
that matches the word or sentence context. In character recognition step the outlines
are arranged into blobs by the nesting process. These blobs are then structured into
lines of text and each of these text lines are broken into possible words and further into
characters as shown in Fig. 11. Multiple passes are involved before finally predicting
the character, words and the sentences. The process involves a number of iterations
before final text is produced. In each iteration, the accuracy of recognition improves
and finally finely converted text data is obtained.
In our system all the information present in the bus board image is obtained by
operating the tesseract tool on the image. This obtained text data is stored as string
type data for further processing. For obtaining text with good accuracy, quality images
are required from the previous module.
Regular expressions are defined to obtain the numeric or alphanumeric bus number
from the text data stored in a text data format. A regular expression is a special text
string for describing a search pattern. Strings of text are compared to the pattern
in order to identify the string that matches the logical pattern defined by regular
expression. On the basis of these comparisons the regular expressions can be used to
identify strings of text that meet specific requirements or to validate that strings meet
a required pattern. It is very important to define regular expressions properly because
even if a single letter or number is missing, the route number becomes ambiguous.
The process involved in extracting the bus route and converting it in to speech output
is shown in Fig. 12.
The constraint laid for our regular expression is to work in a way such that it
only extracts maximum 3 characters that are either numeric or alphanumeric and are
continuous characters present one after another. The constraint is based in such a
way because the bus numbers in the modern transport system are in numeric or in
alphanumeric format and their characters length are usually two or three characters
long. This constraint can be adjusted based on the locality requirements.
A Wearable Assistive Device for Safe Travel … 17
An additional module, a database can be added to the system so that with the
extracted bus number, all other information regarding the bus can be provided to the
user. The bus number serves as the primary key with which the database is queried
to provide details like destination location, stops present in the middle, ticket cost
and other necessary information.
This subsystem involves detection of railway platform when train stops at the stations.
An ultrasonic sensor is mounted to the headwear like cap/hat at a certain angle facing
the ground. Initially the user’s height data is loaded in pi memory. This height along
with a small variable length is added together and the value is stored as a prefixed
threshold value. The variable length is the difference in height from platform to the
train door. An ultrasonic sensor measures the distance based on the working given
in Fig. 13.
The transmitter sends out waves which travels at the speed of sound waves
(0.034 cm/µs) and the receiver obtains them when they are reflected back by an
object or surface that is present in their path. The time difference (t) is noted and
along with the known speed of the sound, distance value can be calculated (v*t/2)
in the pi. The algorithm for the platform detection mainly identifies whether this
measured distance exceeds the prefixed threshold value. If so, the user is notified
quickly through speakers to get down on the other side of the train. Ultrasonic sensor
is very much responsive therefore this operation takes place quickly. It is a light
weight and simple process to assist the blind for safe exit from trains. The same
methodology can be extended in future also for other transports. Some ambiguity
may arise if there are obstacles present between the sensor and the ground/surface.
This can be overcome by adding multiple ultrasonic sensors to the system as shown
in Fig. 14.
A single integrated system is developed and the whole system is attached to a cap to
assist the visually impaired. User can choose the required operation by commanding
through voice or using a button. Raspberry Pi serves the processing unit for all the
operations and it is connected to an external portable power supply.
For bus identification, a High Definition wide angle camera is used and the real
time feed starts once bus number identification operation is chosen to obtain the bus
number and all other details.
For railway platform detection, an Ultrasonic Sensor is present at a certain angle
facing ground every time. It is also connected to the raspberry pi and it gives the
frequent depth measures from ground to the pi in which the depth identification
algorithm is executed. Speaker/headphones are present to give output notifications.
Microphone is used to obtain the voice input from the user. If the voice input is found
to be difficult for the user or if there is lot of noise, button inputs can be added to the
system. GPS can be added as an extended feature which can inform the user where
they are currently located and to know the remaining time it would take to reach the
destination as shown in Fig. 15.
The period which we have now passed over appears to have been
that in which the democratical cast of Athenian public life was first
brought into its fullest play and development, as to judicature,
legislation, and administration.
The great judicial change was made by the methodical
distribution of a large proportion of the citizens into distinct judicial
divisions, by the great extension of their direct agency in that
department, and by the assignment of a constant pay to every
citizen so engaged. It has been already mentioned that even under
the democracy of Kleisthenês, and until the time succeeding the
battle of Platæa, large powers still remained vested both in the
individual archons and in the senate of Areopagus: which latter was
composed exclusively of the past archons after their year of office,
sitting in it for life,—though the check exercised by the general body
of citizens, assembled for law-making in the ekklesia, and for judging
in the heliæa, was at the same time materially increased. We must
farther recollect, that the distinction between powers administrative
and judicial, so highly valued among the more elaborate
governments of modern Europe, since the political speculations of
the last century, was in the early history of Athens almost unknown.
Like the Roman kings,[659] and the Roman consuls before the
appointment of the prætor, the Athenian archons not only
administered, but also exercised jurisdiction, voluntary as well as
contentious,—decided disputes, inquired into crimes, and inflicted
punishment. Of the same mixed nature were the functions of the
senate of Areopagus, and even of the annual senate of Five
Hundred, the creation of Kleisthenês. The stratêgi, too, as well as
the archons, had doubtless the double competence—in reference to
military, naval, and foreign affairs—of issuing orders and of
punishing by their own authority, disobedient parties: the imperium
of the magistrates, generally, enabled them to enforce their own
mandates as well as to decide in cases of doubt whether any private
citizen had or had not been guilty of infringement. Nor was there
any appeal from these magisterial judgments; though the
magistrates were subject, under the Kleisthenean constitution, to
personal responsibility for their general behavior, before the people
judicially assembled, at the expiration of their year of office,—and to
the farther animadversion of the ekklesia, or public deliberative
assembly, meeting periodically during the course of that year: in
some of which ekklesiæ, the question might formally be raised for
deposing any magistrate, even before his year was expired.[660] Still,
in spite of such partial checks, the accumulation, in the same hand,
of powers to administer, judge, punish, and decide civil disputes,
without any other canon than the few laws then existing, and
without any appeal,—must have been painfully felt, and must have
often led to corrupt, arbitrary, and oppressive dealing: and if this be
true of individual magistrates, exposed to annual accountability, it is
not likely to have been less true of the senate of Areopagus, which,
acting collectively, could hardly be rendered accountable, and in
which the members sat for life.[661]
I have already mentioned that shortly after the return of the
expatriated Athenians from Salamis, Aristeidês had been impelled,
by the strong democratical sentiment which he found among his
countrymen, to propose the abolition of all pecuniary qualification for
magistracies, so as to render every citizen legally eligible. This
innovation, however, was chiefly valuable as a victory and as an
index of the predominant sentiment: notwithstanding the enlarged
promise of eligibility, little change probably took place in the fact,
and rich men were still most commonly chosen. Hence the
magistrates, possessing the large powers administrative and judicial
above described,—and still more the senate of Areopagus, which sat
for life,—still belonging almost entirely to the wealthier class,
remained animated more or less with the same oligarchical interest
and sympathies, which manifested themselves in the abuse of
authority. At the same time the democratical sentiment among the
mass of Athenians went on steadily increasing from the time of
Aristeidês to that of Periklês: Athens became more and more
maritime, the population of Peiræus augmented in number as well
as in importance, and the spirit even of the poorest citizen was
stimulated by that collective aggrandizement of his city to which he
himself individually contributed. Before twenty years had elapsed,
reckoning from the battle of Platæa, this new fervor of democratical
sentiment made itself felt in the political contests of Athens, and
found able champions in Periklês and Ephialtês, rivals of what may
be called the conservative party, headed by Kimon.
We have no positive information that it was Periklês who
introduced the lot, in place of election, for the choice of archons and
various other magistrates, but the change must have been
introduced nearly at this time, and with a view of equalizing the
chances of office to every candidate, poor as well as rich, who chose
to give in his name, and who fulfilled certain personal and family
conditions ascertained in the dokimasy, or preliminary examination.
But it was certainly to Periklês and Ephialtês that Athens owed the
elaborate constitution of her popular dikasteries, or jury courts
regularly paid, which exercised so important an influence upon the
character of the citizens. These two eminent men deprived both the
magistrates and the senate of Areopagus of all the judicial and penal
competence which they had hitherto possessed, save and except the
power of imposing a small fine. This judicial power, civil as well as
criminal, was transferred to numerous dikasts, or panels of jurors
selected from the citizens; six thousand of whom were annually
drawn by lot and sworn, and then distributed into ten panels of five
hundred each, the remainder forming a supplement in case of
vacancies. The magistrate, instead of deciding causes, or inflicting
punishment by his own authority, was now constrained to impanel a
Welcome to our website – the ideal destination for book lovers and
knowledge seekers. With a mission to inspire endlessly, we offer a
vast collection of books, ranging from classic literary works to
specialized publications, self-development books, and children's
literature. Each book is a new journey of discovery, expanding
knowledge and enriching the soul of the reade
Our website is not just a platform for buying books, but a bridge
connecting readers to the timeless values of culture and wisdom. With
an elegant, user-friendly interface and an intelligent search system,
we are committed to providing a quick and convenient shopping
experience. Additionally, our special promotions and home delivery
services ensure that you save time and fully enjoy the joy of reading.
ebooknice.com