0% found this document useful (0 votes)

37 views32 pages

BTP_Report

Uploaded by

gopalkrishna.yadav.cd.ece21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views32 pages

BTP_Report

Uploaded by

gopalkrishna.yadav.cd.ece21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Node Fault Prediction and Data

Routing over IOT Networks

EC 491: UG PROJECT

MEMBERS
Dharmesh Mahajan - 21095036
Nikhil Agarwal - 21095079
Samar Singh Randhawa - 21095101
Chakshu Sharad - 21095143

Under the supervision of:

Dr. Om Jee Pandey

DEPARTMENT OF ELECTRONICS ENGINEERING

INDIAN INSTITUTE OF TECHNOLOGY (BHU) VARANASI
1
CERTIFICATE

This is to certify that the UG Project entitled “Node Fault Prediction and Data
Routing over IOT Networks” submitted by Dharmesh Mahajan (21095036),
Nikhil Agarwal (21095079), Samar Singh Randhawa (21095101) and Chakshu
Sharad (21095143) to the Department of Electronics Engineering, Indian
Institute of Technology (Banaras Hindu University) Varanasi, in partial
fulfilment of the requirements for the award of the degree “Bachelor of
Technology” in Electronics Engineering is an authentic work carried out at
Department of Electronics Engineering, Indian Institute of Technology (Banaras
Hindu University) Varanasi under my supervision and guidance on the concept
vide project grant as acknowledged.

Dr. Om Jee Pandey

Assistant Professor
Department of Electronics Engineering,
Indian Institute of Technology (BHU) Varanasi

2
DECLARATION

I hereby declare that the work presented in this project titled “Node Fault
Prediction and Data Routing over IOT Networks” is an authentic record of
our own work carried out at the Department of Electronics Engineering, Indian
Institute of Technology (Banaras Hindu University), Varanasi as requirement for
the award of degree of Bachelors of Technology in Electronics Engineering,
submitted in the Indian Institute of Technology (Banaras Hindu University)
Varanasi under the supervision of Dr. Om Jee Pandey, Department of
Electronics Engineering, Indian Institute of Technology (Banaras Hindu
University) Varanasi. It does not contain any part of the work, which has been
submitted for the award of any degree either in this Institute or in other
University/Deemed University without proper citation.

Dharmesh Mahajan Nikhil Agarwal

(21095036) (21095079)

Samar Singh Randhawa Chakshu Sharad

(21095101) (21095143)

3
ABSTRACT

With the rapid expansion of IoT networks, ensuring reliable

data transmission and network resilience has become
imperative. This project addresses these challenges through a
novel approach integrating node fault prediction and data
routing optimization using machine learning models.
Leveraging advanced algorithms, we develop predictive
models to identify potential node failures in real-time,
enhancing network reliability. Additionally, we optimize data
routing strategies to minimize latency and maximize
efficiency, thus improving overall network performance.
Through the synergy of machine learning techniques and IoT
infrastructure, our project aims to enhance the robustness and
efficiency of IoT networks, paving the way for seamless
operation across various applications and domains.

4
Contents
Page No.

Abstract 4

1. Introduction 7

2. Detailed Literature Survey of Existing Technologies

2.1. Node fault predictions 8.
2.1.0. Statistical Methods 8.
2.1.1. Machine Learning Models 8.
2.1.2. Deep Learning Approaches 8.
2.1.3. Ensemble Methods 9.

2.2. Data Routing Optimization Strategies

2.2.0. Static Routing Algorithms 10.
2.2.1. Dynamic Routing Algorithms 11.
2.2.2. Machine Learning-Based Routing Optimization 11.
2.2.3. Multipath Routing 11.
2.2.4. Energy-Efficient Routing 12.
2.3. Integration of Node Fault Prediction with Data Routing
2.3.0. Fault-Aware Routing Strategies 12.
2.3.1. Security-Enhanced Fault Management 13.

3. Preliminary Results and Insights

3.1. Node Fault Detection
3.1.0. Random Forest + XGBoost + Neural Network 15.
3.1.1. Random Forest + XGBoost + AdaBoost 17.
3.1.2. LOF + XGBoost + Random Forest 18.

5
3.2. Data Routing
3.2.0. Grey Wolf Optimization 22.
3.2.1. Q learning 28.
3.2.2. LEACH 28.

4. Challenges, Conclusions & Future Scope

4.1. Challenge 30.
4.2. Conclusions 31.

5. Bibliography 32

6
Chapter 1
INTRODUCTION

The rapid growth of Internet of Things (IoT) devices has brought about a
new age of connectivity and data sharing. However, the rising complexity
of IoT networks poses significant challenges in maintaining reliable data
transmission and network resilience. To address these issues, our project
aims to develop innovative methods for predicting node failures and
optimizing data routing within IoT networks.

Our project has two main objectives: to use machine learning models for
real-time prediction of potential node failures and to improve data routing
strategies to reduce latency and increase efficiency. By combining these
key aspects, we strive to improve the reliability and performance of IoT
networks, ensuring smooth operations across various applications and
industries.

This report provides a detailed overview of our approach, covering the

methodology, experimental setup, results, and conclusions. We believe our
project marks a meaningful step in tackling the challenges of reliability
and efficiency in IoT networks, and we aim for our findings to contribute
to the progress of this rapidly growing field.

7
Chapter 2
Detailed Literature Survey of Existing
Technologies

1. Node Fault Prediction Techniques :

a. Statistical Methods :
i. Traditional statistical approaches, such as regression
analysis and time series forecasting, have been
extensively utilized for node fault prediction in IoT
networks.
ii. These methods leverage historical data and statistical
patterns to predict potential faults in network nodes
with varying degrees of accuracy and reliability.

b. Machine Learning Models :

i. Recent advancements in machine learning have
revolutionized the field of node fault prediction in
IoT networks.
ii. Techniques such as decision trees, random forests,
support vector machines, and neural networks have
shown promising results in identifying anomalous
behavior and predicting node failures with high
accuracy.

c. Deep Learning Approaches :

i. Deep learning techniques, such as convolutional
neural networks (CNNs) and recurrent neural

8
networks (RNNs), have gained traction in recent
years for node fault prediction in IoT networks.
ii. These models leverage the hierarchical structure of
data to capture complex patterns and relationships,
enabling more accurate and robust predictions.

d. Ensemble Methods :
i. Ensemble learning techniques, such as bagging,
boosting, and stacking, combine multiple base
models to improve prediction performance.
ii. By aggregating the predictions of diverse models,
ensemble methods can mitigate the weaknesses of
individual models and achieve higher accuracy in
node fault prediction tasks.

Fig a

9
Fig b

2. Data Routing Optimization Strategies :

a. Static Routing Algorithms :

i. Conventional routing algorithms, such as shortest
path routing and flooding, provide basic routing
functionalities but may not be optimal for dynamic
IoT environments.
ii. These algorithms typically do not adapt to changes in
network conditions and may result in suboptimal
data transmission paths.
10
b. Dynamic Routing Protocols :
i. Dynamic routing protocols, such as AODV (Ad-hoc
On-Demand Distance Vector) and DSR (Dynamic
Source Routing), dynamically adjust routing paths
based on real-time network conditions.
ii. These protocols offer greater flexibility and
efficiency compared to static routing algorithms,
enabling more responsive and adaptive data routing
in IoT networks.

c. Machine Learning-Based Routing Optimization :

i. Emerging research explores the integration of
machine learning techniques into routing
optimization strategies.
ii. Reinforcement learning and evolutionary algorithms
are used to dynamically optimize routing decisions
based on network performance metrics, such as
latency, throughput, and energy consumption.

d. Multipath Routing :
i. Multipath routing techniques utilize multiple
concurrent paths to transmit data packets, improving
fault tolerance, load balancing, and throughput in IoT
networks.
ii. By distributing traffic across diverse paths, multipath
routing strategies can mitigate the impact of network
failures and congestion, enhancing overall reliability
and performance.

11
e. Energy-Efficient Routing :
i. Energy-efficient routing algorithms minimize energy
consumption in IoT devices by optimizing data
transmission paths and reducing unnecessary
communication overhead.
ii. These algorithms aim to prolong the battery life of
IoT devices while maintaining acceptable levels of
network performance, making them particularly
suitable for resource-constrained environments.

Fig c Fig d
3. Integration of Node Fault Prediction with Data Routing :

a. Fault-Aware Routing Strategies :

i. Fault-aware routing strategies integrate node fault
prediction techniques into the routing
decision-making process to improve network
reliability and resilience.
ii. Predictive models for node fault detection are used to
identify potential failures in real-time, allowing
routing protocols to dynamically adapt and avoid
faulty nodes.

12
b. Security-Enhanced Fault Management :
i. Security-enhanced fault management techniques
address the security implications of node faults,
ensuring that fault recovery operations do not
compromise network security.
ii. These techniques incorporate encryption,
authentication, and access control mechanisms to
protect sensitive data and prevent unauthorized
access during fault handling.

13
Chapter 3
Preliminary Results and Insights

Node Fault Detection

In our comprehensive investigation into node fault detection within IoT

networks, we adopted a Voting Classifier method, leveraging the combined
strengths of multiple machine learning algorithms: Random Forest, XGBoost,
Neural Network, AdaBoost, and Local Outlier Factor (LOF). This
ensemble-based approach was chosen to enhance the robustness and accuracy of
fault detection by integrating the capabilities of diverse methodologies.

The Voting Classifier aggregates predictions from its constituent models,

offering a more balanced and comprehensive analysis of network faults. Each
model contributes its unique strengths to the ensemble, with Random Forest
providing robust performance through its decision tree ensembles, XGBoost
offering precision through gradient boosting, Neural Networks capturing
complex non-linear relationships, AdaBoost focusing on iterative improvement,
and LOF identifying outliers in localized contexts. By combining these diverse
perspectives, the Voting Classifier enhances overall prediction reliability.

Random Forest, known for its capability to handle large datasets and reduce
overfitting, contributes strong baseline predictions to the ensemble. XGBoost, a
sophisticated gradient boosting technique, excels in optimizing predictions by
addressing misclassified instances. Neural Networks bring their ability to model
intricate, non-linear relationships, enabling the detection of subtle patterns
within the data. Meanwhile, AdaBoost, an adaptive boosting method, iteratively
refines weak learners, further improving the ensemble's accuracy. LOF,
operating on a density-based principle, identifies local anomalies and
deviations, providing critical insights into network irregularities.

14
By integrating these models, the Voting Classifier capitalizes on their
complementary strengths, offering a comprehensive solution for node fault
detection. This approach not only enhances fault detection accuracy but also
contributes to improved resilience, reliability, and overall performance within
IoT networks. Through this study, we aim to advance the field of IoT network
management and fault mitigation strategies by showcasing the efficacy of
ensemble learning methodologies in complex network environments.

1. Random Forest + XGBoost + Neural Network

a. This ensemble leverages the strengths of three powerful
algorithms for predictive modeling. Random Forest builds
multiple decision trees and combines their outputs to
improve accuracy and reduce overfitting, excelling in
handling diverse datasets. XGBoost is a fast, scalable
gradient boosting algorithm with advanced regularization
techniques (L1, L2) to prevent overfitting, ideal for
capturing fine-grained feature interactions. Neural
Networks learn complex, non-linear patterns through
interconnected layers, making them highly effective for
high-dimensional data and tasks like image and speech
recognition. Together, they provide a robust, versatile, and
scalable solution for tackling diverse and complex
predictive modeling challenges.

b. Code

Python

# Split the cleaned data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X_cleaned, y_cleaned,
test_size=0.2, random_state=20)

# Initialize individual classifiers

rf_model = RandomForestClassifier(random_state=42)
xgb_model = XGBClassifier(random_state=42, use_label_encoder=False,
eval_metric='logloss')

15
# Create a pipeline for the neural network
nn_pipeline = Pipeline([
('scaler', StandardScaler()),
('classifier', MLPClassifier(random_state=42, max_iter=500))
])

# Create a Voting Classifier with the three classifiers

voting_clf = VotingClassifier(estimators=[
('rf', rf_model),
('xgb', xgb_model),
('nn', nn_pipeline)
], voting='hard') # 'soft' uses predicted probabilities, 'hard' uses
predicted class labels

# Function to train and evaluate a model

def train_and_evaluate_model(model, model_name):

model.fit(X_train, y_train)
y_pred = model.predict(X_test)

# Compute metrics
accuracy = accuracy_score(y_test, y_pred)
precision = precision_score(y_test, y_pred, average='weighted')
recall = recall_score(y_test, y_pred, average='weighted')
f1 = f1_score(y_test, y_pred, average='weighted')

# Print metrics with 10 decimal places

print(f"{model_name} Accuracy: {accuracy:.10f}")
print(f"{model_name} Precision: {precision:.10f}")
print(f"{model_name} Recall: {recall:.10f}")
print(f"{model_name} F1-Score: {f1:.10f}")

# Train and evaluate the Voting Classifier

train_and_evaluate_model(voting_clf, "Voting Classifier")

c. Results

16
2. Random Forest + XGBoost + AdaBoost :
a. This ensemble combines three highly effective algorithms,
each contributing unique strengths to the predictive
modeling process. Random Forest builds a collection of
decision trees, using bagging to reduce overfitting and
increase robustness by averaging predictions, making it
ideal for handling various data types and complex patterns.
XGBoost is a powerful gradient boosting algorithm,
known for its speed and performance. With advanced
regularization techniques like L1 and L2, XGBoost is
excellent at capturing subtle feature interactions and
ensuring that the model generalizes well to new data,
reducing overfitting. AdaBoost enhances weak learners by
focusing on the mistakes made by previous models,
adapting iteratively to misclassified data points, which
helps improve accuracy, especially in cases where weak
models are used initially. Together, these models form a
highly adaptable, scalable, and precise ensemble capable
of handling diverse predictive tasks while maintaining
performance and minimizing overfitting.

b. Code :

Python

# Split the cleaned data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X_cleaned, y_cleaned,
test_size=0.2, random_state=20)

# Initialize individual classifiers

rf_model = RandomForestClassifier(random_state=42)

17
adaboost_model = AdaBoostClassifier(random_state=42)
xgb_model = XGBClassifier(random_state=42, use_label_encoder=False,
eval_metric='logloss')

# Create a Voting Classifier with the three classifiers

voting_clf = VotingClassifier(estimators=[
('rf', rf_model),
('adaboost', adaboost_model),
('xgb', xgb_model)
], voting='hard') # 'soft' uses predicted probabilities, 'hard' uses
predicted class labels

# Function to train and evaluate a model

def train_and_evaluate_model(model, model_name):
model.fit(X_train, y_train)
y_pred = model.predict(X_test)

# Print metrics with 10 decimal places

print(f"{model_name} Accuracy: {accuracy:.10f}")
print(f"{model_name} Precision: {precision:.10f}")
print(f"{model_name} Recall: {recall:.10f}")
print(f"{model_name} F1-Score: {f1:.10f}")

# Train and evaluate the Voting Classifier

train_and_evaluate_model(voting_clf, "Voting Classifier")

c. Results

18
3. LOF + XGBoost + Random Forest :
This ensemble combines three powerful algorithms, each
bringing a distinct advantage to the predictive modeling
process. LOF (Local Outlier Factor) is an unsupervised
anomaly detection technique that identifies outliers by
comparing the local density of data points with their
neighbors. By using LOF, the model can filter out noisy
data points, ensuring that the training set is free from
anomalies and improving the overall model's robustness.
XGBoost is a high-performance gradient boosting
algorithm, renowned for its ability to model complex
interactions between features with speed and efficiency. Its
advanced regularization techniques (L1 and L2) help to
control overfitting, making it especially effective for
handling structured, high-dimensional data. Random
Forest, on the other hand, is an ensemble method that
constructs multiple decision trees and averages their
predictions, which helps to reduce variance and
overfitting, while handling a wide range of data patterns
effectively. Together, these three models form a powerful,
robust, and scalable ensemble that not only performs well
on complex predictive tasks but also ensures clean,
outlier-free data is used in training, enhancing overall
performance and accuracy.

a. Code :

Python
# Initialize individual classifiers
rf_model = RandomForestClassifier(random_state=42)
xgb_model = XGBClassifier(random_state=42, use_label_encoder=False,
eval_metric='logloss')

class LOFClassifier(BaseEstimator, ClassifierMixin):

19
"""Wrapper for Local Outlier Factor to work as a classifier"""
def __init__(self, n_neighbors=20):
self.n_neighbors = n_neighbors
self.lof = LocalOutlierFactor(n_neighbors=self.n_neighbors,
novelty=True)

def fit(self, X, y=None):

X, y = check_X_y(X, y)
self.lof.fit(X)
self.classes_ = [0, 1] # Assuming binary classification
return self

def predict(self, X):

check_is_fitted(self)
X = check_array(X)
return (self.lof.predict(X) == 1).astype(int) # Convert -1
(outlier) to 0 and +1 (inlier) to 1

def fit_predict(self, X, y=None):

return self.fit(X).predict(X)

# Initialize the LOF pipeline

lof_pipeline = Pipeline([
('scaler', StandardScaler()),
('classifier', LOFClassifier(n_neighbors=20))
])

# Create a Voting Classifier with the three classifiers

voting_clf = VotingClassifier(estimators=[
('rf', rf_model),
('xgb', xgb_model),
('lof', lof_pipeline)
], voting='hard') # 'soft' uses predicted probabilities, 'hard' uses
predicted class labels

# Function to train and evaluate a model

def train_and_evaluate_model(model, model_name):
model.fit(X_train, y_train)
y_pred = model.predict(X_test)

# Print metrics with 10 decimal places

print(f"{model_name} Accuracy: {accuracy:.10f}")
print(f"{model_name} Precision: {precision:.10f}")
print(f"{model_name} Recall: {recall:.10f}")
print(f"{model_name} F1-Score: {f1:.10f}")

20
# Train and evaluate the Voting Classifier
train_and_evaluate_model(voting_clf, "Voting Classifier")

b. Results :

Comparison of the Methods :

21
Data Routing :
Data routing involves selecting an optimal path for data transmission from its
source to destination within a network. This process faces various challenges
influenced by network type, channel characteristics, and performance metrics.
In wireless sensor networks (WSNs), data collected by sensor nodes is typically
relayed to a base station for processing, analysis, and subsequent actions. In
larger networks, direct transmission from sensor nodes to the base station is
often impractical due to factors like distance, energy consumption, and
transmission speed. Hence, multi-hop routing algorithms are employed to relay
data packets through intermediate nodes until they reach the base station. To
address these challenges, we have opted to utilize the Grey Wolf Optimization
method for data routing.

Setup:
It is important to note that the following data routing techniques are employed
for a dynamic network where nodes change their position by a certain value
after each network round.

Grey Wolf Optimization:

In wireless sensor networks and decentralized IoT systems, efficient resource
utilization is essential. An objective function guides next hop selection based on
parameters like hop count, residual energy, traffic, distance, and buffer size.
These parameters are critical for subsequent node selections, aiming to
minimize traffic and enhance fault tolerance. This approach seamlessly
integrates with proposed fault detection techniques for system efficiency.

The Hierarchy is as shown below:

22
The Grey Wolf Optimizer (GWO) algorithm is inspired by grey wolves in their
natural habitat. Alpha (α), Beta (β), Delta (δ), and Omega (⍵) are the four types
of wolves found in a pack. On the basis of their hierarchy, the wolves have their
responsibilities. Alpha Wolves are the leaders of the pack, followed by Beta
wolves. The third level of hierarchy in the pack is that of the Delta wolves. The
remaining wolves which are not part of the upper level of the hierarchy are
omega wolves. Encircling, hunting, and attacking are the three main attributes
of the wolves.
This method has the following steps to be implemented:
→ → → →
1. The leader encircles the prey: 𝐷⍺ = |𝐶⍺. 𝑋⍺ − 𝑋|
→ → → →
2. Hunting and attacking the prey: 𝑋1 = 𝑋⍺ - 𝐴1.𝐷⍺
→ 𝑛−1
1
3. Updating positions of the following wolves: 𝑋𝑛 (t+1) = 𝑛−1
∑ Xi(t) ;
𝑖=1
n=2,3,...,m
Where the coefficients are as follows:
→ → → →
𝐴 = 2. 𝑎. 𝑟1 − 𝑎
→ →
𝐶 = 2. 𝑟2
→ 𝑡*𝑡
𝑎 = 2(1 − 𝑇*𝑇
)
→ →
Random vectors 𝑟1 and 𝑟2 lie in the range [0,1]. Vectors A and C are
coefficient vectors that lead to encircling of the prey. They control the tradeoff
between exploration and exploitation phases. Due to this, wolves do not always
go in the same direction. Whenever A is less than 1, the wolves in the pack
→
attack to hunt, otherwise, they try to find the prey. 𝑋 is the position vector of the
→ →
prey, whereas 𝑋𝑖 is the position vector of the grey wolf, and 𝐷𝑖 is a vector that
depends on the location of the target. Where i ϵ {α, β, δ}. t is the current
iteration and T is the maximum number of iterations.
The flowchart of this algorithm can be drawn as such:

23
This method is important for allowing us to find the perfect coefficients for the
objective function which is eventually responsible for the result. The cost
between each link, i.e. node i and node j is given by:

Costi,j = (c1di,j) + (c2Hj) + [c3*(ValidTraffic/Ti,j)] +

[c4*(Einitial/Ej)*(BufferCapacity/Bj)]

Here, the coefficients c1, c2, c3 and c4 are calculated by using the GWO method.
di,j is the distance between the nodes, Hj is the hop count of node j. T, E and B
denote Traffic Status, Energy, and Buffer Status respectively.
The best path is then determined based on the score of the fitness function for
each hop count and. The code for the same is shown below:

24
The cost function given above is implemented as follows:

When the network and all its nodes are initialized, we can run these functions to
find the best path to send data in the network.

25
Network Parameters:

In this method, we need to find the cost of all possible paths and then compare
them to find the path with minimum cost.

26
Since this is a dynamic network, we need to keep updating the positions of each
node. Moreover, every node which is in the transmission range of another node
must be connected. For this, we create a graph and add these connections as
edges of the graph. These are then fed as inputs to the cost functions.
For example, we have taken 100 nodes along with 1 base station, with id=-1,
which is the destination node and node id = 3 as the source node. The network is
simulated for 50 rounds, that is, the nodes change their positions 49 times after
initiation. We obtain an optimal path for each of these rounds. One such
example is shown below.
Result for the example:

For this result, we calculated the time delay and energy consumed by the
following formula (given in code) and compared it to Q-Learning as well as
LEACH (Low-Energy Adaptive Clustering Hierarchy) algorithms.

Energy Consumed:
For every message transmission between two nodes, the energy consumed is
calculated as follows:

27
Q-Learning:
Q-learning is a reinforcement learning algorithm that operates without a model
of the environment, earning it the term "model-free." It evaluates the value of
taking a specific action in a particular state and can manage problems with
stochastic transitions and rewards without requiring adjustments.

This algorithm identifies an optimal policy that maximizes the expected total
reward over successive steps, starting from the current state. Given sufficient
exploration time and a partially random policy, the algorithm can determine the
best action-selection strategy. The term "Q" represents the function the
algorithm computes, which estimates the expected rewards for an action taken
in a specific state.

LEACH:
LEACH (Low-Energy Adaptive Clustering Hierarchy) is a hierarchical protocol
where most nodes send data to cluster heads, which then aggregate, compress,
and forward the information to the base station (sink). To decide if a node will
act as a cluster head in a given round, each node uses a stochastic algorithm.
The protocol assumes that every node's radio can directly communicate with the
base station or the nearest cluster head; however, operating the radio at full
power continuously would result in energy inefficiency.
NM-LEACH (Novel Modified LEACH) is a modified version of LEACH,
which has better energy efficiency but higher complexity.

Comparison Results:

28
Evidently, the proposed method has low time delay and consistent energy
consumption for dynamic networks as compared to other methods, which
become unpredictable at later stages.

29
Chapter 4
Challenges, Conclusions & Future Scope

Challenges
1. Data Quality and Availability : Obtaining high-quality data that
accurately reflects the behavior of IoT devices and network topology can
be challenging. Incomplete, noisy, or biased data may hinder the
performance of ML models.
2. Scalability and Resource Constraints: IoT networks often consist of
numerous interconnected devices with limited computational resources
and bandwidth. Developing scalable ML models that operate efficiently
within these constraints is a significant challenge.
3. Real-time Processing and Adaptation: IoT networks operate in dynamic
environments where network conditions, device behaviors, and fault
patterns may change rapidly. Developing ML models capable of real-time
processing and adaptation to evolving conditions is essential for effective
node fault detection and data routing.
4. Security and Privacy Concerns: IoT networks are susceptible to security
threats and privacy breaches, especially when sensitive data is involved.
ML models must be robust against adversarial attacks and designed with
privacy-preserving mechanisms to protect sensitive information.
5. Integration and Deployment: Integrating ML-based node fault detection
and data routing algorithms into existing IoT infrastructure poses
challenges. Ensuring seamless deployment and compatibility with
existing network protocols and hardware platforms is crucial for practical
implementation.
6. Model Selection and Tuning: Choosing the appropriate ML algorithms
and architectures for node fault detection and data routing tasks
requires careful consideration. Model selection involves evaluating
trade-offs between accuracy, interpretability, and computational
complexity. Furthermore, hyperparameter tuning is essential to optimize
model performance.

30
Conclusions
In conclusion, our investigation into Node Fault Detection and Data
Routing in IoT Networks utilized a Voting Classifier integrating different
model combinations. We tested three configurations: Random Forest
with XGBoost and Neural Network, Random Forest with XGBoost and
AdaBoost, and LOF with XGBoost and Random Forest. Among these, the
combination of Random Forest, XGBoost, and AdaBoost emerged as the
most effective, achieving the highest accuracy and precision, thereby
proving to be the optimal solution for our objectives. In Data Routing
optimization, we implemented Gray Wolf Optimization, Q-Learning and
LEACH methods, each offering distinct advantages in routing
optimization strategies. Despite encountering challenges such as data
quality and scalability, our findings underscore the potential of
ML-driven approaches in enhancing fault detection accuracy and
optimizing data routing decisions within IoT networks. Moving forward,
collaborative efforts and ongoing research will be essential in refining
these techniques, thereby paving the way for more resilient and efficient
IoT networks capable of meeting the evolving demands of modern
connectivity.

31
Chapter 5
Bibliography

1. Zhang, Y., & Wang, Y. (2018). "A Survey on Data Routing Optimization
Strategies in IoT Networks." IEEE Access, 6, 19896-19907.
2. Wang, L., & Li, S. (2017). "A Survey on Gray Wolf Optimization Algorithms
for Optimization Problems." Journal of Computers, 12(9), 1051-1060.
3. Hansen, L. K., & Salamon, P. (1990). Neural network ensembles. IEEE
Transactions on Pattern Analysis and Machine Intelligence, 12(10),
993-1001.
4. Tan, Y., et al. (2021). "An Overview of Deep Learning-Based Fault
Detection Methods in IoT Networks." IEEE Internet of Things Journal,
8(9), 7233-7250.
5. XGBoost Documentation. "XGBoost: A Scalable Tree Boosting System."
Retrieved from https://xgboost.readthedocs.io/en/latest/.
6. Local Outlier Factor (LOF) Documentation.
"sklearn.neighbors.LocalOutlierFactor." Retrieved from
https://scikit-learn.org/stable/modules/generated/sklearn.neighbors.Loc
alOutlierFactor.html.
7. Dietterich, T. G. (2000). Ensemble methods in machine learning. In
International workshop on multiple classifier systems (pp. 1-15).
Springer, Berlin, Heidelberg.
8. Gray Wolf Optimization Algorithm "Gray Wolf Optimizer" Retrieved from
https://www.mathworks.com/matlabcentral/fileexchange/59126-gray-w
olf-optimizer-gwo.
9. Q-Learning basic documentation retrieved from
https://www.geeksforgeeks.org/q-learning-in-python/
10. Low-Energy Adaptive Clustering Hierarchy (LEACH) explained in
https://www.sciencedirect.com/science/article/pii/S2665917423002192
11.Bishop, C. M. (2006). Pattern Recognition and Machine Learning.
Springer. This comprehensive textbook provides in-depth coverage of
various machine learning algorithms, including decision trees, neural
networks, and ensemble methods.

misra2012
No ratings yet
misra2012
5 pages
Conference Paper
No ratings yet
Conference Paper
6 pages
202503-01-020614
No ratings yet
202503-01-020614
6 pages
Recent
No ratings yet
Recent
11 pages
88
No ratings yet
88
12 pages
A Failure Prediction Platform for Internet of Things Applications
No ratings yet
A Failure Prediction Platform for Internet of Things Applications
8 pages
Srinivasa A H, Siddaraju.
No ratings yet
Srinivasa A H, Siddaraju.
5 pages
Internet of Things
No ratings yet
Internet of Things
20 pages
AI and Deep Learning for Networks
From Everand
AI and Deep Learning for Networks
Gopee Mukhopadhyay
No ratings yet
fault-tolerance-in-iot
No ratings yet
fault-tolerance-in-iot
4 pages
Comprehensive Guide to Arduino Systems: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Arduino Systems: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
ETS_internship_report
No ratings yet
ETS_internship_report
24 pages
Monitoring The Internet of Things Networks
No ratings yet
Monitoring The Internet of Things Networks
3 pages
Report
No ratings yet
Report
25 pages
Programming NodeMCU for IoT Applications: Definitive Reference for Developers and Engineers
From Everand
Programming NodeMCU for IoT Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
QUERA_Q-Learning_RPL_Routing_Mechanism_to_Establish_Energy_Efficient_and_Reliable_Communications_in_Mobile_IoT_Networks
No ratings yet
QUERA_Q-Learning_RPL_Routing_Mechanism_to_Establish_Energy_Efficient_and_Reliable_Communications_in_Mobile_IoT_Networks
16 pages
CircuitPython in Practice: Definitive Reference for Developers and Engineers
From Everand
CircuitPython in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
From Everand
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
M. Sreedevi
No ratings yet
Ijeter37822020 PDF
No ratings yet
Ijeter37822020 PDF
19 pages
Enhancing Traffic Routing Inside A Network Through IoT Technology & Network Clustering by Selecting Smart Leader Nodes
No ratings yet
Enhancing Traffic Routing Inside A Network Through IoT Technology & Network Clustering by Selecting Smart Leader Nodes
24 pages
A Practitioner's Approach for Problem-Solving using AI
From Everand
A Practitioner's Approach for Problem-Solving using AI
Satvik Vats
No ratings yet
Abapour, 2021
No ratings yet
Abapour, 2021
10 pages
THARUN ADITHYAN_240911612_PB_37_TEAM 5
No ratings yet
THARUN ADITHYAN_240911612_PB_37_TEAM 5
10 pages
Topology Optimization in IOT CN LS (2).PDF
No ratings yet
Topology Optimization in IOT CN LS (2).PDF
26 pages
Dept. of Computer Science and Engineering Dr. B.R. Ambedkar National Institute of Technology Jalandhar-Punjab, INDIA
No ratings yet
Dept. of Computer Science and Engineering Dr. B.R. Ambedkar National Institute of Technology Jalandhar-Punjab, INDIA
45 pages
SYNP
No ratings yet
SYNP
3 pages
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
B21ec183 Iiot (CRP)
No ratings yet
B21ec183 Iiot (CRP)
1 page
SONG Liumeng Final 120117
No ratings yet
SONG Liumeng Final 120117
179 pages
A novel algorithm for the development of a multipath protocol for routing and energy efficient in IoT with varying density
No ratings yet
A novel algorithm for the development of a multipath protocol for routing and energy efficient in IoT with varying density
15 pages
SFJD+030
No ratings yet
SFJD+030
12 pages
Netdata in Practice: Definitive Reference for Developers and Engineers
From Everand
Netdata in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
A7 Final
No ratings yet
A7 Final
42 pages
Implementing Machine Learning Algorithms For Predictive Network Maintenance in 5G and Beyond Networks
No ratings yet
Implementing Machine Learning Algorithms For Predictive Network Maintenance in 5G and Beyond Networks
11 pages
16900321157_OE-EC 803A
No ratings yet
16900321157_OE-EC 803A
6 pages
Precision Navigation with the Compass: Definitive Reference for Developers and Engineers
From Everand
Precision Navigation with the Compass: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
ESP32 Development and Applications: Definitive Reference for Developers and Engineers
From Everand
ESP32 Development and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
ANN Based Secured Energy Efficient Routing in Wireless Sensor Networks With Dynamic Deterministic Finite Automata DDFA and Particle Swarm Optimization Algorithm
No ratings yet
ANN Based Secured Energy Efficient Routing in Wireless Sensor Networks With Dynamic Deterministic Finite Automata DDFA and Particle Swarm Optimization Algorithm
6 pages
PRF Sujet O1 2023 2024
No ratings yet
PRF Sujet O1 2023 2024
35 pages
LEA-RPL_LIghtweight_Energy_Aware_RPL_Protocol_for_Internet_of_Things
No ratings yet
LEA-RPL_LIghtweight_Energy_Aware_RPL_Protocol_for_Internet_of_Things
22 pages
Predictive Digital Twin of Wind Farm
No ratings yet
Predictive Digital Twin of Wind Farm
78 pages
Paper Format Wsn
No ratings yet
Paper Format Wsn
14 pages
NB-IoT Systems and Protocols: Definitive Reference for Developers and Engineers
From Everand
NB-IoT Systems and Protocols: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Fundamentals of Microcontroller Architecture and Applications: Definitive Reference for Developers and Engineers
From Everand
Fundamentals of Microcontroller Architecture and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Optimising Deep Learning Split Deployment For IoT Edge Networks
No ratings yet
Optimising Deep Learning Split Deployment For IoT Edge Networks
6 pages
IoT Machine Learning and Blockchain Technologies - Com Anotações
100% (1)
IoT Machine Learning and Blockchain Technologies - Com Anotações
306 pages
Principles of Data Forwarding Technologies: Definitive Reference for Developers and Engineers
From Everand
Principles of Data Forwarding Technologies: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
(IJCST-V12I5P6) :S. Bharathi, Dr. D. Maruthanayagam
No ratings yet
(IJCST-V12I5P6) :S. Bharathi, Dr. D. Maruthanayagam
16 pages
Impact of Iot in Electrical Engineering
No ratings yet
Impact of Iot in Electrical Engineering
18 pages
2021- vidushi maam - A critical analysis of RPL objective functions in internet of things paradigm
No ratings yet
2021- vidushi maam - A critical analysis of RPL objective functions in internet of things paradigm
22 pages
electronics-11-02071 (1)
No ratings yet
electronics-11-02071 (1)
13 pages
Principles of Mesh Networks and Mesh Generation: Definitive Reference for Developers and Engineers
From Everand
Principles of Mesh Networks and Mesh Generation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Intelligent Power Management System Using Hybrid R
No ratings yet
Intelligent Power Management System Using Hybrid R
10 pages
IIOT 3
No ratings yet
IIOT 3
19 pages
Algorithms Networks Optimization Robert
No ratings yet
Algorithms Networks Optimization Robert
22 pages
Comprehensive Guide to Mbed Development: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Mbed Development: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
ISSA AND ABDULRAHMAN PROJECT DEFENCE (new)
No ratings yet
ISSA AND ABDULRAHMAN PROJECT DEFENCE (new)
15 pages
Comprehensive Guide to Zipkin: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Zipkin: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
AI Driven VNF Splitting in O RAN For Enhancin Esmaeil Amiri
No ratings yet
AI Driven VNF Splitting in O RAN For Enhancin Esmaeil Amiri
117 pages
DNP3 Protocol Engineering: Definitive Reference for Developers and Engineers
From Everand
DNP3 Protocol Engineering: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
PM Prep Roadmap_Aman Agarwal_IITK
No ratings yet
PM Prep Roadmap_Aman Agarwal_IITK
11 pages
Product Improvement Questions
No ratings yet
Product Improvement Questions
1 page
Group 5_Report
No ratings yet
Group 5_Report
29 pages
Product design
No ratings yet
Product design
5 pages
BTP REPORT_FILE
No ratings yet
BTP REPORT_FILE
20 pages
btp_report_final_2
No ratings yet
btp_report_final_2
24 pages
Project Report
No ratings yet
Project Report
28 pages
An Empirical Study For Student Academic Performance Prediction Using Machine Learning Techniques
No ratings yet
An Empirical Study For Student Academic Performance Prediction Using Machine Learning Techniques
9 pages
Project Report: Ipl Score and Win Prediction Using Machine Learning
No ratings yet
Project Report: Ipl Score and Win Prediction Using Machine Learning
43 pages
Criminova Crime Forecast
No ratings yet
Criminova Crime Forecast
36 pages
Solution To Linear Regression Assignment
No ratings yet
Solution To Linear Regression Assignment
5 pages
Causal Forest Athey Wagner 2019
No ratings yet
Causal Forest Athey Wagner 2019
15 pages
Machine Learning Super Cheatsheet (Prof. Pedram Jahangiry)
No ratings yet
Machine Learning Super Cheatsheet (Prof. Pedram Jahangiry)
2 pages
B.E Ece 19 23 Batchno 35
No ratings yet
B.E Ece 19 23 Batchno 35
50 pages
Cia 4 ML
No ratings yet
Cia 4 ML
60 pages
Predictive Analytics in Employee Churn A Systematic Literature Review
No ratings yet
Predictive Analytics in Employee Churn A Systematic Literature Review
11 pages
Abstract Book ICSCI 2018 25102018 5
No ratings yet
Abstract Book ICSCI 2018 25102018 5
141 pages
Bike Renting PDF
No ratings yet
Bike Renting PDF
26 pages
Sat - 95.Pdf - Heart Disease Prediction Using Machine Learning Algorithms
No ratings yet
Sat - 95.Pdf - Heart Disease Prediction Using Machine Learning Algorithms
11 pages
Major Project Documentation Final
No ratings yet
Major Project Documentation Final
40 pages
Agricultural Data Analysis Using Machine Learningpdf
No ratings yet
Agricultural Data Analysis Using Machine Learningpdf
5 pages
Project Report Emaildetection
No ratings yet
Project Report Emaildetection
44 pages
Multiclass Prediction Model For Student Grade Prediction Using Machine Learning
No ratings yet
Multiclass Prediction Model For Student Grade Prediction Using Machine Learning
106 pages
Predicting Students Employability Using ML
No ratings yet
Predicting Students Employability Using ML
5 pages
Thyroid Disease Classification Using Machine Learning Project
No ratings yet
Thyroid Disease Classification Using Machine Learning Project
34 pages
Murali - EM Search
No ratings yet
Murali - EM Search
1 page
Machine Learning Based Chronic Disease Heart Attack Prediction
No ratings yet
Machine Learning Based Chronic Disease Heart Attack Prediction
6 pages
Project Template AICTE
No ratings yet
Project Template AICTE
14 pages
Zheng Et Al., 2021
No ratings yet
Zheng Et Al., 2021
12 pages
ML-Objectives-Mid-1
No ratings yet
ML-Objectives-Mid-1
5 pages
Restaurants Rating Prediction Using Machine Learning Algorithms
No ratings yet
Restaurants Rating Prediction Using Machine Learning Algorithms
4 pages
Machine Learning for Business Analytics: Concepts, Techniques and Applications with JMP Pro, 2nd Edition Galit Shmueliinstant download
100% (2)
Machine Learning for Business Analytics: Concepts, Techniques and Applications with JMP Pro, 2nd Edition Galit Shmueliinstant download
51 pages
ML Passing Package -1
No ratings yet
ML Passing Package -1
43 pages
Analyzing Positional Play in Chess Using Maching Learning
No ratings yet
Analyzing Positional Play in Chess Using Maching Learning
5 pages
Strategy Learner
No ratings yet
Strategy Learner
5 pages
Siddique Et Al. - 2021 - Irrigation Water Quality Index Development
No ratings yet
Siddique Et Al. - 2021 - Irrigation Water Quality Index Development
20 pages
Burhan Bb(Chp4)
No ratings yet
Burhan Bb(Chp4)
26 pages

BTP_Report

Uploaded by

BTP_Report

Uploaded by

Node Fault Prediction and Data

Routing over IOT Networks

Under the supervision of:

Dr. Om Jee Pandey

DEPARTMENT OF ELECTRONICS ENGINEERING

Dr. Om Jee Pandey

Dharmesh Mahajan Nikhil Agarwal

Samar Singh Randhawa Chakshu Sharad

With the rapid expansion of IoT networks, ensuring reliable

2. Detailed Literature Survey of Existing Technologies

2.2. Data Routing Optimization Strategies

3. Preliminary Results and Insights

4. Challenges, Conclusions & Future Scope

This report provides a detailed overview of our approach, covering the

1. Node Fault Prediction Techniques :

b. Machine Learning Models :

c. Deep Learning Approaches :

2. Data Routing Optimization Strategies :

a. Static Routing Algorithms :

c. Machine Learning-Based Routing Optimization :

a. Fault-Aware Routing Strategies :

Node Fault Detection

In our comprehensive investigation into node fault detection within IoT

The Voting Classifier aggregates predictions from its constituent models,

1. Random Forest + XGBoost + Neural Network

# Split the cleaned data into training and testing sets

# Initialize individual classifiers

# Create a Voting Classifier with the three classifiers

# Function to train and evaluate a model

# Print metrics with 10 decimal places

# Train and evaluate the Voting Classifier

# Split the cleaned data into training and testing sets

# Initialize individual classifiers

# Create a Voting Classifier with the three classifiers

# Function to train and evaluate a model

# Print metrics with 10 decimal places

# Train and evaluate the Voting Classifier

class LOFClassifier(BaseEstimator, ClassifierMixin):

def fit(self, X, y=None):

def predict(self, X):

def fit_predict(self, X, y=None):

# Initialize the LOF pipeline

# Create a Voting Classifier with the three classifiers

# Function to train and evaluate a model

# Print metrics with 10 decimal places

Comparison of the Methods :

Grey Wolf Optimization:

The Hierarchy is as shown below:

Costi,j = (c1di,j) + (c2Hj) + [c3*(ValidTraffic/Ti,j)] +

You might also like