0% found this document useful (0 votes)

3 views

ICT515A001

ICT research paper

Uploaded by

jessy.pman21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

ICT515A001

ICT research paper

Uploaded by

jessy.pman21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

SURVEY OF MACHINE LEARNING

ALGORITHMS FOR PROACTIVE HARD DISK

DRIVE FAILURE DETECTION

Md Rakibul Islam
Student ID: 34795345
Table of Contents
Abstract ....................................................................................................................... 2
1. Introduction .............................................................................................................. 2
2. Literature Review of Machine Learning Algorithms ....................................................... 3
2.1 K-Nearest Neighbors (KNN) ................................................................................... 3
2.2 K-Means Clustering .............................................................................................. 4
2.3 Support Vector Machine (SVM) .............................................................................. 4
3. Comparative Analysis ................................................................................................ 5
4. Conclusion ............................................................................................................... 6
Bibliography ................................................................................................................. 8
Abstract
Detecting HDD failures before they happen is crucial to avoid losing data and maintain
operational stability. This article examines how three machine learning techniques—K-Nearest
Neighbors (KNN), K-Means Clustering, and Support Vector Machine (SVM)—are used to
predict hard disk drive (HDD) failures based on SMART attributes. The classification of failure
instances using KNN is studied based on their proximity to similar data points. At the same time,
the effectiveness of K-Means Clustering in identifying anomalies through unsupervised learning
is investigated. The analysis of SVM focuses on its ability to generate hyperplanes that can
distinguish between failure and non-failure data in spaces with many dimensions. By examining
previous research, we emphasise the benefits and drawbacks of individual methods and
determine that hybrid strategies, which incorporate these algorithms, could result in better
outcomes for detecting HDD failures early.

1. Introduction

As high-performance computing (HPC) and cloud systems expand, significant challenges are
presented by hardware failures, specifically those of hard disk drives (HDD). Conventional fault-
tolerance techniques such as periodic check-pointing and replication are no longer adequate
because of the increasing complexity of these systems. Since 1994, manufacturers have been
using SMART (Self-Monitoring, Analysis, and Reporting Technology) to anticipate HDD
failures by observing internal and environmental characteristics. These characteristics are
examined to determine a SMART flag, indicating possible malfunction and enabling users to
create data backups.

Nevertheless, conventional SMART-based techniques frequently face challenges with precision

and incorrect alerts. The rise of machine learning (ML) has resulted in the creation of more
advanced techniques, more sophisticated methods for predicting failures. Machine learning
algorithms like K-Nearest Neighbors (KNN), K-Means Clustering, and Support Vector Machine
(SVM) improve prediction accuracy by analysing past data trends, resulting in enhanced
detection of failures and decreased downtime.

The paper examines the utilisation of these three machine learning algorithms for predicting
HDD failures, assessing their accuracy and trustworthiness, and enhancing the system's
reliability.
2. Literature Review of Machine Learning Algorithms

2.1 K-Nearest Neighbors (KNN)

KNN is an instance-based and non-parametric learning method that categorises new data points
by their closeness to existing data points in the feature space. It is a basic but effective algorithm
used successfully in many classification projects.

Wang et al. (2014) utilised KNN to predict HDD failures based on SMART attributes in the field
of HDD failure prediction. Using distance metrics by KNN enables it to identify connections
between drives by analysing comparable patterns in failure data. The algorithm worked best on
smaller datasets or when there was a distinct separation between failure and non-failure data.

Despite being simple, KNN faces a significant challenge: the computational complexity
increases with the dataset size. KNN's efficiency also depends on choosing the correct value for
k, the number of nearest neighbours, which may need to be adjusted for optimal results.

Advantages

Easy to use: K-Nearest Neighbors (KNN) is straightforward and intuitive, making it a top pick
for newcomers.

Adaptable for Non-Linear Decision Boundaries: It effectively deals with intricate, non-linear
connections without making any assumptions about the data distribution.

No need for a Training Phase: KNN is a lazy learning algorithm that doesn't need any specific
training phase, so it can be easily applied once the data is ready.

Limitations

Prone to Noise: KNN is sensitive to noisy data and unnecessary features, decreasing its
predictive accuracy.

Computationally Expensive: As the dataset gets larger, the algorithm becomes more
computationally intensive because it needs to calculate the distance between every point and all
other data points.
2.2 K-Means Clustering

K-Means Clustering is an unsupervised learning system that divides data into clusters according
to their similarities. While typically used for Clustering, it has also been effectively utilised for
detecting anomalies, which is beneficial for identifying early indications of HDD malfunctions.

Queiroz et al. (2017) utilised K-Means Clustering on SMART data to identify abnormalities that
could suggest upcoming HDD malfunctions. By categorising the data points into distinct
clusters, anomalies, which frequently indicate irregular or malfunctioning activities, can be easily
detected. Nevertheless, the initial centroid placement and the specified number of clusters impact
K-Means' effectiveness. Selecting the wrong value for k could result in imprecise Clustering and,
subsequently, the failure to detect or identify incorrect instances.

Advantages of K-Means Clustering:

Effective with Big Data: K-Means shows computational efficiency and effectively manages large
datasets. Hence, it is commonly used for unsupervised learning.

Because of its capacity to cluster similar data points is commonly utilised for tasks such as
anomaly detection, data compression, and pattern recognition.

Limitations of K-Means Clustering:

Assumes Spherical Clusters: The algorithm assumes that clusters are spherical and uniformly
sized, which may not be the case for all datasets, resulting in less-than-optimal Clustering.

Dependent on Initialization: The algorithm's accuracy is influenced by the centroids' initial

placement and the specified number of clusters.

2.3 Support Vector Machine (SVM)

SVM is a robust supervised learning method often utilised for categorisation. It identifies a
hyperplane that effectively separates data points belonging to distinct classes. The capability of
SVM to manage high-dimensional data is beneficial for analysing the intricate datasets
commonly seen in predicting HDD failure.

Rincón et al. (2017) utilised SVM to categorise HDDs according to SMART attributes. Using
kernel functions, SVM can address linear and non-linear data, enhancing its efficiency in
intricate settings. An issue with SVM is its high computational cost, especially with big datasets,
since it requires solving a quadratic optimisation problem.

Advantages:

• Effective in high-dimensional spaces.

• Robust to overfitting, particularly in cases with a clear margin between classes.

Limitations:

• Computationally expensive, especially for large datasets.

• Performance is susceptible to the choice of kernel and regularisation parameters.

3. Comparative Analysis

When evaluating KNN, K-Means Clustering, and SVM, all three algorithms possess distinct
advantages and disadvantages, which make them appropriate for various facets of HDD failure
forecasting.

Complexity and Efficiency of Algorithms

• KNN becomes more costly computationally as the dataset increases because it needs to
calculate the distance for each new data point. It is effective in situations with intricate
and non-linear decision boundaries.

• K-Means: Despite its computational efficiency and scalability to large datasets, K-Means
may encounter difficulties with non-spherical clusters or inadequate centroid
initialisation.
• SVM may achieve high accuracy in high-dimensional spaces, but its training phase can
be time-consuming, especially with large datasets, because of the computational expense
of solving it.

Application Suitability

KNN: Most suitable for projects with meaningful connections between neighbouring data
points. It excels in medical diagnostics and other pattern-recognition tasks with restricted data
points.

K-Means: Efficient method for both detecting anomalies and clustering data. Its capability
recognises anomalies or categorises data with comparable trends, making it ideal for extensive,
unmarked datasets.

SVM: Best for tasks that demand well-defined decision boundaries. It performs exceptionally
well in tasks that require handling high-dimensional data, like image recognition and
classification.

Personalisation and Sensitivity to Parameters

The selection of k influences KNN's effectiveness. Choosing a small number of neighbours can
result in overfitting, whereas selecting a large number can result in underfitting.

K-Means necessitates the upfront determination of the cluster quantity (k) and is greatly affected
by the initial centroids' placements. Inadequate initialisation may lead to less-than-optimal
clusters.

SVM's success depends on choosing the proper kernel function and its parameters. While it can
deal with intricate non-linear data, performance may suffer if parameters are not optimised.

4. Conclusion

This article examined three machine learning techniques—KNN, K-Means Clustering, and
SVM— in terms of their use for predicting HDD failures in advance. KNN is a good option for
smaller datasets or those with non-linear boundaries, while K-Means Clustering is beneficial for
detecting anomalies in large datasets without supervision. Even though SVM requires a lot of
computational power, it provides excellent accuracy for tasks involving classification in high-
dimensional spaces.
No one algorithm stands out as universally better for detecting HDD failures. Alternatively, a
hybrid method that leverages the benefits of these algorithms could result in improved
performance. An example would be utilising K-Means Clustering for data pre-processing and
Clustering, then proceeding with SVM for classification or KNN for refining predictions.
Bibliography

[1] Y. Wang, E. W. M. Ma, T. W. S. Chow, and K.-L. Tsui, "A Two-Step Parametric Method for
Failure Prediction in Hard Disk Drives," IEEE Transactions on Industrial Informatics, vol. 10,
no. 1, pp. 419–430, Feb. 2014, doi: https://doi.org/10.1109/tii.2013.2264060.

[2] A. Tahir, F. Chen, Abdulwahab Ali Almazroi, and Nourah Fahad Janbi, "SWEP-RF:
Accuracy sliding window-based ensemble pruning method for latent sector error prediction in
cloud storage computing," Journal of King Saud University - Computer and Information
Sciences, vol. 35, no. 8, pp. 101672–101672, Sep. 2023, doi:
https://doi.org/10.1016/j.jksuci.2023.101672.

[3] D. Liu et al., "Predicting Hard Drive Failures for Cloud Storage Systems," Algorithms and
Architectures for Parallel Processing, pp. 373–388, 2020, doi: https://doi.org/10.1007/978-3-
030-38991-8_25.

[4] C.-J. Su and S.-F. Huang, "Real-time big data analytics for hard disk drive predictive
maintenance," Computers & Electrical Engineering, vol. 71, pp. 93–101, Oct. 2018, doi:
https://doi.org/10.1016/j.compeleceng.2018.07.025.

Step by Step Setting Up Fusion Accounting Hub
100% (1)
Step by Step Setting Up Fusion Accounting Hub
34 pages
Cisco Activity III
67% (6)
Cisco Activity III
2 pages
Morris-Raine Real Estate Co.
No ratings yet
Morris-Raine Real Estate Co.
22 pages
Hard Drive Failure Prediction Using Non-Parametric Statistical Methods
No ratings yet
Hard Drive Failure Prediction Using Non-Parametric Statistical Methods
4 pages
A Two Step Parametric Method For Failure Prediction in Hard Disk Drives
No ratings yet
A Two Step Parametric Method For Failure Prediction in Hard Disk Drives
12 pages
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
A Combined Bayesian Network Method For Predicting Drive Failure Times From Smart Attributes
No ratings yet
A Combined Bayesian Network Method For Predicting Drive Failure Times From Smart Attributes
7 pages
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Decision Tree and PCA-based Fault Diagnosis of Rotating Machinery
No ratings yet
Decision Tree and PCA-based Fault Diagnosis of Rotating Machinery
18 pages
Semeq PDF
No ratings yet
Semeq PDF
19 pages
An Integration of K-Means and Decision Tree (ID3) Towards A More Efficient Data Mining Algorithm
No ratings yet
An Integration of K-Means and Decision Tree (ID3) Towards A More Efficient Data Mining Algorithm
7 pages
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
Contemporary Machine Learning Methods: Harnessing Scikit-Learn and TensorFlow
From Everand
Contemporary Machine Learning Methods: Harnessing Scikit-Learn and TensorFlow
Adam Jones
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Multi-Class Supervised Classification Techniques For High-Dimensional Data - Applications To Vehicle Maintenance at Scania
No ratings yet
Multi-Class Supervised Classification Techniques For High-Dimensional Data - Applications To Vehicle Maintenance at Scania
68 pages
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
Fuzzy Clustering Toolbox
No ratings yet
Fuzzy Clustering Toolbox
77 pages
(Balasko, Dkk. 2007) Fuzzy Clustering
No ratings yet
(Balasko, Dkk. 2007) Fuzzy Clustering
77 pages
A Survey Paper On Hard Disk Failure Prediction Using Machine Learning
No ratings yet
A Survey Paper On Hard Disk Failure Prediction Using Machine Learning
6 pages
Comparative Study Classification Algorit PDF
No ratings yet
Comparative Study Classification Algorit PDF
8 pages
Libro
No ratings yet
Libro
8 pages
A Fast and Effective Partitional Clustering Algorithm For Large Categorical Datasets Using A K-Means Based Approach
No ratings yet
A Fast and Effective Partitional Clustering Algorithm For Large Categorical Datasets Using A K-Means Based Approach
21 pages
Virtual Storage Failure Prediction Model Using Supervised Machine Learning
No ratings yet
Virtual Storage Failure Prediction Model Using Supervised Machine Learning
9 pages
Paper FINAL
No ratings yet
Paper FINAL
6 pages
Mechanical Systems and Signal Processing: Wilson Wang, Derek Kanneg
No ratings yet
Mechanical Systems and Signal Processing: Wilson Wang, Derek Kanneg
15 pages
Download
No ratings yet
Download
6 pages
02 Predictive Maintenance Workshop - PIDI 4.0
No ratings yet
02 Predictive Maintenance Workshop - PIDI 4.0
39 pages
Investigating The Configurable Parameters of K-Means Unsupervised Learning
100% (1)
Investigating The Configurable Parameters of K-Means Unsupervised Learning
33 pages
陌陌陌陌莫迪
No ratings yet
陌陌陌陌莫迪
9 pages
Book Report - 1177 - AI
No ratings yet
Book Report - 1177 - AI
9 pages
ML+Clustering
No ratings yet
ML+Clustering
33 pages
Research On Pattern Analysis and Data Classification Methodology For Data Mining and Knowledge Discovery
No ratings yet
Research On Pattern Analysis and Data Classification Methodology For Data Mining and Knowledge Discovery
10 pages
Lifespan and Failures of SSDs and HDDs: Similarities, Differences, and Prediction Models
No ratings yet
Lifespan and Failures of SSDs and HDDs: Similarities, Differences, and Prediction Models
16 pages
February 2024-: Top Read Articles in Computer Science & Information Technology
No ratings yet
February 2024-: Top Read Articles in Computer Science & Information Technology
35 pages
Data Mining: Kabith Sivaprasad (BE/1234/2009) Rimjhim (BE/1134/2009) Utkarsh Ahuja (BE/1226/2009)
No ratings yet
Data Mining: Kabith Sivaprasad (BE/1234/2009) Rimjhim (BE/1134/2009) Utkarsh Ahuja (BE/1226/2009)
32 pages
Lexicon of Computer Science Terminology: Lexicon of Tech and Business, #16
From Everand
Lexicon of Computer Science Terminology: Lexicon of Tech and Business, #16
Mustafa Al-Dori
4/5 (1)
Support Vector Machine in Machine Condition Monitoring and Fault Diagnosis
No ratings yet
Support Vector Machine in Machine Condition Monitoring and Fault Diagnosis
15 pages
1 s2.0 S0888327019308088 Main
No ratings yet
1 s2.0 S0888327019308088 Main
39 pages
Feature Extraction Techniques Using Support Vector Machines in Disease Prediction
No ratings yet
Feature Extraction Techniques Using Support Vector Machines in Disease Prediction
8 pages
January 2024: Top 10 Downloaded Articles in Computer Science & Information Technology
No ratings yet
January 2024: Top 10 Downloaded Articles in Computer Science & Information Technology
35 pages
Medical Imabmnge Analysis
No ratings yet
Medical Imabmnge Analysis
41 pages
Pso-Data Clustering
No ratings yet
Pso-Data Clustering
10 pages
Neurocomputing: Yi Ding, Xian Fu
No ratings yet
Neurocomputing: Yi Ding, Xian Fu
3 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Unit 4 Introduction to Algorithm
No ratings yet
Unit 4 Introduction to Algorithm
10 pages
IDA Paper Gondek Hafner Sampson
No ratings yet
IDA Paper Gondek Hafner Sampson
6 pages
Machine Learning
No ratings yet
Machine Learning
15 pages
19+-+Analog+Circuits+Based+Fault+v1
No ratings yet
19+-+Analog+Circuits+Based+Fault+v1
5 pages
ML-Notes
No ratings yet
ML-Notes
12 pages
Genetic Algorithms For Multi-Criterion Classification and Clustering in Data Mining
No ratings yet
Genetic Algorithms For Multi-Criterion Classification and Clustering in Data Mining
12 pages
Electronics 09 01295 v2
No ratings yet
Electronics 09 01295 v2
12 pages
Machine Learning & Data Mining
No ratings yet
Machine Learning & Data Mining
108 pages
Practical MXNet Applications: Definitive Reference for Developers and Engineers
From Everand
Practical MXNet Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Predictivemaintenance FaultDetection
No ratings yet
Predictivemaintenance FaultDetection
12 pages
Learning Techniques For NILMTK
No ratings yet
Learning Techniques For NILMTK
9 pages
K Means
No ratings yet
K Means
9 pages
Pattern Recognition
No ratings yet
Pattern Recognition
33 pages
KNN and SVM Based Satellite Image Classification: Ijireeice
No ratings yet
KNN and SVM Based Satellite Image Classification: Ijireeice
5 pages
KNN & Support Vector Machines: Dr.S.Vasantharathna
No ratings yet
KNN & Support Vector Machines: Dr.S.Vasantharathna
22 pages
SBKMMA: Sorting Based K Means and Median Based Clustering Algorithm Using Multi Machine Technique For Big Data
No ratings yet
SBKMMA: Sorting Based K Means and Median Based Clustering Algorithm Using Multi Machine Technique For Big Data
7 pages
Lesson 13. Fault Diagnosis I: Mirko Mazzoleni University of Bergamo
No ratings yet
Lesson 13. Fault Diagnosis I: Mirko Mazzoleni University of Bergamo
19 pages
Useful Circuitry Push-On/Push-Off Circuit (Binary Circuit)
No ratings yet
Useful Circuitry Push-On/Push-Off Circuit (Binary Circuit)
10 pages
DBMS Mini-Project Report-Cloth Rental Store
No ratings yet
DBMS Mini-Project Report-Cloth Rental Store
31 pages
Ensayo Sobre La Naturaleza vs. Crianza
100% (1)
Ensayo Sobre La Naturaleza vs. Crianza
8 pages
SC-400 (1)
No ratings yet
SC-400 (1)
11 pages
System Manager Integration With BlueXP
No ratings yet
System Manager Integration With BlueXP
5 pages
Netsure 8200 Data Sheet DC 00102
No ratings yet
Netsure 8200 Data Sheet DC 00102
2 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
2 pages
DF Unit 2 My Notes
No ratings yet
DF Unit 2 My Notes
18 pages
SRT1F TL1 Commisioning Manual PDF
No ratings yet
SRT1F TL1 Commisioning Manual PDF
124 pages
Data Sheet - Industrial Security
No ratings yet
Data Sheet - Industrial Security
2 pages
Factors Influencing Consumer's Purchase Intention On Video Games (Autosaved)
No ratings yet
Factors Influencing Consumer's Purchase Intention On Video Games (Autosaved)
13 pages
C4.2 Malware Analysis and CTI
No ratings yet
C4.2 Malware Analysis and CTI
3 pages
User Manual. Automatic Horizontal Sliding Car Door. Component: VF7 Relay Electronic Module. V7.01.06-00
No ratings yet
User Manual. Automatic Horizontal Sliding Car Door. Component: VF7 Relay Electronic Module. V7.01.06-00
32 pages
Effectiveness of Artificial Intelligence Techniques Against Cyber Security Risks Apply of IT Industry
No ratings yet
Effectiveness of Artificial Intelligence Techniques Against Cyber Security Risks Apply of IT Industry
13 pages
Compiler Design Unit 5
No ratings yet
Compiler Design Unit 5
39 pages
Contextualised open educational practices: Towards student agency and self-directed learning 1st Edition Jako Olivier instant download
100% (2)
Contextualised open educational practices: Towards student agency and self-directed learning 1st Edition Jako Olivier instant download
79 pages
Unit 2
No ratings yet
Unit 2
65 pages
MC QB
No ratings yet
MC QB
24 pages
chapter 1
No ratings yet
chapter 1
177 pages
AI-Enhanced LSB Steganography Interface Concealed Data Embedding Framework
No ratings yet
AI-Enhanced LSB Steganography Interface Concealed Data Embedding Framework
4 pages
A. Preliminary Activity:: Integration With English Literacy
No ratings yet
A. Preliminary Activity:: Integration With English Literacy
6 pages
MA5616 Product Datasheet 01
No ratings yet
MA5616 Product Datasheet 01
13 pages
De Thi Giua Ki 2 Tieng Anh 8 Ilearn Smart World de So 1 1706085516
No ratings yet
De Thi Giua Ki 2 Tieng Anh 8 Ilearn Smart World de So 1 1706085516
4 pages
CCBoot Manual - Server Maintain
No ratings yet
CCBoot Manual - Server Maintain
42 pages
Assignment 1C F4AB - Introduction
No ratings yet
Assignment 1C F4AB - Introduction
2 pages
Ngdcs-Fidelity Bond Manual Revised Ver 2
No ratings yet
Ngdcs-Fidelity Bond Manual Revised Ver 2
24 pages
Notes on Bitcoin 38
No ratings yet
Notes on Bitcoin 38
2 pages

ICT515A001

Uploaded by

ICT515A001

Uploaded by

SURVEY OF MACHINE LEARNING

ALGORITHMS FOR PROACTIVE HARD DISK

Nevertheless, conventional SMART-based techniques frequently face challenges with precision

2.1 K-Nearest Neighbors (KNN)

Advantages of K-Means Clustering:

Limitations of K-Means Clustering:

Dependent on Initialization: The algorithm's accuracy is influenced by the centroids' initial

2.3 Support Vector Machine (SVM)

• Effective in high-dimensional spaces.

• Computationally expensive, especially for large datasets.

Complexity and Efficiency of Algorithms

Personalisation and Sensitivity to Parameters

You might also like