0% found this document useful (0 votes)

11 views

FINAL

nbjbjhb

Uploaded by

aditya roy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

FINAL

nbjbjhb

Uploaded by

aditya roy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 49

APPLICATION OF MACHINE LEARNING IN PREDICTING THE

MECHANICAL PROPERTIES OF API STEELS

A PROJECT REPORT

SUBMITTED IN PARTIAL FULFILLMENT OF THE

REQUIREMENTS FOR THE AWARD OF THE DEGREE
OF
BACHELOR OF TECHNOLOGY
IN
PRODUCTION AND INDUSTRIAL ENGINEERING
SUBMITTED BY:

AAKARSHIT TAWRA (2K21/PE/01)

ADITYA ROY (2K21/PE/02)
ISHANT CHAUDHRY (2K21/PE/23)
KAUSTUBH KRISHNA CHAUKIYAL (2K21/PE/31)

Under the supervision of

Prof. RS MISHRA

DELHI TECHNOLOGICAL UNIVERSITY

(Formerly Delhi College of Engineering)
Bawana Road, Delhi-110042

DEC 2024
CANDIDATE’S DECLARATION

We, AAKARSHIT TAWRA (2K21/PE/01) ADITYA ROY (2K21/PE/02) ISHANT CHAUDHRY (2K21/PE/23)
KAUSTUBH KRISHNA CHAUKIYAL (2K21/PE/31) students of B. Tech. Pro Engineering, hereby declare that the
project Dissertation titled "Application of machine learning in predicting the yield strength of API steels"
which is submitted by us to the Department of Delhi Technological University, Delhi in partial fulfillment of
the requirement for the award of the degree of Bachelor of Technology, is original and not copied from any
source without proper citation. This work has not previously formed the basis for the award of any Degree,
Diploma Associateship, Fellowship or other similar title or recognition.

Place: Delhi AAKARSHIT TAWRA (2K21/PE/01)

Date: ADITYA ROY (2K21/PE/02)

ISHANT CHAUDHRY (2K21/PE/23

KAUSTUBH KRISHNA CHAUKIYAL (2K21/PE/31)

CERTIFICATE

I hereby certify that the Project Dissertation titled “Application of machine learning in predicting the yield
strength of API steels” which is submitted by AAKARSHIT TAWRA (2K21/PE/01) ADITYA ROY (2K21/PE/02)
ISHANT CHAUDHRY (2K21/PE/23) KAUSTUBH KRISHNA CHAUKIYAL (2K21/PE/31) B.Tech. Production and
industrial Engineering, Delhi Technological University, Delhi in partial fulfillment of the requirement for the
award of the degree of Bachelor of Technology, is a record of the project work carried out by the students
under my supervision. To the best of my knowledge this work has not been submitted in part or full for any
Degree or Diploma to this University or elsewhere.

Place: Delhi

Date: SUPERVISER
ABSTRACT

The significance of machine learning has grown exponentially across diverse fields, including the mechanical
industry. API steels being a subgroup of high-strength low-alloy (HSLA) steels have been designed for use in
the petroleum industry. In this research, the application of machine learning models to estimate the mechanical
properties of API steels was explored. Both non-linear and linear machine learning models were employed to
predict the tensile strength and yield strength of API steels. The models were evaluated using different
performance metrics on test samples, which produced promising results. The results exhibit the effectiveness of
machine learning techniques in predicting mechanical properties, making them a valuable tool for researchers
and engineers in the materials industries.
ACKNOWLEDGEMENT

We would like to express our sincere gratitude to all those who have contributed to the successful completion
of this research project. First and foremost, we extend our heartfelt appreciation to our supervisor Prof. RS
MISHRA for his guidance, support, and valuable insights throughout the entire research process. His expertise
and encouragement have been instrumental in shaping our work.

We are also immensely grateful to the participants who provided the necessary data for this study. Their
cooperation and willingness to share their knowledge and experiences have been invaluable. Furthermore, we
would like to acknowledge the contributions of the various research papers, websites, and handbooks from
which we sourced relevant information. Their work has laid the foundation for our research and enriched our
understanding of the subject matter.

Last but not least, we would like to express our gratitude to our families for their unwavering support and
understanding throughout this endeavor. Their love, patience, and encouragement have been a constant source
of motivation.

In conclusion, we acknowledge the collective efforts of all those who have contributed to this research project.
Their support and contributions have been essential in our journey towards achieving our research objectives.

AAKARSHIT TAWRA (2K21/PE/01)

ADITYA ROY (2K21/PE/02)
ISHANT CHAUDHRY (2K21/PE/23)
KAUSTUBH KRISHNA CHAUKIYAL (2K21/PE/31)
CONTENTS

CANDIDATE’S DECLARATION......................................................................................ii
CERTIFICATE....................................................................................................................iii
ABSTRACT......................................................................................................................... iv
ACKNOWLEDGEMENT................................................................................................... v
CONTENTS......................................................................................................................... vi
LIST OF TABLES.............................................................................................................viii
LIST OF SYMBOLS AND ABBREVIATIONS............................................................... ix
CHAPTER 1 INTRODUCTION.........................................................................................1
1.1 HSLA STEELS............................................................................................................................... 2
1.2 API STEELS.................................................................................................................................. 4
1.3 MACHINE LEARNING.................................................................................................................. 7

CHAPTER 2 LITERATURE REVIEW...........................................................................10

2.1 YIELD STRENGTH...................................................................................................................... 10
2.2 TENSILE STRENGTH...................................................................................................................12
2.3 OTHER MECHANICAL PROPERTIES........................................................................................... 14

CHAPTER 3 METHODOLOGY......................................................................................16
3.1 DATA COLLECTION.................................................................................................................... 16
3.2 DATA ANALYSIS......................................................................................................................... 20
3.3 FEATURE SELECTION.................................................................................................................22
3.4. MODEL TRAINING....................................................................................................................23
3.4.1 Multiple regression........................................................................................................... 24
3.4.2. Decision Tree.................................................................................................................... 28
3.4.3 Random Forest.................................................................................................................. 30
3.4.4 Extreme gradient boosting................................................................................................31
3.5 MODEL EVALUATION................................................................................................................ 32

CHAPTER 4 RESULT AND DISCUSSION.................................................................... 35

CHAPTER 5 CONCLUSIONS......................................................................................... 36
RERFERENCES................................................................................................................ 37
APPENDIX
LIST OF TABLES

Table 3.1 Chemical Composition of API

steels

Table 3.2 Mechanical Properties of

LIST OF SYMBOLS AND ABBREVIATIONS

Symbols
∑ Summation
Coefficients
Residual sum of squares
Penalty term in regularization of multiple regression
Abbreviations
ANN Artificial Neural Network
API American Petroleum Institute
ARB Accumulative Roll Bonding
ASM American Society of Mechanical Engineers
ASTM American Society for Testing and Materials
CART Classification and Regression Tree
CEN European Committee for Standardization
ECB European Chemical Bulletin
ER Ensemble Regression
GPR Gaussian Process Regression
HEA High Entropy Alloys
HSLA High-Strength Low-Alloy
MAE Mean Absolute Error
MAPE Mean Absolute Percentage error
ML Machine Learning
MMC Metal Matrix Composites
MSE Mean Squared Error
NLP Natural Language Processing
R2 Score Coefficient of Determination
RF Random Forest
SAE Society of Automotive Engineers
SVR Support Vector Regression
TS Tensile Strength
UTS Ultimate Tensile Strength
w.r.t with respect to
XGBoost Extreme Gradient Boosting
YS Yield Strength
1

CHAPTER 1 INTRODUCTION
Purpose of the Study

The primary purpose of this study is to leverage machine learning techniques to predict the
mechanical properties of API steels, specifically yield strength and tensile strength, with high
accuracy and efficiency. By developing data-driven models, the study aims to provide a
cost-effective, time-saving alternative to traditional experimental methods for evaluating material
properties.

This research also focuses on exploring the relationship between the chemical composition of API
steels and their mechanical behavior. By identifying the key factors influencing strength, the
study seeks to optimize material design for the petroleum industry, ensuring safety, durability, and
performance in critical applications such as pipelines, offshore platforms, and drilling equipment.

Motivation for the Study

The motivation for this project stems from the challenges faced in traditional material testing and
the evolving needs of the energy sector:
1.Challenges in Traditional Testing: Conventional methods to determine mechanical
properties, such as tensile and yield strength, rely on extensive laboratory experiments. These
methods are not only time-consuming and expensive but also often impractical when dealing
with large datasets or repeated testing.
2.Need for Precision in the Petroleum Industry: API steels are extensively used in the
petroleum industry, where safety and performance are paramount. Pipelines and other
infrastructure are subjected to extreme pressures, temperatures, and corrosive environments,
requiring materials with reliably predicted mechanical properties to avoid catastrophic failures.
3.Advancements in Machine Learning: Machine learning offers a modern approach to
address these challenges by enabling faster, more accurate predictions of material properties using
historical data. The ability to model complex relationships between chemical composition and
mechanical behavior motivates the integration of data-driven techniques in material science.
4.Demand for Innovation in Material Design: As the energy sector evolves, there is a
growing demand for optimized materials that balance strength, weight, and corrosion
resistance. This study aligns with that demand by providing insights into material behavior,
helping manufacturers design steels that meet specific industry needs.
2
1.1 HSLA STEELS

High-strength low-alloy (HSLA) steels represent a specialized class of steels that have been
developed to meet the growing demand for materials that combine high strength, durability, and cost
efficiency. Unlike traditional carbon steels, HSLA steels do not rely solely on carbon content for their
mechanical properties. Instead, their strength and performance are achieved through the precise
addition of microalloying elements and advanced processing techniques.

HSLA steels are characterized by their ability to provide superior mechanical properties while
maintaining a relatively low weight. This makes them an integral material in industries that require
components to withstand high loads, resist wear, and perform reliably under challenging conditions.
The term “high-strength low-alloy” itself encapsulates the essential features of these steels: they offer
high mechanical strength compared to conventional carbon steels, and they achieve this through low
levels of alloying elements (typically less than 5% by weight).

The genesis of HSLA steels can be traced back to the need for materials that could overcome the
limitations of traditional steels. Conventional carbon steels, while versatile, have a trade-off between
strength and ductility, with higher strength often resulting in reduced formability and weldability.
HSLA steels were developed to break this compromise, offering a unique combination of attributes.
One of the most significant breakthroughs in HSLA steel development was the introduction of
microalloying, where minute quantities of elements such as vanadium, niobium, and titanium are
added. These elements refine the grain structure of the steel, leading to significant improvements in
strength and toughness. This grain refinement is typically achieved during controlled rolling
processes, where precise temperature and deformation control allow for the production of steels with
exceptional performance characteristics.

In addition to their enhanced strength, HSLA steels are notable for their ability to maintain toughness
across a wide range of temperatures. This toughness ensures that the material can absorb energy and
resist fractures even in cold environments, making it an ideal choice for applications in regions with
extreme weather conditions.

Another defining feature of HSLA steels is their ability to resist atmospheric and environmental
corrosion. While traditional steels may require protective coatings or treatments to achieve similar
levels of resistance, HSLA steels inherently offer better durability due to the specific alloying
elements used in their composition. This property significantly enhances their longevity and reduces
maintenance requirements in structures and components exposed to harsh conditions.

HSLA steels are classified into several categories based on their mechanical properties,
3
chemical composition, and processing methods. Here are some common classification
systems for HSLA steels:

1. ASTM: The American Society for Testing and Materials (ASTM) has several
standards for HSLA steels based on their yield strength, tensile strength, and elongation.
For example, ASTM A572 and A588 are HSLA steels with minimum yield strengths of 50
ksi and 70 ksi, respectively.

2. SAE: The Society of Automotive Engineers (SAE) classifies HSLA steels based on
their minimum yield strength and composition. For example, SAE J1392 defines several
grades of HSLA steels, such as 050XLK, 060XLK, and 070XLK, which have minimum
yield strengths of 50 ksi, 60 ksi, and 70 ksi, respectively.

3. API: The American Petroleum Institute (API) has standards for HSLA steels used in
oil and gas industry, like API 5L X70 and X80, which have minimum yield strengths of 70
ksi and 80 ksi, respective

4. CEN: The European Committee for Standardization (CEN) classifies HSLA steels
based on their mechanical properties and chemical composition. For example, EN 10149
specifies several grades of HSLA steels with minimum yield strengths ranging from 315
MPa to 700 MPa.

Overall, the classification of HSLA steels is based on their intended application and the
required mechanical properties, and the standards organizations have developed various
systems to ensure that HSLA steels meet the specific requirements for their end use.
4
1.2 API STEEL
API steels, also known as American Petroleum Institute steels, are a group of high-strength
low-alloy (HSLA) steels [2] that are specifically designed for use in the petroleum industry [3].
These steels are developed and tested by the American Petroleum Institute (API) to meet certain
standards for strength, toughness, and durability in harsh environments.

API steels are often made with a mixture of alloying elements including nickel, molybdenum,
and chromium, which enhance the strength and corrosion resistance of the steel [4]. They are also
often designed to withstand extreme temperatures, pressures, and corrosive environments commonly
encountered in the oil and gas industry [5], [6].

API steels are commonly used in a range of applications including pipelines, offshore platforms,
and drilling equipment, where their high strength and toughness make them ideal for withstanding
the demanding conditions of the industry.

API steels are named according to a standardized system that includes both a letter and a number.
The letter indicates the type of steel, while the number indicates the minimum yield strength of the
steel in ksi [7].

For example, in the case of X56, the "X" indicates that this is a type of high-strength, low- alloy
steel, while the "56" indicates that the minimum yield strength of the steel is 56 ksi .

The letter designation system for API steels includes several different types of steel, including:

● X: High-strength, low-alloy steel

● L: Low-alloy steel

● C: Carbon-manganese steel

● P: Chromium-molybdenum steel

● Q: Quenched and tempered high-strength steel

The number that follows the letter designation specifies the minimum yield strength of the steel in
5
ksi. For example, X70 steel has a minimum yield strength of 70 ksi, while L80 steel has a minimum
yield strength of 80 ksi.

It's important to note that API steel grades also have other requirements beyond yield strength, such
as maximum hardness and minimum toughness, that must be met in order to be used in specific
applications in the petroleum industry.

API steels are a versatile material that can be used in a wide range of harsh environments, making
them ideal for use in the petroleum industry. The following are some of the most common API steel
grades and their general characteristics:

● H40: A low-strength, carbon-manganese steel with a minimum yield strength of 40 ksi,

primarily used for casing and tubing in non-corrosive or mildly corrosive environments.

● J55: A low to medium-strength carbon steel with a minimum yield strength of 55 ksi,
primarily used for casing and tubing in mildly corrosive environments.

● K55: A medium-strength carbon steel with a minimum yield strength of 55 ksi, similar to
J55 but with slightly higher mechanical properties and a lower sulfur content.
● N80: A medium-strength, low-alloy steel with a minimum yield strength of 80 ksi,
primarily used for casing and tubing in moderate to highly corrosive environments.

● L80: A medium-strength, low-alloy steel with a minimum yield strength of 80 ksi,

primarily used for casing and tubing in moderately to highly corrosive environments.

● P110: A high-strength, low-alloy steel with a minimum yield strength of 110 ksi, primarily
used for casing and tubing in highly corrosive environments or wells with high pressure and high
stress.

● X52, X60, X65, and X70: High-strength, low-alloy steels with minimum yield strengths of
52 ksi, 60 ksi, 65 ksi, and 70 ksi, respectively, primarily used for pipelines and other transmission
applications.

● Q125: A high-strength, quenched and tempered steel with a minimum yield strength of
6
125 ksi, primarily used for drilling equipment in highly demanding applications such as deepwater
drilling.

It's important to note that each API steel grade has specific mechanical and chemical properties that
must be met in order to be approved for use in the petroleum industry, and that these properties can
vary depending on the specific application and environment.

API steels are used in a variety of applications throughout the petroleum industry, from upstream
exploration and production to downstream refining and petrochemical processing. Specifically, these
steels are used in a variety of applications such as:

1. Oil and Gas Pipelines: API steels are commonly used in the construction of pipelines for
the transportation of oil and gas. The high strength and toughness of these steels make them ideal for
withstanding the high pressures and stresses that can occur during pipeline operation [8].

2. Offshore Platforms: API steels are used in the construction of offshore platforms, which
must withstand harsh environmental conditions including high winds, waves, and corrosive
saltwater. The strength, toughness, and corrosion resistance of API steels make them well-suited for
this application.

3. Drilling Equipment: API steels are also used in the manufacture of drilling equipment
such as drill pipes [9], casing, and tubing [10]. These components must be able to endure the
extreme conditions of drilling operations, including high pressures, temperatures, and corrosive
fluids.

4. Refineries and Petrochemical Plants: API steels are used in the construction of equipment
such as storage tanks, pressure vessels [11], and heat exchangers [12] in refineries and
petrochemical plants [13]. The high strength and corrosion resistance of these steels make them
ideal for use in these harsh environments.
7
Overall, API steels play a critical role in the petroleum industry, helping to ensure the safety,
reliability, and efficiency of the equipment and infrastructure used in the exploration, production,
transportation, and processing of oil and gas.

1.3 MACHINE LEARNING

Machine Learning (ML) is a subfield of artificial intelligence (AI) that focuses on creating systems
capable of learning from data and improving their performance over time without being explicitly
programmed for specific tasks. At its core, machine learning revolves around the development of
algorithms and models that enable computers to identify patterns, make predictions, or take actions
based on input data. Unlike traditional programming, where explicit instructions dictate every
action, machine learning allows systems to infer rules and relationships directly from data, making
it highly versatile and adaptive.

Key Components of Machine Learning

1. Data: Data is the foundation of machine learning. The quality and quantity of data
significantly impact the accuracy and performance of a machine learning model. Data can be
structured (e.g., spreadsheets) or unstructured (e.g., images, text).
2. Algorithms: Algorithms are the mathematical procedures used to process data and
extract patterns. Examples include linear regression, decision trees, and neural networks. The
choice of algorithm depends on the problem being solved and the type of data available.
3. Features: Features are the individual measurable properties or characteristics of the data.
For example, in a dataset about houses, features might include the number of bedrooms, square
footage, and location. Feature selection and engineering are critical for model performance.
4. Model: A model is the output of the learning process. It represents the mathematical
relationships learned from the data. Once trained, the model can be used to make predictions or
decisions.

Types of Machine Learning

1. Supervised Learning: In supervised learning, the model is trained on labeled data, where
each input comes with a corresponding output. The goal is to map inputs to outputs accurately.
Examples: Spam detection, predicting house prices.
Algorithms: Linear regression, decision trees, support vector machines.
2. Unsupervised Learning: In unsupervised learning, the data is not labeled, and the model is
tasked with identifying patterns or clusters within the data.
8
Examples: Customer segmentation, anomaly detection.
Algorithms: K-means clustering, principal component analysis (PCA).
3. Semi-Supervised Learning: Combines elements of both supervised and unsupervised
learning by using a small amount of labeled data along with a larger set of unlabeled data.
Examples: Fraud detection, medical image classification.
4. Reinforcement Learning: In reinforcement learning, an agent learns by interacting with an
environment and receiving feedback in the form of rewards or penalties.
Examples: Self-driving cars, game-playing AI like AlphaGo.

Importance of Machine Learning

Machine learning has transformed numerous fields by enabling systems to process and analyze
large amounts of data faster and more accurately than humans. It has become an essential tool in
applications ranging from daily conveniences like personalized recommendations on streaming
platforms to critical tasks such as diagnosing diseases or optimizing supply chains.
1. Automation: Machine learning reduces the need for manual intervention by automating
complex processes.
2. Scalability: Models can process vast datasets and adapt to new data, making them
scalable for diverse applications.
3. Improved Decision-Making: By identifying patterns in data, machine learning provides
actionable insights for better decision-making in industries like finance, healthcare, and
manufacturing.

Given below are some of the applications of Machine Learning:

1. Image and Speech Recognition: Machine learning techniques have been successfully applied in
areas such as image recognition, object detection, and speech recognition [15]. These applications are
used in various fields like autonomous vehicles, medical imaging, and voice assistants.

2. Natural Language Processing (NLP): NLP techniques facilitate the comprehension and
processing of human language by machines. Applications include language translation, sentiment
analysis, text summarization, and chatbots [16].
9
3. Fraud Detection: Machine learning algorithms can identify anomalies and patterns in large
datasets, making them valuable for fraud detection in finance [17], insurance [18], and e-commerce
sectors [19].

4. Recommendation Systems: Machine learning powers recommendation systems used in

streaming platforms, social media, and e-commerce. These systems analyze user preferences and
behavior to provide personalized recommendations [20]

5. Healthcare and Medicine: Machine learning is revolutionizing healthcare with applications
such as disease diagnosis, drug discovery, medical imaging analysis, and personalized medicine
[21].

6. Predictive Analytics: Machine learning models have the capability to analyze past data and
generate forecasts in various domains, including finance, sales, marketing, and demand forecasting
[22].

7. Autonomous Systems: Machine learning plays a crucial role in autonomous systems like
self-driving cars, drones, and robots [23]. These systems use ML algorithms to perceive and
understand the environment, make decisions, and navigate.

These are just a few examples of how machine learning is being applied across different
industries. As technology advances and data availability increases, the potential for machine
learning applications continues to expand, driving innovation and solving complex problems.

Challenges in Machine Learning

Despite its transformative potential, machine learning faces several challenges:

1. Data Quality: Models require clean, well-labeled data, and poor-quality data can lead to
inaccurate predictions.
2. Overfitting and Underfitting: A model that is too complex may perform well on training
data but fail on new data (overfitting), while a model that is too simple may fail to capture
important patterns (underfitting).
3. Interpretability: Complex models like deep neural networks are often “black boxes,”
making it difficult to interpret how decisions are made.
10

4. Computational Resources: Training machine learning models, especially on large datasets, can
require significant computational power and time.

Future of Machine Learning

As technology advances and data availability grows, machine learning is expected to play an even
greater role in shaping the future. Developments in areas such as deep learning, natural language
processing, and reinforcement learning are opening up new possibilities in AI, from more
human-like interactions in virtual assistants to the automation of scientific discoveries.

CHAPTER 2 LITERATURE REVIEW

2.1 YIELD STRENGTH

Bhandari et al. [24] in his work presents a study on the application of machine learning (ML)
methods to predict the yield strength of high entropy alloys (HEAs) at high temperatures. HEAs
have gained significant attention due to their promising properties and potential applications in
structural materials. The authors utilize the random forest (RF) regressor model to predict the
yield strengths of MoNbTaTiW and HfMoNbTaTiZr at temperatures of 800°C, 1200°C, and
1500°C. The predicted results are compared with experimental data, and the accuracy and
effectiveness of the ML model are evaluated. The research paper explores an interesting and
relevant topic in the field of materials science and engineering. The use of machine learning
methods to predict the mechanical properties of HEAs, such as yield strength, is a valuable
approach that can save time and costs associated with experimental trials. The paper is
well-structured, providing clear sections for the introduction, computational methods, results, and
discussion. Choudhury et al. [25] discussed about a machine intelligence-based model for
predicting the mechanical properties of low carbon steels. The authors address the need for an
automated prediction model that can evaluate the mechanical properties without the need for
11
experimental processes. They propose a model based on machine learning techniques to predict
the elongation and yield strength of low carbon steels produced through various
thermomechanical processes. The authors emphasize that the mechanical

properties of steels depend on compositions, microstructure, and processing techniques. They

discuss the need to understand the influence of thermomechanical processing and control of
compositions to achieve desired mechanical properties. The authors provide a comprehensive
review of related works in the field of predicting mechanical properties of steels using different
techniques, such as artificial neural networks (ANN) and soft computing tools. They acknowledge
the scope for improvement in terms of accuracy and computational cost reduction. The
computational methods section describes the implementation of a multiple linear regression
model using Python and various libraries. The authors use a dataset from the ASM handbook,
which includes different features related to compositions and mechanical properties of low carbon
steels. They discuss statistical model and its application for predicting yield strength and
elongation. Normalization of the dataset is performed to ensure better performance of the linear
regression model. The authors present density plots to show the distribution of points for yield
strength and elongation, which resemble a normal distribution. They also calculate the correlation
coefficients between variables to understand their relationships. The results and discussion section
evaluates the performance of the proposed model using root-mean- square error (RMSE) and
R-squared values. The authors discuss feature selection and demonstrate that a subset of features
has a significant impact on the prediction of elongation and yield strength. They compare the
performance of the model using different splits of the dataset and highlight the best RMSE and
R-squared values achieved. The accuracy of the model is validated through real vs. predicted
plots and residual plots, which show minimal errors between the predicted and actual values. The
authors also employ k- fold cross-validation to evaluate the model's performance on unseen data.
Veeresham et al.[26] showed the application of machine learning (ML) techniques in predicting
the yield strength of nitrogen-doped (CoCrFeMnNi)100-x-Nx high entropy alloys (HEAs) is a
significant advancement in materials science. This study successfully utilized a linear regression
12
model to accurately predict the yield strength of nitrogen-doped HEAs subjected to specific
thermomechanical processing conditions. The ML approach offers a faster and more
cost-effective alternative to experimental methods for predicting material properties, saving
valuable time and resources. By analyzing a comprehensive dataset that included experimental

data and thermophysical calculations, the researchers identified key features that influence yield
strength. The correlations revealed that Tensile test temperature had the strongest impact,
followed by the nitrogen content, cold rolling, entropy, Co content, grain size, Fe content, and
melting temperature. Understanding these relationships provides insights into modifying the yield
strength of nitrogen-doped CoCrFeMnNi HEAs for specific applications. The ML model
demonstrated high accuracy, with a coefficient of determination (R2) of 95.54% and a low mean
absolute error (MAE) of 33.10. The predicted yield strength values closely matched the
experimental values, confirming the reliability of the model. Notably, the model accurately
predicted the yield strength of a nitrogen-doped HEA that underwent cold rolling and annealing,
with an error of only 1.36%. This study highlights the potential of ML techniques in optimizing
thermomechanical processing parameters and predicting material properties for HEAs. The ability
to design and develop HEAs with superior properties using ML models and material parameters
has significant implications for various industries. Overall, this research showcases the power of
ML in accelerating the discovery and design of advanced materials with desired mechanical
properties, ultimately leading to more efficient and cost-effective material development processes.

2.2 TENSILE STRENGTH

Xu et al. [27] presented in his study about predicting the tensile properties of AZ31 magnesium alloys using
machine learning techniques. The authors explain the rules employed for data collection, such as handling
missing attribute values and excluding certain processing methods. They provide details on the ANN and
SVM models used, including the architecture, activation functions, optimization strategies, and
13
hyperparameters. The dataset exploration section presents the characteristics of the collected data, including
the distribution of yield strength (YS), ultimate tensile strength (UTS), and tensile elongation (EL). The
authors calculate the Pearson correlation coefficients to assess the relationship between individual attributes
and the output properties. The research paper provides a comprehensive study on predicting the tensile
properties of AZ31 magnesium alloys using machine learning techniques. The authors successfully
demonstrate the applicability of ANN and SVM models for this task and discuss the implications of their
findings. The paper is well-organized and provides sufficient details on the methodology and results.

However, it would benefit from further discussion on the limitations and potential future directions of the
research. Chun et al. [28] attempted to provide a simple, accurate, and cost-effective method to predict the
residual tensile strength of corroded steel structures. The researchers used FEM to obtain tensile strength data
for artificially corroded plates, generated using a spatial autocorrelation model. The corroded surface data and
material properties were used as input, and the tensile strength was used as the output to train the ANN model.
The accuracy of the model was validated using leave-one-out cross-validation. The research paper confirms
that the proposed ANN approach outperforms previous methods in terms of accuracy. The comparison
between FEM results and experimental data shows a good agreement, validating the FEM model used to
develop the ANN. The study demonstrates that the ANN model can predict tensile strength without the need
for additional tensile tests or FEM analysis. The research paper presents an innovative application of artificial
neural networks for predicting the tensile strength of corroded steel plates. The proposed ANN model offers a
simple, accurate, and cost-effective method to assess the residual tensile strength of corroded steel structures.
The results demonstrate the superiority of the ANN approach compared to previous methods. Sami et al. [29]
demonstrated on the prediction of the tensile and compressive strength of concrete using various machine
learning algorithms. The authors highlight the importance of concrete strength in determining the durability
and performance of structures and the challenge of optimizing the constituent proportions to achieve
high-strength concrete. The goal of the research is to develop accurate and efficient prediction models to
replace time-consuming and resource-intensive laboratory tests. To address this need, the authors employ
different machine learning algorithms, including tree regression models, regression models, ensemble
regression (ER), support vector regression (SVR), and Gaussian process regression (GPR), to predict the
tensile and compressive strength of concrete. The models are trained and tested using a dataset compiled from
journal publications. The results of the study demonstrate the effectiveness of the machine learning models in
predicting concrete strength. The exponential Gaussian process regression (GPR) model exhibits the highest
performance and accuracy among the models considered. It achieves an impressive R2 of 0.98, RMSE of
14
2.412 MPa, and MAE of 1.6249 MPa for predicting the compressive strength of concrete using eight input
variables during the training phase. In the testing phase, the model maintains its accuracy with an R2 of 0.99,
RMSE of 0.0025134 MPa, and MAE of 0.0016367 MPa. Similarly, the GPR model performs well in
predicting the tensile strength of concrete with an R2, RMSE, and MAE of 0.99, 0.00049247 MPa, and
0.00036929 MPa, respectively. Najjar et al. [30] in his study presented a detailed investigation of the
mechanical properties and prediction modeling of aluminum nanocomposites. The research is significant as it
addresses the growing interest in metal matrix composites (MMCs), specifically aluminum metal matrix

composites (Al- MMCs), due to their exceptional properties and wide range of applications. In mechanical
properties, we observe a significant enhancement in the yield strength (YS), ultimate tensile strength (UTS),
and hardness of the composites compared to the unreinforced aluminum. The UTS and YS values show a
notable increase after the initial ARB cycles, and further improvements are achieved with subsequent cycles.
The highest UTS enhancement is achieved in the composite with 4% SiC, reaching 445% after 9 ARB cycles.
These findings highlight the effectiveness of the ARB technique and the addition of SiC particles in enhancing
the mechanical properties of Al-MMCs. The research paper provides a comprehensive study on the
fabrication and characterization of aluminum nanocomposites reinforced with µ-SiC particles using the
accumulative roll bonding (ARB) technique. The experimental results highlight the improvement in the
mechanical properties of the composites with increasing ARB cycles and SiC content. Additionally, the
proposed machine learning model based on the modified random vector functional link using the Growth
Optimizer Algorithm shows promising potential for accurately predicting the tensile properties of the
composites. The findings presented in this paper contribute to the field of metal matrix composites and offer
valuable insights for researchers and practitioners in materials science and engineering

2.3 OTHER MECHANICAL PROPERTIES

Stoll. et. el. [31] addressed the growing need for data-driven methods in materials science due to the
expanding amount of material data from experiments and simulations. The paper explores the
potential of machine learning (ML) techniques in predicting material properties and facilitating
material characterization. The paper also discusses the evolution of materials science paradigms, from
experimental investigations to analytical equations, computational simulations, and data-driven
science. It emphasizes the value of large data volumes in discovering hidden correlations and patterns
15
that may not be apparent in smaller datasets. However, the authors acknowledge the challenges
posed by handling large amounts of data and the limitations of small datasets commonly encountered
in materials science due to expensive and time-consuming data acquisition processes. The authors
examine different ML approaches based on SPT data and discuss a case study involving the prediction
of tensile properties using ML models trained on SPT data. The goal is to determine whether a ML
model can accurately predict tensile properties based on SPT measurements. Shaheen et al. [32]
developed a novel approach to predict the strength and stiffness reduction factors for HSS at elevated
temperatures using machine learning techniques, considering the effect of material chemical
composition. The authors highlight that no prior studies are available in the open literature regarding
the prediction of HSS mechanical properties at elevated temperatures by machine learning, making
their research contribution unique. The development of the artificial neural network (ANN) is
explained, highlighting the use of a multilayer perception model with feed-forward back-propagation
for supervised learning. The three-layer structure of the ANN, including the input layer, hidden layers,
and output layer, is described. The authors provide equations defining the elevated temperature
reduction factors for ultimate tensile strength, effective yield strength, 0.2% proof strength, and
Young's modulus, which were adopted throughout the analysis in the paper. The authors give
equations defining the elevated temperature reduction factors for ultimate tensile strength, effective
yield strength,0.2 validation strength, and Young's modulus, which were espoused throughout the
analysis in the paper. The authors declare that they've no given contending fiscal interests or particular
connections that could have appeared to impact the work reported in this paper.
16
CHAPTER 3 METHODOLOGY

3.1 DATA COLLECTION

The data for this research project was collected from multiple sources, including research
papers, websites, and handbooks. These diverse sources were utilized to ensure a
comprehensive and well-rounded dataset for analysis.

Research papers served as a valuable source of scholarly articles, providing in-depth

studies, empirical data, and theoretical frameworks related to the research topic. These
papers were obtained from reputable academic journals and conferences, ensuring the
reliability and credibility of the information extracted.

Websites were another important source of data, offering a wide range of resources such as
17

industry reports, technical specifications, and case studies. Care was taken to select
authoritative websites from reputable organizations, academic institutions, and government
agencies, making sure the gathered data is accurate and relevant.

Handbooks, reference materials, and technical manuals also played a crucial role in data
collection. These resources provided essential background information, industry standards,
and practical guidelines related to the subject matter. The handbooks were carefully chosen
based on their relevance and reputation within the field.

Overall, the data collection process involved meticulous gathering, organization, and
verification of information from diverse sources. By utilizing a combination of research
papers, websites, and handbooks, we aimed to ensure the completeness and reliability of
the dataset, allowing for a robust analysis and meaningful conclusions to be drawn from
the research findings.
18
19
20

3.2 DATA ANALYSIS

During the analysis of the collected data, various exploratory data analysis techniques were
employed to gain insights into the dataset. Descriptive analysis was conducted to
summarize and understand the distribution, central tendency, and variability of the different
variables.

Pearson correlation coefficient was utilized to conduct correlation analysis in order to

examine the relationships between variables [33]. The analysis facilitated the exploration of
both the intensity and direction of the linear correlation between sets of variables. The Fig.
3.2 shows the correlation plot based on the value of pearson correlation cofficients. The
correlation coefficients were calculated, and their significance was assessed to identify any
statistically significant associations.

Additionally, outlier analysis was conducted to identify observations that deviated

significantly from the overall pattern of the data. Data distribution and potential outliers were
visualized using distribution plots and box plots. Outliers, which are data points that lie far
outside the expected range, were examined and their potential impact on the analysis was
21
considered.

During the analysis, it was observed that some features exhibited a substantial number of
outliers, indicating potential anomalies or errors in the data. These outliers were carefully
examined to determine their validity and potential impact on the analysis results. Similarly,
certain features were found to have minimal variation, suggesting a lack of diversity or
limited usefulness in the analysis. These features were noted and their implications were
taken into account during subsequent modeling and interpretation stages.

By performing descriptive analysis, correlation analysis, and outlier analysis, a

comprehensive understanding of the dataset was obtained. The analysis yielded significant
insights into the distribution of data, interrelationships among variables, and the
identification of outliers and instances of low variability. The findings from these analyses
served as a foundation for further data processing, modeling, and interpretation in the
research study.
22

3.3 FEATURE SELECTION

Feature selection is a process in machine learning where we identify and select the most
important features (variables) from a dataset that significantly contribute to the performance of a
predictive model. This step is crucial in simplifying the model, improving accuracy, and reducing
overfitting by removing irrelevant or redundant data.

Why Is Feature Selection Important?

1. Improves Model Performance: By focusing only on the most relevant features, the model can
better learn patterns, resulting in higher accuracy and efficiency.
2. Reduces Overfitting: Too many irrelevant features can cause the model to fit noise rather than
meaningful patterns in the data. Feature selection helps prevent this.
3. Enhances Interpretability: A model with fewer features is easier to understand and interpret,
especially in research or industrial applications.
4.Decreases Computational Cost: Fewer features mean less complexity, which reduces the time
and computational power needed for training the model.

Methods of Feature Selection

1.Filter Methods: Ranks features based on statistical measures such as correlation, variance, or
mutual information.
Examples: Pearson correlation coefficient, Chi-square test, ANOVA.
2.Wrapper Methods: Involves using a machine learning model to evaluate subsets of features
and selecting the subset that gives the best performance.
Examples: Forward selection, backward elimination, recursive feature elimination (RFE).
3.Embedded Methods: These methods are built into the machine learning algorithm itself. The
model automatically identifies the most relevant features during training.
Examples: Lasso regression, decision tree feature importance, and regularization techniques.

Feature Selection in This Project

In the project, feature selection likely involved analyzing the dataset of API steels to identify
which chemical compositions or material properties most strongly influenced yield and tensile
strength. The process could include:
23
1.Eliminating Redundant Features : Variables like physical properties that showed little to no
variation were removed because they didn’t provide new information to the model.
2.Handling Outliers : Features such as chromium (Cr) and nickel (Ni) may have contained
significant outliers. These were removed for linear regression models to improve robustness but
retained for non-linear models, which can handle outliers better.
3.Retaining Informative Features : Key features (e.g., hardness, carbon content) that strongly
impacted the mechanical properties were retained for model training.
By performing feature selection, the project ensured that only the most relevant data
contributed to the predictive models, enhancing their accuracy and efficiency.

3.4 MODEL TRAINING

Model training is the process of teaching a machine learning algorithm to recognize patterns and
relationships in data by using a labeled or unlabeled dataset. During training, the algorithm learns
from the input data to optimize its parameters and improve its predictive capabilities for new,
unseen data.

Various machine learning algorithms were employed in this study to predict the yield strength of
API steels. These algorithms included multiple regression, Lasso regression, Ridge regression,
decision tree, and random forest. Each algorithm has its unique approach and characteristics that
contribute to the predictive modeling process.

Model Training in Your Project

In the project, several machine learning models were trained to predict the yield strength and
tensile strength of API steels. Here’s how the process likely unfolded:
1. Dataset Preparation:
• The dataset included chemical compositions (e.g., carbon, manganese) and mechanical
properties (e.g., yield strength).
• Data was cleaned, standardized, and divided into training and testing sets.
2. Algorithms Used:
2.1 Linear Models:
•Multiple Regression: A simple approach for linear relationships between features and target
variables.
24
2.2 Regularized Models:
•Lasso Regression: Reduces overfitting by shrinking irrelevant coefficients to zero.
•Ridge Regression: Penalizes large coefficients to enhance model stability.
2.3 Non-linear Models:
•Decision Trees: Captured complex relationships using a tree-like structure.
•Random Forest: Combined multiple decision trees for better accuracy.
2.4 Boosting Models:
•XGBoost: Built models sequentially to correct errors from previous iterations.
3. Training and Optimization:
• Models were trained on a portion of the dataset using algorithms designed to minimize
errors (e.g., Mean Squared Error for regression problems).
• Regularization techniques (e.g., L1, L2 penalties) and hyperparameter tuning (e.g., tree
depth, learning rate) were applied to improve model performance.
4. Evaluation During Training: Metrics such as R² Score (explaining variability) and Mean
Absolute Error (MAE) (average error magnitude) were calculated to assess how well the models
fit the data.

5. Output:The trained models, particularly Random Forest and XGBoost, provided high
accuracy in predicting the mechanical properties of API steels.

3.4.1 LINEAR MODELS

3.4.1.1 MULTIPLE REGRESSION

Multiple regression is a linear regression technique that aims to establish a relationship between
the dependent variable (yield strength and tensile strength) and multiple independent variables
(features). It presupposes a linear relationship and estimates the coefficients of the predictor
variables to predict the target variable [35]. Fig. 3.4 shows the implementation of ridge regression
in python.
25
26

3.4.2 REGULARIZED MODEL

3.4.2.1 LASSO REGRESSION

Lasso Regression (Least Absolute Shrinkage and Selection Operator) is a type of linear
regression that incorporates L1 regularization. This technique not only minimizes the
residual sum of squares (RSS) to fit the model but also adds a penalty proportional to the
absolute values of the regression coefficients.
27

Key Features of Lasso Regression

1. Feature Selection:
• Lasso forces some coefficients to shrink to exactly zero, effectively removing irrelevant
features from the model. This makes it particularly useful when dealing with
high-dimensional datasets where not all features contribute meaningfully.
2. Regularization:
• By penalizing large coefficients, Lasso reduces overfitting and improves the model’s
ability to generalize to unseen data.
3. Sparsity:
• Lasso creates sparse models by retaining only the most important predictors, making
the model simpler and more interpretable.

3.4.2.2 RIDGE REGRESSION

Ridge Regression is another type of linear regression that incorporates L2 regularization.

Unlike Lasso, Ridge does not set coefficients to exactly zero but shrinks them toward zero by
adding a penalty proportional to the square of the coefficients.

Key Features of Ridge Regression

1.Regularization: Ridge penalizes large coefficients, preventing the model from overfitting the
training data.
2.Handles Multicollinearity: Ridge is particularly effective when predictors are highly
correlated, as it distributes the effect among correlated variables rather than arbitrarily selecting
one.
28
3.Does Not Perform Feature Selection: Unlike Lasso, Ridge does not remove features but
reduces their influence by shrinking coefficients closer to zero.

3.4.3 NON LINEAR MODEL

3.4.3.1 DECISION TREE

Decision trees are non-linear models that utilize a hierarchical structure of decision rules to make
predictions. They partition the feature space into smaller regions and assign a prediction value to
each region. Decision trees can capture complex relationships and interactions among the features
[38].

A common approach for creating decision trees is CART (Classification and Regression Trees).
The CART algorithm divides the data into two subsets, each of which is divided along a single
feature. To maximize the gain in homogeneity or decrease in impurity in the resulting subsets, the
feature and splitting threshold are selected. A measure of the output values' variability is used to
determine the impurity of a subset. Entropy and Gini impurity are frequently used impurity
measures for classification tasks, whereas mean squared error (MSE) is frequently used for
regression assignments [39]. The cost function that the CART algorithm minimizes in order to
determine the optimal split of a given node is shown below :
29
n = Records in the dataset

nleft = Records in the left subtree

nright = Records in the right subtree

The initial root node of the CART algorithm represents the entire dataset. The method selects the feature and
the splitting threshold at each level of the tree in order to reduce impurity or improve the homogeneity of the
generated subgroups. Up until it meets a stopping requirement, such as a minimum number of samples in each
leaf node or a maximum tree depth, the algorithm divides the data again into smaller subsets [40].

Once the tree is built, it is possible to make predictions based on new data by moving through the tree from
root to leaf nodes. Based on the value of the feature in the input data, the algorithm applies the decision rule
corresponding to the splitting feature at each node and then proceeds down the relevant branch. The prediction
is then made using the leaf node's related output value.

.
30
3.4.3.2 RANDOM TREE
Random forest is a technique in ensemble learning that utilizes the power of multiple decision
trees to formulate predictions. It leverages the principle of "wisdom of the crowd" by aggregating
the estimations of individual trees. Random forest can improve the predictive accuracy and handle
non-linear relationships effectively [41].

In Random Forest regression, individual trees are constructed by utilizing a randomized subset of
the training data along with a randomized subset of the input features [42]. Enhancing the model's
performance is achieved by reducing the correlation among the trees. Averaging the forecasts of
each individual tree yields the prediction of the Random Forest model [43]. In comparison to other
regression models, Random Forest has a number of advantages, such as the ability to handle large
datasets with numerous input variables, the ability to recognize and manage non-linear
relationships between input features and the target feature, and the ability to deal with outliers and
missing data [44]. On the other hand, due to the quantity of decision trees generated, it is not
interpretable [45] and takes additional training time and memory [46]
31
3.4.4 BOOSTING MODEL

3.4.4.1 Extreme gradient boosting

Extreme Gradient Boosting (XGBoost) is an enhancement of gradient boosting technique shown

in Fig. 3.11 that combines a number of different decision trees for improved performance and
accuracy. Because XGBoost builds decision trees using the gradient boosting framework, it can
operate with a variety of data features and create models that are more accurate and robust [47].

To create predictions about the target variable in XGBoost regression, the method first constructs
a single decision tree. The residual difference between the estimated values and the real values is
then calculated, and this error serves as the target variable for the subsequent tree. Until a
predetermined number of trees have been produced or the error has been minimized, the
algorithm repeats this procedure, creating one tree at a time and adding it to the ensemble.
Fig.3.12 shows the implementation of XGBoost using python.

The key benefits of XGBoost are its performance, speed, and scalability. It is built to effectively
handle massive datasets with millions of rows and thousands of columns. In order to lessen
overfitting and increase generalization, XGBoost can additionally tolerate missing values and
features using built-in regularization.

But XGBoost also has significant shortcomings. When working with huge datasets, it can be
computationally expensive and demands precise hyperparameter adjustment. Additionally,
XGBoost is a "black-box" model, which makes it challenging to identify and understand the
fundamental connections between the independent and the predicted variable.
32

By employing these diverse algorithms, the study aimed to explore and compare their
performance in predicting the yield strength of API steels. Each algorithm brings its own
strengths and characteristics to the analysis, providing a comprehensive understanding of their
effectiveness and suitability for the given dataset.

3.5 MODEL EVALUATION

Performance metrics play a vital role in assessing the efficiency of machine learning models.
These metrics provide quantitative measures of how well a model performs in terms of its
predictions and generalization ability. Performance metrics commonly employed in regression
problems comprise R-squared (R2) score, Mean Absolute Error (MAE), Root Mean Squared
Error (RMSE), and the Mean Squared Error (MSE).

Mean Absolute error (MAE): MAE indicates that on average, how far the predicted values are
from actual values, without taking their direction into consideration. It is calculated by taking the
absolute difference between the real and estimated values and then averaging them. The Mean
Absolute Error (MAE) shares the same units as the feature that is being estimated. The lower the
value of MAE, the better the model’s performance, since it means that the model's estimations are
closer to the real values [48].
33

Mean Absolute Percentage Error (MAPE): MAPE is a metric that quantifies the average
percentage difference between the estimated and true values. To calculate the MAPE, the absolute
percentage difference between the estimated and true values is obtained, and the resulting values
are averaged. MAPE is expressed as a percentage. The model's performance improves as the
value of MAPE decreases, since it shows that the model's predictions exhibit a higher level of
accuracy when compared to the actual values in terms of percentages [49].

R-squared (R2) Score: The R2 score evaluates how much of the variability in the target variable
can be accounted for by the independent variables incorporated in the model, ranging between 0
and 1. A higher R2 value indicates a better model’s performance. The calculation of R2 involves
subtracting the ratio of the sum of squares of the residuals to the total sum of squares from 1. A R2
score of 1 denotes that the model fits the data perfectly, whereas a score of 0 indicates that the
model fails to explain any variation in the target variable [50].
34

It is important to choose the appropriate performance metrics on the basis of specific problem and
the type of data being analyzed. These metrics help researchers and practitioners understand the
strengths and weaknesses of the models, compare different algorithms, and make informed
decisions about model selection and improvement. By quantifying the model's performance,
performance metrics offer valuable insights into the effectiveness and reliability of ML models.
35
CHAPTER 4 RESULT AND DISCUSSION

The Results and Discussion section presents the findings and analysis of the predictive models
developed for the mechanical properties of API steels. The objective of this study was to explore
the effectiveness of various machine learning algorithms in predicting the yield strength and
tensile strength of API steels. In this section, we showcase the outcomes of the trained models and
discuss their performance in terms of accuracy, reliability, and interpretability. Furthermore, we
analyze the importance of different features in predicting the mechanical properties and offer a
deeper understanding of the fundamental connections among the composition and properties of
API steels. The results obtained from this study contribute to the understanding and prediction of
the mechanical behavior of API steels, enabling better decision-making and optimization in the
selection and utilization of these materials in various applications.

The feature importance analysis provides valuable insights into the contribution of each input
variable in predicting the mechanical properties of API steels. Feature importance represents the
relative influence or significance of each feature in the predictive model. It helps us understand
which variables have the most impact on the outcome variable, such as the yield strength of API
steels. The feature importance is calculated based on the model's internal mechanism, such as the
weights assigned to features in linear regression or the split points in decision trees. By examining
the feature importance values, we can identify the key factors that affect the yield strength of API
steels and prioritize them for further investigation or optimization. Additionally, feature
importance analysis aids in the interpretability of the model by highlighting the most influential
features, enabling researchers and engineers to acquire a more profound comprehension of the
underlying relationships between the chemical composition and mechanical properties of API
steels.
36

CHAPTER 5 CONCLUSIONS
IN THIS CHAPTER, WE WILL CONCLUDE THE DIFFERENT TYPES OF ALGORITHMS THAT WE WILL USE TO
PREDICT THE YIELD STRENGTH AND TENSILE STRENGTH OF API STEEL AND OUT OF THESE, WHICH IS THE
MOST EFFECTIVE METHOD/MODEL .
37
REFERENCES

[1] ‘High-strength low-alloy steel’. https://en.wikipedia.org/wiki/High-strength_low-alloy_steel (accessed May 20,

2023).

[2] G. Ananta Nagu and T. K. G Namboodhiri, ‘Eﬀect of heat treatments on the hydrogen embrittlement
susceptibility of API X-65 grade line-pipe steel’, Bull. Mater. Sci, vol. 26, no. 4, pp. 435–439, 2003.

[3] A. K. Das, ‘The present and the future of line pipe steels for petroleum industry’, Center for Bioinformatics and
Molecular Biostatistics, vol. 25, no. 1–3, pp. 14–19, Jan. 2010, doi: 10.1080/10426910903202427.

[4] ‘5 Common Alloying Elements’. https://www.metalsupermarkets.com/5-common-alloying- elements/ (accessed

Apr. 15, 2023).

[5] W.E. White and G. I. Ogundele, ‘Inﬂuences of Dissolved Hydrocarbon Gases and Variable Water Chemistries on
Corrosion of an API-L80 Steel’, CORROSION, vol. 43, no. 11, pp. 665–673, 1987.

[6] R. Elgaddafi, R. Ahmed, and S. Shah, ‘Modeling and experimental studies on CO2-H2S corrosion of API carbon
steels under high-pressure’, J Pet Sci Eng, vol. 156, pp. 682–696, 2017, doi: 10.1016/j.petrol.2017.06.030.

[7] ‘API Steel Grade’. https://www.drinol.com/technology/api-steel-grade.html (accessed May 20, 2023).

[8] F. O. Kolawole, S. K. Kolawole, J. O. Agunsoye, J. A. Adebisi, S. A. Bello, and S. B. Hassan, ‘Mitigation of Corrosion
Problems in API 5L Steel Pipeline-A Review’, J. Mater. Environ. Sci, vol. 9, no. 8, pp. 2397–2410, 2018, [Online]. Available:
http://www.jmaterenvironsci.com!

[9] M. Ziomek-Moroz, ‘Environmentally assisted cracking of drill pipes in deep drilling oil and natural gas wells’, J
Mater Eng Perform, vol. 21, no. 6, pp. 1061–1069, Jun. 2012, doi: 10.1007/ s11665-011-9956-6.

[10] P. D. Thomas, ‘Steels for Oilwell Casing and Tubing - Past, Present and Future’, JOURNAL OF PETROLEUM
TECHNOLOGY, pp. 495–500, May 1963, [Online]. Available: http://onepetro.org/JPT/
article-pdf/15/05/495/2213849/spe-527-pa.pdf

[11] R. L. Amaro, E. S. Drexler, and A. J. Slifka, ‘Fatigue crack growth modeling of pipeline steels in high pressure
gaseous hydrogen’, Int J Fatigue, vol. 62, pp. 249–257, 2014, doi: 10.1016/ j.ijfatigue.2013.10.013.

[12] ‘Overview of API 660 - Shell-and-Tube Heat Exchangers’. https://inspectioneering.com/tag/ api+660 (accessed
May 21, 2023).
38

[13] C. Subramanian, ‘Localized pitting corrosion of API 5L grade A pipe used in industrial fire water piping applications’,
Eng Fail Anal, vol. 92, pp. 405–417, Oct. 2018, doi: 10.1016/ j.engfailanal.2018.06.008.

[14] ‘Types of Machine Learning’. https://www.javatpoint.com/types-of-machine-learning (accessed May 21, 2023).

[15] K. Noda, Y. Yamaguchi, K. Nakadai, H. G. Okuno, and T. Ogata, ‘Audio-visual speech recognition using deep
learning’, Applied Intelligence, vol. 42, no. 4, pp. 722–737, Jun. 2015, doi: 10.1007/s10489-014-0629-7.

[16] P. M. Nadkarni, L. Ohno-Machado, and W. W. Chapman, ‘Natural language processing: An introduction’, Journal of
the American Medical Informatics Association, vol. 18, no. 5. pp. 544–551, Sep. 2011. doi: 10.1136/amiajnl-2011-000464.

[17] J. Perols, ‘Financial statement fraud detection: An analysis of statistical and machine learning algorithms’, Auditing:
A Journal of Practice & Theory, vol. 30, no. 2, pp. 19–50, May 2011, doi: 10.2308/ajpt-50009.

[18] C. Gomes, Z. Jin, and H. Yang, ‘Insurance fraud detection with unsupervised deep learning’,
Journal of Risk and Insurance, vol. 88, no. 3, pp. 591–624, Sep. 2021, doi: 10.1111/jori.12359.

[19] J. Nanduri, Y. Jia, A. Oka, J. Beaver, and Y. W. Liu, ‘Microsoft uses machine learning and optimization to reduce
e-commerce fraud’, INFORMS Journal on Applied Analytics, vol. 50, no. 1, pp. 64–79, Jan. 2020, doi:
10.1287/inte.2019.1017.

[20] F. O. Isinkaye, Y. O. Folajimi, and B. A. Ojokoh, ‘Recommendation systems: Principles, methods and evaluation’,
Egyptian Informatics Journal, vol. 16, no. 3. Elsevier B.V., pp. 261–273, Nov. 01, 2015. doi: 10.1016/j.eij.2015.06.005.

[21] A. Rajkomar, J. Dean, and I. Kohane, ‘Machine Learning in Medicine’, New England Journal of Medicine, vol. 380,
no. 14, pp. 1347–1358, Apr. 2019, doi: 10.1056/nejmra1814259.

[22] S. Makridakis, E. Spiliotis, and V. Assimakopoulos, ‘Statistical and Machine Learning forecasting methods: Concerns
and ways forward’, PLoS One, vol. 13, no. 3, Mar. 2018, doi: 10.1371/journal.pone.0194889.

[23] S. Y. Choi and D. Cha, ‘Unmanned aerial vehicles using machine learning for autonomous ﬂight; state-of-the-art’,
Advanced Robotics, vol. 33, no. 6, pp. 265–277, Mar. 2019, doi: 10.1080/01691864.2019.1586760.

[24] U. Bhandari, M. R. Rafi, C. Zhang, and S. Yang, ‘Yield strength prediction of high-entropy alloys using machine
learning’, Mater Today Commun, vol. 26, Mar. 2021, doi: 10.1016/ j.mtcomm.2020.101871.

[25] A. Choudhury, ‘Prediction and Analysis of Mechanical Properties of Low Carbon Steels Using Machine Learning’,
Journal of The Institution of Engineers (India): Series D, vol. 103, no. 1, pp. 303–310, Jun. 2022, doi:
10.1007/s40033-022-00328-y.
[26] M. Veeresham, R. Jain, U. Lee, and N. Park, ‘Machine learning approach for predicting yield strength of
nitrogen-doped CoCrFeMnNi high entropy alloys at selective thermomechanical processing conditions’, Journal of
Materials Research and Technology, vol. 24, pp. 2621–2628, May 2023, doi: 10.1016/j.jmrt.2023.03.146.

[27] X. Xu, L. Wang, G. Zhu, and X. Zeng, ‘Predicting Tensile Properties of AZ31 Magnesium Alloys by Machine Learning’,
JOM, vol. 72, no. 11, pp. 3935–3942, Nov. 2020, doi: 10.1007/ s11837-020-04343-w.

[28] C. N. N. Karina, P. jo Chun, and K. Okubo, ‘Tensile strength prediction of corroded steel plates by using machine
learning approach’, Steel and Composite Structures, vol. 24, no. 5, pp. 635–641, Aug. 2017, doi:
10.12989/scs.2017.24.5.635.

[29] B. H. Ziyad Sami et al., ‘Feasibility analysis for predicting the compressive and tensile strength of concrete using
machine learning algorithms’, Case Studies in Construction Materials, vol. 18, Jul. 2023, doi: 10.1016/j.cscm.2023.e01893.
39

[30] I. M. R. Najjar, A. M. Sadoun, M. A. Elaziz, H. Ahmadian, A. Fathy, and A. M. Kabeel, ‘Prediction of the tensile
properties of ultrafine grained Al–SiC nanocomposites using machine learning’, Journal of Materials Research and
Technology, vol. 24, pp. 7666–7682, May 2023, doi: 10.1016/j.jmrt.2023.05.035.

[31] A. Stoll and P. Benner, ‘Machine learning for material characterization with an application for predicting
mechanical properties’, GAMM Mitteilungen, vol. 44, no. 1, Mar. 2021, doi: 10.1002/ gamm.202100003.

[32] M. A. Shaheen, R. Presswood, and S. Afshan, ‘Application of Machine Learning to predict the mechanical
properties of high strength steel at elevated temperatures based on the chemical composition’, Structures, vol. 52, pp.
17–29, Jun. 2023, doi: 10.1016/j.istruc.2023.03.085.

[33] ‘Pearson correlation coefficient’. https://en.wikipedia.org/wiki/ Pearson_correlation_coefficient (accessed May 20,

2023).

[34] S.-W. Choi, ‘The Eﬀect of Outliers on Regression Analysis: Regime Type and Foreign Direct Investment’, Quart J Polit
Sci, vol. 4, pp. 153–165, 2009, doi: 10.1561/100.00008021_supp.

[35] M. M. Wagner, A. W. Moore, and R. M. Aryel, ‘Combining Multiple Signals for Biosurveillance’, in Handbook of
Biosurveillance, Academic Press, 2006, pp. 235–242.

[36] Induraj, ‘How to derive B0 and B1 in Linear Regression- Part2’, 2020. https://
induraj2020.medium.com/how-to-derive-b0-and-b1-in-linear-regression-4d4806b231fb (accessed Apr. 15, 2023).

Exploring Bentley STAAD.Pro CONNECT Edition, 3rd Edition
From Everand
Exploring Bentley STAAD.Pro CONNECT Edition, 3rd Edition
Prof. Sham Tickoo
5/5 (3)
Computational Simulation Tools in Engineering
From Everand
Computational Simulation Tools in Engineering
V. Ramesh Kumar,
No ratings yet
Service: Audi A6 2011 Audi A7 Sportback 2011
No ratings yet
Service: Audi A6 2011 Audi A7 Sportback 2011
160 pages
Prediction of Mechanical Properties For Deep Drawing Steel by Deep Learning
No ratings yet
Prediction of Mechanical Properties For Deep Drawing Steel by Deep Learning
10 pages
mathematics-12-01153-v2
No ratings yet
mathematics-12-01153-v2
22 pages
Xiong 2020
No ratings yet
Xiong 2020
9 pages
BTP Quater1 Group17
No ratings yet
BTP Quater1 Group17
29 pages
articles
No ratings yet
articles
8 pages
Ams 2011
No ratings yet
Ams 2011
4 pages
Artificial intelligence for the prediction of tensile properties by using microstructural parameters in high strength steels
No ratings yet
Artificial intelligence for the prediction of tensile properties by using microstructural parameters in high strength steels
13 pages
Materials 16 07354
No ratings yet
Materials 16 07354
15 pages
Study On The Correlation of Toughness With Chemica
No ratings yet
Study On The Correlation of Toughness With Chemica
7 pages
Engineering Critical Assessment (ECA) for Offshore Pipeline Systems
From Everand
Engineering Critical Assessment (ECA) for Offshore Pipeline Systems
Dr. T Sri
No ratings yet
Ibtesam' Thesis
No ratings yet
Ibtesam' Thesis
126 pages
1-s2.0-S1359645424010115-main
No ratings yet
1-s2.0-S1359645424010115-main
13 pages
Lee 2020 IOP Conf. Ser.: Mater. Sci. Eng. 967 012031
No ratings yet
Lee 2020 IOP Conf. Ser.: Mater. Sci. Eng. 967 012031
9 pages
Modeling of Steelmaking Process With Effective Machine Learning Techniques
No ratings yet
Modeling of Steelmaking Process With Effective Machine Learning Techniques
10 pages
Shaheen2023d_StructJ
No ratings yet
Shaheen2023d_StructJ
14 pages
Training Facility Norms and Standard Equipment Lists: Volume 1---Precision Engineering or Machining
From Everand
Training Facility Norms and Standard Equipment Lists: Volume 1---Precision Engineering or Machining
Fook Yen Chong
No ratings yet
Steel 4340
No ratings yet
Steel 4340
6 pages
Materials & Design: Chunyuan Cui, Guangming Cao, Yang Cao, Jianjun Liu, Zishuo Dong, Siwei Wu, Zhenyu Liu
No ratings yet
Materials & Design: Chunyuan Cui, Guangming Cao, Yang Cao, Jianjun Liu, Zishuo Dong, Siwei Wu, Zhenyu Liu
12 pages
Tensile Strength Prediction in Monel 400 Weldments Using Classification and Regression Algorithms in Machine Learning
No ratings yet
Tensile Strength Prediction in Monel 400 Weldments Using Classification and Regression Algorithms in Machine Learning
13 pages
A Review
No ratings yet
A Review
15 pages
Model For Mechanical Properties of Hot-Rolled Steels: Master Thesis
No ratings yet
Model For Mechanical Properties of Hot-Rolled Steels: Master Thesis
90 pages
Engineering in Aerospace Technologies
From Everand
Engineering in Aerospace Technologies
Nanda Iyengar
No ratings yet
Workshop Practice Manual
From Everand
Workshop Practice Manual
Jatinder Madan
No ratings yet
1 s2.0 S0308016120301319 Main
No ratings yet
1 s2.0 S0308016120301319 Main
8 pages
1-s2.0-S2666759224000805-main
No ratings yet
1-s2.0-S2666759224000805-main
13 pages
Training Facility Norms and Standard Equipment Lists: Volume 2---Mechatronics Technology
From Everand
Training Facility Norms and Standard Equipment Lists: Volume 2---Mechatronics Technology
Fook Yen Chong
No ratings yet
Waste to Energy in the Age of the Circular Economy: Compendium of Case Studies and Emerging Technologies
From Everand
Waste to Energy in the Age of the Circular Economy: Compendium of Case Studies and Emerging Technologies
Asian Development Bank
5/5 (1)
GAMM-Mitteilungen - 2021 - Stoll - Machine Learning For Material Characterization With An Application For Predicting
No ratings yet
GAMM-Mitteilungen - 2021 - Stoll - Machine Learning For Material Characterization With An Application For Predicting
21 pages
Machine Learning in Industry
100% (3)
Machine Learning in Industry
202 pages
Predicting Microstructure-Dependent Mechanical Properties In
No ratings yet
Predicting Microstructure-Dependent Mechanical Properties In
32 pages
Fin Irjmets1697356356
No ratings yet
Fin Irjmets1697356356
4 pages
ANSYS Workbench 2023 R2: A Tutorial Approach, 6th Edition
From Everand
ANSYS Workbench 2023 R2: A Tutorial Approach, 6th Edition
Prof. Sham Tickoo
No ratings yet
10 1108 - Jedt 11 2021 0637
No ratings yet
10 1108 - Jedt 11 2021 0637
29 pages
Environmental Safeguard Monitoring Field Kit: Project Implementation Directorate, Nepal
From Everand
Environmental Safeguard Monitoring Field Kit: Project Implementation Directorate, Nepal
Asian Development Bank
No ratings yet
0197
No ratings yet
0197
12 pages
ANSYS Workbench 2021 R1: A Tutorial Approach, 4th Edition
From Everand
ANSYS Workbench 2021 R1: A Tutorial Approach, 4th Edition
Prof. Sham Tickoo
No ratings yet
Predicting Material Properties Using Machine Learning for Accelerated Materials Discovery
No ratings yet
Predicting Material Properties Using Machine Learning for Accelerated Materials Discovery
9 pages
Heat-loss cycle prediction in steelmaking plants through artificial neural network
No ratings yet
Heat-loss cycle prediction in steelmaking plants through artificial neural network
13 pages
1822 b.e Cse Batchno 7
No ratings yet
1822 b.e Cse Batchno 7
60 pages
1-s2.0-S2238785424020192-main
No ratings yet
1-s2.0-S2238785424020192-main
27 pages
SOLIDWORKS Simulation 2018: A Tutorial Approach
From Everand
SOLIDWORKS Simulation 2018: A Tutorial Approach
Prof. Sham Tickoo
No ratings yet
AutoCAD MEP 2023 for Designers, 7th Edition
From Everand
AutoCAD MEP 2023 for Designers, 7th Edition
Prof. Sham Tickoo
No ratings yet
ANSYS Workbench 2019 R2: A Tutorial Approach, 3rd Edition
From Everand
ANSYS Workbench 2019 R2: A Tutorial Approach, 3rd Edition
Prof. Sham Tickoo
No ratings yet
Micro-Cutting: Fundamentals and Applications
From Everand
Micro-Cutting: Fundamentals and Applications
Dr. Kai Cheng
No ratings yet
18
No ratings yet
18
12 pages
CE1FD31949E74EA2BF2FD7B1EE557B9A-mark
No ratings yet
CE1FD31949E74EA2BF2FD7B1EE557B9A-mark
3 pages
Materials 16 05977
No ratings yet
Materials 16 05977
30 pages
MetModel- Microstructural Evolution Model for Hot Rolling and Prediction of Final Product Properties
No ratings yet
MetModel- Microstructural Evolution Model for Hot Rolling and Prediction of Final Product Properties
5 pages
AutoCAD Plant 3D 2021 for Designers, 6th Edition
From Everand
AutoCAD Plant 3D 2021 for Designers, 6th Edition
Prof. Sham Tickoo
No ratings yet
Flow Simulation Using SOLIDWORKS 2023
From Everand
Flow Simulation Using SOLIDWORKS 2023
Prof. Sham Tickoo
No ratings yet
Hardness prediction of high entropy alloys with machine learning and material descriptors selection by improved genetic algorithm
No ratings yet
Hardness prediction of high entropy alloys with machine learning and material descriptors selection by improved genetic algorithm
11 pages
s41524-024-01426-z
No ratings yet
s41524-024-01426-z
11 pages
Review On Application of Machine Learning in Predicting Mechanical Properties of Metals
No ratings yet
Review On Application of Machine Learning in Predicting Mechanical Properties of Metals
9 pages
Methodologies for Assessing Pipe Failure Rates in Advanced Water Cooled Reactors
From Everand
Methodologies for Assessing Pipe Failure Rates in Advanced Water Cooled Reactors
IAEA
No ratings yet
AutoCAD Electrical 2020 for Electrical Control Designers, 11th Edition
From Everand
AutoCAD Electrical 2020 for Electrical Control Designers, 11th Edition
Prof. Sham Tickoo
No ratings yet
AutoCAD Plant 3D 2018 for Designers, 4th Edition
From Everand
AutoCAD Plant 3D 2018 for Designers, 4th Edition
Prof. Sham Tickoo
No ratings yet
0 100029991
No ratings yet
0 100029991
48 pages
Jonathan Tenner1999, Optimisation of The Heat Treatment of Steel Using Neural Networks
No ratings yet
Jonathan Tenner1999, Optimisation of The Heat Treatment of Steel Using Neural Networks
337 pages
3c3033c5f2e58adafb23c44f1df5c655827ca9d9IT%20Endsem%202022
No ratings yet
3c3033c5f2e58adafb23c44f1df5c655827ca9d9IT%20Endsem%202022
2 pages
use-of-machine-learning-in-predictive-maintainance
No ratings yet
use-of-machine-learning-in-predictive-maintainance
5 pages
PREDICTIVE MAINTENANCE UNLEASHING APPLICATIONS OF MACHINE LEARNING A COMPREHENSIVE EXPLORATION
No ratings yet
PREDICTIVE MAINTENANCE UNLEASHING APPLICATIONS OF MACHINE LEARNING A COMPREHENSIVE EXPLORATION
5 pages
pdf
No ratings yet
pdf
7 pages
CD 4054
No ratings yet
CD 4054
13 pages
A. B. C. D. E. : Name Student ID Score
No ratings yet
A. B. C. D. E. : Name Student ID Score
11 pages
Wheel_Chock_Sizing_Guide
No ratings yet
Wheel_Chock_Sizing_Guide
1 page
Acti9 Ic60 - A9F54206
No ratings yet
Acti9 Ic60 - A9F54206
3 pages
526 SLC
No ratings yet
526 SLC
15 pages
4.3 Rates A Levels Chemistry
No ratings yet
4.3 Rates A Levels Chemistry
18 pages
All-Round Protection From The Refinery To The Engine: Hydac Dieselprotection
No ratings yet
All-Round Protection From The Refinery To The Engine: Hydac Dieselprotection
8 pages
Model AC55 Generator Set: Cummins Series
No ratings yet
Model AC55 Generator Set: Cummins Series
7 pages
Creep, Shrinkage Etc
No ratings yet
Creep, Shrinkage Etc
80 pages
DB-10 Bridge Plug
No ratings yet
DB-10 Bridge Plug
2 pages
Case Study Starbucks
No ratings yet
Case Study Starbucks
7 pages
Eurofighter World 2016-07 PDF
No ratings yet
Eurofighter World 2016-07 PDF
27 pages
JFEBEAR Running Manual (Rev.2)
No ratings yet
JFEBEAR Running Manual (Rev.2)
13 pages
Dust Collection Maintenance
No ratings yet
Dust Collection Maintenance
20 pages
How Meta-Lax Works
No ratings yet
How Meta-Lax Works
1 page
Design Calculation - G+1 Accommodation Block Building-R0
100% (1)
Design Calculation - G+1 Accommodation Block Building-R0
41 pages
ZF Astronic Error Codes
No ratings yet
ZF Astronic Error Codes
10 pages
DBA Interview Questions With Answers Part1
100% (1)
DBA Interview Questions With Answers Part1
134 pages
Em-Trak R300 AIS Receiver Product Brochure - MM
No ratings yet
Em-Trak R300 AIS Receiver Product Brochure - MM
4 pages
Metallurgical Insights 3
No ratings yet
Metallurgical Insights 3
3 pages
16 Wall Types
100% (3)
16 Wall Types
22 pages
Optional Gasket Kit: "134-S" Series
No ratings yet
Optional Gasket Kit: "134-S" Series
47 pages
Subsea - Page 1
100% (3)
Subsea - Page 1
39 pages
EDC Lab Manual
100% (1)
EDC Lab Manual
48 pages
Two-Time Logistic Map
No ratings yet
Two-Time Logistic Map
9 pages
Section 6: Foundation: 6.1 General
No ratings yet
Section 6: Foundation: 6.1 General
13 pages
Department of Electrical Engineering Rajasthan College of Engineering For Women Rajasthan Technical University, Kota
No ratings yet
Department of Electrical Engineering Rajasthan College of Engineering For Women Rajasthan Technical University, Kota
6 pages
Ball Valves, 6 In. Through 60 In. (150 MM Through 1,500 MM) : AWWA Standard
100% (1)
Ball Valves, 6 In. Through 60 In. (150 MM Through 1,500 MM) : AWWA Standard
52 pages
HD 465 605-7 Komatsu
No ratings yet
HD 465 605-7 Komatsu
8 pages

FINAL

Uploaded by

FINAL

Uploaded by

APPLICATION OF MACHINE LEARNING IN PREDICTING THE

MECHANICAL PROPERTIES OF API STEELS

SUBMITTED IN PARTIAL FULFILLMENT OF THE

AAKARSHIT TAWRA (2K21/PE/01)

Under the supervision of

DELHI TECHNOLOGICAL UNIVERSITY

Place: Delhi AAKARSHIT TAWRA (2K21/PE/01)

Date: ADITYA ROY (2K21/PE/02)

ISHANT CHAUDHRY (2K21/PE/23

KAUSTUBH KRISHNA CHAUKIYAL (2K21/PE/31)

AAKARSHIT TAWRA (2K21/PE/01)

CHAPTER 2 LITERATURE REVIEW...........................................................................10

CHAPTER 4 RESULT AND DISCUSSION.................................................................... 35

Table 3.1 Chemical Composition of API

Table 3.2 Mechanical Properties of

Motivation for the Study

●​ X: High-strength, low-alloy steel

●​ Q: Quenched and tempered high-strength steel

●​ H40: A low-strength, carbon-manganese steel with a minimum yield strength of 40 ksi,

●​ L80: A medium-strength, low-alloy steel with a minimum yield strength of 80 ksi,

1.3 MACHINE LEARNING

Key Components of Machine Learning

Types of Machine Learning

Importance of Machine Learning

Given below are some of the applications of Machine Learning:

4.​ Recommendation Systems: Machine learning powers recommendation systems used in

Challenges in Machine Learning

Despite its transformative potential, machine learning faces several challenges:

Future of Machine Learning

CHAPTER 2 LITERATURE REVIEW

2.1 YIELD STRENGTH

properties of steels depend on compositions, microstructure, and processing techniques. They

2.2 TENSILE STRENGTH

2.3 OTHER MECHANICAL PROPERTIES

3.1 DATA COLLECTION

Research papers served as a valuable source of scholarly articles, providing in-depth

3.2 DATA ANALYSIS

Pearson correlation coefficient was utilized to conduct correlation analysis in order to

Additionally, outlier analysis was conducted to identify observations that deviated

By performing descriptive analysis, correlation analysis, and outlier analysis, a

3.3 FEATURE SELECTION

Why Is Feature Selection Important?

Methods of Feature Selection

Feature Selection in This Project

3.4 MODEL TRAINING

Model Training in Your Project

3.4.1 LINEAR MODELS

3.4.1.1 MULTIPLE REGRESSION

3.4.2 REGULARIZED MODEL

3.4.2.1 LASSO REGRESSION

Key Features of Lasso Regression

3.4.2.2 RIDGE REGRESSION

Ridge Regression is another type of linear regression that incorporates L2 regularization.

Key Features of Ridge Regression

3.4.3 NON LINEAR MODEL

nleft = Records in the left subtree

nright = Records in the right subtree

3.4.4.1 Extreme gradient boosting

Extreme Gradient Boosting (XGBoost) is an enhancement of gradient boosting technique shown

3.5 MODEL EVALUATION

[1]​ ‘High-strength low-alloy steel’. https://en.wikipedia.org/wiki/High-strength_low-alloy_steel (accessed May 20,

[4]​ ‘5 Common Alloying Elements’. https://www.metalsupermarkets.com/5-common-alloying- elements/ (accessed

[7]​ ‘API Steel Grade’. https://www.drinol.com/technology/api-steel-grade.html (accessed May 20, 2023).

[14]​ ‘Types of Machine Learning’. https://www.javatpoint.com/types-of-machine-learning (accessed May 21, 2023).

[33]​ ‘Pearson correlation coefficient’. https://en.wikipedia.org/wiki/ Pearson_correlation_coefficient (accessed May 20,

You might also like

● X: High-strength, low-alloy steel

● Q: Quenched and tempered high-strength steel

● H40: A low-strength, carbon-manganese steel with a minimum yield strength of 40 ksi,

● L80: A medium-strength, low-alloy steel with a minimum yield strength of 80 ksi,

4. Recommendation Systems: Machine learning powers recommendation systems used in

[1] ‘High-strength low-alloy steel’. https://en.wikipedia.org/wiki/High-strength_low-alloy_steel (accessed May 20,

[4] ‘5 Common Alloying Elements’. https://www.metalsupermarkets.com/5-common-alloying- elements/ (accessed

[7] ‘API Steel Grade’. https://www.drinol.com/technology/api-steel-grade.html (accessed May 20, 2023).

[14] ‘Types of Machine Learning’. https://www.javatpoint.com/types-of-machine-learning (accessed May 21, 2023).

[33] ‘Pearson correlation coefficient’. https://en.wikipedia.org/wiki/ Pearson_correlation_coefficient (accessed May 20,