Iron Ore Quality Prediction Using Machine Learning

The document discusses predicting the percentage of silica concentrate in iron ore using machine learning algorithms. It describes variables in an iron ore froth flotation process and the dataset used. Various regression algorithms are compared to determine the best model for predicting silica concentrate without using iron concentrate as a feature.

Uploaded by

rkgadalhat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views

Iron Ore Quality Prediction Using Machine Learning

Uploaded by

rkgadalhat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

International Journal of Scientific Research in Engineering and Management (IJSREM)

Volume: 06 Issue: 06 | June - 2022 Impact Factor: 7.185 ISSN: 2582-3930

Iron ore Quality prediction Using Machine Learning

Chetan Kumar G S1, Kavana K M2
1Assitant Professor, Department of Master of Computer Application,UBDTCE,Davangere
2Kavana K M,PG Student, Department of Master of Computer Application,UBDTCE,Davangere

ABSTRACT:

The main goal of this project is to predict how much impurity Research Objectives
is in the ore concentrate. The% of Silica is measured in a lab
experiment it takes at least one hour for the process engineers
to have this value. As this impurity is measured every hour, if 1. To evaluate the feasibility of using machine learning
we can predict how much silica (impurity) is in the ore algorithms to predict in real-time the percentage of silica
concentrate, we can help the engineers, giving them early concentrate of froth flotation processing plant.
information to take actions (empowering!). Hence, they will
be able to take corrective actions in advance (reduce impurity,
if it is the case) and also help the environment (reducing the 2. Model selection: The project finds out which variable
amount of ore that goes to tailings as you reduce silica in the associated with iron ore extraction is statistically significant.
ore concentrate).

1.Introduction
3. Estimate: The project will propose a model to predict
The approach is simple. It aims whether we can predict the percentage of silica concentrate in froth flotation
silica concentrate without iron concentrate and approached
with simple way of developing the model with concentrate and
model without concentrate and compare the performance of 2.LITERATURE REVIEW
model using various regression metric like R^2 or MAE and Column Process DESCRIPTION OF
drawing conclusion based on the results. VARIABLES IN FORTH
PLANT
When multiple dependent variables exist in a regression Date date of the measurement
model, this task is called as multi-target regression. In this % Iron Feed % of Iron that comes
case, a multi-output regressor is employed to learn the from the iron ore that is
mapping from input features to output variables jointly. In this
being fed into the
study, multi-target regression technique is implemented for
quality prediction in a mining process to estimate the amount flotation cells
of silica and iron concentrates in the ore at the end of the % Silica Feed % of silica (impurity) that
process. In the experimental studies, different regressors that comes from the iron ore
use Random Forest, AdaBoost, k-Nearest Neighbors and that is being fed into the
Decision Tree algorithms separately in the background were flotation cells
compared to determine the best model. Coefficient of Starch Flow Starch (reagent) Flow
determination (R 2 ) measure was used as the evaluation measured in m3/h
metric. There are some studies that predict iron concentrate Amina Flow Amina (reagent) Flow
and silica concentrate separately. However, this Model measured in m3/h
provides a new contribution to the field by calculating these Ore Pulp Flow t/h
two values jointly since they have a great correlation. Ore Pulp pH pH scale from 0 to 14
Our Approaches is whether Ore Pulp Density Density scale from 1 to 3
1. % Iron Concentrate is correlated with % Silica Concentrate kg/cm³
2.Predict the % silica concentrate without using % iron Flotation Column 01 Air Air flow that goes into
concentrate . Flow the flotation cell
3. If it is correlated and we can predict both % Iron and Silica measured in Nm³/h
concentrate at same time using power of ML and DL . Flotation Column 02 Air Air flow that goes into

© 2022, IJSREM | www.ijsrem.com DOI: 10.55041/IJSREM14486 | Page 1

International Journal of Scientific Research in Engineering and Management (IJSREM)
Volume: 06 Issue: 06 | June - 2022 Impact Factor: 7.185 ISSN: 2582-3930

Flow the flotation cell dataset from data analytic practitioners. Data scientists
measured in Nm³/h compete to build the best model for both descriptive and
Flotation Column 03 Air Air flow that goes into predictive analytic. It however allows individual to access
Flow the flotation cell their dataset in order create models and also work with other
measured in Nm³/h data scientist to solve various real world analytics problems.
The input dataset used in developing this model has been
Flotation Column 04 Air Air flow that goes into
downloaded from Kaggle. The dataset contains design
Flow the flotation cell
characteristics of iron ore froth flotation processing plant
measured in Nm³/h which were put together within three months. This is nicely
Flotation Column 05 Air Air flow that goes into organized using common format and a standardized set of
Flow the flotation cell associate features of iron ore froth flotation system.
measured in Nm³/h
Flotation Column 06 Air Air flow that goes into Structure of Dataset
Flow the flotation cell
measured in Nm³/h The dataset contains 24 columns representing the
Flotation Column 07 Air Air flow that goes into measurements, 737,453 samples exist. The 24 columns include
the date and time of the measurement, which will not be used
Flow the flotation cell
as an input feature. The last columns of the dataset represent
measured in Nm³/h
the targets of this prediction task: the percentages of iron ore
Flotation Column 01 Froth level in the and silica concentrate, which are highly inversely correlated.
Level flotation cell measured in Our goal is to predict silica concentrate without the use of iron
mm (millimeters) concentrate. The other 21 columns will be used as features for
Flotation Column 02 Froth level in the predicting the target value. Description of each feature can be
Level flotation cell measured in found in Table above
mm (millimeters)
Flotation Column 03 Froth level in the
Level flotation cell measured in
2.2 Proposed Solution
mm (millimeters)
Flotation Column 04 Froth level in the Over the past two decades, there has been an upsurge of
Level flotation cell measured in academic research work within froth flotation process
mm (millimeters) fraternity. Though, a significant number of the plant
Flotation Column 05 Froth level in the processing problems are being successfully modelled using
Level flotation cell measured in machine learning algorithms but other unresolved issues and
mm (millimeters) impediment still remain.
Flotation Column 06 Froth level in the
Level flotation cell measured in
mm (millimeters) Random ForestRegressor
Flotation Column 07 Froth level in the
This method basically trains a number of classifying decision
Level flotation cell measured in
trees on various different subsamples. It benefits from
mm averaging mechanism to improve the predictive accuracy and
%Iron Concentrate % of Iron which to control over-fitting. Training samples are randomly selected
represents how much with replacement. The size of each new training set is the same
iron is presented in the as the original dataset. That is to say, a chosen instance is likely
end of the flotation to be chosen again and again as an element of distinct subsets.
process As input parameters, the number of trees in the algorithm and
% Silica Concentrate % of silica which maximum depth should be determined initially. The change in
represents how much their values may affect the performance and predictive power
iron is presented in the of the algorithm. Therefore, all possible parameters in the range
end of the flotation for the size of the dataset are given to the method and tested.
process The parameters leading to best results become candidates to be
2.1 Source of Data used. This method performs efficiently without causing too
much computational cost.
Kaggle is an online community for descriptive analysis and
predictive modelling. It collects variety of research fields’
© 2022, IJSREM | www.ijsrem.com DOI: 10.55041/IJSREM14486 | Page 2
International Journal of Scientific Research in Engineering and Management (IJSREM)
Volume: 06 Issue: 06 | June - 2022 Impact Factor: 7.185 ISSN: 2582-3930

3.1BlockDiagram
3. THEORETICAL ANALYSIS

When multiple dependent variables exist in a regression model,

this task is called as multi-target regression. In this case, a Data set DataPre- Cheki Data
multi-output regressor is employed to learn the mapping from processing
3.2 SoftwareDesigning ng Visual
input features to output variables jointly. In this study, multi- Null ization
target regression technique is implemented for quality
prediction in a mining process to estimate the amount of silica
and iron concentrates in the ore at the end of the process.
PCA Feature Train And Spli
In this study, two inter-dependent single target regression tasks
Scaling Test Split ttin
are transformed into a multiple output regression problem for
Data g
quality prediction in a mining process.

In the previous models have been conducted to estimate silica

concentrate with or without taking iron concentrate as input
parameter. In this aspect, the problem is a single-target Deployem
regression problem. However, this study that focuses on the ent
estimation of both iron and silica concentrates simultaneously
as output variables. We compared different multi-target
regressors that use Random Forest, AdaBoost, XGBOOST
,RIDGE and Decision Tree algorithms separately in the Jupyter NotebookEnvironment
background. Coefficient of determination (R^2) metric and
MSE was used to evaluate predictive performance of the SpyderIde
regression methods for the mentioned data.
Machine LearningAlgorithms
The prediction error is defined as the difference between its
actual outcome value and its predicted outcome value.In this Python(pandas,numpy,matplotlib,seaborn,sklearn)
study, two metrics were used to compare models: - RMSE and
HTML
MAE. RMSE (root mean squared error) is calculated . This is
computed by taking the differences between the target and the Flask
actual algorithm outputs, squaring them and averaging over all
classes and internal validation samples . We developed this loan status prediction by using the Python
language which is a interpreted and high level programming
MAE (mean absolute error/deviation) is calculated as MAE language and usng the Machine Learning algorithms. for
This gives the magnitude of the average absolute error . coding we used the Jupyter Notebook environment of the
Anaconda distributions and the
Spyder,itisanintegratedscientiﬁcprogramminginthe
pythonlanguage.

For creating an user interface for the prediction we used the

Flask. It is a micro web framework written in Python. It is
classiﬁed as a microframework because it does not require
particulartools or libraries.It
hasnodatabaseabstractionlayer,formvalidation,oranyother
componentswherepre-existingthird-
partylibrariesprovidecommonfunctions,andascripting language
to create a webpage is HTML by creating the templates to use
in th functions ofthe Flask andHTML.

International Journal of Scientific Research in Engineering and Management (IJSREM)
Volume: 06 Issue: 06 | June - 2022 Impact Factor: 7.185 ISSN: 2582-3930

4.EXPERIMENTALINVESTIGATION 4. it can increase the thermal conductivity, change the

adhesive viscosity and increase the flame retardancy.
Below is the image of data set and it has totally 737453 data
points and 24 attributes. In this blog , we used the first 21 5. due to the fine grain size and reasonable distribution
attributes as independent variables and the last two attributes of silica fume, it can effectively reduce and eliminate
(% iron and % silica concentrate) as target variables. precipitation and stratification.

6. pure silicon powder, low content of impurities, stable

physical and chemical properties, so that the curing
5. RESULT material has good insulation properties and arc
resistance.
Dataset:-
DISADVANTAGES
In this analysis, we evaluate the predictive performance of the
aforementioned ML models. The values for RMSE, MSE, and 1. dry shrinkage.
R2 for all models are reported in Figure below. Low values for 2. it is easy to produce temperature cracks.
RMSE and MSE, and high values for R2 indicate better model 3. Silica fume requires a high amount of water and
performance, respectively. For simplicity of discussion, needs to be used with a superplasticizer.
RMSE is used as the primary metric of performance. In 4. The price of silica fume is relatively high compared
addition, both the testing and validation performance is to cement and fly ash.
reported, which facilitates the discussion on overfitting in the 5. Silica fume will increase the autogenous shrinkage of
models. These performance measures are plotted as boxplots the cement slurry, and the amount of inclusion will
to illustrate the range and variance of the error. exceed 5%, which may increase the risk of
cracking.It is easy to cause cracks in mortar and
r2_score mse rmse concrete and need concrete maintenance..
% silica 0.99417 0.002450 0.049504
concentrate 7.CONCLUSION
without using %
iron concentrate  This Project presents a simple mathematical model to
. predict the quality prediction in a mining process
0.99391 0.001925 0.043886 from the early time test results. In this study, the slica
% silica
concentrate characteristic with date is modeled by a
concentrate with
Random forest regression mathematical equation.
using % iron
Early age test data are being used in this case to get
concentrate .
reliable values of the 20 seconds silica prediction.
Herein, a simple and practical approach has been
described for prediction of quality prediction in a
6. ADVANTAGES AND DISADVANTAGES mining process and the proposed technique can be
used as a reliable tool for assessing the mining
ADVANTAGES process from quite early test results. This will help in
making quick decision at site and reduce delay in the
1. Silica is basically impurity in iron ore and execution time of large construction projects.
by predicting the impurity in ore we can help the  To predict the silica(impurity) % in the ore
engineers in the plant to take measurements in early concentrate in a less time we are building a predictive
stages of manufacturing. To help the environment by analytics system in that we are applying various
reducing the amount of ore that goes to tailing as you machine learning algorithms and find the best
reduce silica in the ore concentrate. accurate model. Here web application will be used to
2. silica fume is a kind of neutral inorganic filler with display the prediction . The web application is built
very stable physical and chemical properties. It does by using flask framework and it is integrated with
not contain crystalline water, does not participate in trained ML model.
the curing reaction, and does not affect the reaction
mechanism.

3. good infiltration for various kinds of resin, good

adsorption performance, easy to mix, no
agglomeration phenomenon.

International Journal of Scientific Research in Engineering and Management (IJSREM)
Volume: 06 Issue: 06 | June - 2022 Impact Factor: 7.185 ISSN: 2582-3930

8. REFERENCES Prediction in a Mining Process. Retrieved September

7, 2019, from Kaggle.com website:
 Breiman, L., &Schapire, R. (2001). Random Forests. https://www.kaggle.com/edumagalhaes/quality-
IEEE, 45, 5–32. Retrieved from prediction-in-a-mining-process
https://www4.stat.ncsu.edu/~lu/ST7901/reading%20  Li, C., Sun, H., Bai, J., & Li, L. (2010). Innovative
materials/Breiman2001_Article_Random Forests.pdf. methodology for comprehensive utilization of iron
 https://ieeexplore.ieee.org/abstract/document/8907120 ore tailings. Journal of Hazardous Materials
.
 Eduardo Magalhães Oliveira. (2017). Quality

Sg270-Bv-Saf-010 - 27apr2011 2
No ratings yet
Sg270-Bv-Saf-010 - 27apr2011 2
90 pages
N5 Mathematics
No ratings yet
N5 Mathematics
24 pages
Chapter 1
No ratings yet
Chapter 1
38 pages
Accomplishment Stories Assignment - Yogesh Nizzer
No ratings yet
Accomplishment Stories Assignment - Yogesh Nizzer
4 pages
Krish - Moving Laggards To Early Adopters, NcKinsey & Co
No ratings yet
Krish - Moving Laggards To Early Adopters, NcKinsey & Co
18 pages
Enbridge Case Study Paper - 2018
100% (1)
Enbridge Case Study Paper - 2018
16 pages
Fact Sheet, National Diploma, Mechanical Engineering N1-N6 V240521
No ratings yet
Fact Sheet, National Diploma, Mechanical Engineering N1-N6 V240521
3 pages
N5 Mechanotechnics August 2022 Question Paper
No ratings yet
N5 Mechanotechnics August 2022 Question Paper
9 pages
Digital Transformation - AI _ Data
No ratings yet
Digital Transformation - AI _ Data
35 pages
3Cs Principle Document Lean Model
No ratings yet
3Cs Principle Document Lean Model
9 pages
Q317 Full Issue PDF
No ratings yet
Q317 Full Issue PDF
124 pages
Shewhart, Walter A. - Economic Control of Quality Manufactured Product-American Society For Quality (ASQ) (1980) - 250-274
No ratings yet
Shewhart, Walter A. - Economic Control of Quality Manufactured Product-American Society For Quality (ASQ) (1980) - 250-274
25 pages
McKinsey Launches New Product Suite To Help Clients Scale AI
No ratings yet
McKinsey Launches New Product Suite To Help Clients Scale AI
5 pages
Hypothesis-Driven Problem Solving: - Most Effective Way To Solve A Labyrinth Problem? - Usually To Start From The Goal..
No ratings yet
Hypothesis-Driven Problem Solving: - Most Effective Way To Solve A Labyrinth Problem? - Usually To Start From The Goal..
13 pages
Lecciones para Construir Mentalidades Ágiles
No ratings yet
Lecciones para Construir Mentalidades Ágiles
7 pages
DS Ebook - Technical Due Diligence 1
No ratings yet
DS Ebook - Technical Due Diligence 1
6 pages
Maintenance Strategies
No ratings yet
Maintenance Strategies
11 pages
Understanding Tool: Mckinsey 7S Model
No ratings yet
Understanding Tool: Mckinsey 7S Model
9 pages
Industry Supply Curve
No ratings yet
Industry Supply Curve
11 pages
Mechanotechnics N4 QP Nov 2019
No ratings yet
Mechanotechnics N4 QP Nov 2019
8 pages
PESTLE Analysis Templates
No ratings yet
PESTLE Analysis Templates
4 pages
Metals Mining Industry in India 2023-2028 Part-II
No ratings yet
Metals Mining Industry in India 2023-2028 Part-II
37 pages
Candidate Guide India
No ratings yet
Candidate Guide India
26 pages
Corporate Brochure Worldsensing
No ratings yet
Corporate Brochure Worldsensing
8 pages
KPMG's DT Playbook For BITS Pilani
No ratings yet
KPMG's DT Playbook For BITS Pilani
34 pages
Firm Learning - Chart Type Selection
No ratings yet
Firm Learning - Chart Type Selection
1 page
Immediate download Automated Machine Learning for Business R. Larsen ebooks 2024
100% (3)
Immediate download Automated Machine Learning for Business R. Larsen ebooks 2024
66 pages
KPMG Lean Six Sigma
No ratings yet
KPMG Lean Six Sigma
16 pages
Systems Engineering
No ratings yet
Systems Engineering
67 pages
Lean_Six_Sigma_Green_Belt_Program
No ratings yet
Lean_Six_Sigma_Green_Belt_Program
6 pages
The McKinsey Model
No ratings yet
The McKinsey Model
2 pages
Cloud Computing: Post Graduate Program in
No ratings yet
Cloud Computing: Post Graduate Program in
20 pages
5-Rare_20Raw_20Material_20Issues
No ratings yet
5-Rare_20Raw_20Material_20Issues
41 pages
PowerPoint Template - 01Jan2025
No ratings yet
PowerPoint Template - 01Jan2025
216 pages
Big Brand Retail Story PDF
No ratings yet
Big Brand Retail Story PDF
10 pages
2024 JMP Discovery Summit - Advanced Decision SHL Rev3
No ratings yet
2024 JMP Discovery Summit - Advanced Decision SHL Rev3
17 pages
Understanding Capstone Project DQLab
No ratings yet
Understanding Capstone Project DQLab
48 pages
2011.12 - Korean Polysilicon Players
No ratings yet
2011.12 - Korean Polysilicon Players
9 pages
Using 360 Degree Evaluation Methods
No ratings yet
Using 360 Degree Evaluation Methods
26 pages
Managing Uncertainty in Supply Chain Safety Inventory
No ratings yet
Managing Uncertainty in Supply Chain Safety Inventory
39 pages
1b - McKinsey - How-Pharma-Can-Accelerate-Business-Impact-From-Advanced-Analytics PDF
No ratings yet
1b - McKinsey - How-Pharma-Can-Accelerate-Business-Impact-From-Advanced-Analytics PDF
10 pages
Stakeholder Engagement Strategy
No ratings yet
Stakeholder Engagement Strategy
12 pages
Mckinsey'S 7'S Framework: Prof. DR - Salim G Sonekhan - Sdmcet-Dharwad
No ratings yet
Mckinsey'S 7'S Framework: Prof. DR - Salim G Sonekhan - Sdmcet-Dharwad
5 pages
What Matters Most? Eight Priorities For Ceos in 2024
100% (1)
What Matters Most? Eight Priorities For Ceos in 2024
95 pages
NXP 2012 Analyst Day Final
100% (2)
NXP 2012 Analyst Day Final
109 pages
3M Env Solutions
No ratings yet
3M Env Solutions
170 pages
McKinsey Quarterly: New Tools For Negotiators
No ratings yet
McKinsey Quarterly: New Tools For Negotiators
12 pages
Using Predictive Analytics To Optimize Asset Maintenance in The Utilities Industry
100% (1)
Using Predictive Analytics To Optimize Asset Maintenance in The Utilities Industry
6 pages
Handout 1 Min313
No ratings yet
Handout 1 Min313
8 pages
Lecture I Innovation Management v05 Final
No ratings yet
Lecture I Innovation Management v05 Final
76 pages
Building MECE Hypotheses With Decision Trees
No ratings yet
Building MECE Hypotheses With Decision Trees
21 pages
Hearts & Minds - How To Win Hearts & Minds - POSTER (Excellent Pres of HSE Ladder, Beh Change)
100% (1)
Hearts & Minds - How To Win Hearts & Minds - POSTER (Excellent Pres of HSE Ladder, Beh Change)
1 page
Determining Optimal Lot Size
No ratings yet
Determining Optimal Lot Size
6 pages
Screener Templete Final
No ratings yet
Screener Templete Final
16 pages
Diagnostic Review Process - GUIDE
No ratings yet
Diagnostic Review Process - GUIDE
10 pages
1 s2.0 S1876610215012424 Main PDF
100% (1)
1 s2.0 S1876610215012424 Main PDF
6 pages
Human Ressources
No ratings yet
Human Ressources
81 pages
Quality Prediction in A Mining Process
No ratings yet
Quality Prediction in A Mining Process
4 pages
Purities Prediction in A Manufacturing Froth Flotation Plant The Deep Learning Techniques
No ratings yet
Purities Prediction in A Manufacturing Froth Flotation Plant The Deep Learning Techniques
12 pages
11.ABM SoftSensor MachineLearning DeepLearning
No ratings yet
11.ABM SoftSensor MachineLearning DeepLearning
13 pages