Soil Nutrient Analysis (1)
Soil Nutrient Analysis (1)
Modeling
Raji Gupta Shiv Prakash Singh Rishabh Gupta
dept.of Information dept.of Information Technology dept.of Information Technology
Technology Ajay Kumar Ajay Kumar Garg Ajay Kumar Garg
Garg Engineering Engineering College Engineering College
College
𝑠𝑝. 𝑠ℎ𝑖𝑣. 2202@𝑔𝑚𝑎𝑖𝑙. 𝑔𝑢𝑝𝑡𝑎. 𝑟𝑖𝑠ℎ2501@𝑔𝑚𝑎𝑖𝑙.
Ghaziabad, India Ghaziabad, India
𝑟𝑎𝑗𝑖𝑔𝑢𝑝𝑡𝑎1003@𝑔𝑚𝑎𝑖𝑙.
Ghaziabad, India
𝑐𝑜𝑚 𝑐𝑜𝑚
𝑐𝑜𝑚
Shikha Agarwal
Saurabh dept.of Information Technology
dept.of Information Ajay Kumar Garg
Technology Ajay Kumar Engineering College
Garg Engineering
𝑠ℎ𝑖𝑘ℎ𝑎𝑎𝑔𝑙03@𝑔𝑚𝑎𝑖𝑙.
Ghaziabad, India
College
𝑐𝑜𝑚
𝑠𝑎𝑢𝑟𝑎𝑏ℎ2113076@𝑔𝑚𝑎𝑖𝑙.
Ghaziabad, India
𝑐𝑜𝑚
Abstract— Agriculture is a crucial part of India's economy, becomes imperative. Use of modern technical and logistic
contributing over 17% to the nation's GDP(Gross Domestic approach with an intention to cater the shortcomings is the goal
Product), engaging over 60% of the workforce but still the to be achieved. Considering its vital role in the life of masses,
effort-to-yield ratio of the sector remains significantly low. in the
Analyzing soil properties, such as temperature, pH, humidity,
water retention capacity, etc., helps determine soil quality,
which is crucial for optimal agricultural productivity. This
assessment later recommended crops that align with soil
conditions and external environments, maximizing yield
potential, and sustainable land management thereby
preventing degradation.
Similar to most studies aimed at advancing agriculture, the
main objective of this research is to drive development by
implementing ML-based models for crop classification. A
model trained using various algorithms like K-Nearest
Neighbors (KNN) algorithm, Support Vector Classification
(SVC), and more can accurately analyze soil characteristics.
By leveraging the strengths of each algorithm, such as
Gradient Boosting ability to reduce bias and Random Forest's
robustness, this ensemble model improves the prediction of
suitable crops.
I. INTRODUCTION
𝑤 .𝑓 + 𝑤 .𝑓 + . . . +𝑤 .𝑓 + 𝑏 = 0
if exists, are removed to avoid overfitting the model.
1 2 2 6 6
1 2
separated values to train and test the data. The ratio of train to
test dataset is usually 80:20, allowing to use approximately and b is bias attached.
80% of the dataset for training purposes and achieve ● Once such an optimal line is found, the new sample
precision-based data models. This ensures that the model vector features are plugged into the equation to check
performs efficiently on unseen data and allows testing its which side it falls. Very similar to,
efficiency at fullest. Later, cross-validation techniques are if the equation is greater than 0, it is Crop A.If the
used to avoid overfitting of the model. equation is less than 0, it is Crop B.
(𝑝 )
current point and all the other points in the dataset.
𝑖
where 𝑝 is the proportion of items in a group
These distances are arranged in ascending order
.
𝑖
2
𝑑 =
(𝑥2 − 𝑥 ) + 2(𝑦 −
belonging to class i.
2 2
𝑖 2 𝑖
–
𝐸 = 𝑦 − 𝐹
difference iteratively. multiple prediction is selected.The label produced
(𝑥 )
maximum times as the output is chosen as the final
𝑖 𝑖 𝑜 𝑖
𝑡ℎ
where 𝐸 = error residue at trees are trained over a random sample of the data.
𝑖
stage
𝑦 = true value (here,
�
�label crop)
𝑖
𝐹 (𝑥 ) = initial predicted value
𝑜 𝑖
IV. RESULT
V. CONCLUSION