0% found this document useful (0 votes)
27 views

Student Performance Evaluation in Educat

This document discusses using data mining techniques like clustering and prediction to evaluate student performance in education. It focuses on using the K-means clustering algorithm and classification models. The K-means algorithm is used to cluster similar groups of students. Classification is then used to predict student performance based on personal, social, and other variables. The goal is to identify students with low academic achievement early so teachers can provide more support to improve performance. A naive Bayesian classifier is proposed to classify students due to its ability to handle large datasets and provide reasonably accurate results.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
27 views

Student Performance Evaluation in Educat

This document discusses using data mining techniques like clustering and prediction to evaluate student performance in education. It focuses on using the K-means clustering algorithm and classification models. The K-means algorithm is used to cluster similar groups of students. Classification is then used to predict student performance based on personal, social, and other variables. The goal is to identify students with low academic achievement early so teachers can provide more support to improve performance. A naive Bayesian classifier is proposed to classify students due to its ability to handle large datasets and provide reasonably accurate results.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

IJSRD - International Journal for Scientific Research & Development| Vol.

3, Issue 10, 2015 | ISSN (online): 2321-0613

Student Performance Evaluation in Education Sector using Prediction


and Clustering Algorithms
Solankar Punam Anil1 Jagatap Trupti Baban2 Rupnawar Sachin Hanumant3 Shitole Vibhavari
Jayvant4 Prof. Kumbhar S. L.5
1,2,3,4
Student 5Assistant Professor
1,2,3,4,5
Department of Computer Engineering
1,2,3,4,5
SBPCOE Indapur
Abstract— Data mining is the crucial steps to find out the students identified can be cared more by the teacher so
previously unknown information from large relational that their performance is improved in future.
database. various technique and algorithm are their used in The remaining paper is organized as follows. The
data mining such as association rules, clustering and Related work is included in second section related to the
classification and prediction techniques. Ease of the proposed work. Section 3 contains the proposed work with
techniques contains particular characteristics and behaviour. the K- Means clustering algorithm, classification, and the
In this paper the prime focus on clustering technique and new procedure for the implementation. Section 4 consists of
prediction technique. Now a days large amount of data the conclusion.
stored in educational database increasing rapidly. The
database for particular set of student was collected. The II. RELATED WORK
clustering and prediction is made on some detailed manner International conference on computer supported Brijesh
and the results were produce. The K-means clustering Kumar Bhardwaj and Saurbh Pal, Predicting student’s future
algorithm is used here. To find nearest possible a cluster a learning behavior-with the use of student modeling, By
similar group the turning point India is the performance in making the students model the destination can be
higher education for all students. This academic accomplished.
performance is influenced by various factor, therefore to The destination can be achieved by making student
identify the difference between high learners and slow models that assemble the learner’s characteristics, contains
learner students it is important for student performance to all information such as their knowledge; behaviour and
develop predictive data mining model. motivation to learn .The learning are also measured with
Key words: Data Mining, Clustering, Classification, user experience of the learner and their overall satisfaction.
Predictive Model Suchita Borkar and K. Rajeswari studied the use of
education data mining for discovering different patterns to
I. INTRODUCTION improve the performance of student’s data and identified the
The attainment to predict a student’s performance is most attributes which effect student’s academic performance. In
important in educational sector. Student performance is this paper to operate large amount of data to their
based upon various factors like personal, social and other performance is improved by using the clustering algorithm.
variables. A most promising tool to get this objective is the In this paper the High prediction accuracy, processing speed
use of Data Mining. Data mining is the simply the process is increased.
of extracting useful information from large amount of data. In S. Ganga and Dr. T. Meyyappan paper
To mine the unknown data, different techniques were used performance of student educational data mining describes a
such as the Supervised and unsupervised learning technique, research region concerned with the desire data mining,
pattern mining, clustering, classification technique, machine learning and statistics to information generated
prediction, Association rule etc. The K-Means Algorithm from educational setting. At high levels, the field search for
concentrated in this paper is the clustering technique, which grow up the methods for exploring this data.
is the partition, and supervised learning type data mining The categorization of clustering algorithm into
algorithm. Clustering is the process of grouping a given set unsupervised and semi supervised strategies based on
of paradigm into disjoint clusters. This is done such that whether we have certain prior knowledge about the clusters
patterns in the same cluster are likewise and patterns and clustering process to utilize to improve the clustering
belonging to two different clusters are different. performance of student data. In our work, focus on the K-
In this paper another one of the data mining Means algorithm as centroid based semi-supervised model.
techniques were used such as classification. Classification is It can be achieved by various algorithms that to present a
a predictive data mining technique; create prediction about systematic commentary on various clustering techniques
values of data using known results found from various data. applied for educational data mining to predict academic
Classification maps data into predefined sets of classes. performance of student and its implications find them.
Before examining the data it is often referred to as Pooja Thakar, Anil Mehta and Manisha conducted
supervised learning because the classes are determined. study on Performance Analysis and Prediction in
Prediction models that contain all personal, social, and other Educational Data Mining.
variables are needs for the performance of the students for
effective prediction. The prediction of student performance
with high accuracy is beneficial for to recognize the students
with low academic achievements initially. It is required that

All rights reserved by www.ijsrd.com 229


Student Performance Evaluation in Education Sector using Prediction and Clustering Algorithms
(IJSRD/Vol. 3/Issue 10/2015/054)

III. PROPOSED WORK problem. Maximum-likelihood training can be done by


evaluating a closed-form expression, which takes linear
A. Data Set:
time, rather than by expensive iterative approximation used
We are collecting the data of engineering college. This for many other types of classifiers. The Navie Bayesian
shows the student academic performance and the personal classifier is based on Bayes theorem with independence
information like economical problem, family problem etc. assumptions between predictors. A naïve Bayesian model is
The data also contain the student family background, student easy to build, with no complicated iterative parameter
details, subjects marks, semester wise percentage have been estimation which makes it particularly useful for very large
recorded. datasets. Despite its simplicity, the Naive Bayesian classifier
often does surprisingly well is widely used because it often
outperforms more sophisticated classification methods.
D. Implementation of Mining Model
In implementation of mining model various algorithms and
techniques like, classification, clustering, regression, neural
network, artificial intelligence association rules and genetic
algorithm, decision tree are used for knowledge discovery
from various database. From these algorithm and k means
classification is one of the most frequently used problems by
data mining and machine learning (ML) researches .It
consist of predicting the value of category class based on
values of other class which are predicting class (attributes ).
There are different classification methods in our present
Fig. 1: Analysis of student performance
study. We are going to use the Bayesian classification
algorithm.
IV. DATA PROCESSING METHODS
Bayes classification proposed that is based on
A. Clustering: conditional probability of Bayes rule. Bayes rule is one of
Clustering analysis or clustering is the task of grouping set the techniques to the similarity of property given the set of
of object in such a way that object in the same group are data as evidence or input Bayes, Bayes theorem is as
more allied to each other than to those in other groups follows:
(cluster) it is main task of exploratory data mining, and for
statistical analysis in general purpose it is also useful in
machine learning pattern recognition, image analyze bio- The approach is called “Navie”, because it
informatics and also information get back. considers the independence between the various classes or
attributes values. Navie byes classification generally viewed
B. K-Means Clustering Algorithm: as both descriptive and a predictive types of algorithm. The
K-Means is one of the unsupervised learning algorithms that probabilities are predictive type and the used to predict the
used to solve the problem of clustering. The k-means class membership. For a target tuple.
algorithm is easier and simple to understand the problem. The naive Bayes approaches has several
The main intellection is to define K centers, one for each advantages: It is easy to use unlike other classification
cluster. These centers should be placed in designing way approaches the Navie Bayes approach required only one
because of various location reason result vary. So, the better scan of the training data; easily handle mining value by
choice is to place them as much as possible far away from simply omitting that probability one more advantage of the
each other. Another step is to take each point belonging to a Navie Bayes classifier is that it requires a small amount of
given data set and attach it to the closest center. After the training data to estimate the various parameters such as
first point is complete there is no waiting state and an early means and variance which are necessary for classification
group age is done. For recalculating that point of the K new because independent variables are assumed, there is no need
centroid as barycenter of the clustering resulting from the to determine entire covariance matrix, only the variances of
previous step. After we have these K new centroid, A new the variables for each class need to be determined. In spite
obligatory has to be done between the similar data set points of their Navie design and apparently over simplified
and the closest new center. A loop has been generated. As assumptions, Navie Bayes classifiers have worked quite
result of this loop we may notice that the K center change well in many complex real world situations.
their step by step location varies is does not change. After
that this algorithm target at minimizing an objective function V. CONCLUSION
known as soured error function. By referred these Navie Bayesian classification and k-
C. Naviebayes: Means clustering technique on student database to predict
the student academic performance on the basis of student
In machine learning, naive Bayes classifiers are a family of
database. This predicted data from student database is
intelligible probabilistic classifiers depend on applying
usefull for student performance improvement. This study
Bayes theorem with strong (naive) independence
shows student performance and easy to identify those
assumptions between the objectives. Naive Bayes classifiers
student which having less mark and poor performance.
are highly scalable, requiring a number of parameters linear
According to the present study the performance off the
in the number of variables (features/predictors) in a learning

All rights reserved by www.ijsrd.com 230


Student Performance Evaluation in Education Sector using Prediction and Clustering Algorithms
(IJSRD/Vol. 3/Issue 10/2015/054)

students are not always depend on their own efforts. Other [16] Azhar Rauf, Sheeba,“Enhanced K-Mean Clustering
factors have influenced the students’ performance according Algorithm to Reduce Number of Iterations and Time
to our investigation. This proposal will improve the insights Complexity”, Middle-East Journal of Scientific
over existing methods. Research, Vol. 12 (7), Pp. 959-963, 2012.
[17] Jaideep Vaidya, “Privacy Preserving K-Means
REFERENCES Clustering overVertically Partitioned Data”, In
[1] Ali Buldua, Kerem Üçgün,. Data mining application on proceeding of SIGKDD ’03,Washington, DC, USA,
students’ data. Procedia Social and Behavioral Sciences August 24-27, 2003.
2 5251–5259, 2010. [18] N. Sivaram, “Applicability of Clustering and
[2] Singh, Randhir. An Empirical Study of Applications of Classification Algorithms for Recruitment Data
Data Mining Techniques for Predicting Student Mining”, International Journal of Computer
Performance in Higher Education, 2013. Applications, Vol. 4(5), July 2010.
[3] Baha Sen, Emine Ucar. Evaluating the achievements of [19] Md. Hedayetul Islam Shovon, “Prediction of Student
computer engineering department of distance education Academic Performance by an Application of K-Means
students with data mining methods. Procedia Clustering Algorithm”,International Journal of
Technology 1 262 – 267, 2012. Advanced Research in Computer Science and Software
[4] Baradwaj, Brijesh Kumar, and Saurabh Pal. Mining Engineering, Vol. 2(7), July 2012.
Educational Data to Analyze Students' Performance.
arXiv preprint arXiv: 1201.3417, 2012.
[5] Castro, Félix, et al. Applying data mining techniques to
e-learning problems. Evolution of teaching and learning
paradigms in intelligent environment. Springer Berlin
Heidelberg, 183-221, 2007.
[6] Huebner, Richard A. "A survey of educational".
[7] Ramaswami, M., and R. Bhaskaran. A CHAID based
performance prediction model in educational data
mining. arXiv preprint arXiv: 1002.1144, 2010.
[8] Pool, Lorraine Dacre, Pamela Qualter, and Peter J.
Sewell. "Exploring the factor structure of the
CareerEDGE employability development profile."
Education+ Training 56.4 (2014): 303-313.
[9] Saranya, S., R. Ayyappan, and N. Kumar. "Student
Progress Analysis and Educational Institutional Growth
Prognosis Using Data Mining." International Journal Of
Engineering Sciences & Research Technology, 2014
[10] Hicheur Cairns, Awatef, et al. "Towards Custom-
Designed Professional Training Contents and
Curriculums through Educational Process Mining."
IMMM 2014, The Fourth International Conference on
Advances in Information Mining and Management.
2014.
[11] Archer, Elizabeth, Yuraisha Bianca Chetty, and Paul
Prinsloo. "Benchmarking the habits and behaviors of
successful students: A case study of academic-business
collaboration." The International Review of Research in
Open and Distance Learning 15.1 (2014).
[12] Arora, Rakesh Kumar, and Dharmendra Badal. "Mining
Association Rules to Improve Academic Performance."
(2014).
[13] Peña-Ayala, Alejandro. "Educational data mining: A
survey and a data mining-based analysis of recent
works." Expert systems with applications 41.4 (2014):
1432-1462.
[14] Potgieter, Ingrid, and Melinde Coetzee. "Employability
attributes and personality preferences of postgraduate
business management students." SA Journal of
Industrial Psychology 39.1 (2013): 01-10.
[15] Jantawan, Bangsuk, and Cheng-Fa Tsai. "The
Application of Data Mining to Build Classification
Model for Predicting Graduate Employment."
International Journal Of Computer Science And
Information Security (2013).

All rights reserved by www.ijsrd.com 231

You might also like