0% found this document useful (0 votes)

45 views6 pages

Engineering Journal Missing Data Imputation Methods in Classification Contexts

Abstract— We examine different imputation methods that deal with missing data in classification contexts and compare the performance of the methods with an experiment study. We investigate the performance of the methods under the assumption that data are missing at random. We find that, as the number of missing holes in data increases, the imputation methods deteriorate and the misclassification rates of the imputation methods increase. We also examine the scenario where missing data are due to strategic behaviors of data providers. We find that imputation methods play an important role at deterring strategic behaviors of data providers and minimizing the misclassification rate.

Uploaded by

Engineering Journal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views6 pages

Engineering Journal Missing Data Imputation Methods in Classification Contexts

Uploaded by

Engineering Journal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

International Journal of Engineering Research & Science (IJOER)

ISSN: [2395-6992]

[Vol-2, Issue-4 April- 2016]

Missing Data Imputation Methods in Classification Contexts

Juheng Zhang
Department of Operations and Information Systems, University of Massachusetts Lowell, Lowell, MA

Abstract We examine different imputation methods that deal with missing data in classification contexts and compare the
performance of the methods with an experiment study. We investigate the performance of the methods under the assumption
that data are missing at random. We find that, as the number of missing holes in data increases, the imputation methods
deteriorate and the misclassification rates of the imputation methods increase. We also examine the scenario where missing
data are due to strategic behaviors of data providers. We find that imputation methods play an important role at deterring
strategic behaviors of data providers and minimizing the misclassification rate.
Keywords missing data, imputation method, classification.

INTRODUCTION

Often in many empirical studies, data are missing due to various reasons. Missing data may be caused by negligence of data
collectors, poor experiment designs or procedures, or even purposely hiding behaviors of data providers. The two general
assumptions of missing data are: data missing at random and data missing strategically. Randomly missing data assumption
assumes that the missing data of an attribute are not related to the values themselves nor the values of other attributes. For
instance, in U.S. census data, a specific home address is missing, which is likely due to a random reason. As for strategically
missing data assumption, the data are missing due to strategic reasons. For instance, an insurance applicant can purposely
hide her/his smoking/drinking when apply for a health insurance in hope for a more likely result of approval. Another
example is limited information disclosure in financial markets [5, 6]. Certain companies strategically hide information from
investors. Missing data are a common problem in many research fields such as economics, marketing, health, statistics,
psychology, and education.
Missing data can lead to a number of problems [8]. The high level of statistical power requires a large amount of data. When
data are missing, sample size decreases dramatically if only observations with complete data are used. Empirical studies
found that if two percent of data are missing randomly in a data set, then eighteen percent of the total data can be lost when
observations having a missing value are removed. Missing data decreases statistical power.
In this study, we consider different imputation methods that either designed for randomly missing data or strategically
missing data. We compare the performance of the imputation methods in classification contexts under the assumption of data
missing at random. We also examine the imputation methods when data providers act strategically and data are hidden
intentionally. In the following section, we overview related research works and briefly discuss different imputation methods.

II.

LITERATURE REVIEW

In the statistics field, a few imputation methods such as the Average Method, the Similarity, and the Regression Method have
gained widespread acceptance. These methods normally consider attributes having continuous values. They have become
conventional methods for dealing with missing data, and we see adaption of these methods in different fields. We refer
readers to survey papers [4, 9] for detailed discussion on these conventional methods, although we discuss some of these
below. There are also some imputation techniques unique to classification problems. Several papers [2, 3, 7] summarize
different linear discriminate methods for handling missing data and compare the performance of these methods. The simplest
imputation method perhaps is the Average Method, also known as marginal mean imputation. The method was first
mentioned in the study [11]. For each missing value on a given variable, the Average Method finds the estimated value by
calculating the mean for those cases with observed data present on that variable. The Similarity Method finds an observation
that is the most similar to the record with a missing value as measured by the values not missing, and uses the actual value in
the most similar record to replace the missing value. Proponents of the Similarity Method argue that the method improves
accuracy since it uses realistic values, and that the method also preserves the shape of the distribution. The underlying
principal of the Similarity Method can be used for discrete variables, and this variant of the Similarity Method is called Hotdeck imputation, which has become popular in survey research. A data set is hot if it is currently being used for imputing a
score. The Hot-deck imputation replaces a missing value with actual scores of a similar case. If there are several equally
similar cases, then the method randomly chooses one of them. The Regression Method, also called the conditional mean
Page | 60

International Journal of Engineering Research & Science (IJOER)

ISSN: [2395-6992]

[Vol-2, Issue-4 April- 2016]

imputation method, is another statistical imputation method. The Regression Method uses a regression equation to calculate
an estimate of the true value. Assume only one variable has missing values on some cases. Using the cases with complete
data on other variables, the method regresses on all of the other variables and then uses the regression equation to generate
the substitutes for the cases with missing data. The substitutes are predicted values for missing data. According to the study
[1], the Regression Method generates predicted values that preserve deviations from the mean and the shape of the
distribution. It does not attenuate correlations between variables as much as mean substitution.
Another category of imputation methods [12-14] assumes data are missing strategically by data providers who try to game
the decision makers decision rules. The imputation methods proposed in the studies [12, 13] include the D and DNeg
methods. These methods were designed for classification problems. The decision maker may use the D or DNeg method to
impute missing values and minimize misclassification rates when facing with strategic data providers. The DNeg method was
to thwart negative data providers from gaining a positive classification when they intentionally hide information. The D
method considered not only negative data providers but also positive ones. Using the D method, the decision maker can deter
negative data providers gaming behaviors and also incent positive data provider to reveal information. The D method is
more conservative than the DNeg method in a sense that it considers both positive and negative data providers while the
DNeg method is online for negative ones.

III.

EXPERIMENT DESIGN

We compare eight methods, Average, Regression, Similarity, D, DNeg, AvgNeg, RegNeg, and SimNeg. The Average,
Regression, Similarity, D, and DNeg are as what we discussed in the above section. The AvgNeg, RegNeg, and SimNeg are
the revised version of the original Average, Regression, and Similarity method respectively, in which only negative training
samples are used for imputing missing values. We first start with the case of randomly missing data, and then examine the
strategically missing data. The parameters of experimental design are listed in Table 1.

TABLE 1.
SUMMARY OF EXPERIMENTAL DESIGNS
Treatment
Replications
Dimensionality
Training set size
Testing set size
Randomly missing data percent
Data providers methods
Decision Makers methods
Outcomes

Parameter
30
3, 4, 5, 6, 7
20, 100, 200, 1000
20, 100, 200, 1000
1%, 2%, 3%, 4%, 5%
D, Average, Regression, Similarity, DNeg, AvgNeg, RegNeg, SimNeg
D, Average, Regression, Similarity, DNeg, AvgNeg, RegNeg, SimNeg
TotMisc, PosMisc, NegMisc, Notclassified, TotStrMisc, PosStrMisc,
NegStrMisc, StrPos, StrNeg

We use 30 replications for each case. The number of attributes ranges from 3 to 7. We use different training and testing set
sizes, 20, 100, 200, and 1000. In the randomly missing data case, we consider various percent of missing holes, 1%, 2%, 3%,
4%, and 5%. In the strategically missing data case, data providers may use one of the eight methods to hide information: D, Average,
Regression, Similarity, DNeg, AvgNeg, RegNeg, SimNeg. The decision maker chooses one of the eight methods to impute missing values.
We use different misclassification measurements. TotMisc is the misclassification over all data providers, PosMisc is the misclassification
rate over positive records, NegMisc is the misclassification rate over negative records, Notclassified is the rate that records not get
classified. To stabilize the variance of the rates of misclassification in statistical tests [10], we map the performance measures

to 2 arc sin (sqrt (misclassification rate)).

IV.

EMPIRICAL RESULTS

We first conduct an ANOVA analysis on the misclassification rate (TotMisc) for the randomly missing data case. We see
that all experiment factors are significant, as well as all interaction effect of all factors at 0.0001 confidence level. The
ANOVA analysis results are included in Table 2.

Page | 61

International Journal of Engineering Research & Science (IJOER)

ISSN: [2395-6992]

[Vol-2, Issue-4 April- 2016]

TABLE 2.
TWO-WAY ANALYSIS FOR DEPENDENT VARIABLE: MISCLASSIFICATION IN RANDOM CASE

214

Sum of
Squares
3798.52

Mean
Square
17.75

Error

191785

3360.07

0.02

Corrected Total

191999

7158.59

Source

Anova SS

Source

F Value

Pr>F

R-Sqe

Model

1013.14

<.0001

0.53

Mean Square

F Value

Pr>F

98.43

24.61

1404.59

<.0001

4.71

1.57

89.63

<.0001

187.58

62.53

3568.79

<.0001

Ram

1546.32

386.58

22065.10

<.0001

1691.19

241.60

13789.90

<.0001

Coeff
Var
44.46

Root
MSE
0.13

Rate
Mean
0.30

Main effect

Two-way Interaction effect

2.93

0.24

13.94

<.0001

3.15

0.26

14.97

<.0001

n Ram

13.56

0.85

48.37

<.0001

n Mp

91.22

3.26

185.95

<.0001

2.13

0.24

13.49

<.0001

Ram

1.43

0.12

6.78

<.0001

47.44

2.26

128.94

<.0001

5.45

0.45

25.92

<.0001

5.57

0.27

15.13

<.0001

96.68

3.45

197.08

<.0001

Ram

Ram Mp

Table 2 shows that the percent of randomly missing data in datasets is a significant factor of misclassification rate. Next, we
study the impact of percent of randomly missing data on performance measurements in details. The results of the
misclassification rates for different percents of randomly missing data are provided in Table 3.

TABLE 3.
STATISTICS OF RANDOMLY MISSING HOLES
Missing Percent

TotMisc

PosMisc

NegMisc

NotClassified

0.15957

0.21862

0.01514

0.00000

0.24331

0.33372

0.02756

0.00006

0.30635

0.42139

0.03891

0.00009

0.36456

0.50299

0.04877

0.00026

0.41469

0.57368

0.05807

0.00033

Page | 62

International Journal of Engineering Research & Science (IJOER)

ISSN: [2395-6992]

[Vol-2, Issue-4 April- 2016]

As shown in Table 3, the misclassification rate over positive records is higher than that over negative records. In addition, all
of four misclassification rates, TotMisc, PosMisc, NegMisc, and NotClassified, increase as the percent increases. The results
in percentage format are provided in Table 4. In percent format, the misclassification rate is 0.64% when 1% data are missing
and increases to 4.24% when the percent of missing holes increases to 5%.

TABLE 4.
STATISTICS OF RANDOMLY MISSING HOLES IN PERCENT
Missing Percent

TotMisc

PosMisc

NegMisc

NotClassified

0.640%

1.190%

0.010%

0.00%

1.470%

2.760%

0.020%

0.00%

2.330%

4.370%

0.040%

0.00%

3.290%

6.190%

0.060%

0.00%

4.240%

8.000%

0.080%

0.00%

We plot the trend of misclassification rate with the increase in the missing percent in Fig 1. The top line is for the
misclassification rate over positive records, and the TotMisc is the average over positive and negative misclassification rates.
The non-classified records stay as zero when the percent of missing holes increases.

In the random case, data are missing randomly, that is the information is not hidden strategically. A principal still can choose
one of eight methods to impute missing information. Next, we consider the case where data are missing strategically. We
simulate the case where agents select a method in determining which attributes to hide and simultaneously a decision maker
chooses from those eight methods to impute estimates for missing data of all agents. Similarly, we conduct an ANOVA test
to examine the effect of factors and interaction effect on the misclassification rates and provide the results in Table 5. The
hiding strategies of data providers is denoted as Ma . As shown in Table 5, the hiding strategies of data providers are
significant , and all other experiment factors are significant at the 0.0001 level.

Page | 63

International Journal of Engineering Research & Science (IJOER)

ISSN: [2395-6992]

[Vol-2, Issue-4 April- 2016]

TABLE 5.
TWO-WAY ANALYSIS FOR DEPENDENT VARIABLE: MISCLASSIFICATION RATE IN THE STRATEGICALLY
MISSING CASE

Sum of
Squares

Model

271

103371.3

381.4

Error

306928

7652.747

0.0249

Corrected Total

307199

111024.1

78.763

19.691

789.74

<.0001

66.17

22.057

884.62

<.0001

4.944

1.648

66.1

<.0001

Source

Mean
Square

F Value

Pr>F

R-Sq

Coeff
Var

Root
MSE

Rate
Mean

15298.5

<.0001

0.931

10.579

0.158

1.493

Main effect

2751.8

393.114

15766.6

<.0001

42912.157

6130.308

245868

<.0001

19.43

1.619

64.94

<.0001

1.015

0.085

3.39

<.0001

n Ma

430.728

15.383

616.97

<.0001

n Mp

1277.847

45.637

1830.37

<.0001

Two-way Interaction effect

0.56

0.062

2.5

0.0075

68.872

3.28

131.54

<.0001

156.013

7.429

297.96

<.0001

1.672

0.08

3.19

<.0001

13.682

0.652

26.13

<.0001

25245.6

515.218

20663.8

<.0001

Mp
Ma Mp

We conduct Tukeys range tests for the decision makers methods. The summary of these test results can be found in Tables
6. Table 6 shows that when the decision maker uses the D or DNeg method, NegMisc is the lowest and PosMisc is the
highest. More specifically, we see that NegMisc is 0 for the D method and 0.013 for the DNeg method (or 0.004% in terms
of actual rate before the mapping), which are lower than 0.402 (or 3.98% ) for Similarity, 0.324 (or 2.6%) for RegNeg, 0.322
(or 2.58%) for Regression, 0.283 (or 1.99%) for Average, 0.06 (or 0.09%) for AvgNeg, and 0.031 (or 0.02%) for SimNeg.
The rate of positive or negative agents who act strategically is the same regardless of a decision makers method. Therefore,
if a decision maker is more negative risk averse, her best strategy should be the D method.

TABLE 6.
THE COMPARISON RESULTS OF TUKEY'S RANGE TEST
TotMisc

PosMisc

NegMisc

NotClassified

Group

Mean

Methods

Group

Mean

Methods

Group

Mean

Methods

Group

Mean

Methods

2.205

2.192

0.402

1.902

2.086

1.893

0.324

1.902

2.065

1.629

0.322

1.902

2.062

1.283

0.283

1.902

1.128

0.783

0.06

0.992

0.185

0.031

0.895

0.167

0.013

0.822

0.158

5
Page | 64

International Journal of Engineering Research & Science (IJOER)

ISSN: [2395-6992]

[Vol-2, Issue-4 April- 2016]

CONCLUSION

We compare eight different imputation methods in the case where data are missing at random and in the case where data are
missing strategically. We find that as the percent of missing data increases, the performance of all the eight imputation
methods decreases. When data are missing strategically, the D method or DNeg method gives the lowest misclassification
rate.

REFERENCES
[1] Allison, P.D. Missing data. Woburn, MA, USA: Sage Publications Inc., 2001.
[2] Chan, L.S., and Dunn, O.J. The treatment of missing values in discriminant analysis-1. The sampling experiment. Journal of the
American Statistical Association, 67, 338 (1972), 473-477.
[3] Chan, L.S., Gilman, J.A., and Dunn, O.J. Alternative approaches to missing values in discriminant analysis. Journal of the American
Statistical Association, 71, 356 (1976), 842-844.
[4] Donders, A.R.T., van der Heijden, G., Stijnen, T., and Moons, K.G.M. Review: A gentle introduction to imputation of missing values.
Journal of Clinical Epidemiology, 59, 10 (2006), 1087-1091.
[5] Healy, P.M., and Palepu, K.G. The challenges of investor communication the case of cuc international, inc. Journal of financial
economics, 38, 2 (1995), 111-140.
[6] Hirshleifer, D., and Teoh, S.H. Limited attention, information disclosure, and financial reporting. Journal of accounting and
economics, 36, 1-3 (2003), 337-386.
[7] Jackson, E.C. Missing values in linear multiple discriminant analysis. Biometrics, 24, 4 (1968), 835-844.
[8] Roth, P.L. Missing data: A conceptual review for applied psychologists. Personnel Psychology, 47, 3 (1994), 537-560.
[9] Schafer, J.L., and Graham, J.W. Missing data: Our view of the state of the art. Psychological Methods, 7, 2 (2002), 147-177.
[10] Stam, A., and Joachimsthaler, E.A. Solving the classification problem in discriminant analysis via linear and nonlinear programming
methods. Decision Sciences, 20, 2 (1989), 285-293.
[11] Wilks, S.S. Moments and distributions of estimates of population parameters from fragmentary samples. The Annals of Mathematical
Statistics, 3, 3 (1932), 163-195.
[12] Zhang, J. Linear discrimination with strategic missing values. Information Systems and Operations Management, Gainesville, FL,
USA: University of Florida, 2011.
[13] Zhang, J., Aytug, H., and Koehler, G.J. Discriminant analysis with strategically manipulated data. Information Systems Research, 25,
3 (2014), 654-662.
[14] Zhang, J., Liu, X., and Li, X. Support vector regression for handling strategically hidden data. Lowell, U.o.M., 2015

Page | 65

Ijctt V3i2p104
No ratings yet
Ijctt V3i2p104
5 pages
Missing Data Analysis: University College London, 2015
No ratings yet
Missing Data Analysis: University College London, 2015
37 pages
Roles of Imputation Methods For Filling The Missing Values: A Review
No ratings yet
Roles of Imputation Methods For Filling The Missing Values: A Review
9 pages
DL Vs Conventional
No ratings yet
DL Vs Conventional
14 pages
M Akaba 2019
No ratings yet
M Akaba 2019
7 pages
The Negative Impact of Missing Value Imputation in Classification of Diabetes Dataset and Solution For Improvement
No ratings yet
The Negative Impact of Missing Value Imputation in Classification of Diabetes Dataset and Solution For Improvement
8 pages
JDS 612 PDF
No ratings yet
JDS 612 PDF
18 pages
Platias2020 Greece
No ratings yet
Platias2020 Greece
10 pages
An Analysis of Four Missing Data Treatment Methods For Supervised Learning
No ratings yet
An Analysis of Four Missing Data Treatment Methods For Supervised Learning
16 pages
IJDKP
No ratings yet
IJDKP
17 pages
8 Hron Et Al 2010
No ratings yet
8 Hron Et Al 2010
13 pages
ISAT 600 Progress Report 2
No ratings yet
ISAT 600 Progress Report 2
6 pages
149 Missing
No ratings yet
149 Missing
10 pages
Handling Missing Data
No ratings yet
Handling Missing Data
23 pages
Centraltendencywhattoconsider 1
No ratings yet
Centraltendencywhattoconsider 1
6 pages
Revisions
No ratings yet
Revisions
101 pages
Chapter 3
No ratings yet
Chapter 3
18 pages
Art_mouad_3
No ratings yet
Art_mouad_3
9 pages
Modern Method Web in Ar May 2012
No ratings yet
Modern Method Web in Ar May 2012
45 pages
An Analysis of Four Missing Data Treatment Methods
No ratings yet
An Analysis of Four Missing Data Treatment Methods
13 pages
An Evaluation of K-Nearest Neighbour Imputation Using Likert Data
No ratings yet
An Evaluation of K-Nearest Neighbour Imputation Using Likert Data
23 pages
FDS U4
No ratings yet
FDS U4
93 pages
Flexible Imputation of Missing Data
100% (3)
Flexible Imputation of Missing Data
444 pages
Journal of Statistical Software: Reviewer: Abdolvahab Khademi University of Massachusetts
No ratings yet
Journal of Statistical Software: Reviewer: Abdolvahab Khademi University of Massachusetts
4 pages
A Comparison of Six Methods For Missing Data Imputation 2155 6180 1000224 PDF
No ratings yet
A Comparison of Six Methods For Missing Data Imputation 2155 6180 1000224 PDF
6 pages
A Method For Missing Values Imputation of Machine Learning Datasets
No ratings yet
A Method For Missing Values Imputation of Machine Learning Datasets
11 pages
A Comparison of Three Popular Methods For Handling Missing Data Complete Case Analysis Inverse
No ratings yet
A Comparison of Three Popular Methods For Handling Missing Data Complete Case Analysis Inverse
31 pages
Comparison of Imputation Techniques After Classifying The Dataset Using KNN Classifier For The Imputation of Missing Data
No ratings yet
Comparison of Imputation Techniques After Classifying The Dataset Using KNN Classifier For The Imputation of Missing Data
4 pages
Comparing Multiple Imputation and Machine Learning Techniques For Longitudinal Data
No ratings yet
Comparing Multiple Imputation and Machine Learning Techniques For Longitudinal Data
13 pages
Missing Data
100% (2)
Missing Data
35 pages
Graham2009 Missing Values Analysis
No ratings yet
Graham2009 Missing Values Analysis
31 pages
Missing Value
No ratings yet
Missing Value
11 pages
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
How Handling Missing Data May Impact Conclusions - A Comparison of Six Different Imputation Methods For Categorical Questionnaire Data
No ratings yet
How Handling Missing Data May Impact Conclusions - A Comparison of Six Different Imputation Methods For Categorical Questionnaire Data
20 pages
Fuzzy Based Techniques For Handling Missing Values
No ratings yet
Fuzzy Based Techniques For Handling Missing Values
6 pages
Jds 1135
No ratings yet
Jds 1135
13 pages
Marketing Analytics (Unit 2)
No ratings yet
Marketing Analytics (Unit 2)
78 pages
MIssing Data Imputation Using Machine Learning Algorithm
No ratings yet
MIssing Data Imputation Using Machine Learning Algorithm
11 pages
Semiparametric Theory and Missing Data - Anastasios Tsiatis - Springer Series in Statistics, 1, 2006 - Springer - 9780387324487 - Anna's Archive
No ratings yet
Semiparametric Theory and Missing Data - Anastasios Tsiatis - Springer Series in Statistics, 1, 2006 - Springer - 9780387324487 - Anna's Archive
391 pages
Data Cleaning
No ratings yet
Data Cleaning
8 pages
Imputation
No ratings yet
Imputation
10 pages
Cienciadedatos
No ratings yet
Cienciadedatos
21 pages
A Comparison of Imputation Strategies For Ordinal Missing Data On Likert Scale Variables
No ratings yet
A Comparison of Imputation Strategies For Ordinal Missing Data On Likert Scale Variables
21 pages
ST-14 Handling Missing Data With Multiple Imputation Using PROC MI in SAS
No ratings yet
ST-14 Handling Missing Data With Multiple Imputation Using PROC MI in SAS
5 pages
Missing Imput Values
No ratings yet
Missing Imput Values
2 pages
A GMM Approach For Dealing With Missing Data
No ratings yet
A GMM Approach For Dealing With Missing Data
41 pages
603-8-1 Donders - J Clin Epidemiol 2006 v59 n10 p1087-91
No ratings yet
603-8-1 Donders - J Clin Epidemiol 2006 v59 n10 p1087-91
5 pages
WINSEM2018-19 - MGT1051 - TH - SJTG23 - VL2018195003627 - Reference Material I - 12-12 - C1 - BAE
No ratings yet
WINSEM2018-19 - MGT1051 - TH - SJTG23 - VL2018195003627 - Reference Material I - 12-12 - C1 - BAE
20 pages
Unit 2 Notes - Docx-3
No ratings yet
Unit 2 Notes - Docx-3
14 pages
Missing Value Imputation in Machine Learning
No ratings yet
Missing Value Imputation in Machine Learning
8 pages
Week 5 Lecture - Data Wrangling
No ratings yet
Week 5 Lecture - Data Wrangling
26 pages
A Comparative Study of Multiple Imputation and Maximum Likelihood Methods of Imputing Missing Data in A
No ratings yet
A Comparative Study of Multiple Imputation and Maximum Likelihood Methods of Imputing Missing Data in A
14 pages
Milsap Allison
No ratings yet
Milsap Allison
18 pages
Missing Data Mechanisms and Imputation Methods
No ratings yet
Missing Data Mechanisms and Imputation Methods
16 pages
Nonparametric Analysis of Factorial Designs With Random Missingness: Bivariate Data
No ratings yet
Nonparametric Analysis of Factorial Designs With Random Missingness: Bivariate Data
38 pages
Missing Data & How To Handle It
No ratings yet
Missing Data & How To Handle It
32 pages
An Investigation of Missing Data Methods For Classification Trees
No ratings yet
An Investigation of Missing Data Methods For Classification Trees
43 pages
Solutions For Missing Data in Structural Equation Modeling
No ratings yet
Solutions For Missing Data in Structural Equation Modeling
6 pages
Paper 4-Imputation and Classification of Missing Data Using Least Square Support Vector Machines - A New Approach in Dementia Diagnosis
No ratings yet
Paper 4-Imputation and Classification of Missing Data Using Least Square Support Vector Machines - A New Approach in Dementia Diagnosis
6 pages
Handling Missing Data
No ratings yet
Handling Missing Data
32 pages
Engineering Journal A Professional PID Implemented Using A Non-Singleton Type-1 Fuzzy Logic System To Control A Stepper Motor
No ratings yet
Engineering Journal A Professional PID Implemented Using A Non-Singleton Type-1 Fuzzy Logic System To Control A Stepper Motor
8 pages
Engineering Journal Estimation of Rolling-Contact Bearings Operational Properties by Electrical Probe Method
No ratings yet
Engineering Journal Estimation of Rolling-Contact Bearings Operational Properties by Electrical Probe Method
7 pages
Engineering Journal Strategic and Operational Scope of Foreign Subsidiary Units
No ratings yet
Engineering Journal Strategic and Operational Scope of Foreign Subsidiary Units
11 pages
Engineering Journal Alteration of Pelvic Floor Biometry in Different Modes of Delivery
No ratings yet
Engineering Journal Alteration of Pelvic Floor Biometry in Different Modes of Delivery
7 pages
Engineering Journal Controller Design For Nonlinear Networked Control Systems With Random Data Packet Dropout
No ratings yet
Engineering Journal Controller Design For Nonlinear Networked Control Systems With Random Data Packet Dropout
6 pages
Engineering Journal Sentiment Analysis Methodology of Twitter Data With An Application On Hajj Season
No ratings yet
Engineering Journal Sentiment Analysis Methodology of Twitter Data With An Application On Hajj Season
6 pages
Engineering Journal A Silicon-Containing Polytriazole Resin With Long Storage Time
No ratings yet
Engineering Journal A Silicon-Containing Polytriazole Resin With Long Storage Time
9 pages
Engineering Journal Securing of Elderly Houses in Term of Elderly's Vision Disorders
No ratings yet
Engineering Journal Securing of Elderly Houses in Term of Elderly's Vision Disorders
10 pages
Engineering Journal Estimation of Global Solar Radiation Using Artificial Neural Network in Kathmandu, Nepal
No ratings yet
Engineering Journal Estimation of Global Solar Radiation Using Artificial Neural Network in Kathmandu, Nepal
9 pages
Engineering Journal Quality Evaluation of Entrepreneur Education On Graduate Students Based On AHP-fuzzy Comprehensive Evaluation Approach
No ratings yet
Engineering Journal Quality Evaluation of Entrepreneur Education On Graduate Students Based On AHP-fuzzy Comprehensive Evaluation Approach
6 pages
Engineering Journal Mortar of Lime and Natural Cement For The Restoration of Built Cultural Heritage
No ratings yet
Engineering Journal Mortar of Lime and Natural Cement For The Restoration of Built Cultural Heritage
3 pages
Engineering Journal Optimal Power Flow Analysis of IEEE-30 Bus System Using Genetic Algorithm Techniques
0% (1)
Engineering Journal Optimal Power Flow Analysis of IEEE-30 Bus System Using Genetic Algorithm Techniques
5 pages
Engineering Journal Solving Complex Fuzzy Linear System of Equations by Using QR-Decomposition Method
No ratings yet
Engineering Journal Solving Complex Fuzzy Linear System of Equations by Using QR-Decomposition Method
10 pages
Engineering Journal The Effects of Individualized Physical Rehabilitation Program
No ratings yet
Engineering Journal The Effects of Individualized Physical Rehabilitation Program
7 pages
Practical Research 2 - Module 3 (2021)
No ratings yet
Practical Research 2 - Module 3 (2021)
14 pages
R CropStat Introduction
No ratings yet
R CropStat Introduction
1 page
Statistical Reports
No ratings yet
Statistical Reports
1 page
UTD AI - ML Brochure Updated 26-06-2023
No ratings yet
UTD AI - ML Brochure Updated 26-06-2023
26 pages
Copy1-ENHANCING MARKET LINKAGE FOR TUBER CROPS FARMERS IN OYO STATE
No ratings yet
Copy1-ENHANCING MARKET LINKAGE FOR TUBER CROPS FARMERS IN OYO STATE
22 pages
Technical Note 17
No ratings yet
Technical Note 17
32 pages
Reserach Proposal Final
No ratings yet
Reserach Proposal Final
10 pages
Tutorial (3) Common To All Branches EC, CS
No ratings yet
Tutorial (3) Common To All Branches EC, CS
2 pages
Approaches and Methods in Human Geography
No ratings yet
Approaches and Methods in Human Geography
24 pages
E 2171 - 02 - Rtixnze - PDF
No ratings yet
E 2171 - 02 - Rtixnze - PDF
23 pages
Statistics and Occupational Optometry
No ratings yet
Statistics and Occupational Optometry
3 pages
Level of Awareness On Social Media Platforms Among The Employees of Kalinga State University Bulanao Campus
No ratings yet
Level of Awareness On Social Media Platforms Among The Employees of Kalinga State University Bulanao Campus
23 pages
Experiment 1 - Uncertainty & Error Analysis
No ratings yet
Experiment 1 - Uncertainty & Error Analysis
3 pages
Inferential Statistics: Estimation Hypothesis Testing
No ratings yet
Inferential Statistics: Estimation Hypothesis Testing
59 pages
MMW 101 - Lesson 9 - Measures of Relative Position
No ratings yet
MMW 101 - Lesson 9 - Measures of Relative Position
16 pages
An Introduction To Statistical Methods and Data Analysis 6th Edition by R Lyman Ott
No ratings yet
An Introduction To Statistical Methods and Data Analysis 6th Edition by R Lyman Ott
328 pages
MB0050 SET-1: 1.a. Differentiate Between Nominal, Ordinal, Interval and Ratio Scales, With An Example of Each
No ratings yet
MB0050 SET-1: 1.a. Differentiate Between Nominal, Ordinal, Interval and Ratio Scales, With An Example of Each
10 pages
Solvent Fractionation Technique Paired With Apparent Solidification Time (AST) Test As A Method To Detect Palm Olein and Sheep Body Fat in Ghee (Clarified Milk Fat)
No ratings yet
Solvent Fractionation Technique Paired With Apparent Solidification Time (AST) Test As A Method To Detect Palm Olein and Sheep Body Fat in Ghee (Clarified Milk Fat)
6 pages
Tools For Data Collection and Analysis in Nursing Research
No ratings yet
Tools For Data Collection and Analysis in Nursing Research
13 pages
Mba I Sem Regular
No ratings yet
Mba I Sem Regular
14 pages
NSM Oratorical Speech 2
No ratings yet
NSM Oratorical Speech 2
2 pages
Vinka Dwi Melinda (1023032031) - Compressed
No ratings yet
Vinka Dwi Melinda (1023032031) - Compressed
17 pages
A Study On The Impact On Mutual Fund Investment Pattern of Employees in Chennai City
No ratings yet
A Study On The Impact On Mutual Fund Investment Pattern of Employees in Chennai City
13 pages
Comparative Study of Lizard Populations in Mt. Makiling, Laguna
No ratings yet
Comparative Study of Lizard Populations in Mt. Makiling, Laguna
32 pages
0 Ppt1 Introduction To Biostatistics123
No ratings yet
0 Ppt1 Introduction To Biostatistics123
59 pages
ML Lab Programs PDF
No ratings yet
ML Lab Programs PDF
15 pages
4 ML
No ratings yet
4 ML
41 pages
Mebrahtom Gyesus
No ratings yet
Mebrahtom Gyesus
85 pages
Stress and Strategy: An Investigation Into Stressors and Coping Mechanisms Among Quick Service Restaurant Employees in Cavite
No ratings yet
Stress and Strategy: An Investigation Into Stressors and Coping Mechanisms Among Quick Service Restaurant Employees in Cavite
19 pages
G11 PAG ASA Learners Packet
No ratings yet
G11 PAG ASA Learners Packet
11 pages

Engineering Journal Missing Data Imputation Methods in Classification Contexts

Uploaded by

Engineering Journal Missing Data Imputation Methods in Classification Contexts

Uploaded by

International Journal of Engineering Research & Science (IJOER)

[Vol-2, Issue-4 April- 2016]

Missing Data Imputation Methods in Classification Contexts

International Journal of Engineering Research & Science (IJOER)

[Vol-2, Issue-4 April- 2016]

to 2 arc sin (sqrt (misclassification rate)).

International Journal of Engineering Research & Science (IJOER)

[Vol-2, Issue-4 April- 2016]

Two-way Interaction effect

International Journal of Engineering Research & Science (IJOER)

[Vol-2, Issue-4 April- 2016]

International Journal of Engineering Research & Science (IJOER)

[Vol-2, Issue-4 April- 2016]

Two-way Interaction effect

International Journal of Engineering Research & Science (IJOER)

[Vol-2, Issue-4 April- 2016]

You might also like