DM Missing Value
DM Missing Value
Techniques
Missing Not at Random, MNAR, means there is a relationship between the propensity of a
value to be missing and its values. This is a case where the people with the lowest
education are missing on education or the sickest people are most likely to drop out of the
study.
MNAR is called “non-ignorable” because the missing data mechanism itself has to be
modeled as you deal with the missing data. You have to include some model for why the
data are missing and what the likely values are.
“Missing Completely at Random” and “Missing at Random” are both considered ‘ignorable’
because we don’t have to include any information about the missing data itself when we
deal with the missing data.
Two types of Deletions are
Listwise Deletions
Pairwise Deletions
• It is recommended that these deletion techniques
only be used when the data set contains fewer
missing values.
Listwise Deletion
• When a column has an empty or nan, listwise
deletion deletes the entire row.
• As a result of the listwise deletion, the data will be
shrunk.
Pair wise Deletions
• Pair wise deletion makes an attempt to reduce the loss that
happens in list wise deletion.
• It calculates the correlation between two variables for every
pair of variables to which data is considered. The
coefficient of correlation can be used to take such data into
account.
List wise Deletion using Python