Data Transformation by Andy Field
Data Transformation by Andy Field
Square root transformation (Xi ): Taking the square root of large values has more of an Positive skew,
effect than taking the square root of small values. Consequently, taking the square root unequal variances
of each of your scores will bring any large scores closer to the centre – rather like the log
transformation. As such, this can be a useful way to reduce positive skew; however, you still
have the same problem with negative numbers (negative numbers don’t have a square root).
Reciprocal transformation (1/Xi ): Dividing 1 by each score also reduces the impact of Positive skew,
large scores. The transformed variable will have a lower limit of 0 (very large numbers will unequal variances
become close to 0). One thing to bear in mind with this transformation is that it reverses
the scores: scores that were originally large in the data set become small (close to
zero) after the transformation, but scores that were originally small become big after the
transformation. For example, imagine two scores of 1 and 10; after the transformation they
become 1/1 = 1, and 1/10 = 0.1: the small score becomes bigger than the large score
after the transformation. However, you can avoid this by reversing the scores before the
transformation, by finding the highest score and changing each score to the highest score
minus the score you’re looking at. So, you do a transformation 1/(XHighest
Xi ).
Reverse score transformations: Any one of the above transformations can be used to Negative skew
correct negatively skewed data, but first you have to reverse the scores. To do this, subtract
each score from the highest score obtained, or the highest score + 1 (depending on
whether you want your lowest score to be 0 or 1). If you do this, don’t forget to reverse the
scores back afterwards, or to remember that the interpretation of the variable is reversed:
big scores have become small and small scores have become big!