Week2-1
Week2-1
LECTURE 4
Chapter 2-Data Preprocessing
• Median:
• Middle value if odd number of values, or average of the middle two
values otherwise
+ width
• Estimated by interpolation (for grouped data):
–
• Mode
• Value that occurs most frequently in the data
• Unimodal, bimodal, trimodal
• Empirical formula:
• Boxplot
• Data is represented with a box
• The ends of the box are at the first and third
quartiles, i.e., the height of the box is IRQ
• The median is marked by a line within the box
• Whiskers: two lines outside the box extend to
Minimum and Maximum
Properties of Normal Distribution Curve