4- Lect-Finding Z- Score, Percentiles and Quartiles,
4- Lect-Finding Z- Score, Percentiles and Quartiles,
• IT WILL BE USUAL IF (L.B.J) SUCH HIS HEIGHT TO PLAYS FOR THE MIAMI HEAT. BECAUSE IT LIES
WITH THE RANGE.
EXAMPLE
• ME.L ON MIAMI HEAT, LETS CONSIDER THAT THE HEIGHT OF (ME.L)→72
𝑋−𝜇
𝑍=
𝜎
72−80.0 −8
𝑍= = ≈ −2.42 →UNUSUAL..(IT MEANS IT WILL BE COMMONLY RARE, BUT NOT
3.3 3.3
IMPOSSIBLE TO BE IN TEAM WITH THAT HEIGHT)
THE LARGER Z-SCORE IS, THE MORE RARE THE PIECE OF DATA IS.
RARENESS OF THE DATA
• THE LARGER THE Z-SCORE, IN TERMS OF ABSOLUTE VALUE THE RARER THE DATA VALUE.
• WHAT IS THE RELATIONSHIP BETWEEN RARENESS OF THE DATA AND THE Z-SCORE?
• DO YOU KNOW HOW TO CALCULATE Z-SCORE?
QUARTILES
• IT BREAKS YOUR DATA INTO QUARTERS, IT BREAKS YOUR DATA EVERY QUARTER.
• IT IS VERY SIMILAR TO THE MEDIAN, THE MEDIAN BREAKS YOUR DATA INTO MIDDLE!
1ST QUARTILE: Q1 IT REPRESENTS THE BOTTOM 25% OF THE DATA, BUT IS THE DATA WILL BE ORDERED OR NOT
ORDERED? WHAT YOU THINK! IT SHOULD BE ORDERED RIGHT. SO Q1 REPRESENTS THE LOWEST 25% OF THE DATA
VALUES.
• Q1 → BOTTOM 25% OF SORTED DATA
2ND QUARTILE:
Q2 (MEDIAN 𝑴) → BOTTOM 50% OF SORTED DATA
3RD QUARTILE:
Q3 → BOTTOM 75% OF SORTED DATA
WE ARE CUTTING OUR DATA INTO 4 PIECES SO WE NEED HERE QUARTILES, THERE ARE NO 4TH QUARTILE
WE UST NEED Q1 , Q2 & Q3
EXAMPLE(QUARTILES-1)
• 1, 3 , 6 , 10, 15, 21, 28, 36
ANSWER:
• SO WHAT IS THE FIRST THING WE LOOK AT THE DATA FOR? WE NEED TO SEE IF IT IS SORTED?
• GO AHEAD AND SEE IF YOU CAN FIND THE MEDIAN?
• 1, 3 , 6 , 10, 15, 21, 28, 36 M=?
EXAMPLE(QUARTILES-1)..CONT
• 1, 3 , 6 , 10, 15, 21, 28, 36
ANSWER:
• SO WHAT IS THE FIRST THING WE LOOK AT THE DATA FOR? WE NEED TO SEE IF IT IS SORTED?
• GO AHEAD AND SEE IF YOU CAN FIND THE MEDIAN?
10+15 25
• 1, 3 , 6 , 10, 15, 21, 28, 36 →M= ⇒ ⇒
2 2
• M=12.5
• NOW WE FOUND THE 2ND QUARTILE WHEN WE FIND THE MEDIAN. FIND THE MEDIAN BEFORE OF
LEFT YOU CALCULATE 1ST QUARTILE, CALCULATE MEDIAN ON THE RIGHT YOU CALCULATE THE 3RD
QUARTILE
1 , 3 , 6 , 10 , 15 , 21 , 28 , 36 , 39
EXAMPLE(QUARTILES-2)
1 , 3 , 6 , 10 , 15 , 21 , 28 , 36 , 39
ANSWER:
• 1 , 3 , 6 , 10 , 15 , 21 , 28 , 36 , 39
• WHEN WE CALCULATE THE QUARTILE HERE WE DON’T ADD THE 15 WHEN WE CALCULATE THE
OTHER QUARTILES. MEAN WE DON’T ADD 15 WITH CALCULATING THE FIRST AND THIRD
QUARTILE, WE JUST EXCLUDE THE 15 FROM THE CALCULATIONS.
• WE IMAGINE THERE IS NO 15 WE JUST SPLIT OUR DATA INTO 50%
1 , 3 , 6 , 10 , 15 , 21 , 28 , 36 , 39
•
• Q1=4.5 M=15 Q3=32
PERCENTILES
• PERCENT IS OUT OF 100.
• PERCENTILES: IT SEPARATES THE DATA INTO 100 PARTS (JUST LIKE QUARTILE WHICH
SEPARATES 3 Q, BECAUSE WE WANTED 4 PARTS) . THEREFORE THERE ARE 99 PERCENTILES.
• THERE PERCENTILES TELL YOU WHERE YOU ARE, FOR EXAMPLE IF YOU TOOK 50% IN A TEST. IT
MEANS THAT YOU SCORE 50% BETTER THAN OTHER STUDENTS. AND NOT 50% ON THE WHOLE
TEST. IT MEANS IT COMPARES YOU WITH EVERYONE ELSE.
39
𝑝𝑒𝑟𝑐𝑒𝑛𝑡𝑖𝑙𝑒 𝑜𝑓 ′87′ = ∗ 100 = 72𝑛𝑑 𝑝𝑒𝑟𝑐𝑒𝑛𝑡𝑖𝑙𝑒 −→ P72
54
PERCENTAGE IS YOUR SCORE, BUT
PERCENTILE CALCULATES YOUR PLACE IN THE CLASS
YOU CAN GO BACK IN FOURTH IN THE EQUATION, FOR EXAMPLE IF I GIVE YOU THE PERCENTILE AND YOU FIND THE
PERCENTAGE.
IN THIS EXAMPLE YOU SCORED BETTER THAN 72 PEOPLE
PERCENTILE
• WHAT IS ….
• P25= Q1
• P50= M
• P75= Q3
WHAT IS IQR?
WHAT IS IQR?
• IT IS ENTERAL QUARTILE RANGE: IT IS THE DIFFERENCE BETWEEN THE QUARTILES, HOW MUCH IS THE
RANGE.
• IQR: Q3 - Q1
• MIDDLE 50% OF THE DATA
BOX PLOT
• FIVE NUMBER SUMMARY Min Max
• MINIMUM
• Q1 Q1 M Q3
• MEDIAN
• Q3
• MAXIMUM
EXAMPLE
• 1 , 4 , 5 , 5 , 9 , 12 , 13 , 13 , 15 , 21
• MINIMUM:
• Q1:
• MEDIAN:
• Q3:
• MAXIMUM:
EXAMPLE
• 1 , 4 , 5 , 5 , 7 , 9 , 12 , 13 , 13 , 15 , 21
Q1 M Q3
• MINIMUM: 1
• Q1: 5
• MEDIAN: 9
• Q3: 13
• MAXIMUM: 21 1 21
5 9 13
• WHAT IT MEANS AN OUTLIER? OUTLIER
• HOW WE CALCULATE THE OUTLIER? IS 21 IN PREVIOUS EXAMPLE AN OUTLIER?
• THERE IS A MATHEMATICAL WAY TO CALCULATE THE OUTLIER.
TO FIND AN OUTLIER ?
1- FIRST NEED TO FIND IQR
• IQR IN PREVIOUS EXAMPLE=Q3-Q1→ 13-5= 8
2- THE NEXT 1.5*(IQR) → 1.5*(8)= 12
• YOU WILL TAKE THE PREVIOUS NUMBER, WILL SUBTRACT IT FROM Q1, AND ADD IT TO Q3
Q1 – 1.5*(IQR)
Q3 + 1.5*(IQR)
• ANY NUMBER OUTSIDE THAT RANGE WILL BE AN OUTLIER
• 5 – 12 = -7 Q1 Q3
• 13+12 = 25 21
1
• IS THERE ANYTHING IN YOUR DATA LESS THAN -7,
OR GREATER THAN 25? 5 9 13
• NO
• THEN THERE ARE NO OUTLIERS -7 (5-12) (13+12) 25
• IF INSTEAD OF 21 I HAVE 32 THEN IT WILL BE OUTLIER
WHAT IS Z-SCORE