Exercise 1
Exercise 1
Q1. To find out the means marks of 200 students in the variable “read”.
Descriptives
Mean
readv 52.23
Descriptives
Mean
writev 52.775
Descriptives
Standard Deviation
math 9.368
Q4. Calculate the variance for the variable “science”.
Descriptives
Variance
science 98.028
Q5. Calculate the max and min marks for the variable “socst”.
Descriptives
Minimum Maximum
socst 26 71
Descriptives
Range
writev 36
Q7 To find the number of boys and girls OR the frequency for the variable “gender”.
gender
Cumulative
Frequency Proportion Cumulative Proportion
Frequency
Q8 Create the frequency table for the following variables: race, schtyp, prog, ses
RACE
race
Cumulative
Frequency Proportion Cumulative Proportion
Frequency
schtyp
Cumulative
Frequency Proportion Cumulative Proportion
Frequency
public
168 168 0.84 0.84
school
private
32 200 0.16 1
school
PROG
prog
Cumulative Cumulative
Frequency Proportion
Frequency Proportion
UG 45 45 0.225 0.225
ses
Cumulative Cumulative
Frequency Proportion
Frequency Proportion
Q2 To draw a pie chart showing the percentage of following categories: Race, ses, schtyp, prog
Below each chart, mention which category has the highest and lowest percentage.
Q3 To create a bar plot showing count of each of the following category and write MIN and MAX count for them:
gender, ses, prog, schtyp, race
GENDER SES
MAX – PRIVATE MAX - RESEARCH
SCHTYP PROG
RACE
Q4 Make a clustered bar plot showing the preference of the programme for each gender, and ascertain which
programme is preferred by which gender.
Q5 Make a clustered bar plot showing the preference of the school for each gender, and ascertain which school is
preferred by which gender.
Q6 Make a clustered bar plot indicating the preference of the programme for SES.
Q7 Make a clustered bar plot showing the preference of the programme for each RACE, and ascertain which
programme is preferred by which RACE.
Q8 Make a clustered bar plot showing the preference of the programme based on the schooling of student.
Q9 Make a clustered bar plot showing the preference of the type of schooling for each gender.
SCATTER PLOT
Eg. We need to see how performance in read varies with that in write.
The spread of the scatter indicates how strong the bonding between any two variables is. If the spread is large, then
the bonding is weak and if the spread is small, then the bonding will be strong.
Q2 Make a scatter plot between maths and science and write the interpretation.
BOX PLOT
After making the box plot, count how many circles are above the top and bottom horizontal lines. Those are the
number of outliers for that variable.
Q1 Create a box plot and indicate the number of outliers for the following variable: read, write, math, science, social
science
READ
The lower value of box is q1 AND UPPER VALUE IS Q3.
HISTOGRAM
This is the frequency polygon which shows the frequency for each mark.
Each bar shows how many students have secured that many marks.
Q1. Make a histogram for the following variables: MATHS, SCIENCE, SOCST, and from the histogram which range of
marks have been secured by the max and min.
Socst =
To create a new variable “TOTAL”, which is addition of marks ion read, write, maths and social science:
menu>data>prepare>derive>click on empty space> make the formula
Q: Make a box plot for total to figure out the presence of outliers.
Q: Make a scatter plot to see the variation of the reading score with total.
Q: Create a new variable average which is the average of marks for each student. Make a histogram then interpret
the max and min range.
MAX – 50-60
MIN- 30-40
Q: Make a box plot to figure out the presence of outliers in the variable “average”.
Q: Make a scatter plot to see the variation of “total” with “average” and write your interpretation.