AASHISH YDV B.SC 1st Stats Practical
AASHISH YDV B.SC 1st Stats Practical
(STATISTICS)
:AASHISH YADAV
:B.Sc (I)
PRACTICAL 1
1) Draw the histogram and frequency polygon of the above data. From the
histogram, obtain approximated value of mode.
2) For the above weights, prepare a cumulative frequency table and draw the
less than ogive. Hence obtain approximate value of media.
Objective: To find the approximate value of mode and to draw less than and
more than ogive for the given frequencies and also to prepare a histogram for
the same.
Theory:
Histogram: A histogram is a graph that shows the frequency of numerical data
using rectangles. It is used to summarize discrete or continuous data that are
measured on an interval scale.
Ogive: The curve obtain by plotting cumulative frequencies is called a
cumulative frequency curve or ogive.
~THERE ARE TWO TYPES OF OGIVE:-
Less than type
More than type
Less Than Ogive: It represents the cumulative frequency below each data
point. The curve starts from the left and rises with each data point
More Than Ogive: It shows the cumulative frequency above each data point.
The curve starts from the right and descends with each data point. Both ogives
are useful for analyzing the cumulative distribution of a dataset and
understanding how many observations fall below or above certain values.
PROCEDURE:
x<-c(42,74,40,82,115,41,61,75,83,63,53,110,76,84,50,67,65,78,77
,56,95,68,69,104,80,79,79,54,73,59,81,100,66,49,77,90,84,76,42,64,69,70,80,7
2,50,79,52,103,96,51,86,78,94,71)
#HISTOGRAM
hist<-hist(x,xlab="weights",xlim=c(35,130),ylim=c(0,10))
#POLYGON
x.axis=c(min(hist$breaks),hist$mids,max(hist$breaks))
y.axis=c(0,hist$counts,0)
lines(x.axis,y.axis,type='l')
bins=seq(40,120,by=10);bins
xcut=cut(x,bins,right=FALSE);xcut
xf=table(xcut);xf
z=cbind(xf);z
xcumfre<-cumsum(z);xcumfre
cumfre<-cbind(xcumfre);cumfre
cumgp<-c(0,cumsum(z));cumgp
plot(bins,cumgp,main="ogive curve",xlab="weights",ylab="frequency")
lines(bins,cumgp,col="blue")
par(new=TRUE)
CALCULATION:-
x<-c(42,74,40,82,115,41,61,75,83,63,53,110,76,84,50,67
,65,78,77,56,95,68,69,104,80,79,79,54,73,59,81,100,66,49,77,90,84,76,4
2,64,69,70,80,72,50,79,52,103,96,51,86,78,94,71)
> #HISTOGRAM
> hist<-hist(x,xlab="weights",xlim=c(35,130),ylim=c(0,10))
> #POLYGON
> x.axis=c(min(hist$breaks),hist$mids,max(hist$breaks))
> y.axis=c(0,hist$counts,0)
> lines(x.axis,y.axis,type='l')
>
> bins=seq(40,120,by=10);bins
[1] 40 50 60 70 80 90 100 110 120
> xcut=cut(x,bins,right=FALSE);xcut
[1] [40,50) [70,80) [40,50) [80,90) [110,120) [40,50) [60,70)
[8] [70,80) [80,90) [60,70) [50,60) [110,120) [70,80) [80,90)
[15] [50,60) [60,70) [60,70) [70,80) [70,80) [50,60) [90,100)
[22] [60,70) [60,70) [100,110) [80,90) [70,80) [70,80) [50,60)
[29] [70,80) [50,60) [80,90) [100,110) [60,70) [40,50) [70,80)
[36] [90,100) [80,90) [70,80) [40,50) [60,70) [60,70) [70,80)
[43] [80,90) [70,80) [50,60) [70,80) [50,60) [100,110) [90,100)
[50] [50,60) [80,90) [70,80) [90,100) [70,80)
8 Levels: [40,50) [50,60) [60,70) [70,80) [80,90) [90,100) ... [110,120)
> xf=table(xcut);xf
xcut
[40,50) [50,60) [60,70) [70,80) [80,90) [90,100) [100,110)
[110,120)
5 8 9 15 8 4 3 2
> z=cbind(xf);z
xf
[40,50) 5
[50,60) 8
[60,70) 9
[70,80) 15
[80,90) 8
[90,100) 4
[100,110) 3
[110,120) 2
>
> xcumfre<-cumsum(z);xcumfre
[1] 5 13 22 37 45 49 52 54
> cumfre<-cbind(xcumfre);cumfre
xcumfre
[1,] 5
[2,] 13
[3,] 22
[4,] 37
[5,] 45
[6,] 49
[7,] 52
[8,] 54
> cumgp<-c(0,cumsum(z));cumgp
[1] 0 5 13 22 37 45 49 52 54
>plot(bins,cumgp,main="ogive curve",xlab="weights",ylab="frequency")
> lines(bins,cumgp,col="blue")
> par(new=TRUE)
>
> #MORE THAN OGIVE
> bins2<-rev(bins);bins2
[1] 120 110 100 90 80 70 60 50 40
> y<-rev(z);y
[1] 2 3 4 8 15 9 8 5
> cumgp2=c(0,cumsum(y));cumgp2
[1] 0 2 5 9 17 32 41 49 54
>plot(bins2,cumgp2,main="ogive
curve",xlab="weights",ylab="frequency")
> lines(bins2,cumgp2,col="red")
> abline(v=73,col="red")
Result:-
i) The first graph shows the histogram of the given data with the
frequency polygon
ii) The second graph shows the Ogive of the cumulative frequency of the
data.
Result:
Graph:
Histogram
ogive curve
50
40
frequency
30
20
10
0
40 60 80 100 120
weights
Interpretation:-
i)Here the histogram shows the mode of the given data which is approx 75
ii)The intersection between the less than and more than ogive shows the
median of the data and here the median is approx 75.