0% found this document useful (0 votes)

7 views

Ipsita Panda-Biostats Assignment

The document covers key concepts in biostatistics, including the chi-square test, measures of central tendency (mean, median, mode), and various types of graphs. It explains how to calculate chi-square values and expected frequencies, as well as how to determine mean, median, and mode from data sets. Additionally, it describes different graphical representations such as bar graphs, pie charts, box plots, histograms, ogives, and scatter plots.

Uploaded by

shrutigupta4142

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Ipsita Panda-Biostats Assignment

Uploaded by

shrutigupta4142

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

BIOSTATISTICS ASSIGNMENT

NAME- IPSITA PANDA

EN. NO.- A35204122006
COURSE CODE- BIOT213
COURSE TITLE- BIOSTATISTICS
Topic: Measure of central tendency, graphs & chi square test.

1. CHI SQAURE TEST

 A chi square (χ2) statistics is the measure of the difference between the
observed (O) and Expected (E) frequencies of the outcomes of a set of events
or variables.
 Chi square depends on the size of the difference between actual and observed
values, the degree of freedom and the sample size.
 It can also be used to test the goodness of fit between an observed distribution
and theoretical distribution of frequencies.
 Formula used is:

χ2= (O -E)^2/ E

O= observed values
E= expected values
 Goodness -of- fit: chi square provides a way to test how well a sample of data
matches the (known or assumed) characteristics of the larger population that
the sample is intended to represent. This is known as goodness of fit.

Example 1: In a flowering plant white flower (B) are dominant over red flower (b) and
short plant (E) are dominant over tall (e) plants. When the two double heterozygote
(BbEe) plants were crossed the resulting phenotypes is observed (O) as follows: white &
short (206), red & short (83), white & tall (65) and red & tall (30).

- According to Mendel’s dihybrid cross the F2 phenotypic ratio should be

9:3:3:1.
- Therefore, the null hypothesis will be 9:3:3:1.
- In order to calculate expected value formula used is,
Expected value = (null hypothesis (e.g. 9) * total observation)/ 16.

(expected value calculation)

- Next step will be to calculate chi square value for individual observation
and then sum of all the values.

(chi square= 4.32)

- Another method to is to calculate p value (probability of getting results
close to the extremes of the observed results. It is based on the assumption
that the null hypothesis is correct). Degree of freedom= number of classes
-1.

(Formula to calculate P value.)

- Then calculate test statistics (equal to chi square value) using the following
formula,

(Formula to calculate test statistics.)

- Now calculate the critical chi square value using the following formula,
(Formula to calculate critical chi square value.)

- Final output will be as follows-

- Since critical chi square value is greater than test statistics or chi square
value which means null hypothesis is accepted.

2. MEASURE OF CENTRAL TENDENCY

I. MEAN

- It is the ratio of sum of all the observation to the number of observations.

- In a given set of data of size n,
{x1, x2, x3…... xn}
The mean (x “bar”) is given by the following formula:

Numerator- sum of all the observation (x1+x2+……xn)

Denominator- n, that is number of observations.
- In case of different frequency- let there be ‘n’ number of items in a set x1,
x2, x3……xn and frequency corresponding them be f1, f2, f3…. fn. then
mean will be:

- In case of continuous distribution- first step is to calculate the mid value of

the class interval and second step is to apply mean formula. As mentioned
above.
II. MEDIAN

- That value of the observation which divides the entire data set into two equal
parts. Condition, that the data should be arranged in ascending or descending
order.
- Median is a positional average which locates the centre of the observation.
- If the number of observation “n” is odd, there will be a unique median, ½(n)th
observation from either end of the observation will be the median.
- If the number of observation “n” is even, there is no middle observation, but
median is defined by convection as the average of (n/2) th and (n+1)/2th
observation.
- In case of discrete frequency distribution – the first step is to arrange the data in
ascending or descending order, then find the cumulative frequency. Then divide
the cumulative frequency by 2 (cf/2). Find a number greater than cf/2; that will be
the median of the data.
- In case of continuous frequency distribution – the first step is to find the
cumulative frequency and then apply the formula:

L=lower class limit of median class

N/2= half of cumulative frequency
Cf= cumulative frequency of class before the median class
F= frequency corresponding to median class
H= class width

III. MODE

- Mode is the most frequently occurring item of the series.

- Unlike mean and median which calculate the average of the given dataset,
mode simply identifies the value that appears most frequently.
- Types of mode- unimodal (single mode), bimodal (two modes), trimodal
(three modes) and ill- defined (multiple modes).
- Mode in case of ungrouped data- value which is occurring the greatest
number of times.
- In case of grouped data- formula used is:
Mode = l + [(f1 – f0) / (2f1 – f0 – f2)] × h

- L- lower limit of modal class

- F0 – frequency of preceding modal class
- F1- frequency of modal class
- F2- frequency of succeeding modal class
- h – width of modal class

Example 2: Sample of birthweights(g) of live born infants in a private hospital in San

Diego, California in one week period is given in the following table:
To calculate the mean, median and mode following steps are to be followed-

(fig: calculation of mean)

(fig: calculation of median)

(fig: calculation of mode)
Therefore, the final answers are as follows:
Example 3: Consider the data set in the given table, which consist of white blood cell
count taken on admission of patients entering a small hospital, Allentown, Pennsylvania
on a given day. Compute the mean, median and mode.

In order to calculate mean, median and mode following steps are to be performed:

(Fig: calculation of mean)

(Fig: calculation of median)

(Fig: calculation of mode)
Therefore, the final answers are as follows:

(Mean=10.77, median= 8, mode= 8)

3. GRAPHS
A graph can be defined as pictorial representation or a diagram that represents data or
values in an organized manner.

I. BAR GRAPHS
A bar graphs or bar chart is a visual presentation of group of data that is made
up of horizontal or vertical rectangular bar of length equal to the measure of
the data.

II. PIE CHARTS

Pie chart is a way of summarizing a set of nominal data or displaying the
different values of the given variable (e.g. percentage distribution). This type
of chart is a circle divided into series of segments. The area of each segment is
the same proportion of a circle as the category.
III. BOX PLOT
When we display the data distribution in a standardized way using five
summary- minimum, Q1(first quartile), median, Q2(third quartile) and
maximum, it is called box plot.
The end of the box are the upper & lower quartiles so the box crosses the
interquartile range. A vertical line inside the box marks the median and the two
lines outside the box are the whiskers extending to the highest and lowest
observations.

IV. HISTOGRAM
A histogram is a graphical representation of a grouped frequency distribution
with continuous classes. It is an area diagram and can be defined as a set of
rectangles with bases along with the intervals between class boundaries and
with areas proportional to frequencies in the corresponding classes.

V. OGIVE
The ogive is defined as the frequency distribution graph of a series. The ogive
is a graph of cumulative distribution, which explains data values on the
horizontal plane axis and either the cumulative relative frequencies, the
cumulative frequencies or cumulative per cent frequencies on the vertical axis.
Two methods of ogive are:-(i) less than ogive- the frequencies of all preceding
classes are added to the frequency of a class. (ii) greater than ogive-
frequencies of all succeeding classes are added to the frequency of a class.

VI. SCATTER PLOT

The scatter diagram graphs numerical data pairs, with one variable on each
axis, show their relationship. This is used in case when we have numerical
data, or when there are multiple values of the dependent variable for a unique
value of an independent variable.
Example 4: Following are the weights of 57 children in a day care.
i) Bar graph:

ii) Pie chart:

iii) Box plot

iv) Histogram

v) Ogive

vi) Scatter plot

Economics Class 11 Notes Chapter Correlation
50% (4)
Economics Class 11 Notes Chapter Correlation
4 pages
Measures of Central Tendency: Presentation By: DR Dharuv
No ratings yet
Measures of Central Tendency: Presentation By: DR Dharuv
44 pages
Statistics-deals-with-experimental-designs-and-procedures-which-include-data-collection-classification-organization-and-interpretation-and-decision-making-regarding-these-data.-Can-be-can-be
No ratings yet
Statistics-deals-with-experimental-designs-and-procedures-which-include-data-collection-classification-organization-and-interpretation-and-decision-making-regarding-these-data.-Can-be-can-be
41 pages
Measures of Central Tendency: Presentation By: Dr. Sampda Rajurkar
100% (1)
Measures of Central Tendency: Presentation By: Dr. Sampda Rajurkar
44 pages
Stat 153 Slides PDF Statistics Mode (Statis
No ratings yet
Stat 153 Slides PDF Statistics Mode (Statis
10 pages
summry biostatstics pptx
No ratings yet
summry biostatstics pptx
32 pages
Introduction To Bio Statistics
No ratings yet
Introduction To Bio Statistics
53 pages
Organization of Data
No ratings yet
Organization of Data
6 pages
biosa
No ratings yet
biosa
99 pages
Statistics 1
No ratings yet
Statistics 1
291 pages
Origin and Growth of Statistics
No ratings yet
Origin and Growth of Statistics
18 pages
And Dividing It by Total Number of Values
No ratings yet
And Dividing It by Total Number of Values
3 pages
Engineering Probability and Statistics
No ratings yet
Engineering Probability and Statistics
42 pages
Mmw Data Management
No ratings yet
Mmw Data Management
35 pages
Descriptive Statistics-Lc2
No ratings yet
Descriptive Statistics-Lc2
36 pages
Lecture 2_Descriptive Statistics
No ratings yet
Lecture 2_Descriptive Statistics
53 pages
Statistics L 1
No ratings yet
Statistics L 1
27 pages
3 Data Description and Measures of Central Tenndency
No ratings yet
3 Data Description and Measures of Central Tenndency
72 pages
Math 5
No ratings yet
Math 5
3 pages
Stat Review Lecture (Complete)
No ratings yet
Stat Review Lecture (Complete)
18 pages
Data-Management-Lecture-Notes
No ratings yet
Data-Management-Lecture-Notes
14 pages
Mathematics in The Modern World
No ratings yet
Mathematics in The Modern World
50 pages
Unit 2 Measures of Central Tendency and Dispersion: Structure
No ratings yet
Unit 2 Measures of Central Tendency and Dispersion: Structure
27 pages
Statistics
No ratings yet
Statistics
22 pages
Math
No ratings yet
Math
13 pages
GEE138 (4)
No ratings yet
GEE138 (4)
45 pages
Stat Chapter 3
No ratings yet
Stat Chapter 3
41 pages
Chapter 15 (3)nnn
No ratings yet
Chapter 15 (3)nnn
16 pages
STPDF2 - Descriptive Statistics
100% (1)
STPDF2 - Descriptive Statistics
74 pages
1 Introduction of The Nature of Statistics and Frequency Distributions and Graph
No ratings yet
1 Introduction of The Nature of Statistics and Frequency Distributions and Graph
13 pages
Stats Assingment
No ratings yet
Stats Assingment
12 pages
Statistical Methods
No ratings yet
Statistical Methods
43 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
35 pages
Lec 2 To 5 - Describing Data Sampling Design-2
No ratings yet
Lec 2 To 5 - Describing Data Sampling Design-2
95 pages
Research II Q4 Measures of Variability
No ratings yet
Research II Q4 Measures of Variability
54 pages
Formula and Notes For Class 11 Maths Download PDF Chapter 15. Statistics
No ratings yet
Formula and Notes For Class 11 Maths Download PDF Chapter 15. Statistics
16 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
18 pages
Statistics
No ratings yet
Statistics
46 pages
Mmw Statistics
No ratings yet
Mmw Statistics
50 pages
Stats Lec01
No ratings yet
Stats Lec01
9 pages
GNED 03 Finals Reviewer
No ratings yet
GNED 03 Finals Reviewer
10 pages
Week 1-2
No ratings yet
Week 1-2
9 pages
3rd-qtr-stats-reviewer
No ratings yet
3rd-qtr-stats-reviewer
24 pages
Business Statistics
No ratings yet
Business Statistics
106 pages
Staticus: Math 103 Lecture 9 Class Notes
No ratings yet
Staticus: Math 103 Lecture 9 Class Notes
4 pages
Module-4 PPT
No ratings yet
Module-4 PPT
54 pages
Chapter - 14 Statistics
No ratings yet
Chapter - 14 Statistics
33 pages
Module 1
No ratings yet
Module 1
108 pages
Session 3 Week 2
No ratings yet
Session 3 Week 2
31 pages
MMW Module 4 - Statistics
No ratings yet
MMW Module 4 - Statistics
18 pages
Bio Statistics 3
No ratings yet
Bio Statistics 3
13 pages
1 - Introduction To Biostatititics
No ratings yet
1 - Introduction To Biostatititics
34 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
86 pages
Inferential Statistics
No ratings yet
Inferential Statistics
92 pages
Research 3 Quarter 3 - MELC 1 Week 1-2 Inferential Statistics
No ratings yet
Research 3 Quarter 3 - MELC 1 Week 1-2 Inferential Statistics
39 pages
Finals Rt Core 3
No ratings yet
Finals Rt Core 3
25 pages
STATS
No ratings yet
STATS
3 pages
Statistical Foundations for Psychology
From Everand
Statistical Foundations for Psychology
James C. Ware
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Co-Clustering: Models, Algorithms and Applications
From Everand
Co-Clustering: Models, Algorithms and Applications
Gérard Govaert
No ratings yet
Statistics I Essentials
From Everand
Statistics I Essentials
Emil G. Milewski
No ratings yet
List of Question_quantum Mechanics
No ratings yet
List of Question_quantum Mechanics
2 pages
Minor Project
No ratings yet
Minor Project
4 pages
GENETIC Minor Project
No ratings yet
GENETIC Minor Project
2 pages
-biostats Assignment
No ratings yet
-biostats Assignment
8 pages
Correlations
No ratings yet
Correlations
9 pages
Document 1
No ratings yet
Document 1
6 pages
Pract 04
No ratings yet
Pract 04
3 pages
Dataanalysis PDF
No ratings yet
Dataanalysis PDF
12 pages
IB A&I 3.1
No ratings yet
IB A&I 3.1
38 pages
CCGPS Math 6 Grade Unit 6 Study Guide - Statistics: Name: Period: Date
No ratings yet
CCGPS Math 6 Grade Unit 6 Study Guide - Statistics: Name: Period: Date
4 pages
Business Statistics II
No ratings yet
Business Statistics II
71 pages
MTAP 7 Practice Set (Stat)
No ratings yet
MTAP 7 Practice Set (Stat)
1 page
Sample Problems With Answers For Measures of Variability
No ratings yet
Sample Problems With Answers For Measures of Variability
1 page
English and Stastics SAT 1
No ratings yet
English and Stastics SAT 1
5 pages
Box and Whisker PA and PT
No ratings yet
Box and Whisker PA and PT
2 pages
9 Maths NcertSolutions Chapter 14 4 PDF
No ratings yet
9 Maths NcertSolutions Chapter 14 4 PDF
6 pages
Statistics and Probability - 3rd Quarter
No ratings yet
Statistics and Probability - 3rd Quarter
6 pages
GridDataReport-Aji Setiawan
No ratings yet
GridDataReport-Aji Setiawan
7 pages
LAMPIRAN 20 Uji Homogenitas
No ratings yet
LAMPIRAN 20 Uji Homogenitas
2 pages
Year-9-Worksheet-9_-Probability-Data-Analysis_
No ratings yet
Year-9-Worksheet-9_-Probability-Data-Analysis_
22 pages
Junior Secondary Mathematics in Action 3B - Chapter 11 Measures of Central Tendency - Full Solutions
100% (2)
Junior Secondary Mathematics in Action 3B - Chapter 11 Measures of Central Tendency - Full Solutions
30 pages
6.1-6.4 Review
No ratings yet
6.1-6.4 Review
5 pages
Standard Normal Distribution
No ratings yet
Standard Normal Distribution
28 pages
Module 2 - ARIMA PDF
No ratings yet
Module 2 - ARIMA PDF
15 pages
Linear Correlation and Regression
No ratings yet
Linear Correlation and Regression
42 pages
Continuity Correction For Iqs: Pnorm Pnorm
No ratings yet
Continuity Correction For Iqs: Pnorm Pnorm
8 pages
Eps 400 New Notes Dec 15-1
No ratings yet
Eps 400 New Notes Dec 15-1
47 pages
Descriptive Statistics Week 2: L2 - Graphical Display of Data
No ratings yet
Descriptive Statistics Week 2: L2 - Graphical Display of Data
22 pages
Stats and Probablity 3
No ratings yet
Stats and Probablity 3
8 pages
Nonparametric Lab
No ratings yet
Nonparametric Lab
15 pages
Full Download of Applied Statistics in Business and Economics 4th Edition Doane Test Bank in PDF DOCX Format
100% (25)
Full Download of Applied Statistics in Business and Economics 4th Edition Doane Test Bank in PDF DOCX Format
66 pages
Pearson R XXXX
No ratings yet
Pearson R XXXX
9 pages
Decile of Grouped Data
No ratings yet
Decile of Grouped Data
9 pages