0% found this document useful (0 votes)
42 views30 pages

Basic Statistics

This document provides an overview of basic statistics concepts. It discusses descriptive statistics which deals with collecting and presenting data through measures of central tendency and variation. It also discusses inferential statistics which deals with predictions and inferences based on analyzing collected data. The document defines key terms like population, sample, and variable. It describes methods of collecting data and presenting data through tables, graphs, and distributions. It also explains measures of central tendency, position, and variation used to summarize data.

Uploaded by

King Harlequin
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
42 views30 pages

Basic Statistics

This document provides an overview of basic statistics concepts. It discusses descriptive statistics which deals with collecting and presenting data through measures of central tendency and variation. It also discusses inferential statistics which deals with predictions and inferences based on analyzing collected data. The document defines key terms like population, sample, and variable. It describes methods of collecting data and presenting data through tables, graphs, and distributions. It also explains measures of central tendency, position, and variation used to summarize data.

Uploaded by

King Harlequin
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 30

BASIC STATISTICS

A WORKTEXT

Prepared by: DSB


Introduction
 For most people, the word “statistics” is a scary thing that
must be avoided as much as possible. They think of
statistics as a collection of numbers and formulas that
have vague meanings.
Statistics
 Is branch of applied mathematics which deals with the
collection, organization, presentation, analysis and
interpretation of data. Statisticians apply appropriate
methods in collecting and analyzing data. The guide the
design of a research study then analyze the results.
Descriptive Statistics
 Deals with the collection and presentation of data and collection of
summarizing values to describe its group characteristics. The most
common summarizing values are the measures of central tendency
and variation.

Inferential Statistics
 Deals with predictions and inference based on the analysis and
interpretation of the results of the information gathered by the
statisticians. Some of the common statistical tools of inferential
statistics are the r-test, z-test, analysis of variance
Variable
 Is a numerical characteristics or attribute associated with the
population that is being studied.

Types of Variables
 CATEGORICAL AND QUALITATIVE – are classified according to
some attributes or categories

Ex. Gender, Eye Color, Political Reference, Religion, Blood Type,


Civil Status, Year Level, Course, Profession and Socio Economic
Status.
 NUMERICAL – VALUED OR QUANTITATIVE VARIABLES
– are variables that are classified according to numerical
characteristics such as height, age pulse rate, number of children and
speed. Numerical – valued variables are often grouped into class
intervals.
Ex. Age in Year – 5-9, 10-14, 15-19, and 20 & above.
Height in cm – 100-149, 150-199, 200-249.
Grade in Math – 1.00-1.49, 1.50-1.99, 2.00-2.49, 2.50-2.99, 3.00-
3.49, 3.50-3.99.

 Discrete – is a variable whose obtained by counting.


Ex. Number of children, Number of Persons with Blue Eyes, number of patients with
TB.
 Continuous – is a variable whose values are obtained by measuring.
Ex. Temperature, Distance, Area, Density, Age, Height, Weight.
Scales Measurement
 Measurements are usually obtained when gathering data by any
method. Examples of measurement.

 In selecting the statistical tool to be used for drawing


inferences on a random sample, the type of measurement scale
must be carefully chosen. Measurements are classified into four
scale.
 NOMINAL SCALE – classifies elements into two or more categories
or classes, the numbers indicating that the elements are different but
not according to order or magnitude.

 ORDINAL SCALE – ranks individuals in terms of the degree which


they possess a characteristics of interest.

 INTERVAL SCALE – in addition to ordering scores from high to low.


It also establishes a uniform unit in the scale so that any equal
distance between two scores is of equal magnitude.

 RATIO SCALE – is a measurement scale. In addition to being


interval scale, that also has an absolute zero in scale
Definition of Terms
 POPULATION – defined as groups of people, animals, places, things
or ideas to which any conclusions based on characteristics of sample
will be applied.

 SAMPLE – is a subgroup of the population.

 PARAMETER – a numerical measure that describe a characteristics


of a population.
METHODS OF COLECTING DATA
 INTERVIEW METHOD
 DIRECT METHOD – researcher personally interviews the respondent.
 INDIRECT METHOD – uses telephone, gadget or any tool to communicate
with the respondent.
 QUESTIONARE METHOD
 A questionnaire is a list of well planned questions written on paper which can be
either personally administered or mailed by the researcher to the respondents
using any of the following forms:
a. Response Guided Type – respondent is guided in making reply.
b. Recall Type
c. Recognition Type
d. Dichotomous Type
e. Multiple Choice Type
f. Free Response Type
g. Rating Scale Type
 EMPIRICAL OBSERVATION METHOD – obtaining data through seeing,
hearing, testing, touching and smelling.
a. PARTICIPANT OBSERVATION – observer participate in the activities of the group
being studied.
b. NON-PARTICIPANT – observer is a bystander only.
c. CONTROLLED OBSERVATION – observation is done in a laboratory as in the case of
experimental studies

 TEST METHOD – widely u sed in Psychological Research and Psychiatry.


Standard tests are used because of their validity, reliability and usability.

Ex. Aptitude Test, IQ Test, and Achievement Test.

 REGISTRATION METHOD – examples are gathered using this method are


those that are obtained from the National Statistics Office, Land
Transportation Office, Department of Education, CHED, etc.
 MECHANICAL DEVICES – can be used for social and educational research
data gathering such as camera, projector, video tape, tape recorder, etc.

PRESENTATION OF DATA
 TEXTILE FORM – also called the graph method combines text and
figures in a statistical report. Presents data in paragraph form and
becomes effective when the objective is to call the reader’s attention
to some data that require special emphasis.

 TABULAR FORM – A more effective device in presenting data is means of


statistical table. This method presents data in rows and columns. It is more
convenient and understandable
 GRAPHICAL FORM – it is the most effective way of presenting
statistical data because important relationship are brought about
clearly.

TYPES OF GRAPH
 BAR GRAPH – it can be used to organize data or information visually.

 CINEGRAPH – this type of graphical form is the most practical and


effective device which shows a general trend pattern or changes over
a given time.

 CIRCLE GRAPH/ PIE GRAPH – It is consist of a circular region divided into


sections that do not overlap and each section represents a part or percentage
of the whole being considered.
 PICTOGRAPH – it is very effective tool for attracting attention since it
uses pictures or symbols to indicate the message of the numerical
obtained information

FREQUENCY DISTRIBUTION
 The tabular arrangement of the data by moving categories or
classes and their corresponding frequencies.
 HISTOGRAM – it shows the vertical bars representing class intervals or
categories of a frequency distribution.
 FREQUENCY DISTRIBUTION – it is contracted by plotting. The
frequencies against the corresponding class marks connecting
successive points by means of straight line and allowing both trials to
teach the horizontal axis by adding an extra mark to laid of the
distribution.

 CUMMULATIVE FREQUENCY – the data of each of this graph is taken


from cumulative frequencies of a frequency distribution, which are
tabular arrangements of cumulative frequencies in either ascending
or descending order for the class boundaries.
MEASURES OF CENTRAL TENDENCY
 MEAN – is the sum of the values of the variables divided by the
number of observations.
Formula: Where, = mean; Xi =observation; N = total number of observations

 MEDIAN – median of the grouped data arranged in an array (increasing


or decreasing order of magnitude) is the middle observation of an odd
number of items on the arithmetic arrange of two middle values when
the number of items in the distribution is even.
 MODE – the mode for ungrouped data is defined as the value that
appears with the highest number frequency.

FOR GROUPED DATA:


 MEAN – to compute the mean of grouped data we need to determine
the midpoint of each class interval. The mean assumed that the class
mark of each class is the average value of items falling on that class.

METHODS TO COMPUTE THE MEAN


 LONG METHOD

 SHORT METHOD
 LONG METHOD
Formula:

 SHORT METHOD
Formula:
MEDIAN OF GROUPED DATA
Formula:

Solution:
1. Get the median class.
2. Determine the value of the cumulative frequency below the median class.
3. Determine the tower class, the frequency of the median of the class width.
4. Substitute the values in the formula.
MODE OF GROUPED DATA
Formula:
MEASURES OF POSITION
 QUARTILES – these are measures of position which like the median
separates the distribution into four equal parts. It is also an extension
of median concept in that they are values which divide a set of data
into equal parts.

 QUARTILES OF UNGROUPED DATA – the quartiles are the score points


which divide the distribution into four equal parts. 25% of the data fall below
the first quartile (Q1), 50% of the data are below the second quartile (Q2),
75% of the data are less than the thick quartile (Q3).
 QUARTILES OF GROUPED DATA – computing the values of the
quartiles is similar to computing the value of the median when n is
multiplied to k and divided by 4 instead on 2.

Formula:
MEASURES OF VARIATION
 The measures of variation or dispersion indicates the degree or
extent to which numerical values are dispersed or spread out about
the average value in a distribution.

DEFINITION OF TERMS
 RANGE – the simplest to compute is the difference but the target and
the lonest values in the set of numerical data.

 UNGROUPED DATA – range = highest value – lowest value

 GROUPED DATA – range = upper boundary of the highest class interval –


lower boundary of the lowest class interval
 INTERQUARTILE – it is found by finding the differences between
the values of the third quartile or upper quartile and the first quartile
or lower quartile. QR = Q3 – Q1

 SEMI-INTERQUARTILE RANGE ORQUARTILE DEVIATION


– it indicates the variation or dispersion of the values covering the
middle 50% of the distribution of data is found by getting half of the
value or distance between the third quartile or the lower quartile.
SQR/QD = Q3 – Q1/ 2

NOTE: The SQR/QD is an appropriate measure of variation if the median is


used as the measure of central tendency and especially, if the distribution is
skewed.
 MEAN DEVIATION/ AVERAGE DEVIATION – It is the average of
the absolute deviations of the individual values of a set of numerical
data from either the mean, median or the mode among the three, the
mean is the most preferred and commonly used measure of central
tendency for computing the mean deviation or average deviation.

Formula:
Procedure for Ungrouped Data
1. Average the values from the lowest to the highest or
vice-versa.

2. Compute the value of the mean.

3. Find the individual absolute value for each deviation


from the mean.

4. Find the sum of the absolute value in step 3.

5. Substitute the values in the formula and solve.


Procedure for Grouped Data
1. Compute the mean of the distribution.

2. Subtract the mean from each of the midst and write the
absolute value of the results under the column Ix-x primeI.

3. Find the products of items under column f and items under


column Ix-x primeI.

4. Add the product in step 3 to obtain the values of fIx-X


primeI.

5. Divide the sum obtained in step 4.


VARIANCE
 It is the average of squared deviations from the mean. The
square root of the variance is known as the standard
Deviation.
Formula:
Procedure for Ungrouped Data
1. Arrange the values from the lowest to the highest or vice-versa.

2. Compute the value of the mean.

3. Find the individual absolute value for each deviation from the
mean.

4. Square each sum deviation and with the results under column Ix-x
primeI2.

5. Find the sum of the squared deviations.

6. Divide the sum in step 5 by n-1 for example data of by n for


population data.
Procedure for Grouped Data
1. Compute the value of the mean.

2. Subtract mean from each of the midpoint of each class interval.

3. Square each deviation in step 2 and will the results under Ix-x
primeI2.

4. Multiply the squared deviations in step 3 by their corresponding


frequencies.

5. Obtain the sum of the results in step 4.

6. Divide the results in step 5 by n-1 for sample data and by n for
population data.

You might also like