0% found this document useful (0 votes)
40 views57 pages

Reading On Data Collection

Here is a frequency distribution table summarizing the number of children per family data: Number of Children Tally Frequency 1 1 2 3 1 2 5 3 6 7 1 2 Total Families Surveyed = 12 This frequency distribution table organizes the raw data into categories based on the number of children per family and tallies the number of occurrences in each category. It provides a simple way to summarize and present the distribution of the variable (number of children per family).

Uploaded by

Christine R
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
40 views57 pages

Reading On Data Collection

Here is a frequency distribution table summarizing the number of children per family data: Number of Children Tally Frequency 1 1 2 3 1 2 5 3 6 7 1 2 Total Families Surveyed = 12 This frequency distribution table organizes the raw data into categories based on the number of children per family and tallies the number of occurrences in each category. It provides a simple way to summarize and present the distribution of the variable (number of children per family).

Uploaded by

Christine R
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 57

DATA MANAGEMENT

“Data is everything”
OBJECTIVES

• Discuss the importance of statistics


• Define data
• Identify different types of data
• Use a variety statistical tools to
process and manage numerical data
Statistical Concepts
STATISTICS
the branch of science that deals with the collection,
presentation, organization, analysis, and interpretation
of data

• Descriptive Statistics includes all the techniques used in


organizing, summarizing, and presenting data on hand.
• Inferential Statistics includes all the techniques used in
analyzing the sample data that will lead to generalizations
about a population from which the sample came from.
•Population is the set of all elements under
study
•Sample is a subset of the population
•A variable is a characteristic or attribute of
the elements that is observable or
measurable
•Data are the values that the variable can
assume
• Parameter = numerical summary calculated on
the population data

• Statistic = numerical summary calculated on the


sample data
Suppose we want to determine the average age of all the students in ISU.
Classification of variables
• QUALITATIVE VARIABLES
nonnumeric variables, words or codes
cannot be measured

Examples: gender
civil status
religious affiliation
Classification of variables
• QUANTITATIVE VARIABLES
numeric variables
can be measured

Examples:
height
number of students in a class
number of units enrolled in a semester
Types of Quantitative variables
• Discrete Variables
countable data, results are whole numbers
Examples: number of siblings, number of students in a class

• Continuous Variables
Can assume all values between any two specific values
Examples: weight, height, body temperature
Levels of measurement
• NOMINAL LEVEL
names, labels or categories only
numbers are used to represent an item or
characteristic;

Examples:
Gender
Civil status
Course
Levels of measurement
• ORDINAL LEVEL (Rank Data)
assign numbers or labels to observations with implied ordering

Examples:
performance rating (𝑂𝑢𝑡𝑠𝑡𝑎𝑛𝑑𝑖𝑛𝑔, 𝑉𝑒𝑟𝑦 𝑆𝑎𝑡𝑖𝑠𝑓𝑎𝑐𝑡𝑜𝑟𝑦, 𝑆𝑎𝑡𝑖𝑠𝑓𝑎𝑐𝑡𝑜𝑟𝑦, 𝑃𝑜𝑜𝑟)
satisfaction rating (very satisfied, satisfied, neutral, dissatisfied, very dissatisfied)
stage of cancer (stages I, II, III, IV)
Levels of measurement
• INTERVAL LEVEL
numbers can be ordered and has exact difference between any
two units but has no meaningful zero or starting point

Examples:
Temperature
Test scores
Levels of measurement
• RATIO LEVEL
Data at this level can be ordered, has exact difference
between units, and has a meaningful zero.
Doesn’t have negative numbers unlike interval

Examples:
weight
height
distance
Levels of Measurement

Names, labels, or categories only.


NOMINAL Data cannot be arranged in order.

Names, labels or categories.


ORDINAL
Data can be arranged in order but the differences either can’t
be found or meaningless.
RATIO
Data can be arranged in order and has a meaningful zero.

INTERVAL
Data can be arranged in order but has NO meaningful zero.
To determine whether financial status affects the
children’s access to education, Carlo surveyed 100
children in his town and collected information on their
socio-economic status, age, school type (if attending),
and grade level.
POPULATION All children living in Carlo’s town.

SAMPLE 100 children living in Carlo’s town


Socio-economic status
VARIABLES Age
Type of School
Grade level
A study was conducted at a local university to
analyze the average cumulative GWA's of
students who graduated last year.

POPULATION All students who attended the local university last year.

A group of students who graduated from the local


SAMPLE
university last year.

VARIABLE Average cumulative GWA of students who


graduated from the local university last year
A professor wishes to determine the problems or issues
the students in the university have encountered during
the implementation of the limited face-to-face
learning. She surveyed 80 randomly selected students
to obtain her needed information.
POPULATION All the students in the university
SAMPLE 80 randomly selected students

VARIABLE Problems encountered by the students during the


implementation of limited F2F
DATA COLLECTION
Data collection methods
Use of Documented Data
oIf information is already available for use, then it would be more
practical to use documented data in gathering needed information.

Primary data are data documented by a primary source. The data


collectors themselves documented this data.
Secondary data are data documented by a secondary source. An
individual/agency, other than the data collectors, documented this data.
Data collection methods
Primary data:
Philippine Statistics Authority (PSA) is a primary source of
data on population, housing, and establishments.
The University Registrar’s Office is a primary source of
student records.
Bureau of Agricultural Statistics (BAS) is a primary source of
data on agriculture and livestock in the Philippines.
Data collection methods
Secondary data :
The United Nations’ compiled data for its yearbook, which
were originally gathered by government statistical agencies of
different countries.
The documented data of a student for his thesis, which were
originally collected by the Department of Labor and
Employment.
Data collection methods
Surveys
omethod of collecting data on the variable of interest by asking
people questions
orespondents are selected objectively by employing probability
sampling procedure

Census - data came from asking all the people in the population
Sample survey - data came from asking a sample of people from a well-
defined population
Data collection methods
Surveys
Example:

Pulse Asia, a private polling firm, conducted a pre-election survey of top


presidential candidates in the 2022 elections. Its respondents were
selected 2,400 adult Filipinos who intend to vote in the 2022 election.
Satisfaction of the customers of a store
Data collection methods
Experiments
omethod of collecting data where there is direct human
intervention on the conditions that may affect the values of the
variable of interest
Different types of variables:
The explanatory variable is the factor under study.
The response variable is the observation which the researcher uses for
comparison after conducting the experiment using the factor under study.
The extraneous variables are such which the researcher believes may have
an effect on the response variable
Data collection methods
Experiments

Determine if drinking alcohol affects memory


Sleeping time of students before an exam affects their grades
in the exam
A researcher aims to determine the sensitivity and accuracy of
smoke detectors in the market
Data collection methods
Observation
omethod of collecting data on the phenomenon of interest by
recording the observations made about the phenomenon as it
actually happens

oThe observation method is useful in studying the reactions and


behavior of individuals or groups of persons/objects in a given
situation or environment as it happens.
Data collection methods
Observation
Example:

researchers often use the observation method to study the


behavior of animals in the wild
Allowing children to play with selected toys to determine their
preference/s
Behavior of customers visiting a certain store
DATA PRESENTATION
TABULAR PRESENTATION

• A systematic and logical arrangement of data in the form of rows


and columns with respect to the characteristics of the data

• Tables should be simple and easy to understand. Each row and


column must have an appropriate label.

• A table is best suited for representing individual information and


represents both quantitative and qualitative information.
Types of Tables

Frequency Distribution

• Most convenient way of organizing data

• List each category of data and the number of occurrences for


each category
TABLE FORMAT

HEADING

Box Head
Types of Tables
SIMPLE OR ONE-WAY TABLE
a table that summarizes information on a single characteristic

Year
Types of Tables

COMPOUND TABLE
An extension of a simple table in which there are more than one
variable distributed among its attributes or sub-variables
Types of Tables

Frequency Distribution Table


1)List in the first column the variable as the column header, and the
categories as the row headers
2)Tally the occurrences in the second column
3)Input in the last column the total number of occurrences per
category
Supposed the following raw data give the blood types of 48 selected students of ISUCabagan.

O O AB A A O
AB A O O B A BLOOD TYPE TALLY FREQUENCY

O A AB O O A A

B A O O B O B

AB O A B B A O

A AB A B O O AB

O O A AB AB O
A O B O O O
A social worker was tasked to determine the number of children per family in the
province of Isabela. Rather than surveying every families in each of the
towns/cities in Isabela, she only surveyed randomly selected families in Ilagan
and Tumauini. The data she collected are provided below.

5 3 7 1 5 5 2 3 3 3

1 4 8 2 5 3 3 4 1 5

7 2 9 5 8 4 2 2 5 2

4 6 3 3 2 5 8 8 4 7

2 3 2 7 1 4 3 4 9 2
Title

Column Headings

Stubs

Source note
GRAPHICAL PRESENTATION

• A graph is a very effective visual tool as it displays data at a


glance, facilitates comparison, distribution, and can reveal
trends and relationships within the data such as changes over
time, and correlation or relative share of a whole.

• It is considered an important medium of communication


because we are able to create a pictorial representation of the
numerical figures.
GRAPHICAL PRESENTATION

LINE CHART

• Useful for presenting historical data


• Effective in showing movement of a series over time
• Appropriate when comparing two or more time series data
and trends over time
• Use to organize continuous data
GRAPHICAL PRESENTATION
Figure 1. Stock Prices of Google and Amazon

E XAMP LE
GRAPHICAL PRESENTATION
Figure 2. Number of Students enrolled from 2015 to 2020

E XAMP LE
GRAPHICAL PRESENTATION

E XAMP LE
GRAPHICAL PRESENTATION

COLUMN CHART or BAR GRAPH

• Can be use to organize qualitative data and discrete data


• It is constructed by labeling each category of data on either the
horizontal or vertical axis and the frequency or relative
frequency of the category on the other axis.
GRAPHICAL PRESENTATION

E XAMP LE
GRAPHICAL PRESENTATION

E XAMP LE
GRAPHICAL PRESENTATION

HORIZONTAL BAR CHART

• Appropriate when we wish to show the distribution of


categorical data.
• Used to compare magnitudes for different categories of a
qualitative variable.
GRAPHICAL PRESENTATION

E XAMP LE
GRAPHICAL PRESENTATION

E XAMP LE
GRAPHICAL PRESENTATION
PIE CHART

• Pie charts are typically used to present the relative frequency of


qualitative data.
• Pie charts are useful for showing the division of all possible values
of a qualitative variable into its parts.
• Circle divided into several sections. Each section represents a
category of data, and its area is proportional to the frequency of
the category
GRAPHICAL PRESENTATION
Figure 9. Weekly Expenses of freshman students for their online classes

E XAMP LE
GRAPHICAL PRESENTATION

E XAMP LE
GRAPHICAL PRESENTATION

HISTOGRAM

• It is constructed by drawing rectangles for each class of data. The


height of each rectangle is the frequency or relative frequency of
the class. The width of each rectangle is the same and the
rectangles touch each other.
• It is a graph used to present quantitative data, is similar to the bar
graph.
• It is use to organize continuous data.
GRAPHICAL PRESENTATION

E XAMP LE
TEXTUAL PRESENTATION

• Data is presented in the form of text, phrases or paragraphs

• Involves enumerating important characteristics, emphasizing


significant figures and identifying important features of the data

• we must always provide additional discussion about the relevance


of the figures in our presentation and its implications
Excerpts taken from the Isabela Covid-19 Case Updates

“As of 4PM today, the Department of Health reports a total number of


COVID- 19 cases at 290,190, after 3,475 newly-confirmed cases were
added to the list of COVID-19 patients.
DOH likewise announces 400 recoveries. This brings the total number of
recoveries to 230,233.
Twenty-eight duplicates were removed from the total case count. Of
these, 19 were recovered cases.
Moreover, 13 cases previously reported as recovered were reclassified as
death (12) and active (1) cases after final validation.”
Role of Statistics in Data Analysis
• To organize the number derived from measuring a trait or a variable.

• To describe and interpret the distribution of data, relationships between variables,


hypothesis being tested or parameters being predicted or estimated.

• To help the researcher in making credible decisions based on quantitative data or


arguments.

• To cope with changes by forecasting the future based on data on hand.

• To provide a plausible foundation for building new learning or teaching theory in education.

You might also like