EDA Lesson 1
EDA Lesson 1
Data gathering
● First step in conducting statistical inquiry
● Systematic method of collecting and measuring data from different sources
information in order to provide answers to relevant questions
● Involves acquiring information published literature, surveys, questionnaires or
interviews, experimentations, documents and or examinations and other forms of
data gathering instruments.
Statistics
● -- for converting data into information
Investigator
● Person who conducts the inquiry
● The author or researcher of the study to be produced
Enumerator
● The one who helps in information collection
● Seldom called the surveyor or field surveyor
Respondent
● Whom the data or information is collected from
● The interviewee or the source of the information
Retrospective Study
● This study use the population or sample of the historical data which had been
archived over some period of time
Prospective Study
● Type of longitudinal study where researchers will follow and observe a group of
subjects over a period of time to gather information and record the development
of outcomes
A. Interview
C. Documentary Analysis
When the researcher obtains the information directly from the source.
Indirect methods of data collection involve sourcing and accessing existing data that
were not originally collected for the purpose of study. This type of data is known as
secondary data.
Ex. School enrolments; hospital admissions; records of births, deaths and marriages
III. Experiments
Used when the objective is to determine the cause and effect of a certain phenomenon
under some controlled conditions.
Survey
● Survey is a method of asking respondents some well-constructed questions
● It is an efficient way of collecting information and easy to administer wherein a
wide variety of information can be collected
● The researcher can be focused and can stick to the questions that interest him
and are necessary in his statistical inquiry or study
● Surveys can be done through face-to-face interviews or self-administered
through the use of questionnaires
Designing a Survey
Questionnaire
● is one of the common sources or method or gathering data. It can be classified
as open-ended and close-ended.
Questionnaires Design
Sampling
● Strategies which enable a researcher to pick a sub group as a basis for making
judgements about a larger group.
Steps on sampling
1. Identify the population
2. Determine the sample size
3. Select the sample
Sample size of the population
1. Slov'ns formula
n = N/1+Ne²
N= population size
n= sample size
e= margin of error
Sampling Process
● Sampling techniques are necessary when choosing respondents
● Sampling is the process of selecting units (e.g., people, organizations) from a
population of interest
● Sample must be a representative of the target population. The target population
is the entire group a researcher is interested in; the group about which the
researcher wishes to draw conclusions.
● There are two ways of selecting a sample: the non-probability sampling and
the probability sampling
Probability sampling
● Sampling techniques in which every item in a population has an equal chance of
being selected and quantifies as the sample
Types of Probability Sampling
1. Simple Random Sampling
2. Systematic Sampling
3. Stratified Random Sampling
4. Cluster Sampling
Probability sampling
● Simple random sampling
✓ Basic sampling techniques where a group of subjects (a sample) is selected for study
from a larger group (a population).
✓Each individual is chosen entirely by chance and each member of the population has
an equal chance of being included in the sample.
● Systematic sampling
✓Is a sampling technique which selects units by every kth member of a population after
arranging perhaps alphabetically or other sort.
● Stratified random sampling
✓A stratified sample is obtained by taking samples for each stratum or subgroup of a
population.
✓When a sample is to be taken from a population with several strata, the proportion of
each stratum in the sample should be the same as in the population.
● Cluster sampling
A sampling technique which occurs when one select the members of a sample in cluster
rather than in using separate individuals.
Sampling where groups, not individuals, are randomly selected.
Non-probability sampling
● Non-probability sampling is also called judgment or subjective sampling
● This method is convenient and economical, but the inferences made based on
the findings are not so reliable
● The most common types of non-probability sampling are the convenience
sampling, purposive sampling, and quota sampling.
Non-probability sampling
Design of Experiments
● Is a tool to develop an experimentation strategy that maximizes learning using
minimum resources.
● Design of Experiments is widely and extensively used by engineers and
scientists in improving existing processes through maximizing the yield and
decreasing the variability or in developing new products and processes.
● It is a technique needed to identify the "vital factors in the most efficient manner
and then directs the process to its best setting to meet the ever increasing
demand for improved quality and increased productivity.
Design of Experiments