0% found this document useful (0 votes)
48 views

Reviewer Stat Midterm

Statistics is the science of collecting, organizing, summarizing, and interpreting data. It originated from the Latin word "status" meaning "state." There are two main types of variables: qualitative variables which yield categorical responses and quantitative variables which take on numerical values. Descriptive statistics are used to organize and summarize collected data using tables, graphs and charts, while inferential statistics are used to draw conclusions from samples. Proper data collection methods include surveys, experiments and observation, and sampling techniques can be probability-based or non-probability based.

Uploaded by

Seok Jin Kim
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
48 views

Reviewer Stat Midterm

Statistics is the science of collecting, organizing, summarizing, and interpreting data. It originated from the Latin word "status" meaning "state." There are two main types of variables: qualitative variables which yield categorical responses and quantitative variables which take on numerical values. Descriptive statistics are used to organize and summarize collected data using tables, graphs and charts, while inferential statistics are used to draw conclusions from samples. Proper data collection methods include surveys, experiments and observation, and sampling techniques can be probability-based or non-probability based.

Uploaded by

Seok Jin Kim
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

INTRODUCTION TO STATISTICS -is based on the probability of a certain outcome

happening by chance.
Originated from the Latin word status which means 2 Types of VARIABLES
“state”. 1.Qualitative variables are variables that yield
The branch of science dealing with data about the categorical responses. It is a word or a code that
condition of a state or community. represents a class or category.
It is the branch of science that deals with the collection, 2.Quantitative variables take on numerical values
presentation, organization, analysis, and interpretation representing an amount or quantity.
of data.  Discrete variable is a quantitative variable that
Sir Ronald Fisher is known as the father of modern either a finite or a countable number of
statistics and experimental design. possible values.
The information referred to the definition is the data.  A continuous variable is a quantitative variable
MERIAM WEBSTER D., data are “factual information that has an infinite number of possible values
used as a basis for reasoning, discussion, or calculation”. that are not countable.
The variable is a characteristic or attribute of the Level of Measurements
elements in a collection that can assume different Weakest to Strongest
values for the different elements.
An observation is a realized value of a variable.
Data is the collection of observations.

KELVIN
NOMINAL- Identify, name, classify, or categorize objects
or events.
ORDINAL-It uses numbers in the scale to classify an
object into categories.
INTERVAL- It ranks data, and precise differences
between units of measure do exist; however, there is no
meaningful zero.
Universe is the set of all entities under study. RATIO- The system has an absolute zero.
Population is the set of all possible values of the Data collection is the process of gathering and
variable. measuring information on variables of interest.
An individual is a person or object that is a member of
the population being studied.
Sample is the subset of the universe or the population.
PURPOSE OF SAMPLES:
 Reduced Cost
 Greater Speed or Timeliness
 *Greater Efficiency and Accuracy
 *Greater Scope
 Convenience
METHODS of Collecting
 Necessity
Primary Data
 Ethical Considerations 1. Direct personal interviews
Descriptive statistics (Organize and summarize data) 2. Indirect/Questionnaire Method (Open-ended
-describe the information collected through. /Closed-ended Questionnaire)
numerical measurements, charts, graphs, 3. Focus group.
and tables. 4. Experiment
There are 3 main types of descriptive statistics: 5. Observation
1. The distribution concerns the frequency of Secondary Data
each value. 1. Published report on newspaper and
2. The central tendency concerns the averages of periodicals
the values. 2. Financial Data reported in annual reports.
3. The variability or dispersion concerns how 3. Records maintained by the institution.
spread out the values are. 4. Internal reports of the government
Inferential statistics (Draw conclusion from the data) departments
-uses methods that take results obtained from a 5. Information from official publications
sample, extends them to the population, and measures
the reliability of the result. MEASURES OF CENTRAL TENDENCY
 average or typical of the distribution. DISPERSION OR VARIATION the degree to which
 Value as where the middle of the distribution lies. numerical data tend to spread about an average value is
MEAN called the dispersion or variation of the data.

Weighted Mean Grouped data


MEAN

The median is the halfway point in a data set.

The mode of a data set is the data entry that occurs


with the greatest frequency.
Data set is No mode, if no entry is repeated.
The data set can be Bimodal (2) and multimodal (>2)
For grouped data

MIDRANGE (MR)- it gives the midpoint.

MEASURE OF POSITION

FRACTILES/QUANTILES

-they can be expressed as percentiles, deciles, or


quartiles.

For Grouped Data

Interquartile Range (IQR)

OUTLIER FORMULA *required in box plot.

Outliers are data points that are far from other data
points.
Method of collecting data
The survey is a method of collecting data on the
variable of interest by asking people questions.
Data came from asking:
-a sample is called sample survey.
-all people in the population called Census
Experiments: It is a method of collecting data where
there is a direct human intervention on the conditions
that may affect the values of the variable of interest.
Observation: It is a method of collecting data on the
phenomenon of interest by recording the observations
made about the phenomenon as it actually happens.
SAMPLING TECHNIQUES
Probability sampling is a method of selecting a sample
wherein each element in the population has a known,
nonzero chance of being included in the sample;
otherwise, it is non-probability sampling.
Methods of probability sampling are the following:
simple random sampling, stratified sampling, systematic
sampling, and cluster sampling.
Methods of nonprobability sampling are the following:
Distribution is positively skewed (both box and
haphazard or convenience sampling, judgment or
whiskers) no outliers
purposive sampling, and quota sampling.

1) Textual presentation
2) Tabular presentation

Graphical Representation is a way of analyzing


Distribution is negatively skewed (both box and numerical data. It exhibits the relation between
whiskers) data, ideas, information, and concepts in a diagram.
 There are different types of graphical
representation. Some of them are as follows:
histogram, line chart, bar chart, pie chart, and
pictograph.
GOOD CHART POSSESS
 Accurate
 Simple
 Clear
 Attractive

COLLECTION AND PRESENTATION OF DATA

Primary data is data documented by the primary


source. The data collectors themselves documented this
data.

Secondary data are data documented by the secondary Frequency Table


source. An individual/agency, other than the data Relative Class Frequency
collectors, documented this data. Frequency Distribution Table- grouped data, there is
classes.( Tally chart and Cummulative Freq distribution)
Bar Chart
Pie Chart
Histogram
Frequency Polygon
Ogive

You might also like