STS Reviewer
STS Reviewer
Overview of Statistics
• MatLab is an analytical platform and programming
language that is widely used by engineers and
Statistics scientists. As with R, the learning path is steep, and
− Statistics is the science of collecting, organizing, you will be required to create your own code at some
summarizing, and analyzing information to draw point.
conclusions or answer questions.
− It provides procedure in data collection, presentation, SAS
organization, and interpretation to have a meaningful
• SAS is a statistical analysis platform that offers
idea.
options to use either the GUI, or to create scripts for
more advanced analyses. It is a premium solution
Importance of Statistics that is widely used in business, healthcare, and
− Statistics plays a major role in many aspects of our lives. human behavior research alike.
− It is used in sports, for example, to help a general
manager decide which player might be the best fit for a GraphPad Prism
team.
• GraphPad Prism is premium software primarily used
− It is used in politics to help candidates understand how
within statistics related to biology, but offers a range
the public feels about various policies.
of capabilities that can be used across various fields.
− It is used in medicine to help determine the effectiveness
of new drugs.
− Statistical research in business enables managers to Minitab
analyze past performance, predict future business • The Minitab software offers a range of both basic
practices and lead organizations effectively. Statistics and fairly advanced statistical tools for data analysis.
can describe markets, inform advertising, set prices and
respond to changes in consumer demand. Excel
− Statistics, being quantitative tools widely used in the • Excel offers a wide variety of tools for data
areas of economics and finance, could help to shape visualization and simple statistics. It is simple to
effective monetary and fiscal policies and to develop generate summary metrics and customizable
pricing models for financial assets such as equities, graphics and figures, making it a usable tool for
bonds, currencies, and derivative securities. many who want to see the basics of their data.
SPSS Data
• SPSS (Statistical Package for the Social Sciences) is • The information referred to the definition is the data.
perhaps the most widely used statistics software
• According to the Merriam Webster dictionary, data
package within human behavior research. SPSS
are “factual information used as a basis for
offers the ability to easily compile descriptive
reasoning, discussion, or calculation”
statistics, parametric and non-parametric analyses,
as well as graphical depictions of results through the
graphical user interface (GUI). Types of Statistics
Descriptive Statistics
R It basically consists of organizing and summarizing data.
Descriptive statistics describe data through numerical
• R is a free statistical software package that is widely
summaries, tables, and graphs.
used across both human behavior research and in
other fields. While R is a very powerful software, it Examples:
also has a steep learning curve, requiring a certain 1.The average score of a volleyball player for the past 10
degree of coding. games
2.Birth rate in rural areas in the Philippines
3.Enrollment record of all colleges in BSU – TNEU Lipa ❖ Quantitative variables or numerical variables are
Campus variables that take on numerical values representing
an amount or quantity. These numerical values
Inferential Statistics should answer the question how much or how many.
− Some examples of qualitative variables are height,
It is the logical process that involves generalizing
weight, distance, salary, etc.
from a sample to the population from which the sample was
❖ Variables can also be classified into two according
selected and assessing the reliability of such generalizations.
to purpose whether experimental or mathematical.
It is also called as statistical inference or inductive statistics.
Examples:
Experimental Classification
1. A car manufacturer wishes to estimate the average lifetime
of batteries by testing a sample of 50 batteries.
2. The political views of the youth in the urban areas with ➢ Independent variables or explanatory variables are
respect to inflation rate in Asia variables controlled by the experimenter or
researcher, and expected to have an effect on the
3. A campaign manager analyzes the effect of TV ads on the
behavior of the subjects.
promotion of a presidential candidate
➢ Dependent variables or outcome variables measure
the behavior of subjects and expected to be
Basic Terminologies in Statistics influenced by the independent variable.
o Example:
❖ A population consists of all the members of the o For instance, to predict the value of
group about which you want to draw a conclusion, fertilizer on the growth of plants, the
while sample is a portion or part of the population of dependent variable is the growth of plants
interest selected for analysis. while the independent variable is the
❖ A parameter is a numerical index describing a amount of fertilizer used.
characteristic of a population while a statistic is a
numerical index describing a characteristic of a Mathematical Classification
sample.
→ Discrete variables are quantitative variables that
are either a finite number of possible values or a
Sources of Data countable number of possible values. These are
variables that are countable.
❖ Primary data are data that come from an original Some examples of these variable are number of cars, number
source, and are intended to answer a specific of siblings, etc.
research question. This can be taken by interview, → Continuous variables are quantitative variables
mail-in questionnaire, survey or experimentation. that have an infinite number of possible values that
❖ Secondary data are data taken from previously are not countable. These are variables that are no
recorded data, such as information in previously longer countable but are measurable.
conducted research, financial statements, business
Some examples of these variables are height, weight, volume,
periodicals, and government reports. It can also be
etc.
taken electronically, for instance via internet
websites, etc.
❖ A constant is a characteristic of objects, people, or Level of Measurement of Variables
events that does not vary. For example, the → Nominal Level is the first level of measurement and
temperature at which water boils (100 degree it is characterized by data that consist of names,
Celsius) is a constant. labels or categories only. Data cannot be arranged in
❖ A variable is a characteristic of objects, people, or ordering scheme. Nominal scales have no numerical
events that can take different values. It can vary in value.
quantity like weight of people, or in quality like hair Some examples of nominal level variables are
color of people. - Sex (male or female)
- Type of School (public or private)
Two Types of Variables - Eye Color (blue, green, brown).
❖ Qualitative variables or categorical variables are → Ordinal Level involves data that may be arranged
variables that yield categorical responses. These are in some order, but differences between data values
words or codes that represent class or category. either cannot be determined or meaningless. An
− Some examples of qualitative variables are eye ordinal scale not only classifies subjects but also
color, sex, occupation, student number, etc.
ranks them in terms of the degree to which they existing data that were originally collected for the
possess a characteristic of interest. purpose of the study.
Some examples of ordinal level variables are → Questions can either be:
- Highest Educational Attainment (elementary, high school, o An open-ended question is a type of
bachelor, masteral, doctoral) question that does not include response
categories. This type of question is usually
- Rank of military officer (lieutenant, captain, major,
appropriate for collecting subjective data.
colonel).
o A closed-ended question is a type of
question that includes a list of response
→ Interval Level is a measurement level that specifies categories from which the respondent will
the distances between each interval on the scale. select his answer. This type of question is
Variables of this level have no absolute zero. This usually appropriate for collecting objective
means that a value of zero does not mean the absence data.
of the quantity. 3. Focus Group – It is a group interview of
Some examples of interval level variables are approximately six to twelve people who share
- Temperature on Fahrenheit/Celsius thermometer similar characteristics or common interests. A
- IQ (e.g., high IQ vs. average IQ vs. low IQ), facilitator guides the group based on a
predetermined set of topics.
4. Experiment – It is a method of collecting data
→ Ratio Level represents the highest, most precise,
where there is direct human intervention on the
level of measurement. Variables of this level have
conditions that may affect the values of the variable
absolute zero which means that a value of zero
of interest.
means the absence of the quantity.
5. Observation – It is a method of collecting data on
Some examples of ratio level variables are the phenomenon of interest by recording the
- Height and weight observations made about the phenomenon as it
- Time actually happens. involves collecting information
- Distance and speed without asking questions.
Sample Mean There are also some cases when data set values have
the same number frequency, when this occur, the data set is
said to be no mode.
Measure of Dispersion
→ Spread of data values from the average
→ Dispersion is the difference between the actual value
and the average value.
Range
- Difference of highest and lowest value.
- (low value – lesser the variability or malapit sa mean)
Standard Deviation
- Describes the difference between data values and
mean
-
- calculated as the square root of variance.
Variance
- Squared measure of standard deviation.
Whatever you do, work at it with all your heart, as working for the Lord, not for human masters.
- Colossians 3:23