Quantitative data Analysis 2025.pptx
Quantitative data Analysis 2025.pptx
Analysis
By: Dr. Mulugeta W and Mr. Dagne Amdetsion
Portion 3
Key points
Researchers need to know how to select and use different charting and statistical
techniques
Main Concern
Typically, large data sets include errors. For example, respondents may fill
fields incorrectly or skip them accidentally. To make sure that there are no
such errors, the researcher should conduct basic data checks, check for
outliers, and edit the raw research data to identify and clear out any data
points that may hamper the accuracy of the results.
For example, an error could be fields that were left empty by respondents.
While editing the data, it is important to make sure to remove or fill all the
empty fields. (Here are 4 methods to deal with missing data.)
Step 3: Data Coding
Data Collection:
Research design (experimental, correlational, descriptive)
Sampling methods (random, stratified, convenience)
Data collection instruments (surveys, questionnaires, standardized tests)
Data Cleaning and Preparation:
Missing data handling (imputation techniques)
Outlier identification and management
Data transformation (e.g., log transformation)
Analysing quantitative data
null hypothesis
"the two groups will not differ“
alternative hypothesis
"group A will do better than group B"
"group A and B will not perform the same"
Based on the results, the assumed null hypothesis is accepted or rejected. If
the null hypothesis is rejected, it indicates that data readings are strong
and are probably not due to chance.
Null hypothesis rejected: Differences are statistically significant
Null hypothesis accepted: Differences are not statistically significant
Possible Outcomes in
Hypothesis Testing (Decision)
Correct
Reject Error
Decision
Type I Error
T-Test
The data collected must follow a continuous or ordinal scale, such as the
scores for an IQ test.
The data is collected from a randomly selected portion of the total
population
The data will result in a normal distribution of a bell-shaped curve.
Equal or homogenous variance exists when the standard variations are
equal.
Using the T-Test
Where
c = Degrees of freedom
O = Observed Value
E = Expected Value
The degrees of freedom in a statistical calculation represent the number of
variables that can vary. The degrees of freedom can be calculated to
ensure that chi-square tests are statistically valid. These tests are frequently
used to compare observed data with data expected to be obtained if a
particular hypothesis were true.
The Observed values are those you gather yourselves.
The expected values are the anticipated frequencies, based on the null
hypothesis.
Types of Chi-Square Tests
Y = a + bX + ϵ
Where:
Y – Dependent variable
X – Independent (explanatory) variable
a – Intercept
b – Slope
ϵ – Residual (error)
Regression Analysis – Multiple Linear Regression
Where:
Y – Dependent variable
X1, X2, X3 – Independent (explanatory) variables
a – Intercept
b, c, d – Slopes
ϵ – Residual (error)
Multiple Linear Regression
Multiple linear regression follows the same conditions as the simple linear
model. However, since there are several independent variables in multiple
linear analysis, there is another mandatory condition for the model:
Non-collinearity: Independent variables should show a minimum
correlation with each other. If the independent variables are highly
correlated with each other, it will be difficult to assess the true relationships
between the dependent and independent variables.