STATISTICS
STATISTICS
Statistics plays a significant role in education in various ways. It involves collecting, analyzing,
interpreting, and presenting data related to educational processes and outcomes. Here are some
key roles of statistics in education:
Data-driven decision-making: Statistics provides educators with tools and techniques to
collect and analyze data on student performance, teacher effectiveness, curriculum
evaluation, and other educational factors. This data can inform decision-making at
various levels, from individual student interventions to school-wide policies and
curriculum development.
Evaluation and assessment: Statistics is used to develop and implement valid and
reliable assessment tools to measure student learning outcomes, evaluate the
effectiveness of educational programs, and monitor progress towards educational goals.
Statistical analysis of assessment data helps identify strengths and weaknesses in the
educational system and guides evidence-based improvements.
2. Explain the general nature of the normal curve.
The normal curve, also known as the bell curve or Gaussian distribution, is a statistical concept
that describes the distribution of a continuous random variable in a population. It is a
symmetrical, bell-shaped curve that is characterized by its mean, median, and standard
deviation.
Bell-shaped: The normal curve is bell-shaped, with the highest point (peak) at the mean and
tapers off symmetrically on both sides. The tails of the curve extend indefinitely in both
directions, but they get closer and closer to the horizontal axis without ever touching it.
Mean: The mean, or average, is the sum of all the data points divided by the number of data
points. It is widely used in various fields to represent the "typical" value of a dataset. However, it
can be sensitive to extreme values or outliers, and may not accurately represent the center of
the data if the distribution is not symmetric.
Median: The median is the middle value when a set of data points is arranged in ascending or
descending order. It is less sensitive to extreme values and is often used when the data is skewed
or contains outliers, as it represents the "middle" value that separates the higher and lower
halves of the dataset.
Mode: The mode is the value that appears most frequently in a dataset. It is used to represent
the most common or frequently occurring value in a dataset, and it can be used for both
categorical and numerical data.
Central tendency measures are used in various applications, such as in descriptive statistics, data
analysis, and inferential statistics, to summarize and describe the central or typical value of a
dataset.
b. Variability: Variability measures are used to describe the spread, dispersion, or variability of a
set of data points. They provide information about how much the data points deviate from the
central tendency measures. Some common measures of variability include:
Range: The range is the difference between the highest and lowest values in a dataset. It
provides a simple measure of the spread of data, but it can be influenced by extreme values and
may not fully represent the variability of the dataset.
Variance: Variance is a measure of how much the data points deviate from the mean. It is the
average of the squared differences between each data point and the mean. It provides a
measure of the overall variability of the dataset, with higher values indicating greater variability.
Standard Deviation: The standard deviation is the square root of the variance. It is commonly
used to describe the spread of data points around the mean, and it is widely used in statistics
and data analysis as a measure of variability. It is often preferred over the variance because it is
expressed in the same units as the original data and is more easily interpretable.
Variability measures are used in various applications, such as in data analysis, quality control,
and inferential statistics, to quantify and describe the spread or variability of a dataset.
c. Correlation: Correlation measures are used to quantify the strength and direction of a
relationship between two variables. They are commonly used in statistics and data analysis to
assess the degree to which two variables change together. Some common measures of
correlation include:
Pearson's Correlation Coefficient: Pearson's correlation coefficient, often denoted as "r",
measures the linear relationship between two variables. It ranges from -1 to 1, where -1
indicates a perfect negative correlation, 1 indicates a perfect positive correlation, and 0 indicates
no correlation.
Correlation is used in various fields, such as statistics, social sciences, business, and healthcare,
to understand the relationship between variables and make informed decisions based on the
strength and direction of the association.
Define the problem: The first step in statistical analysis is to clearly define the problem or
research question. This involves identifying what information or insights are required, what data
needs to be collected, and what statistical methods are appropriate for the analysis.
Plan the study or experiment: In this step, the researcher develops a plan or strategy for
collecting data. This may involve determining the sample size, selecting the sampling method,
and designing the data collection process. It is important to carefully plan the study to ensure
that the data collected will be representative and reliable.
Collect the data: Once the study is planned, data collection takes place. This involves gathering
the relevant data according to the predetermined plan. Data can be collected through various
methods, such as surveys, experiments, observations, or existing sources.
Clean and preprocess the data: After data collection, the next step is to clean and preprocess
the data. This involves checking for errors, missing values, outliers, and inconsistencies in the
data, and addressing them appropriately. Data preprocessing may also involve transforming or
normalizing the data to make it suitable for analysis.
Analyze the data: In this step, various statistical techniques are applied to the data to extract
meaningful information and draw conclusions. This may involve descriptive statistics, such as
measures of central tendency and variability, as well as inferential statistics, such as hypothesis
testing, regression analysis, or analysis of variance (ANOVA), depending on the nature of the
data and research question.
Interpret the results: Once the data is analyzed, the results are interpreted in the context of
the research question or problem. This involves drawing conclusions, making inferences, and
interpreting the findings in a meaningful way. It is important to consider the limitations of the
data and the statistical methods used during interpretation.
Communicate the results: The findings of the statistical analysis are typically communicated to
others through reports, presentations, or visualizations. It is important to effectively
communicate the results to stakeholders, such as decision-makers, colleagues, or the general
public, in a clear and understandable manner.
Draw conclusions and make recommendations: Finally, based on the results and
interpretation, conclusions are drawn, and recommendations are made. Conclusions may involve
summarizing the main findings, identifying patterns or trends, and making recommendations for
future actions or further research.
5. Define, explain or discuss the following:
a. Kurtosis
b. Test
c. Evaluation
d. Ranking
e. Percentile
Kurtosis is a statistical measure that describes the shape of a probability distribution or the
peakedness/tailedness of a dataset. It indicates the extent to which the data in a distribution
deviates from a normal distribution. Kurtosis can be classified into three types:
Leptokurtic: A leptokurtic distribution has a higher peak and thicker tails compared to a
normal distribution. Kurtosis value is positive in a leptokurtic distribution, indicating
heavier tails and a more peaked distribution.
Platykurtic: A platykurtic distribution has a flatter peak and lighter tails compared to a
normal distribution. Kurtosis value is negative in a platykurtic distribution, indicating
lighter tails and a flatter distribution.
b. Test: In statistics, a test refers to a procedure or method used to assess a hypothesis or claim
about a population parameter based on a sample of data. Tests are used to make inferential
conclusions and determine the validity of a claim or hypothesis. Common types of statistical
tests include t-tests, chi-square tests, ANOVA (Analysis of Variance) tests, regression analysis,
and hypothesis testing.
c. Evaluation: Evaluation refers to the process of assessing or determining the value, quality,
effectiveness, or performance of something. In various contexts, evaluation can be done for
different purposes, such as assessing the success of a program, measuring the performance of
an employee, evaluating the accuracy of a model or prediction, or determining the effectiveness
of a product or service. Evaluation often involves collecting data, analyzing and interpreting
results, and making judgments or recommendations based on the findings.
d. Ranking: Ranking is the process of arranging items or entities in a particular order based on a
set of criteria or attributes. Rankings are commonly used in various fields, such as sports,
business, education, and research. Ranking can be done using different methods, such as
numerical rankings, ordinal rankings, or subjective rankings. Rankings can be used to compare
and contrast items, establish priorities, make decisions, or assess performance.
90-94 2 A 80 103
85-89 2 B 89 100
80-84 4 C 85 105
75-79 8 D 70 110
70-74 6 E 80 80
65-69 11 F 75 99
60-64 9 G 60 95
55-59 7 H 55 100
50-54 5 I 80 98
45-49 0 J 75 100
40-44 2 N= 10
Using the above frequency distribution on (A) answer the following questions.
11. Compute P90 and P10 :Pp=1=PN-f/fp P10= 53.1, P90= 82.5
12. Give the useful index of skewness using the formula:SK=P90-P10/2-50 0.0674
13. Compute the coefficient of correlation of Test x and y by the Rank Difference Method or from the
data in (B) above P(rho0=1 6ED/N(N-1) 0.0909
14. Compute the weighted mean of the response to determine the extent of opinion of teachers towards
the use of technological innovative in the classroom.
N:
Data:
Response f w
Agree 20 3
Undecided 5 2
Disagree 5 1
Total 30
15. Using the data on no.14, compute the x2 test of significant difference. Interpret your answer.