0% found this document useful (0 votes)
20 views

STATISTICS

Statistics

Uploaded by

Catherine Ventic
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
20 views

STATISTICS

Statistics

Uploaded by

Catherine Ventic
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 6

1. Discuss the role of statistics in Education.

Statistics plays a significant role in education in various ways. It involves collecting, analyzing,
interpreting, and presenting data related to educational processes and outcomes. Here are some
key roles of statistics in education:
Data-driven decision-making: Statistics provides educators with tools and techniques to
collect and analyze data on student performance, teacher effectiveness, curriculum
evaluation, and other educational factors. This data can inform decision-making at
various levels, from individual student interventions to school-wide policies and
curriculum development.

Evaluation and assessment: Statistics is used to develop and implement valid and
reliable assessment tools to measure student learning outcomes, evaluate the
effectiveness of educational programs, and monitor progress towards educational goals.
Statistical analysis of assessment data helps identify strengths and weaknesses in the
educational system and guides evidence-based improvements.
2. Explain the general nature of the normal curve.
The normal curve, also known as the bell curve or Gaussian distribution, is a statistical concept
that describes the distribution of a continuous random variable in a population. It is a
symmetrical, bell-shaped curve that is characterized by its mean, median, and standard
deviation.

The general nature of the normal curve can be summarized as follows:


Symmetry: The normal curve is symmetric, which means it is evenly balanced on both sides
of the vertical axis representing the mean. This means that the mean, median, and mode of
a normal distribution are all equal and located at the centre of the curve.

Bell-shaped: The normal curve is bell-shaped, with the highest point (peak) at the mean and
tapers off symmetrically on both sides. The tails of the curve extend indefinitely in both
directions, but they get closer and closer to the horizontal axis without ever touching it.

In summary, the normal curve is a symmetrical, bell-shaped, unimodal distribution that is


asymptotic, and it is fully defined by its mean and standard deviation. It is a common distribution
used in statistical analysis due to its well-known properties and widespread occurrence in many
natural phenomena.

3. Explain the uses of the following measures.


a. Central Tendency
b. Variability
c. Correlation
a. Central Tendency: Central tendency measures are used to describe the typical or central value
of a set of data points. The most common measures of central tendency are:

Mean: The mean, or average, is the sum of all the data points divided by the number of data
points. It is widely used in various fields to represent the "typical" value of a dataset. However, it
can be sensitive to extreme values or outliers, and may not accurately represent the center of
the data if the distribution is not symmetric.

Median: The median is the middle value when a set of data points is arranged in ascending or
descending order. It is less sensitive to extreme values and is often used when the data is skewed
or contains outliers, as it represents the "middle" value that separates the higher and lower
halves of the dataset.

Mode: The mode is the value that appears most frequently in a dataset. It is used to represent
the most common or frequently occurring value in a dataset, and it can be used for both
categorical and numerical data.

Central tendency measures are used in various applications, such as in descriptive statistics, data
analysis, and inferential statistics, to summarize and describe the central or typical value of a
dataset.

b. Variability: Variability measures are used to describe the spread, dispersion, or variability of a
set of data points. They provide information about how much the data points deviate from the
central tendency measures. Some common measures of variability include:

Range: The range is the difference between the highest and lowest values in a dataset. It
provides a simple measure of the spread of data, but it can be influenced by extreme values and
may not fully represent the variability of the dataset.

Variance: Variance is a measure of how much the data points deviate from the mean. It is the
average of the squared differences between each data point and the mean. It provides a
measure of the overall variability of the dataset, with higher values indicating greater variability.

Standard Deviation: The standard deviation is the square root of the variance. It is commonly
used to describe the spread of data points around the mean, and it is widely used in statistics
and data analysis as a measure of variability. It is often preferred over the variance because it is
expressed in the same units as the original data and is more easily interpretable.

Variability measures are used in various applications, such as in data analysis, quality control,
and inferential statistics, to quantify and describe the spread or variability of a dataset.

c. Correlation: Correlation measures are used to quantify the strength and direction of a
relationship between two variables. They are commonly used in statistics and data analysis to
assess the degree to which two variables change together. Some common measures of
correlation include:
Pearson's Correlation Coefficient: Pearson's correlation coefficient, often denoted as "r",
measures the linear relationship between two variables. It ranges from -1 to 1, where -1
indicates a perfect negative correlation, 1 indicates a perfect positive correlation, and 0 indicates
no correlation.

Spearman's Rank Correlation Coefficient: Spearman's rank correlation coefficient, often


denoted as "ρ", measures the monotonic relationship between two variables, which means it
can capture nonlinear relationships as well. It is based on the ranks of the data points rather
than their actual values and is used when the data is not normally distributed or contains
outliers.

Point-Biserial Correlation Coefficient: The point-biserial correlation coefficient measures the


association between a dichotomous variable (e.g., yes/no) and a continuous variable. It is
commonly used when one variable is binary and the other is continuous.

Correlation is used in various fields, such as statistics, social sciences, business, and healthcare,
to understand the relationship between variables and make informed decisions based on the
strength and direction of the association.

4. Explain the different steps used in statistics.


Statistics is a branch of mathematics that deals with the collection, analysis, interpretation,
presentation, and organization of data. It involves several steps or stages, which are typically
followed in a systematic manner to obtain meaningful and reliable results. The general steps
used in statistics are:

Define the problem: The first step in statistical analysis is to clearly define the problem or
research question. This involves identifying what information or insights are required, what data
needs to be collected, and what statistical methods are appropriate for the analysis.

Plan the study or experiment: In this step, the researcher develops a plan or strategy for
collecting data. This may involve determining the sample size, selecting the sampling method,
and designing the data collection process. It is important to carefully plan the study to ensure
that the data collected will be representative and reliable.

Collect the data: Once the study is planned, data collection takes place. This involves gathering
the relevant data according to the predetermined plan. Data can be collected through various
methods, such as surveys, experiments, observations, or existing sources.

Clean and preprocess the data: After data collection, the next step is to clean and preprocess
the data. This involves checking for errors, missing values, outliers, and inconsistencies in the
data, and addressing them appropriately. Data preprocessing may also involve transforming or
normalizing the data to make it suitable for analysis.

Analyze the data: In this step, various statistical techniques are applied to the data to extract
meaningful information and draw conclusions. This may involve descriptive statistics, such as
measures of central tendency and variability, as well as inferential statistics, such as hypothesis
testing, regression analysis, or analysis of variance (ANOVA), depending on the nature of the
data and research question.

Interpret the results: Once the data is analyzed, the results are interpreted in the context of
the research question or problem. This involves drawing conclusions, making inferences, and
interpreting the findings in a meaningful way. It is important to consider the limitations of the
data and the statistical methods used during interpretation.

Communicate the results: The findings of the statistical analysis are typically communicated to
others through reports, presentations, or visualizations. It is important to effectively
communicate the results to stakeholders, such as decision-makers, colleagues, or the general
public, in a clear and understandable manner.

Draw conclusions and make recommendations: Finally, based on the results and
interpretation, conclusions are drawn, and recommendations are made. Conclusions may involve
summarizing the main findings, identifying patterns or trends, and making recommendations for
future actions or further research.
5. Define, explain or discuss the following:
a. Kurtosis
b. Test
c. Evaluation
d. Ranking
e. Percentile

Kurtosis is a statistical measure that describes the shape of a probability distribution or the
peakedness/tailedness of a dataset. It indicates the extent to which the data in a distribution
deviates from a normal distribution. Kurtosis can be classified into three types:

Mesokurtic: A mesokurtic distribution has a similar shape to a normal distribution, with


a moderate peak and moderate tails. Kurtosis value is close to 0 in a mesokurtic
distribution.

Leptokurtic: A leptokurtic distribution has a higher peak and thicker tails compared to a
normal distribution. Kurtosis value is positive in a leptokurtic distribution, indicating
heavier tails and a more peaked distribution.

Platykurtic: A platykurtic distribution has a flatter peak and lighter tails compared to a
normal distribution. Kurtosis value is negative in a platykurtic distribution, indicating
lighter tails and a flatter distribution.

b. Test: In statistics, a test refers to a procedure or method used to assess a hypothesis or claim
about a population parameter based on a sample of data. Tests are used to make inferential
conclusions and determine the validity of a claim or hypothesis. Common types of statistical
tests include t-tests, chi-square tests, ANOVA (Analysis of Variance) tests, regression analysis,
and hypothesis testing.
c. Evaluation: Evaluation refers to the process of assessing or determining the value, quality,
effectiveness, or performance of something. In various contexts, evaluation can be done for
different purposes, such as assessing the success of a program, measuring the performance of
an employee, evaluating the accuracy of a model or prediction, or determining the effectiveness
of a product or service. Evaluation often involves collecting data, analyzing and interpreting
results, and making judgments or recommendations based on the findings.

d. Ranking: Ranking is the process of arranging items or entities in a particular order based on a
set of criteria or attributes. Rankings are commonly used in various fields, such as sports,
business, education, and research. Ranking can be done using different methods, such as
numerical rankings, ordinal rankings, or subjective rankings. Rankings can be used to compare
and contrast items, establish priorities, make decisions, or assess performance.

e. Percentile: Percentile is a measure used in statistics that represents a specific percentage of a


dataset that is below a particular value. It is a way of expressing the relative standing or position
of an observation within a dataset. For example, the 75th percentile of a dataset represents the
value below which 75% of the observations fall. Percentiles are commonly used in various
applications, such as determining income distributions, test scores, or ranking performance. The
median is often referred to as the 50th percentile, as it represents the value that splits the
dataset into two halves.

A. Scores f B.PUPILS TEST X TEST Y

90-94 2 A 80 103

85-89 2 B 89 100

80-84 4 C 85 105

75-79 8 D 70 110

70-74 6 E 80 80

65-69 11 F 75 99

60-64 9 G 60 95

55-59 7 H 55 100

50-54 5 I 80 98

45-49 0 J 75 100

40-44 2 N= 10
Using the above frequency distribution on (A) answer the following questions.

1. If means the “the sum of” the EF total number of observations


2. The size of the class interval is 5
3. The exact limits of the highest class or step are 90-94
4. The midpoint of the entire distribution is 67
5. The crude mode of the entire distribution is 65-69
6. Compute the arithmetic mean M=A.M(Efd/N)i= 67.35714
7. Compute the median: Mdn.=1+(N/2/fm) 67
8. Compute the refined mode, Mo=(3Mdn.)-(2m) 67
9. Compute the standard deviation. 11.4361
S.D. √Efd2/N – (Efd/2)2

10. Compute the quartile: Q=Q3-Q1/2 66.773

11. Compute P90 and P10 :Pp=1=PN-f/fp P10= 53.1, P90= 82.5

12. Give the useful index of skewness using the formula:SK=P90-P10/2-50 0.0674

13. Compute the coefficient of correlation of Test x and y by the Rank Difference Method or from the
data in (B) above P(rho0=1 6ED/N(N-1) 0.0909

14. Compute the weighted mean of the response to determine the extent of opinion of teachers towards
the use of technological innovative in the classroom.

Formula X=£fw/N 12.500

Use the following values:

N:

Agree=3 Undecided=2 Disagree=1

Data:

Response f w

Agree 20 3

Undecided 5 2

Disagree 5 1

Total 30

15. Using the data on no.14, compute the x2 test of significant difference. Interpret your answer.

Formula: X2=£(0.E/E 0.931

You might also like