0% found this document useful (0 votes)
6 views

MTH 106_Tutorial Questions_1

This tutorial sheet from Sokoine University of Agriculture covers various topics in introductory statistics, including descriptive statistics, sampling techniques, and simple linear regression. It contains questions that require explanations of statistical terminology, methods of data collection, and the relevance of statistics in daily life, as well as practical exercises involving data analysis and interpretation. The document serves as a comprehensive guide for students to understand and apply statistical concepts.

Uploaded by

yusuphhanigomba7
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views

MTH 106_Tutorial Questions_1

This tutorial sheet from Sokoine University of Agriculture covers various topics in introductory statistics, including descriptive statistics, sampling techniques, and simple linear regression. It contains questions that require explanations of statistical terminology, methods of data collection, and the relevance of statistics in daily life, as well as practical exercises involving data analysis and interpretation. The document serves as a comprehensive guide for students to understand and apply statistical concepts.

Uploaded by

yusuphhanigomba7
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Sokoine University of Agriculture

Department of Mathematics and Statistics


Tutorial Sheet 1
MTH 106: Introductory Statistics
Coverage for Topic 1: Descriptive statistics, Topic 2: Sampling and sampling techniques & Topic 3:
Simple linear regression and correlation
1) Explain the following terminology as they are used in statistics
a) Statistics (b) Population c) Sample d) Sampling e) A Statistic f) A parameter g) Descriptive Statistics
h) Statistical Inference i) Variable j) Data k) Quantitative data l) Qualitative data m) Primary Data n)
Secondary Data
2) Discuss the reasons for sampling.
3) Describe the probability sampling and non-probability sampling. For each category discus its advantages
and disadvantages.
4) With typical examples, explain the relevance of statistics in our daily life.
5) Discuss different methods of collecting primary data, state the advantages and disadvantages of each
method.
6) Explain the advantages of taking a sample in a survey instead of having a complete enumeration.
7) What is a stratified sampling? Describe it with concrete examples.
8) The following are the figures (in millions of USD) of Tanzanian trade with SADC for the period 1994-
1998. Discuss how the data given can be presented in a bar graph
Table: Tanzania Trade with SADC for the period 1994-1998
Year 1994 1995 1996 1997 1998
Exports 87.3 96.4 80.7 102.9 69
Imports 233.8 220.8 193.9 226.3 294.1
Source: Tanzania Revenue Authority-Customs Department.
9) What is the main purpose of data presentation? Discuss the important aspects in making data presentation.
10) With examples, discuss the four scales of measurements
11) Discus different ways of presenting statistical data graphically. For each type of the graph, state the
suitable type data and scale of measurement(s).
12) Outline the advantages of an arithmetic mean as a statistical average compared to other measures of
central tendency.
13) Discuss the Properties of a Good Measure of central Tendency and Dispersion.
14) The sample mean of five items of an observation is 4 and the variance is 5.2.If three of the items are 1, 2,
and 6 then find the other two.
15) The mean and standard deviation of a sample of size 10 were found to be 9.5 and 2.5 respectively. Later
on, an additional observation became available. This was 1.5 and was included in the original sample.
Find the mean and standard deviation of the 11 observations.
16) The mean and standard deviation of 10 observations were found to be 16.5 and 4.7. But later on it was
discovered that, the value 16 and 12 were wrongly entered instead of a 6 and 2. Find the correct value of
the mean and the standard deviation
17) Find the mean and standard deviation of the values 4, 5, 6 and 10
18) Coefficients of variation of two series are 75% and 90% and their standard deviations are 15 and 18
respectively. Find their means.
1
19) A surveyor had already identified about 2280 items from which a systematic Sampling would be made.
Given that the sampling interval was 10. Find;
(a) The sample size to be taken
(b) If the first item to be picked was the 9th in the list, what would be the last item in the list to be included in
the sample?
(c) Considering the order of the items in the list as the numerical values, find the mean and median of the
sample.
20) The first three moments about the value 4 of a variable are, 2, 9.7and –48. Find the 1st three moments
about the mean. Also compute the coefficient of skewness and comment on the nature of the distribution.
21) Compute the coefficients of skewness and kurtosis if the first four moments about the value 3 of a variable
are 1.7, 8.9, 39.5 and 211.7
22) An analysis of monthly wage paid to the workers in two firms; A and B belongs to the
same industry, gave the following results;
Statistics Firm A Firm B
No. of wage earners 986 548
Average monthly wages $52.5 $47.5
Variance of the distribution of wages 100 121
(a) Which firm pays larger amount of salary for its workers?
(b) Which firm is stable in terms of individual wages?? Justify your answer.
23) Two students were suspected to be examination cheaters and the following were random scores by the two
students in different tests of a certain subject.
Student I: 20, 70, 30, 90, 34, 99
Student II: 23, 18, 25, 24, 22, 26
Between these two students, whom one do you think is likely to be a good cheater than the other? Give
reasons for your answer.
24) The following are the scores in terms of G.P.A of 10 pre-entry female students at Sokoine University of
Agriculture in 2000/1 against their entry points (based on A-level) performance
Entry points 3 3.5 4 3.5 3.5 4 3 3.5 3 3.5
G.P.A 2.3 2.1 2.6 3.2 3.2 2.5 3.1 2.8 3.6 2.8
Plot a scatter diagram, compute the Pearson correlation coefficient and comment on the results from the
scatter plot and the correlation coefficient.
25) A group of 5 students took tests before and after training and obtained the following scores
Before X: 2 2.5 2.5 3 5
After Y: 2.5 3 3 5 5
Find the correlation coefficient r and comment on the nature of the relationship.
26) The table shows the temperature and the relative humidity at one place at regular intervals during one day:
Temp˚F, (X) 65 68 68 70 72 74 78 81 79 78 77 75
R h (%), (Y) 52 52 53 45 42 33 32 28 30 31 32 32
Temp-Temperature, Rh-Relative humidity
a) Plot a scatter diagram of the above data and comment.
b) Find the correlation coefficient and interpret.
27) A financial manager speculates about the relationship between family incomes and their allocation for
investment. The following table presents the result of the survey of 8 randomly selected families.
2
Annual income in ($) 8 12 9 24 13 37 10 20
% allocation for invest. 36 25 33 15 28 19 20 22
a) Calculate the coefficient of correlation (r) and interpret it.
b) Calculate the coefficient of determination (R2) and comment.
c) Develop a regression equation that describe this data and interpret. State whether it is a good or poor fit.
d) Estimate the percentage allocation for investment of family earning 80 annually.
28) A company keeps extensive record on its sales people on the premise that sales should increase with
experience. A random sample of eight new sales people produced the data on experience and sales
provided in the table below;
Month on job 2 4 8 12 1 5 9 7
Month sales(Tshs) 1.2 7 11.3 15 0.8 3.7 12 5.2
i) Plot a scatter diagram.
ii) Compute and interpret both the coefficient of correlation and that of determination.
iii) Set the regression line of sales on job experience.
iv) Estimate the level of sales in Tshs if the experience of the sales people is exactly 10 months.
29) What is regression line? With the help of an example illustrate how regression line helps in decision
making.
30) In trying to evaluate the effectiveness of its advertising campaign, a firm completed the following
information:
Year 1991 1992 1993 1994 1995 1996 1997 1998
Adv.Expenditure (Rs.) 12 15 15 23 24 38 42 48
Sales (lakh Rs.) 5.0 5.6 5.8 7.0 7.2 8.8 9.2 9.5
a) Compute the coefficient of correlation and that of determination.
b) Derive the regression equation of Sales on Expenditure.
c) Estimate the amount of sales if advertising expenditure is 60, Rs.
31) Explain the significance of r  1 and r  1 in correlation analysis.
32) Distinguish between correlation and regression analysis.
33) The age and blood pressure of 10 women are:
Age 56 42 36 47 49 42 60 72 63 55
Blood pressure 147 12 11 128 145 140 155 164 149 150
Find the correlation coefficient between blood pressure and age.
i. Determine the least square regression equation of blood pressure on age and state whether it is a good or
poor fit.
ii. Estimate the blood pressure of a woman whose age is 45 years.
34) The following data relate to the prices and supplies of a commodity during a period of eight years:
Price(Rs./kg) 10 12 18 16 15 19 18 17
Supply(kg) 30 35 45 44 42 48 47 46
i. Calculate the coefficient of correlation between the two series.
ii. Derive the regression of Supply on Demand and state whether it is good or poor fit.
iii. Estimate the amount supplied if the price is 100 Rs per Kg.
35) Find the regression equation from the following data:
Age of husband (X) 18 19 20 21 22 23 24 25 26 27
Age of Wife (Y) 17 17 18 18 19 19 19 20 21 22

3
Also calculate the correlation coefficient between the ages of husbands and wives.
36) In order to find the correlation coefficient, two variables X and Y from 12 pairs of observation are
n n n n
considered, the following calculations were made:  X i  30 ,
i 1
Yi  5 ,
i 1
 X i2  670 ,
i 1
Y
i 1
i
2
 285 ,

X Y
i 1
i i  334 , On subsequent verification it was found that the pair (X=11,Y=4) was copied wrongly, the

correct value being X=10,Y=14. Find the correct value of r.


37) Obtain the equations of the two lines of regression for the following data:
X 43 44 46 40 44 42 45 42 38 40 42 57
Y 29 31 19 18 19 27 27 29 41 30 26 10
Hence, obtain the correlation coefficient between X and Y, for what values of X, Y=49?
38) A computer, while calculating the correlation coefficient between two variables X and Y, obtained the
n n n n n
following constants; n  30 , X
i 1
i
2
 600 ,  X i  120 ,
i 1
Y
i 1
i  90 ,  Yi 2  250 ,
i 1
X Y
i 1
i i  356 .It was,

however, later discovered at the time of checking that it had copied down two pairs of observations as
X Y
8 10
10 7
While the correct values were:
X Y
8 12
10 8
Obtain the correct value of the correlation coefficient between X and Y.

You might also like