AI Regression Questions
AI Regression Questions
your answer.
5. A straight line is fit for a given bivariate sample data of size 1000. If the mean of the
residue (e) is found to be 36 and the variance of the residue if found to be 324, write
7. Define Regression of Y on X.
8. If the covariance of two variables X and Y, from a sample of size 100, is found to be
36 from a sample, and their variances are 25 and 81 respectively, what is their
correlation coefficient?
9. If a random variable X has a mean of X , then what is the mean of the Random
Variable Y = X - X ?
10. A regression equation y=m*x+c is fit to a sample data set. A statistic MSR/MSE is
11. A model was fit for a data with two independent variables X 1 and X2 and one
ANOVA was performed on the model and the following results were obtained.
Source df SS MS F p
Regressio 1350.75 675.378
2
n 7 4 23.4581 1.29E-
489.443 28.7907 6 05
Residual 17
1 7
Total 19 1840.2
Explain each value in the ANOVA table and write your conclusion on the goodness of
13. In a linear regression of a bivariate data y=ax+b, which of the following statements
are true.
y x1 x2
7 2 1
12 3 2
5 1 1
16 2 4
23 4 5
20 7 2
13 2 3
6 3 0
5 1 1
12 0 4
15. A regression equation y=m*x+c is fit to a sample data set. A statistic c/SE c is
17. If the correlation coefficient between two variables in a sample of 10 is 0.58, what is
19. Under what conditions do you say two variables are independent?
20. If X and Y are continuous Random Variables, prove that E[X+Y] = E[X]+E[Y]
21. If the covariance of two variables X and Y, from a sample of size 100, is found to be
36 from a sample, and their variances are 25 and 81 respectively, what is their
correlation coefficient?
22. If a random variable X has a mean of X , then what is the mean of the Random
Variable Y = X - X ?
23. The following table has the data of the total costs and the number of units produced
by a company.
Total Cost Y 25 11 34 23 32
Units Produced X 5 2 8 4 6
25. What is the statistic used to find the significant independent variables in case of
multiple regression?
28. If a random variable X has a mean of X̅ , then what is the mean of the Random
Variable Y = X -X̅ ?
30. The following table gives a sample representing the total costs and the number of
X Y
2 14
5 55
8 130
9 175
13 350
A quadratic model is fit for this sample data and the regression equation was found
to be 𝑦 = 2𝑥 2 + 5 Find the F value for this model and also estimate the p value at
95% confidence
32. If X and Y are continuous Random Variables, prove that E[X+Y] = E[X]+E[Y]
33. If the covariance of two variables X and Y, from a sample of size 100, is found to be
36 from a sample, and their variances are 25 and 81 respectively, what is their
34. A straight line is fit for a given bivariate sample data of size 1000. If the mean of the
residue (ε) is found to be 36 and the variance of the residue if found to be 324, write
parameter estimation.
x y
1 8
5 17
7 25
13 41
15 51
16 52
18 57
20 65
36. If r is the mean square error, express r in terms of bias and variance.
37. 36 observations were made of two variables. Based on this sample, the correlation
38. Find mean and standard deviation of the price of banana if the at various locations of
39. The standard deviation of two variables, based on 19 observations, is 5.34 and 5.4.
The covariance between these two variables, based on the same observations, is -
40. Three models were proposed describing the relation between two variables. During
the analysis of variance, the F statistic was found to be 0.58, 0.83 and 0.60 for these
d. X decreases as Y increases
43. What is the statistic used to find the significant independent variables in case of
multiple regression?
44. In a linear regression of a bivariate data y=ax+b, which of the following statements
are true.
45. If x is defined as
∑ xi then x is an unbiased estimate of the population mean. True
i=1
,
n
or False?
47. In a bivariate data, write the equations for determining the value of the regression
48. 100 observations were made of two variables. Based on this sample, the correlation
a. mutually exclusive,
b. Independent.
50. The scatter plot of a set of observed values is shown below. Write your observation
9000
8000
7000
6000
5000
4000
3000
2000
1000
0
0 5 10 15 20 25
51. A set of observed values of pulp production in metric tons and world pulp price in
52. If a random variable X has a mean of X , then what is the mean of the Random
Variable Y = X - X ?
53. Given a sample of bivariate data, list the steps to be followed to build a prediction
model.
54. The following table has the data of the total costs and the number of units produced
by a company.
Total Cost Y 25 11 34 23 32
Units Produced 5 2 8 4 6
X
a. Calculate the Correlation Coefficient rxy. Is it significant?