0% found this document useful (0 votes)

13 views11 pages

Results

The document contains analysis of data on students' socioeconomic status, type of program enrolled in, and science scores. A contingency table shows relationships between socioeconomic status and program. Chi-square tests show an association. Summary statistics of science scores are presented, along with a histogram, boxplot, and tests of the mean. Regression analysis identifies reading, writing and math as significant predictors of science scores. Diagnostic checks find that residuals are normally distributed.

Uploaded by

macasaquit17

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views11 pages

Results

Uploaded by

macasaquit17

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Group 5

2023-12-18

3 Results

Contingency Table

Table 1: Socio-economic status of student’s family vs Type of Program

academic general vocational Sum

high 22 2 2 26
low 10 11 5 26
middle 22 11 15 48
Sum 54 24 22 100

The very first column indicates the Socio-economic status of student’s family, while the very first row of
this contingency table indicates the Type of program conducted in this statistics. In the middle of this
contingency table, we can see the outcomes. The last column and last row indicates the summary for the
outcomes.

Chi-square test

TestStatistic DegreesOfFreedom PValue

X-squared 17.18055 4 0.0017829

Interpretation: Provided that p-value (0.0482) is less than 0.05, the null hypothesis is rejected, suggesting
that there is an association between Socio-economic status of student’s family and the Type of program.
Both Variables are dependent from each other.

1
Summary Statistics

Table 2: Summary Statistics for Science

x
nbr.val 100
nbr.null 0
nbr.na 0
min 29
max 69
range 40
sum 5242
median 53
mean 52
SE.mean 1
CI.mean.0.95 2
var 85
std.dev 9
coef.var 0
skewness 0
skew.2SE -1
kurtosis -1
kurt.2SE -1
normtest.W 1
normtest.p 0

The summary statistics provides the average mean score of 52 for science. The difference in gap between the
highest and lowest score is 46 with a median at 53. The skewness is at 0 indicating that the data is fairly
symmetrical. The Kurtosis is at -1 indicating that the distribution is light tails or a platykurtic distribution.

2
Histogram

Figure 1 Histogram of Science

Histogram and Density curve for science

0.06
0.04
Density

0.02
0.00

30 40 50 60 70

Science

Interpretation: The Histogram shows a fairly symmetrical distribution for science with a mean score of
52. The density curve aids with the interpretation of the distribution.

Test and boxplot

One Sample z-test

data: User input summarized values for x

z = 0, p-value = 0.5
alternative hypothesis: true mean is greater than 52
95 percent confidence interval:
51.47985 Inf
sample estimates:
mean of x
52

3
30 40 50 60 70

Interpretation: Since p-value (0.5) is equal to 0.05 so we do not reject the null hypothesis. Therefore,
there is no sufficient evidence to conclude that the true mean is greater than 52.

One-way ANOVA

One-way analysis of means

data: science and prog

F = 2.3554, num df = 2, denom df = 97, p-value = 0.1003

Interpretation: Provided that p-value (0.02703) is less than 0.05, the null hypothesis is rejected, suggesting
that there is difference in the science means based on the program.

4
Regression Analysis

Call:
lm(formula = science ~ ., data = hsb)

Residuals:
Min 1Q Median 3Q Max
-13.3612 -2.5502 -0.0756 3.0082 11.3224

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) -1.21537 5.94419 -0.204 0.838474
sexmale 2.55652 1.28261 1.993 0.049409 *
raceasian -0.43339 3.10646 -0.140 0.889372
racehispanic -2.53456 2.47415 -1.024 0.308512
racewhite 2.34978 1.80717 1.300 0.196988
seslow -0.69219 2.01909 -0.343 0.732570
sesmiddle -1.13038 1.59743 -0.708 0.481092
schtyppublic 1.25630 1.56326 0.804 0.423823
proggeneral 4.02147 1.63033 2.467 0.015622 *
progvocational 4.10986 1.89177 2.172 0.032570 *
read 0.22523 0.08334 2.703 0.008290 **
write 0.37379 0.09783 3.821 0.000251 ***
math 0.31383 0.09408 3.336 0.001257 **
socst 0.01062 0.07893 0.135 0.893269
---
Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ ’ 1

Residual standard error: 5.738 on 86 degrees of freedom

Multiple R-squared: 0.6647, Adjusted R-squared: 0.614
F-statistic: 13.11 on 13 and 86 DF, p-value: 2.209e-15

Analysis of Variance Table

Response: science
Df Sum Sq Mean Sq F value Pr(>F)
sex 1 1.80 1.80 0.0546 0.815864
race 3 1497.15 499.05 15.1574 5.309e-08 ***
ses 2 382.06 191.03 5.8020 0.004328 **
schtyp 1 6.26 6.26 0.1902 0.663826
prog 2 327.43 163.72 4.9724 0.009047 **
read 1 2074.58 2074.58 63.0101 7.063e-12 ***
write 1 956.71 956.71 29.0578 6.064e-07 ***
math 1 366.25 366.25 11.1240 0.001259 **
socst 1 0.60 0.60 0.0181 0.893269
Residuals 86 2831.51 32.92
---
Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ ’ 1

Interpretation: The Multiple R squared is 0.5558 or 55.58%. The model is statistically significant since
the p-value (1.533e-10) is less than the alpha. Variables that are predictors of science scores are read, write,
and math. These variables have low p-value which indicates that these predictors are significant. The table
for ANOVA shows that the rest sex, ses, schtyp, and socst have values greater than 0.05.

5
Best Subsets Regression
-------------------------------------------------------------
Model Index Predictors
-------------------------------------------------------------
1 read
2 read math
3 race write math
4 prog read write math
5 race prog read write math
6 sex race prog read write math
7 sex race schtyp prog read write math
8 sex race ses schtyp prog read write math
9 sex race ses schtyp prog read write math socst
-------------------------------------------------------------

Subsets Regression Summary

--------------------------------------------------------------------------------------------------------
Adj. Pred
Model R-Square R-Square R-Square C(p) AIC SBIC SBC MSEP
--------------------------------------------------------------------------------------------------------
1 0.4233 0.4174 0.4003 51.9057 678.3502 393.0571 686.1657 4969.1385
2 0.5189 0.5090 0.486 29.3781 662.2180 377.2216 672.6387 4188.2686
3 0.5689 0.5460 0.5082 18.5566 657.2454 368.6756 675.4816 3792.5282
4 0.6134 0.5928 0.5627 9.1628 646.3690 360.6656 664.6052 3437.8664
5 0.6436 0.6123 0.5706 3.4087 644.2267 355.5492 670.2784 3203.1142
6 0.6591 0.6250 0.5827 1.4346 641.7817 353.9647 670.4386 3097.1569
7 0.6623 0.6244 0.5825 2.6052 642.8287 355.4678 674.0907 3101.4908
8 0.6646 0.6184 0.5647 4.0181 646.1485 357.2337 682.6208 3114.6929
9 0.6647 0.6140 0.5568 6.0000 648.1274 359.5425 687.2050 3149.0264
--------------------------------------------------------------------------------------------------------
AIC: Akaike Information Criteria
SBIC: Sawa’s Bayesian Information Criteria
SBC: Schwarz Bayesian Criteria
MSEP: Estimated error of prediction, assuming multivariate normality
FPE: Final Prediction Error
HSP: Hocking’s Sp
APC: Amemiya Prediction Criteria

Diagnostic Checking

Normality

Exact two-sample Kolmogorov-Smirnov test

data: residuals(mmodel) and pnorm(mean = 0, sd = 1, 79)

D = 0.58, p-value = 0.8515
alternative hypothesis: two-sided

Interpretation Our p-value (0.7723) is greater than 0.05, do not reject the null hypothesis. Therefore, we
conclude that the residuals/errors are normally distributed.
Homoscedasticity

6
studentized Breusch-Pagan test

data: mmodel
BP = 23.364, df = 13, p-value = 0.0375

Goldfeld-Quandt test

data: mmodel
GQ = 0.53089, df1 = 36, df2 = 36, p-value = 0.9693
alternative hypothesis: variance increases from segment 1 to 2

Interpretation Using the studentized Breusch-Pagan test, we are able to identify that the error term is
the same across all values of the independent Variable since our p-value (0.4629)is greater than 0.05. Thus
Heteroscedasity is not present. We then use Goldfeld-Quandt test to check the variance of residuals which
in this case shows that p-value (0.9276) is greater than 0.05. Therefore, we conclude upon that there is
sufficient evidence to prove that Homoscedasticity is present in this model.
Muli-collinearity

GVIF Df GVIF^(1/(2*Df))
sex 1.241147 1 1.114068
race 1.425362 3 1.060851
ses 1.829725 2 1.163045
schtyp 1.142305 1 1.068787
prog 1.991833 2 1.187991
read 2.206004 1 1.485262
write 2.817441 1 1.678523
math 2.102655 1 1.450053
socst 2.214519 1 1.488126

Interpretation This measure of collinearity (VIF) shows that there are no values greater than 5. Therefore,
we conclude that there is no Multi-collineanarity in this model. No multicollinearity issue.
Independence

Durbin-Watson test

data: mmodel
DW = 1.9923, p-value = 0.5011
alternative hypothesis: true autocorrelation is greater than 0

Interpretation Since the p-value (0.7754) is greater than 0.05, we do not reject the null hypothesis. There-
fore, the errors are uncorrelated.

7
Standardized residuals
Residuals vs Fitted Q−Q Residuals
108

0 10
108

2
Residuals

0
−2
−15
63 113 113
63

35 40 45 50 55 60 65 −2 −1 0 1 2

Fitted values Theoretical Quantiles

Standardized residuals

Standardized residuals
Scale−Location Residuals vs Leverage
63 108
108 113

2
3
1.0

0
Cook's63distance
0.0

−3
35 40 45 50 55 60 65 0.00 0.10 0.20 0.30

Fitted values Leverage

Diagnostic Plots

8
Logistic Regression

chr [1:100] "female" "male" "female" "male" "male" "female" "male" "male" ...

Call:
glm(formula = sex ~ ., family = binomial, data = hsb)

Coefficients:
Estimate Std. Error z value Pr(>|z|)
(Intercept) 3.217629 2.415980 1.332 0.18292
raceasian -0.009584 1.396595 -0.007 0.99452
racehispanic -0.421459 1.033880 -0.408 0.68353
racewhite 0.043967 0.753661 0.058 0.95348
seslow -1.408884 0.902394 -1.561 0.11846
sesmiddle -0.080766 0.645067 -0.125 0.90036
schtyppublic -0.168951 0.604326 -0.280 0.77981
proggeneral 0.966928 0.690848 1.400 0.16163
progvocational 0.032704 0.757363 0.043 0.96556
read -0.017867 0.034153 -0.523 0.60087
write -0.157941 0.048976 -3.225 0.00126 **
math 0.031223 0.042956 0.727 0.46732
socst -0.011069 0.034275 -0.323 0.74673
science 0.099075 0.046535 2.129 0.03325 *
---
Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ ’ 1

(Dispersion parameter for binomial family taken to be 1)

Null deviance: 137.99 on 99 degrees of freedom

Residual deviance: 110.24 on 86 degrees of freedom
AIC: 138.24

Number of Fisher Scoring iterations: 5

Interpretation The model shows a logistic regression analysis for sample data where sex is the response
variable, and the rest are predictors. The variables seslow, proggeneral, write, and science have values less
than 0.05 which makes these variables significant to the model.

9
Diagnostic Checking for logistic regression

math read science

70
70
70
60
60 60
50
50
50
40
40
40
predictor.value

30 30

−2.5 0.0 2.5 5.0

socst write
70
60
60
50
50

40 40

30 30

20
−2.5 0.0 2.5 5.0 −2.5 0.0 2.5 5.0
logit

Interpretation The Smoothed scatter plots show that not all variables are linear or the variables are
non-linear. Thus, it might need some transformations.

10
Cook's distance
0.20

188
0.15
Cook's distance

84
0.10

63
0.05
0.00

0 20 40 60 80 100

Obs. number
glm(sex ~ .)
Interpretation This table shows Cook’s distance values for each observation number from the model.
There are some outliers with a value of 163, 82, 51, 88, and 174.

IV_AI-DS_AD3491_FDSA_Unit5
No ratings yet
IV_AI-DS_AD3491_FDSA_Unit5
35 pages
R Project File (Sem4)
No ratings yet
R Project File (Sem4)
10 pages
Regression Explained SPSS
No ratings yet
Regression Explained SPSS
25 pages
HYPOTHESIS
No ratings yet
HYPOTHESIS
24 pages
Notes 1
No ratings yet
Notes 1
26 pages
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
100% (77)
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
6 pages
Module 6 Content
No ratings yet
Module 6 Content
12 pages
STA215 STA220 Practice Test
No ratings yet
STA215 STA220 Practice Test
13 pages
3 Simple Linear Regression
No ratings yet
3 Simple Linear Regression
71 pages
Operational Foundation of Statistics
No ratings yet
Operational Foundation of Statistics
59 pages
BRM Assignmnet
No ratings yet
BRM Assignmnet
7 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
Chapter5
No ratings yet
Chapter5
14 pages
MultipleReg - SchoolData - Stata Annotated Output - UCLA
No ratings yet
MultipleReg - SchoolData - Stata Annotated Output - UCLA
6 pages
Multiple Regression
No ratings yet
Multiple Regression
60 pages
Unit V
No ratings yet
Unit V
27 pages
STAT501 Online FinalExam Fall2024
No ratings yet
STAT501 Online FinalExam Fall2024
14 pages
Explaining Psychological Statistics - 4th Edition ISBN 1118436601, 9781118436608 EPUB DOCX PDF Download
No ratings yet
Explaining Psychological Statistics - 4th Edition ISBN 1118436601, 9781118436608 EPUB DOCX PDF Download
16 pages
Interpretation of Regression
No ratings yet
Interpretation of Regression
6 pages
Regression Analysis - Stata Annotated Output: Use Https://stats - Idre.ucla - Edu/stat/stata/notes/hsb2
No ratings yet
Regression Analysis - Stata Annotated Output: Use Https://stats - Idre.ucla - Edu/stat/stata/notes/hsb2
6 pages
2015 Exit Exam - Questions
No ratings yet
2015 Exit Exam - Questions
159 pages
Regression with Linear Predictors Complete DOCX Download
100% (12)
Regression with Linear Predictors Complete DOCX Download
16 pages
206 Research Methodology
No ratings yet
206 Research Methodology
32 pages
QMB Asn 3
No ratings yet
QMB Asn 3
9 pages
The multivariate social scientist introductory statistics using generalized linear models Sofroniou - The ebook in PDF format with all chapters is ready for download
No ratings yet
The multivariate social scientist introductory statistics using generalized linear models Sofroniou - The ebook in PDF format with all chapters is ready for download
49 pages
Stucor Ma3251 Aa
No ratings yet
Stucor Ma3251 Aa
74 pages
Assignment 3( QM)
No ratings yet
Assignment 3( QM)
3 pages
Stats101A - Chapter 1
No ratings yet
Stats101A - Chapter 1
25 pages
Regn_lect_5
No ratings yet
Regn_lect_5
9 pages
Foundations of Elementary Analysis
From Everand
Foundations of Elementary Analysis
Roshan Trivedi
No ratings yet
CSS
No ratings yet
CSS
15 pages
Business Statistics
No ratings yet
Business Statistics
7 pages
Introduction To Generalized Linear Models: Logit Model With Categorical Predictors. Before
No ratings yet
Introduction To Generalized Linear Models: Logit Model With Categorical Predictors. Before
24 pages
BRM Unit 2 PPT
100% (1)
BRM Unit 2 PPT
64 pages
Multivariate Statistics Introduction
No ratings yet
Multivariate Statistics Introduction
20 pages
Group4
No ratings yet
Group4
9 pages
Formula Stables
No ratings yet
Formula Stables
29 pages
Problems
No ratings yet
Problems
12 pages
Final Formulas - Stats
No ratings yet
Final Formulas - Stats
49 pages
Assignment SPSS Word2
No ratings yet
Assignment SPSS Word2
17 pages
Logistic Regression: Continued Psy 524 Ainsworth
0% (1)
Logistic Regression: Continued Psy 524 Ainsworth
29 pages
Advanced Statistics Day 1
No ratings yet
Advanced Statistics Day 1
61 pages
Final Exam Review
No ratings yet
Final Exam Review
6 pages
Results 1
No ratings yet
Results 1
4 pages
Community Project: Simple Linear Regression in SPSS
No ratings yet
Community Project: Simple Linear Regression in SPSS
4 pages
Linear Regression
100% (2)
Linear Regression
28 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Practical - 592 MA SOCIOLOGY SPSS Fourth Sem
No ratings yet
Practical - 592 MA SOCIOLOGY SPSS Fourth Sem
45 pages
Probability and statistics assignment Hunain khan 22210
No ratings yet
Probability and statistics assignment Hunain khan 22210
3 pages
STAB27
No ratings yet
STAB27
51 pages
Assignment 2
No ratings yet
Assignment 2
11 pages
Cheat Sheet Statistics
No ratings yet
Cheat Sheet Statistics
3 pages
(Ebook PDF) Vital Statistics: Probability and Statistics For Economics and Business 2024 Scribd Download
100% (4)
(Ebook PDF) Vital Statistics: Probability and Statistics For Economics and Business 2024 Scribd Download
51 pages
Final Exam Suggested Solution Key
No ratings yet
Final Exam Suggested Solution Key
5 pages
ps4 Fall2015
No ratings yet
ps4 Fall2015
8 pages
Example of Building and Using A Bivariate Regression Model
No ratings yet
Example of Building and Using A Bivariate Regression Model
3 pages
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
100% (50)
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
26 pages
STA408
No ratings yet
STA408
11 pages
Literature Review and Hypothesis Development
100% (2)
Literature Review and Hypothesis Development
6 pages
Assignment On Probit Model
No ratings yet
Assignment On Probit Model
17 pages
Msci 2020 Excel Assignment
No ratings yet
Msci 2020 Excel Assignment
4 pages
Regression Analysis: Variables in The Model
No ratings yet
Regression Analysis: Variables in The Model
3 pages
An Overview of Regression Analysis: Notes
No ratings yet
An Overview of Regression Analysis: Notes
5 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
46 pages
Stat 1116-BHS20100 - M Assignment
No ratings yet
Stat 1116-BHS20100 - M Assignment
7 pages
Final Exam in Statistics
No ratings yet
Final Exam in Statistics
7 pages
1
No ratings yet
1
5 pages
Two-Sample Tests of Hypothesis: Mcgraw Hill/Irwin
No ratings yet
Two-Sample Tests of Hypothesis: Mcgraw Hill/Irwin
14 pages
II Year IV Semester 2017 Syllabus
No ratings yet
II Year IV Semester 2017 Syllabus
32 pages
Digital Signal Processing (DSP) with Python Programming
From Everand
Digital Signal Processing (DSP) with Python Programming
Maurice Charbit
No ratings yet
STM 009 Reviewer
No ratings yet
STM 009 Reviewer
2 pages
Analysis of Information System Quality and User Acceptance On Internet Banking Industry PDF
No ratings yet
Analysis of Information System Quality and User Acceptance On Internet Banking Industry PDF
8 pages
Tests of Normality
No ratings yet
Tests of Normality
11 pages
Manual PIFACE Ingles
No ratings yet
Manual PIFACE Ingles
11 pages
Tbi 1 A: 1. Lafina Syifaul Ula (12510174023) 2. Meiliana Hidayati (12510174027) 3. Moh Nofal (12510174031)
No ratings yet
Tbi 1 A: 1. Lafina Syifaul Ula (12510174023) 2. Meiliana Hidayati (12510174027) 3. Moh Nofal (12510174031)
23 pages
Analytical Methods of Optimization
From Everand
Analytical Methods of Optimization
D. F. Lawden
No ratings yet
Effect of Short Educational Movie On The Increasing Knowledge of Breastfeeding Management
No ratings yet
Effect of Short Educational Movie On The Increasing Knowledge of Breastfeeding Management
8 pages
Chapter One and 2 Biostatistics and Epidemiology
No ratings yet
Chapter One and 2 Biostatistics and Epidemiology
15 pages
Business Applications of Multiple Regression
50% (4)
Business Applications of Multiple Regression
48 pages
Hypothesis Testing of Two Independent Samples: Group A Data Table and Statistics
No ratings yet
Hypothesis Testing of Two Independent Samples: Group A Data Table and Statistics
2 pages
DOE 5.1class Notes
No ratings yet
DOE 5.1class Notes
250 pages
Calculus: Maths of the Gods
From Everand
Calculus: Maths of the Gods
Bill Todorovich
No ratings yet
Chapter 13 The Wilcoxon Signed Rank Test: The Avonford Star Public Votes New Shopping Centre The Tops - at Last!
No ratings yet
Chapter 13 The Wilcoxon Signed Rank Test: The Avonford Star Public Votes New Shopping Centre The Tops - at Last!
18 pages
CH - 8 PSM
No ratings yet
CH - 8 PSM
25 pages
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Discriminant Analysis
No ratings yet
Discriminant Analysis
16 pages
Westgard QC
No ratings yet
Westgard QC
15 pages
Par Inc Written Report
No ratings yet
Par Inc Written Report
1 page
Quantitative Data Analysis
No ratings yet
Quantitative Data Analysis
5 pages

Results

Uploaded by

Results

Uploaded by

Group 5

Table 1: Socio-economic status of student’s family vs Type of Program

academic general vocational Sum

TestStatistic DegreesOfFreedom PValue

Table 2: Summary Statistics for Science

Figure 1 Histogram of Science

Histogram and Density curve for science

Test and boxplot

One Sample z-test

data: User input summarized values for x

One-way analysis of means

data: science and prog

Residual standard error: 5.738 on 86 degrees of freedom

Analysis of Variance Table

Subsets Regression Summary

Exact two-sample Kolmogorov-Smirnov test

data: residuals(mmodel) and pnorm(mean = 0, sd = 1, 79)

Fitted values Theoretical Quantiles

Fitted values Leverage

(Dispersion parameter for binomial family taken to be 1)

Null deviance: 137.99 on 99 degrees of freedom

Number of Fisher Scoring iterations: 5

math read science

−2.5 0.0 2.5 5.0

You might also like