0% found this document useful (0 votes)

13 views

2015 No Memo Test 3

Memo

Uploaded by

alutakaunda

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

2015 No Memo Test 3

Memo

Uploaded by

alutakaunda

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

UNIVERSITY OF CAPE TOWN

STATISTICAL SCIENCES DEPARTMENT

STA3022F: RESEARCH AND SURVEY STATISTICS

CLASS TEST 2

06 MAY 2015

TIME: 1 ½ hours Total Marks: 50

Answer ALL questions. (4 pages – 4 questions)
Marks are allocated for intermediate calculations.

QUESTION 1 [5 marks]
(a) What is test-retest reliability? (1)
(b) What is internal consistency reliability? (1)
(c) How do you measure internal consistency? Provide three formula’s or explanations, not just
the names of the methods. (3)

QUESTION 2 [16 marks]

(a) In the painters data set in the R package MASS the subjective assessment, on a 0 to 20 integer
scale, of 54 classical painters is given. The painters were assessed on four characteristics:
composition, drawing, colour and expression. Calculate the Euclidean distance between the
following two samples:
> painters[1:2,]
Composition Drawing Colour Expression
Da Udine 10 8 16 3
Da Vinci 15 16 4 14
(3)
(b) Why is there no need to scale the data set before calculating the Euclidean distance? (1)
(c) Define 𝑠𝑡𝑟𝑒𝑠𝑠 and explain how it is used. (5)
(d) Explain step by step how to perform hierarchical clustering with the centroid method. (7)

QUESTION 3 [17 marks]

The current study aims to identify what factors make some people believe that they are lucky and others
believe that they are unlucky. The study is based on a survey of 62 STA3022F students who answered the
following questions in an online questionnaire (possible responses for categorical variables are given in
brackets).

1. Do you consider yourself to be a lucky person? (Yes/No)

2. What is your age?

1
3. What is your gender? (1 = Male; 0 = Female)
4. Have you ever won a competition before? (1 = Yes; 0 = No)
5. How many economic courses have you completed?

A discriminant analysis model has been constructed with the aim of identify which, if any, of the four
independent variables are able to distinguish between the two groups (groups labelled as “Yes”, and “No”).
Questions:

a) Write down the discriminant function. (2)

b) Can the discriminant model able to significantly discriminate between the two groups? Provide
statistical evidence at the 5% level to support your answer. Clearly state all null and alternate
hypotheses. (4)

c) Use the cut-off value rule to classify Respondent 4. Clearly indicate the classification rule. Is this a
correct classification? (5.5)

d) Compare the overall hit rate with two chance criteria and use these comparisons to evaluate the
overall quality of the discriminant model (4)

e) Evaluate whether the discriminant model is better at predicting some groups than others. (Hint:
Calculate the correct classification rate for each group) (1.5)

Data for the first 15 respondents

ID Q1 Q2 Q3 Q4 Q5
1 Yes 21 Female No 3
2 Yes 21 Male Yes 3
3 No 21 Female No 3
4 Yes 20 Male No 2
5 No 20 Male No 2
6 Yes 20 Female No 2
7 No 21 Male Yes 2
8 No 21 Female No 3
9 No 19 Male No 2
10 Yes 21 Female Yes 4
11 Yes 21 Male Yes 2
12 No 20 Male No 2
13 No 20 Male Yes 2
14 No 20 Male No 2
15 Yes 20 Female Yes 2

> fit <- lda(Q1 ~ Q2+Q3d+Q4d+Q5,data=luck, method="moment")

> fit
Call:
lda(Q1 ~ Q2+Q3d+Q4d+Q5,data=luck,method = "moment")

Prior probabilities of groups:

Yes No
0.4888889 0.5111111

Group means:
Q2 Q3d Q4d Q5
Yes 20.75 0.428 0.2857143 3.20000
No 20.20 0.750 0.3636364 2.52273

2
Coefficients of linear discriminants:
LD1
Constant 0.254
Q2 -2.948
Q3d 0.085
Q4d 1.383
Q5 -0.011

Classification Table
Predicted Groups
yes no Total
Observed yes 28 6 34
Groups no 4 24 28

Total 32 30 62

> centroidYes
[1] -1.0242

> centroidNo
[1] 1.0974

QUESTION 4 [12 marks]

In a 2001 paper titled “Variable precision rough set theory and data discretisation: an application to corporate
failure prediction”, Beynon and Peel use a number of financial performance ratios to build a model that is
able to discriminate between firms in the UK that fail and those that do not fail. Data of 60 randomly chosen
firms was collected on the following set of financial variables.

SALES Sales in 1000's of pounds

ROCS profit before tax/capital employed
FFTL funds flow/total liabilities
GEAR (current liabilities + long-term debt)/total assets
CLTA current liabilities/total assets
CACL current assets/current liabilities
QACL (current assets - stock)/current liabilities
WCTA (current assets - current liabilities)/total assets
AGE number of years company has been operating
CHAUD coded 1 is company changed auditor in previous 3 years, 0 otherwise
BIG6 coded 1 if the company is audited by a big 6 auditor, 0 otherwise
FAIL coded 1 if company failed, 0 otherwise

Refer to the attached Classification tree and answer the following questions.

Questions:

a) Define a set of decision rules indicating the circumstances under which firms can be predicting as
failing or not failing. (3)

b) Which group would Firm 2 be classified to? Is this a correct classification? (2)

3
c) Calculate the diversity index for node 1 (Root Node) and comment why CALC variable is chosen as
a splitting variable? (2)

d) Briefly explain the differences between the Bonsai and Pruning techniques. (1)

e) Construct the classification matrix. (4)

Data for the first 5 firms only

Firm SALES ROCS FFTL GEAR CLTA CACL QACL WCTA AGE CHAUD BIG6 FAIL
1 6762 7.54 0.15 0.62 0.62 1.55 0.74 0.34 74 0 0 0
2 16149 -1.07 0.03 1.22 1.22 0.62 0.32 -0.46 29 0 1 0
3 8086 15.20 0.62 0.33 0.33 2.36 1.75 0.45 51 0 1 0
4 7646 31.22 0.63 0.52 0.48 1.64 1.49 0.31 25 0 0 0
5 36067 10.96 0.35 0.38 0.38 1.59 1.16 0.22 33 0 1 0

1 NotFail

29/31

CACL<=1.1694 CACL>1.1694

2 Fail 3 NotFail

23/8 7/22

ROCS<=4.4486
ROCS>4.4486 CLTA<=0.70635 CLTA>0.70635

4 Fail 5 NotFail 6 NotFail 7 Fail

22/4 1/4 5/22 2/0

WCTA<= - 0.3326 WCTA> - 0.3326 SALES<=3091.5

SALES>3091.5

8 NotFail 9 Fail 10 Fail 11 NotFail

1/2 21/2 2/0 3/22

𝑛 𝐷𝐼 +𝑛 𝐷𝐼 = 𝐷𝐼 − 𝑊𝐴𝐷𝐼
=
𝑛 +𝑛

(𝑛 − 1 − 𝑝)𝑛 𝑛 𝑛 𝑍̅ + 𝑛 𝑍̅
= 𝑑 =
𝑝(𝑛 − 2)(𝑛 + 𝑛 ) 𝑛 +𝑛

𝐹, , . = 2.557
𝐹, , . = 2.513
=1− 𝜌

Topic 7 - Discriminant and Cluster Analysis
No ratings yet
Topic 7 - Discriminant and Cluster Analysis
56 pages
STA3022 Test2 Solutions
No ratings yet
STA3022 Test2 Solutions
7 pages
STA3022Test2 2018
No ratings yet
STA3022Test2 2018
7 pages
Empirical Data Analysis in Accounting and Finance
No ratings yet
Empirical Data Analysis in Accounting and Finance
37 pages
PGP End Term NOV 2019 15-11-2019 - Soln
100% (1)
PGP End Term NOV 2019 15-11-2019 - Soln
18 pages
STA3022Test2 2023 v2
No ratings yet
STA3022Test2 2023 v2
6 pages
OPS 5003 End-Term Question Paper
No ratings yet
OPS 5003 End-Term Question Paper
7 pages
Section: - This Is An Open-Book and Open-Note Test. However, Sharing of Material Is NOT Permitted
No ratings yet
Section: - This Is An Open-Book and Open-Note Test. However, Sharing of Material Is NOT Permitted
9 pages
Classification Models
No ratings yet
Classification Models
95 pages
SDS Solution1
No ratings yet
SDS Solution1
26 pages
Business Statistics Level 3/series 4 2008 (3009)
100% (1)
Business Statistics Level 3/series 4 2008 (3009)
19 pages
ISL Answers
No ratings yet
ISL Answers
19 pages
Discriminant Function Analysis
No ratings yet
Discriminant Function Analysis
9 pages
Test 1 Review A
No ratings yet
Test 1 Review A
7 pages
Discriminant & Logit Analysis Using SAS Enterprise Guide
No ratings yet
Discriminant & Logit Analysis Using SAS Enterprise Guide
53 pages
Datascience Interview
100% (1)
Datascience Interview
31 pages
Solution Manual for Business Statistics 1st Edition by Donnelly ISBN 0132145391 9780132145398 pdf download
100% (5)
Solution Manual for Business Statistics 1st Edition by Donnelly ISBN 0132145391 9780132145398 pdf download
46 pages
ISLR solutions——Classification
No ratings yet
ISLR solutions——Classification
20 pages
Hypothesis Test - Variance - Section B
No ratings yet
Hypothesis Test - Variance - Section B
40 pages
FRA Milestone 1
No ratings yet
FRA Milestone 1
33 pages
Soal UAS Statu Genap 2019 2020 ENGLISH 1
No ratings yet
Soal UAS Statu Genap 2019 2020 ENGLISH 1
9 pages
MR Asssignment Group 4
No ratings yet
MR Asssignment Group 4
40 pages
Practical Question 2023
No ratings yet
Practical Question 2023
5 pages
339 - DADMB End Term
No ratings yet
339 - DADMB End Term
3 pages
HW5_solution_Fall_2024
No ratings yet
HW5_solution_Fall_2024
18 pages
Discriminant Analysis For Risk Classification and Prediction
No ratings yet
Discriminant Analysis For Risk Classification and Prediction
23 pages
Ant Analysis
No ratings yet
Ant Analysis
31 pages
Download the 2025 version of Solution Manual for Business Statistics 1st Edition by Donnelly ISBN 0132145391 9780132145398 (PDF) with all chapters
100% (16)
Download the 2025 version of Solution Manual for Business Statistics 1st Edition by Donnelly ISBN 0132145391 9780132145398 (PDF) with all chapters
49 pages
LDA 01 Linear Discriminant Analysis
No ratings yet
LDA 01 Linear Discriminant Analysis
65 pages
Rekapitulacija NIR - Sve
No ratings yet
Rekapitulacija NIR - Sve
23 pages
A Review of Basic Statistical Concepts: Answers To Odd Numbered Problems 1
No ratings yet
A Review of Basic Statistical Concepts: Answers To Odd Numbered Problems 1
32 pages
Topic 6 - FE, RE and tests
No ratings yet
Topic 6 - FE, RE and tests
46 pages
PA summary sheet
No ratings yet
PA summary sheet
9 pages
ADS ia 2
No ratings yet
ADS ia 2
9 pages
BDMDM Final Paper P16052 Dhruv
No ratings yet
BDMDM Final Paper P16052 Dhruv
5 pages
Basicof Stats
No ratings yet
Basicof Stats
7 pages
Chapter 1. Elements in Predictive Analytics
No ratings yet
Chapter 1. Elements in Predictive Analytics
66 pages
Salary 17.143 +1.589 (No of Years) + 3.643 (Gender) - 0.89 (Gen No of Years) + E Significance
No ratings yet
Salary 17.143 +1.589 (No of Years) + 3.643 (Gender) - 0.89 (Gen No of Years) + E Significance
4 pages
Save Your Answers in This Document and Submit Through Blackboard
No ratings yet
Save Your Answers in This Document and Submit Through Blackboard
7 pages
Rss Grad Diploma Module5 Solutions Specimen B PDF
No ratings yet
Rss Grad Diploma Module5 Solutions Specimen B PDF
15 pages
Statistical Inferences Solved Paper
No ratings yet
Statistical Inferences Solved Paper
7 pages
PUT Solution
No ratings yet
PUT Solution
12 pages
Discriminant Analysis: Plot of Y X. Symbol Is Value of GROUP
No ratings yet
Discriminant Analysis: Plot of Y X. Symbol Is Value of GROUP
8 pages
Final Exam, Data Mining (CEN 871) : Name Surname: Student's ID
No ratings yet
Final Exam, Data Mining (CEN 871) : Name Surname: Student's ID
2 pages
IV_AI-DS_AD3491_FDSA_Unit5
No ratings yet
IV_AI-DS_AD3491_FDSA_Unit5
39 pages
A Review of Basic Statistical Concepts: Answers To Problems and Cases 1
No ratings yet
A Review of Basic Statistical Concepts: Answers To Problems and Cases 1
94 pages
Assignment 2
No ratings yet
Assignment 2
22 pages
25 Question Paper
No ratings yet
25 Question Paper
4 pages
Mid Semester Regular-DM
No ratings yet
Mid Semester Regular-DM
3 pages
Practical 7 Classification Revision Questions
No ratings yet
Practical 7 Classification Revision Questions
8 pages
Context PDF
No ratings yet
Context PDF
31 pages
Data Science Cheatsheet 2.0: Statistics Model Evaluation Logistic Regression
No ratings yet
Data Science Cheatsheet 2.0: Statistics Model Evaluation Logistic Regression
4 pages
Itae 002 Test 1 2
0% (1)
Itae 002 Test 1 2
5 pages
TYCS Practical
No ratings yet
TYCS Practical
26 pages
Interview questions companie
No ratings yet
Interview questions companie
72 pages
Master ACT Math Prep: Maths, #1
From Everand
Master ACT Math Prep: Maths, #1
Subbalakshmi Devaki
No ratings yet
Master SAT Prep Maths: Maths, #1
From Everand
Master SAT Prep Maths: Maths, #1
Subbalakshmi Devaki
No ratings yet
100 Puzzles to Learn Data Warehousing
From Everand
100 Puzzles to Learn Data Warehousing
Cristian Scutaru
No ratings yet
AP Statistics Flashcards, Fifth Edition: Up-to-Date Practice
From Everand
AP Statistics Flashcards, Fifth Edition: Up-to-Date Practice
Barron's Educational Series
No ratings yet
SSC CGL Preparatory Guide -Mathematics (Part 2)
From Everand
SSC CGL Preparatory Guide -Mathematics (Part 2)
Dr. DK Sukhani
4/5 (1)
Probability For Finance
No ratings yet
Probability For Finance
115 pages
Frequencies: Frequencies Variables Usia /piechart Percent /order Analysis
No ratings yet
Frequencies: Frequencies Variables Usia /piechart Percent /order Analysis
37 pages
Course Project
No ratings yet
Course Project
2 pages
THE IMPACT OF INTERNET BANKING SERVICE QUALITY ON Customer Satisfaction
50% (2)
THE IMPACT OF INTERNET BANKING SERVICE QUALITY ON Customer Satisfaction
21 pages
Statistics Final Revision
No ratings yet
Statistics Final Revision
16 pages
Practicefinalsolutions
No ratings yet
Practicefinalsolutions
7 pages
The Process of Research in Psychology 3rd Edition Dawn M. Mcbride 2024 scribd download
100% (8)
The Process of Research in Psychology 3rd Edition Dawn M. Mcbride 2024 scribd download
85 pages
Anova Excel
No ratings yet
Anova Excel
27 pages
Analitik Dan Visualisasi Data - Pengenalan Data Analitik Dan Visualisasi
No ratings yet
Analitik Dan Visualisasi Data - Pengenalan Data Analitik Dan Visualisasi
18 pages
MGMT E-104: Quantitative Methods For Economics and Finance: Course Overview
No ratings yet
MGMT E-104: Quantitative Methods For Economics and Finance: Course Overview
7 pages
Cheat Sheet PSM
No ratings yet
Cheat Sheet PSM
3 pages
Data Mining Processes
No ratings yet
Data Mining Processes
14 pages
Foot Patrol System CHAP13
No ratings yet
Foot Patrol System CHAP13
27 pages
STAT609 SP23 LCN Unit3
No ratings yet
STAT609 SP23 LCN Unit3
46 pages
SPSS Worksheet 2 One-Way ANOVA
No ratings yet
SPSS Worksheet 2 One-Way ANOVA
6 pages
3is Quiz1 Kian
No ratings yet
3is Quiz1 Kian
1 page
(eBook PDF) Introduction to Statistical Investigations by Nathan Tintle 2024 scribd download
100% (6)
(eBook PDF) Introduction to Statistical Investigations by Nathan Tintle 2024 scribd download
45 pages
(Hunter & Brewer, 2015)
No ratings yet
(Hunter & Brewer, 2015)
36 pages
Reseaech Methods
No ratings yet
Reseaech Methods
27 pages
Multiple Choice Questions: Region North East South East South West North West 100,000
No ratings yet
Multiple Choice Questions: Region North East South East South West North West 100,000
10 pages
(Contributions To Statistics) Prof. Vladimir V. Anisimov (Auth.), Alessandra Giovagnoli, Anthony C. Atkinson, Bernard Torsney, Caterina May (Eds.) - mODa 9 - Advances in Model-Oriented Design and Anal
No ratings yet
(Contributions To Statistics) Prof. Vladimir V. Anisimov (Auth.), Alessandra Giovagnoli, Anthony C. Atkinson, Bernard Torsney, Caterina May (Eds.) - mODa 9 - Advances in Model-Oriented Design and Anal
263 pages
Annex 2 To OMCL GL Evaluation and Reporting of Results Evaluation of Results From Quantitative Testing - PAPHOMCL (21) 02R3
No ratings yet
Annex 2 To OMCL GL Evaluation and Reporting of Results Evaluation of Results From Quantitative Testing - PAPHOMCL (21) 02R3
9 pages
Comparison of Quantitative and Qualitative Research Traditions Epistemological Theoretical and Methodological Differences
No ratings yet
Comparison of Quantitative and Qualitative Research Traditions Epistemological Theoretical and Methodological Differences
18 pages
Research Empowers Us With Knowledge. 5
No ratings yet
Research Empowers Us With Knowledge. 5
20 pages
Local Calibration of The Mechanistic-Empirical Pavement Design Guide
No ratings yet
Local Calibration of The Mechanistic-Empirical Pavement Design Guide
77 pages
Structural Equation Modeling (Sem) & R-Software Lecture: Sri Kasnelly, S.E, M.M, Ciqar Arranged By: Group 4 (9&10) Sara Afifah Ananfsiyah (20.23.951)
No ratings yet
Structural Equation Modeling (Sem) & R-Software Lecture: Sri Kasnelly, S.E, M.M, Ciqar Arranged By: Group 4 (9&10) Sara Afifah Ananfsiyah (20.23.951)
24 pages
Checklist For Evaluating A Research Report
No ratings yet
Checklist For Evaluating A Research Report
2 pages
ML Questions
No ratings yet
ML Questions
56 pages
FORM_TWO__SCHEME_term_2
No ratings yet
FORM_TWO__SCHEME_term_2
10 pages
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
No ratings yet
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
16 pages

2015 No Memo Test 3

Uploaded by

2015 No Memo Test 3

Uploaded by

UNIVERSITY OF CAPE TOWN

STATISTICAL SCIENCES DEPARTMENT

TIME: 1 ½ hours Total Marks: 50

QUESTION 2 [16 marks]

QUESTION 3 [17 marks]

1. Do you consider yourself to be a lucky person? (Yes/No)

a) Write down the discriminant function. (2)

Data for the first 15 respondents

> fit <- lda(Q1 ~ Q2+Q3d+Q4d+Q5,data=luck, method="moment")

Prior probabilities of groups:

QUESTION 4 [12 marks]

SALES Sales in 1000's of pounds

e) Construct the classification matrix. (4)

Data for the first 5 firms only

4 Fail 5 NotFail 6 NotFail 7 Fail

22/4 1/4 5/22 2/0

WCTA<= - 0.3326 WCTA> - 0.3326 SALES<=3091.5

8 NotFail 9 Fail 10 Fail 11 NotFail

1/2 21/2 2/0 3/22

You might also like