Instructions

Uploaded by

loluise127

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views

Instructions

Uploaded by

loluise127

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

[Print: May 19, 2022]

[]

Statistics Project Directions

Along with this set of directions you will find a .csv file containing three columns of numbers.
The name of the file is one of CUcccxx.csv, Gcccxx.csv, Bcccxx.csv. The first letter(s) of the
file name indicates the type of distribution contained in the first column of the table (CU means
continuous uniform, G means gamma, and B means binomial). The characters ccc indicate the
presence of some number of other characters and the digits xx (or in some cases only x) are an
identification number for the file.

Column A. When the file is opened with Excel (or any other spreadsheet that can read .csv
files) the first column should contain these items:
Distribution Name
Distribution Variance
<a blank cell>
data point 1
data point 2
...
data point n

Column A Tasks.
A.1 For the type of distribution presented in the first column, plot a histogram (normalized to
unit area) of the data.
A.2 Estimate the mean and the variance for the column A distribution. For comparison, the
exact variance is given in the cell labeled (here) Distribution Variance.
A.3 The distribution found in the first column depends on two parameters as follows:
Continuous Uniform a and b (the endpoints of the interval)
Gamma Distribution α and β
Binomial Distribution n and p
Estimate the two parameters associated with the distribution given in Column A and deter-
mine a 96% confidence interval around each parameter.
A.4 How large a data set is needed to get 96% confidence intervals of width 0.01 or smaller around
the two parameters? (Assume X̄ and S 2 do not change significantly with N when N is large.)
A.5 Plot a graph of the density function for the distribution in column A using the estimated
parameter values determined in part A.3. Compare this graph to your normalized histogram.

Column B. The second column is similar to the first and looks like this:
Normal
<a blank cell>
<a blank cell>
data point 1
data point 2
...
data point 10000

This column contains 10,000 values from some Normal distribution.

1
[Print: May 19, 2022]
[]

Column B Tasks.
B.1 From the B column select two non-overlapping chunks1 of consecutive data points. The first
chunk should contain a large number of data points. The second chunk should contain exactly
25 data points. Estimate the mean and variance of this normal distribution using your first
(large) chunk of data.
B.2 Compute 98% confidence intervals around each of the parameters µ and σ based on the large
chunk of data.
B.3 Test the claim µ ≥ 4 using the second (small) chunk of data in a significance test. State the
null and research hypotheses. Describe the location of the critical region. Give the P-value
and the Z, T, or χ2 -stat as appropriate. Find the region where the power of the test exceeds
0.95.

Column C. The third column is similar to the second column and looks like this:
?????
<a blank cell>
<a blank cell>
data point 1
data point 2
...
data point 1000

This column contains random values taken from a mystery distribution. The distribution is known
to be either gamma or normal. Read section 15.2 in the textbook concerning a significance test
called The Goodness of Fit test or consult the Statistical Goodies handout.

Column C Tasks.
C.1 Create a raw histogram of your data. On this basis make a guess at the distribution.
C.2 Assume the distribution is normal, estimate the parameters that it would have. Use this
information to perform a Goodness of Fit test on the data under the assumption that the
data is normal. State the research and null hypotheses. Describe the location of the critical
region. Give the P-value and the Z, T, or χ2 -stat as appropriate. State your conclusions
using 0.10 as the rejection level.
C.3 Assume the distribution is gamma, estimate the parameters that it would have. Use this
information to perform a Goodness of Fit test on the data under the assumption that the
data is gamma. State the research and null hypotheses. Describe the location of the critical
region. Give the P-value and the Z, T, or χ2 -stat as appropriate. State your conclusions
using 0.10 as the rejection/acceptance level.
C.4 If some P-values from steps 2 and 3 are above 90%, assume that the data set with the largest
of these P-values represents the distribution of the data. This is your candidate distribution.
If all of the P-values from steps 2 and 3 are below 90%, locate the analysis that had the largest
P-value and perform a goodness of fit hypothesis test on that data to resolve the question of
its distribution. <continued on next page>
1
a technical term. . . don’t try to understand this!

2
[Print: May 19, 2022]
[]

If you cannot get a positive resolution from either the significance or the hypothesis test,
report your findings and go on the step (C.5)– the distribution you found with the best
goodness of fit P-value will stand in for your candidate distribution.
C.5 Based on the Goodness of Fit test results, name the mystery distribution. Does this agree
with your initial guess? Plot a graph of the actual density and compare it with your Part
C.1 histogram normalized to unit area.

The Work.

Divide up the responsibility for solving these problems among the team members. In your report,
explain who was responsible for which aspects of the work. In the end, it is important for everyone
to understand how the problems were solved — this will make the task of preparing for the final
exam easier.

Many spreadsheets contain packages that can do a lot of these calculations for you. Avoid using
these since a computer of any kind is unavailable for the final exam – it is best to learn how to
perform these computations from the basic ideas. You may use spreadsheet functions, such as
NORMINV, NORMDIST, CHIINV, TINV, AVERAGE, STDEV, (to name a few), in lieu of the tables in the
back of the book. Make sure you know what these functions are reporting. Do not use Excel data
analysis functions to determine histograms, confidence intervals, hypothesis/significance tests, and
so on. You should construct histograms, confidence intervals, perform tests of hypothesis from
scratch.

The Report.

Write up (in a few pages) who performed the analysis and how the data was analyzed. Describe
any estimators, equations, theorems, etc. used in performing the confidence interval estimates.
Explain what you are doing for a significance test and interpret the results. Include any remarks
about the data (or the results) that you feel are pertinent.

When you are ready to submit, print a paper copy of your report, check the copy one last time
for errors, and submit.

It isn’t necessary (or desired) to turn in pages and pages of spreadsheet computations. The results
and your interpretations of them must suffice. Use graphs, charts, and summary tables as needed
to support your work. Above all, make this report readable! . . . omit needless words, be succinct!

Mathematics in The Modern World
No ratings yet
Mathematics in The Modern World
75 pages
Wolter Introduction To Variance Estimation PDF PDF Expert
No ratings yet
Wolter Introduction To Variance Estimation PDF PDF Expert
2 pages
Assignment 1
No ratings yet
Assignment 1
11 pages
Data Management Tutorials
No ratings yet
Data Management Tutorials
56 pages
Workbook.data distributions
No ratings yet
Workbook.data distributions
24 pages
Ial Maths s1 Review Exercise 1
No ratings yet
Ial Maths s1 Review Exercise 1
15 pages
Ch04le 1
No ratings yet
Ch04le 1
59 pages
Grade 12 Data Management MDM4U Final_Questions
No ratings yet
Grade 12 Data Management MDM4U Final_Questions
16 pages
41-47 Introductory Biostatistics Notes | Osmosis
No ratings yet
41-47 Introductory Biostatistics Notes | Osmosis
136 pages
1 - Practice Exercise 1 Data Descriptives
No ratings yet
1 - Practice Exercise 1 Data Descriptives
10 pages
Final Exam Review: Test Scores Frequency
100% (1)
Final Exam Review: Test Scores Frequency
10 pages
Exam 1
No ratings yet
Exam 1
5 pages
RP Notes Unit 4 - Distribution Fucntions
No ratings yet
RP Notes Unit 4 - Distribution Fucntions
5 pages
W12SGFE
No ratings yet
W12SGFE
3 pages
Statistics
No ratings yet
Statistics
6 pages
Mathematics As A Tool (Descriptive Statistics) (Midterm Period) Overview: This Module Tackles Mathematics As Applied To Different Areas Such As Data
No ratings yet
Mathematics As A Tool (Descriptive Statistics) (Midterm Period) Overview: This Module Tackles Mathematics As Applied To Different Areas Such As Data
33 pages
S1 Revision Worksheet November 2020: Chapter 2, 3, 4, 5
No ratings yet
S1 Revision Worksheet November 2020: Chapter 2, 3, 4, 5
4 pages
AP Stat Spring Pacing
No ratings yet
AP Stat Spring Pacing
4 pages
Statistics S1 Summary: X X X S
No ratings yet
Statistics S1 Summary: X X X S
3 pages
Lecture 2_Descriptive Statistics
No ratings yet
Lecture 2_Descriptive Statistics
53 pages
Project The Normal Distribution Activity
No ratings yet
Project The Normal Distribution Activity
17 pages
Homework Topic 1&2.: Plus 20
No ratings yet
Homework Topic 1&2.: Plus 20
11 pages
BSChem-Statistics in Chemical Analysis PDF
No ratings yet
BSChem-Statistics in Chemical Analysis PDF
6 pages
Answer Key - Exercise 6
No ratings yet
Answer Key - Exercise 6
5 pages
RM Assignment 2
No ratings yet
RM Assignment 2
9 pages
Biostatistics 140127003954 Phpapp02
No ratings yet
Biostatistics 140127003954 Phpapp02
47 pages
Worksheet Chapter 1-9
No ratings yet
Worksheet Chapter 1-9
7 pages
Community MCQ
50% (2)
Community MCQ
271 pages
Normal Distribution
No ratings yet
Normal Distribution
15 pages
01_statistics_lesson
No ratings yet
01_statistics_lesson
35 pages
Cmda2005 Review
No ratings yet
Cmda2005 Review
65 pages
1696603448-MDM4U - Unit 2 Statistical Analysis
No ratings yet
1696603448-MDM4U - Unit 2 Statistical Analysis
9 pages
Activity
No ratings yet
Activity
11 pages
OCR MEI S1 Summary Sheets
No ratings yet
OCR MEI S1 Summary Sheets
9 pages
Measures of Variability and Position
No ratings yet
Measures of Variability and Position
34 pages
AP Stats Cheat Sheet FINAL
No ratings yet
AP Stats Cheat Sheet FINAL
8 pages
VCTest 1 BF09 Ans
No ratings yet
VCTest 1 BF09 Ans
9 pages
QBM 101 Business Statistics: Department of Business Studies Faculty of Business, Economics & Accounting HE LP University
No ratings yet
QBM 101 Business Statistics: Department of Business Studies Faculty of Business, Economics & Accounting HE LP University
62 pages
KTE Pudu Jaya 2022 Q
No ratings yet
KTE Pudu Jaya 2022 Q
3 pages
Dissertation Boxplot
100% (2)
Dissertation Boxplot
8 pages
Cambridge O Level: STATISTICS 4040/22
No ratings yet
Cambridge O Level: STATISTICS 4040/22
16 pages
STAT 250 Practice Problem Solutions
100% (1)
STAT 250 Practice Problem Solutions
5 pages
Statistical Analysis in Excel by Golden MCpherson
No ratings yet
Statistical Analysis in Excel by Golden MCpherson
315 pages
Statistics 311 Learning Objectives
No ratings yet
Statistics 311 Learning Objectives
7 pages
260 Proj
No ratings yet
260 Proj
3 pages
Unit 1 Assignment SKELETON R spr18
No ratings yet
Unit 1 Assignment SKELETON R spr18
23 pages
KEY FOR Notes Statistics MINI NOTES KEY
No ratings yet
KEY FOR Notes Statistics MINI NOTES KEY
5 pages
WK 1b Biostat
No ratings yet
WK 1b Biostat
38 pages
Inbound 588667172330667162
No ratings yet
Inbound 588667172330667162
30 pages
ALY6010 - Project 3 Document - Electronic Keno - v1 PDF
No ratings yet
ALY6010 - Project 3 Document - Electronic Keno - v1 PDF
6 pages
Test 1 Review A
No ratings yet
Test 1 Review A
7 pages
STATS 10 Assignment 1
No ratings yet
STATS 10 Assignment 1
7 pages
1
No ratings yet
1
47 pages
IGNOU Assignment
0% (1)
IGNOU Assignment
9 pages
Mostly Harmless Statistics
No ratings yet
Mostly Harmless Statistics
506 pages
CS215 Autumn 2024-1
No ratings yet
CS215 Autumn 2024-1
6 pages
C15 Statistics TI84
No ratings yet
C15 Statistics TI84
178 pages
Chapter 3 Univariate Data Worksheet Package Student Spaces
No ratings yet
Chapter 3 Univariate Data Worksheet Package Student Spaces
24 pages
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Student Solutions Manual for Mathematics for Economics, fourth edition
From Everand
Student Solutions Manual for Mathematics for Economics, fourth edition
Michael Hoy
No ratings yet
Chapter 3 - Forecasting - EXCEL TEMPLATES
No ratings yet
Chapter 3 - Forecasting - EXCEL TEMPLATES
14 pages
QM 8 Panel Regression, Random Effects
No ratings yet
QM 8 Panel Regression, Random Effects
39 pages
SPSS Assignment
0% (1)
SPSS Assignment
7 pages
QTT Project Crease Stifness Problem
No ratings yet
QTT Project Crease Stifness Problem
38 pages
Harrell's Concordance Index R
No ratings yet
Harrell's Concordance Index R
13 pages
Basic Econometrics
No ratings yet
Basic Econometrics
2 pages
Bayesian Data Analysis
No ratings yet
Bayesian Data Analysis
14 pages
IRT in Mplus: 1 ICC Curves
No ratings yet
IRT in Mplus: 1 ICC Curves
8 pages
Performance Standard-Based Assessment Statistics and Probability 11.4.1
No ratings yet
Performance Standard-Based Assessment Statistics and Probability 11.4.1
2 pages
Business Statistics Operations Research
No ratings yet
Business Statistics Operations Research
8 pages
Exercise 1 (Exam December 2012)
No ratings yet
Exercise 1 (Exam December 2012)
30 pages
MMS Business Statistics
No ratings yet
MMS Business Statistics
158 pages
Statistik Chapter 9
No ratings yet
Statistik Chapter 9
2 pages
Data Analysis Using SPSS - Evsu PDF
No ratings yet
Data Analysis Using SPSS - Evsu PDF
201 pages
CASE STUDY - North-South Airlines
No ratings yet
CASE STUDY - North-South Airlines
15 pages
استخدام أحد نماذج بوكس-جينكنز للتنبؤ بأعداد الطالبات في المرحلة الأساسية في محافظة أبين PDF
No ratings yet
استخدام أحد نماذج بوكس-جينكنز للتنبؤ بأعداد الطالبات في المرحلة الأساسية في محافظة أبين PDF
2 pages
Anova
No ratings yet
Anova
56 pages
Group 2 - Chapter 3 - Multiple Regression Analysis Estimation
No ratings yet
Group 2 - Chapter 3 - Multiple Regression Analysis Estimation
13 pages
509-Article Text-2135-1-10-20220804
No ratings yet
509-Article Text-2135-1-10-20220804
14 pages
Introduction to Probability and Statistics 3rd Edition Mendenhall Solutions Manualpdf download
100% (4)
Introduction to Probability and Statistics 3rd Edition Mendenhall Solutions Manualpdf download
60 pages
Nihms 173355
No ratings yet
Nihms 173355
24 pages
Solution Manual for Managing, Controlling, and Improving Quality, 1st Edition, Douglas C. Montgomery, Cheryl L. Jennings Michele E. Pfund - Download Now To Experience The Complete Book
100% (6)
Solution Manual for Managing, Controlling, and Improving Quality, 1st Edition, Douglas C. Montgomery, Cheryl L. Jennings Michele E. Pfund - Download Now To Experience The Complete Book
31 pages
Exam All Questions
No ratings yet
Exam All Questions
566 pages
ML 1
No ratings yet
ML 1
51 pages
Analysis Analysis of Variance One Way Anova
No ratings yet
Analysis Analysis of Variance One Way Anova
3 pages
Random Motors Project Submission: Name
No ratings yet
Random Motors Project Submission: Name
10 pages
2503.04941v2
No ratings yet
2503.04941v2
85 pages
Research 5
No ratings yet
Research 5
41 pages
STAT Q4 Week 5 Enhanced.v1
No ratings yet
STAT Q4 Week 5 Enhanced.v1
11 pages

Instructions

Uploaded by

Instructions

Uploaded by

[Print: May 19, 2022]

Statistics Project Directions

This column contains 10,000 values from some Normal distribution.

You might also like