0% found this document useful (0 votes)

8 views

Stat 4

Uploaded by

Christian Lerrick

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Stat 4

Uploaded by

Christian Lerrick

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Worksheet 2 - Basic statistics

Basic statistics references

z Fowler et al. (1998) -Chpts 1, 2, 3, 4, 5, 6, 9, 10, 11, 12, & 16 (16.1, 16.2, 16.3, 16.9,16.11-16.14)
z Holmes et al. (2006) - Chpt 4 & Sections 7.1-7.3 & 8.1-8.3
z Quinn & Keough (2002) - Chpt 1, 2, 3 & 4

Question 1 - Population parameters

The little spotted kiwi (Apteryx owenii) is a very rare flightless bird that is extinct on mainland New Zealand and
survives as 1000 individuals on Kapiti Island. In order to monitor the population, researches in the recovery team
systematically captured all of the individuals in the population over a two week period. Each individual was
weighed, banded, assessed and released. The file *.csv lists the weights of each individual male little spotted kiwi
in the population.

Format of kiwi.csv data files

Band Weight
64955 1.749
65318 2.551
64612 1.768
64393 2.327
64092 2.127
... ...

Band Unique bird identification band number

Weight Weight (grams) of the individual male
birds

Open the kiwi data file.

Generate a frequency histogram of male kiwi weights. This distribution represents the population (all possible
observations) of male kiwi weights. Note that this is the statistical population and not a biological population -
obviously a biological population entirely lacking in females would not last long!

Q1-1.Describe the shape of the distribution?

Since we have the weights of all male kiwi in the population, is possible to calculate population parameters (such
as population mean and standard deviation) directly!

Q1-2. What is the mean (a location measure) and standard deviation (a measure of spread) of the
population?

a. Mean
b. SD

Assuming, the population is normally distributed, it is possible to calculate the probability that a randomly
recaptured male kiwi will weigh greater than a particular value, less than a particular value, or weigh between a
range of weights. This probability is just the area under a particular region of a normal distribution and can be
calculated using the normal probabilities.

Q1-3. Assuming that the population is normally distributed, what is the probability of recapturing a
male little spotted kiwi that weighs greater than 2.9 kg?

For data sets with large numbers of observations, the distribution of observations can be examined via a
histogram - as demonstrated above. However, histograms are only meaningful for summarizing large data sets.
For smaller data sets other exploratory tools (such as boxplots) are necessary. To appreciate the relationship
between boxplots and the underlying distribution of data, construct a boxplot of male kiwi weights.

Question 2 - Samples as estimates of populations

Here is a modified example from Quinn and Keough (2002). Lovett et al. (2000) studied the chemistry of forested
watersheds in the Catskill Mountains in New York State. They had 38 sites and recorded the concentrations of
ten chemical variables, averaged over three years. We will look at two of these variables, dissolved organic
carbon (DOC) and hydrogen ions (H).

Format of lovett.csv data files

STREAM DOC H
Hunter 180.4 0.48
West Kill 108.8 0.24
Mill 104.7 0.47
Kelly Hollow 84.5 0.23
Pigeon 82.4 0.37

STREAM Name of the site (stream) from which

observations were collected
DOC Dissolved oxygen concentration
(mmol.L-1)
H Hydrogen concentration (mmol.L-1)

Open the lovett data file.

Q2-1. What is the purpose of sampling?

Before continuing, make sure you are clear on what the observations, variables and populations are.

Construct a boxplot of dissolved organic carbon (DOC) from the sample observations.

Q2-2. How would you describe the boxplot?

Q2-3. Are there any outliers? (Y or N)

Provided the data were collected without bias (ideally random) and with adequate replication, the sample should
reflect the entire population. Therefore sample statistics should be good estimates of the population
parameters.

Q2-4. Calculate the sample mean

The mean of a sample is considered to be a location characteristic of the sample. Along with the mean, it is often
desirable to characterize the spread of data in a sample - that is to determine how variable the sample is.

Q2-5. Calculate the sample standard deviation

For most purposes, the sample itself is of little interest - it is purely used to estimate the population. Therefore it is
necessary to be able to estimate how well the sample mean estimates the true population mean. The Standard
error (SE) of the mean is a measure of the precision of the mean.

Q2-6. Calculate the standard error of the mean

Following on from the idea of precision of the mean, is the concept of confidence intervals, by which an interval
is calculated that we are 95% confident will contain the true population mean.

Q2-7. Calculate the 95% confidence interval of the mean +/-

Construct a boxplot of hydrogen concentration (H) from the sample observations

Q2-8. How would you describe the boxplot?

Many statistical analyses assume that the population from which the sample was collected is normally distributed.
However, biological data is not always normally distributed. To normalize the data, try transforming to logs.

Q2-9. Does the transformation successfully normalize these data? (Y or N)

Earlier we identified the presence of an outlier in the DOC variable. To investigate the impact of this outlier on a
range of summary statistics, calculate the following measures of location (mean and median) and spread
(standard deviation and interquartile range) for DOC, with and without the outlying observation and complete
the table below.

Summary Statistic DOC Modified DOC

Mean

Median

Variance

Standard deviation

Inter-quartile range
Q2-10. Which measures of location and spread are most robust to inclusion and exclusion of a single
unusual observation?

Question 3 - Exploratory data analysis

Sánchez-Piñero & Polis (2000) studied the effects of seabirds on tenebrionid beetles on islands in the Gulf of
California. These beetles are the dominant consumers on these islands and it was envisaged that seabirds
leaving guano and carrion would increase beetle productivity. They had a sample of 25 islands and recorded the
beetle density, the type of bird colony (roosting, breeding, no birds), % cover of guano and % plant cover of
annuals and perennials.

Format of sanchez.csv data files

COLTYPE BEETLE96 GUANO PLANT96
.. .. .. ..
.. .. .. ..
.. .. .. ..

COLTYPE Type of bird colony (N = no birds, R

= roosting, B = breeding
BEETLE96 Abundance of beetles (number per
carrion trap) in 1996
GUANO % cover of guano on island in 1995
and 1996
PLANT96 % cover of total plants (annual and
perennial) on island in 1996

Open the sanchez data file.

Q3-1.For percentage plant cover, Calculate the following summary statistics separately for each
colony type and complete the table below.

Summary No Roosting Breeding

Statistic colonies colony colony
Mean

Variance

Standard deviation

Coefficient of variation

a. Which colony type has the greatest variance? (N, R or B)

b. Which is the most variable when corrected for the mean? (N, R or B)

Normality
Before proceeding, make sure you are familiar with the significance of normally distributed sample data and thus
why it is necessary to examine the distribution of sample data as part of routine exploratory data analysis
(EDA) prior to any formal data analysis.

Q3-2. Construct a boxplot for total 1996 beetle abundance for each colony type separately.

a. Are there any outliers identified? (Y or N)

b. Describe the shape of each distribution.

c. Now transform the response variable to logs and redraw the boxplots, does this
change (improve?) the shape of the distributions? (Y or N)

Linearity
Often it is necessary to examine the nature of the relationship or association between variables as part of
routine exploratory data analysis (EDA) prior to any formal data analysis. The nature of relationships/associations
between continuous data is explored using scatterplots.

Q3-3. Construct a scatterplot for beetle abundance against total 1996 plant cover.

a. Is there any evidence of non-linearity? (Y or N)

b. Note, that the boxplots also enable us to explore the normality of both variables
(populations). Is there any evidence of non-normality? (Y or N)

Sánchez-Piñero & Polis (2000) measured a number of continuous variables (% cover of guano, % cover or plants
and abundance of beetles. Therefore, they might be interested in exploring the relationships between each of
these variables. That is, the relationship between guano and plants, guano and beetles, and beetles and plants.
While it is possible to create separate scatterplots for each pair (in this case three separate scatterplots), a
scatterplot matrix is usually more informative and efficient.

Q3-4. Construct a scatterplot matrix or SPLOM for % of guano, % of plant cover and beetle
abundance. Are there any obvious relationships?

Homogeneity of variance
Many statistical hypothesis tests assume that populations are equally varied. For hypothesis tests that compare
populations (such as t-tests - see Question 4), it is important that one of the populations is not substantially more
or less variable than the other population(s). Thus, such tests assume homogeneity of variance.

Q3-5. Construct an examine boxplots of beetle abundance for each of the three colony types.

a. Firstly, is there any evidence of non-normality? (Y or N)

b. Try square-root transforming (preferred over log transformation when applying to count
data, since log(0) is not legal) the beetle variable (function is sqrt) and using this
transformed variable to reconstruct the boxplots. Note that it may be necessary to
perform a forth-root transformation (which performing the square-root transformation
twice) in order to normalize this highly skewed data. This can be done using the
expression to compute as sqrt(sqrt(BEETLE96)). If this successfully normalizes the
data, focus on whether there is any evidence that the populations are equally varied. Is
there any evidence that the assumption of homogeneity of variance is violated? (Y or
N)

c. Try calculating the variance or standard deviation of beetle abundance for each
colony type separately (remember to use the transformed data, as the raw data was
obviously non-normal and non-normality often results in unequal variances). Do these
values provide any evidence for unequally varied populations? (Y or N)

d. The primary concern of the equal variance assumption is that there should not be a
relationship between population mean and variance. Use the sample statistics to plot
mean against variance for the transformed beetle abundance data. Any evidence of a
relationship between mean and variance? (Y or N)

Question 4 - Hypothesis testing

Furness & Bryant (1996) studied the energy budgets of breeding northern fulmars (Fulmarus glacialis) in
Shetland. As part of their study, they recorded the body mass and metabolic rate of eight male and six female
fulmars.

Format of furness.csv data files

SEX METRATE BODYMASS
MALE 2950 875
FEMALE 1956 765
MALE 2308 780
MALE 2135 790
MALE 1945 788

SEX Sex of breeding northern fulmars

(Fulmarus glacialis)
METRATE Metabolic rate (hJ/day)
BODYMASS Body mass (g)

Open the furness data file.

Q4-1. The researchers were interested in testing whether there is a difference in the metabolic rate of
male and female breeding northern fulmars. In light of this, list the following:

a. The biological hypotheses of interest

b. The biological null hypotheses

c. The statistical null hypotheses (H0)

The appropriate statistical test for testing the null hypothesis that the means of two independent populations are
equal is a t-test

Before proceeding, make sure you understand what is meant by normality and equal variance as well as the
principles of hypothesis testing using a t-test.

Q4-2. For the null hypothesis test of interest (that the mean population metabolic rate of males and
females were the same), calculate the Degrees of freedom

Q4-3. Calculate the critical t-values for the following null hypotheses (&alpha = 0.05)

a. The metabolic rate of males is higher than that females (one-tailed test)

b. The metabolic rate of males is the same as that of females (two-tailed test)

Since most hypothesis tests follow the same basic procedure, confirm that you understand the basic steps of
hypothesis tests.

Q4-4.In the table below, list the assumptions of a t-test along with how violations of each assumption
are diagnosed and/or the risks of violations are minimized.

Assumption Diagnostic/Risk Minimization

II.

III.

So, we wish to investigate whether or not male and female fulmars have the same metabolic rates, and that we
intend to use a t-test to test the null hypothesis that the population mean metabolic rate of males is equal to the
population mean metabolic rate of females. Having identified the important assumptions of a t-test, use the
samples to evaluate whether the assumptions are likely to be violated and thus whether a t-test is likely to be
reliability.

4.5 Is there any evidence that;

a. The assumption of normality has been violated?

b. The assumption of homogeneity of variance has been violated?

Q4-6. Perform a t-test to examine the effect of sex on the mass of fulmars using either (which ever is
most appropriate) a pooled variance t-test (for when population variances are very similar) or
separate variance t-test (for when the variance of one population is likely to be up to 2.5 times greater
or less than the other population). Ensure that you are familiar with the output of a t-test.
a. What is the t-value? (Excluding the sign. The sign will depend on whether you
compared males to females or females to males, and thus only indicates which group
had the higher mean).

b. What is the df (degrees of freedom).

c. What is the p value.

Q4-7. Write the results out as though you were writing a research paper/thesis. For example (select
the phrase that applies and fill in gaps with your results):
The mean metabolic rate of male fulmars was (choose correct option)
(choose correct option) (t = , df = ,P =
)
the mean metabolic rate of female fulmars.

Q4-8.Construct a bar graph showing the mean metabolic rate of male and female fulmars and an
indication of the precision of the means with error bars.

Question 5 - Paired data

Here is a modified example from Quinn and Keough (2002). Elgar et al. (1996) studied the effect of lighting on
the web structure or an orb-spinning spider. They set up wooden frames with two different light regimes
(controlled by black or white mosquito netting), light and dim. A total of 17 orb spiders were allowed to spin their
webs in both a light frame and a dim frame, with six days `rest' between trials for each spider, and the vertical
and horizontal diameter of each web was measured. Whether each spider was allocated to a light or dim frame
first was randomized. The H0's were that each of the two variables (vertical diameter and horizontal diameter of
the orb web) were the same in dim and light conditions. Elgar et al. (1996) correctly treated these as paired
comparisons because the same spider spun her web in a light frame and a dark frame.

Format of elgar.csv data files

PAIR VERTDIM HORIZDIM VERTLIGH HORIZLIGH
.. .. .. .. ..
.. .. .. .. ..
.. .. .. .. ..

PAIR Name given to each pair of webs spun by a particular spider

VERTDIM The vertical dimension or height (mm) of webs spun in dim
conditions
HORIZDIM The horizontal dimension or width (mm) of webs spun in dim
conditions
VERTLIGH The vertical dimension or height (mm) of webs spun in light
conditions
HORIZLIGH The horizontal dimension or width (mm) of webs spun in light
conditions
Note:for paired t-tests, categories appear as column labels rather than entries in a categorical variable. Compare
the structure of the elgar data (paired t-test) set with that of the furness (standard t-test) data set.

Open the elgar data file.

Q5-1. What is an appropriate statistical test for testing an hypothesis about the difference in
dimensions of webs spun in light versus dark conditions? Explain why?
Q5-2. The actual H0 is that the mean of the differences between the pairs (light and dim for each
spider) equals zero. Use a paired t-test to test the H0 that the mean of the differences in vertical
diameter and separately, in horizontal diameter of the web between the pairs (light and dim for each
spider) equal zero.

Q5-3. Write the results out as though you were writing a research paper/thesis. For example (select
the phrase that applies and fill in gaps with your results):
The mean vertical diameter of spider webs in dim conditions was (choose correct option)
(choose correct option) (t = , df = ,P =
)
the vertical dimensions in light conditions.
The mean horizontal diameter of spider webs in dim conditions was (choose correct option)
(choose correct option) (t = , df = ,P =
)
the horizontal dimensions in light conditions.

Question 6 - Non-parametric tests

We will now revisit the data set of Furness & Bryant (1996) that was used in Question 4 to investigate the effects
of gender on the metabolic rates of breeding northern fulmars (Fulmarus glacialis). Furness & Bryant (1996) also
recorded the body mass of the eight male and six female fulmars they captured.

Since the males and female fulmars were all independent of one another, a t-test would be appropriate to test the
null hypothesis of no difference in mean body weight of male and female fulmars.

Q6-1. Are the assumptions underlying this test met? (Y or N) Hint: check the relative sizes of the two
sample variances and the distribution of body weight for each sex.

When the distributional assumptions are violated, parametric tests are unreliable. Under these circumstances,
non-parametric tests can be very useful.

Q6-2. The Wilcoxon-Mann-Whitney test is described as a non-parametric test for comparing two
groups.

a. What null hypothesis does this test actually evaluate?

b. What are the underlying assumptions of a Wilcoxon-Mann-Whitney test?

Q6-3. If the assumptions are met, test the null hypothesis of no difference in body weight between
male and female fulmars using a Wilcoxon test. Based on this outcome, what are your conclusions?

a. Statistical:
b. Biological (include trend):

Q6-4.Construct a bar graph showing the mean mass of male and female fulmars and an indication of
the precision of the means with error bars.

Welcome to the end of Worksheet 2

The Analysis of Biological Data Practice Problem Answers
40% (5)
The Analysis of Biological Data Practice Problem Answers
46 pages
Whitlock and Schluter-The Analysis of Biological Data Solutions Manual (2008) PDF
62% (13)
Whitlock and Schluter-The Analysis of Biological Data Solutions Manual (2008) PDF
44 pages
Tenko Raykov, George A. Marcoulides-Basic Statistics - An Introduction With R-Rowman & Littlefield Publishers (2012) PDF
No ratings yet
Tenko Raykov, George A. Marcoulides-Basic Statistics - An Introduction With R-Rowman & Littlefield Publishers (2012) PDF
345 pages
Fundamentals of Biostatistics 8th Edition by Rosner ISBN 130526892X Solution Manual
100% (51)
Fundamentals of Biostatistics 8th Edition by Rosner ISBN 130526892X Solution Manual
19 pages
Lab 1. The Nature of Data
No ratings yet
Lab 1. The Nature of Data
15 pages
Exploratory Data Analysis: 2.1 Objectives
No ratings yet
Exploratory Data Analysis: 2.1 Objectives
23 pages
Entendimiento de Datos Cientificos
No ratings yet
Entendimiento de Datos Cientificos
6 pages
Understanding The Structure of Scientific Data: LC - GC Europe Online Supplement
No ratings yet
Understanding The Structure of Scientific Data: LC - GC Europe Online Supplement
22 pages
Use of Statistics by Scientist
No ratings yet
Use of Statistics by Scientist
22 pages
CHE331 L08 Descriptive Stats
No ratings yet
CHE331 L08 Descriptive Stats
31 pages
Montagna Using SAS to Manage Biological Species Data and Calculate Diversity Indices
No ratings yet
Montagna Using SAS to Manage Biological Species Data and Calculate Diversity Indices
5 pages
Young, L. J., & Young, J. H. (1998) - Statistical Ecology.
No ratings yet
Young, L. J., & Young, J. H. (1998) - Statistical Ecology.
581 pages
Introduction To Probabilty
No ratings yet
Introduction To Probabilty
212 pages
Introduction To Statistics and Data Analysis
No ratings yet
Introduction To Statistics and Data Analysis
26 pages
Biol4121_PopulationOf LeafSizeatTSUwetland
No ratings yet
Biol4121_PopulationOf LeafSizeatTSUwetland
8 pages
mishel6
No ratings yet
mishel6
2 pages
Lecture Notes Ma12003 PDF
100% (1)
Lecture Notes Ma12003 PDF
105 pages
Data Mining - R Assignment: Konstantinos Stavrou (70134) 11/11/2012
No ratings yet
Data Mining - R Assignment: Konstantinos Stavrou (70134) 11/11/2012
13 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Word File for Prob and Stats
No ratings yet
Word File for Prob and Stats
25 pages
Measure of Relativity Position Normal Distribution Correlation
No ratings yet
Measure of Relativity Position Normal Distribution Correlation
22 pages
Measure of Relativity Position Normal Distribution Correlation
No ratings yet
Measure of Relativity Position Normal Distribution Correlation
22 pages
Cmda2005 Review
No ratings yet
Cmda2005 Review
65 pages
Complete Course 2021
No ratings yet
Complete Course 2021
61 pages
Notes 3
No ratings yet
Notes 3
19 pages
1347 s13 QP 2
No ratings yet
1347 s13 QP 2
8 pages
ML R Experiment1
No ratings yet
ML R Experiment1
10 pages
Methods Available For The Analysis of Data From Dominant Molecular Markers
No ratings yet
Methods Available For The Analysis of Data From Dominant Molecular Markers
6 pages
8 Probability Distributions: 8.1 R As A Set of Statistical Tables
No ratings yet
8 Probability Distributions: 8.1 R As A Set of Statistical Tables
6 pages
Engdat
No ratings yet
Engdat
3 pages
Ae Solution
100% (1)
Ae Solution
130 pages
1984 IBP 17 Chapter 8 Statistics
No ratings yet
1984 IBP 17 Chapter 8 Statistics
71 pages
X X X X X X: Data Presentation and Interpretation
No ratings yet
X X X X X X: Data Presentation and Interpretation
89 pages
Word File for Prob and Stats (2)
No ratings yet
Word File for Prob and Stats (2)
22 pages
IndEco 2024-Lab 01
No ratings yet
IndEco 2024-Lab 01
7 pages
ANOVA Models
No ratings yet
ANOVA Models
44 pages
Class Notes
No ratings yet
Class Notes
147 pages
Alevelsb sm1 Ex1mix
No ratings yet
Alevelsb sm1 Ex1mix
3 pages
Assignment 1,2,3
No ratings yet
Assignment 1,2,3
3 pages
Stats and Math for 9700 Bio p5 (1)
No ratings yet
Stats and Math for 9700 Bio p5 (1)
8 pages
Week 8
No ratings yet
Week 8
13 pages
Bio 160 - Exercise No 1 - Statistics
No ratings yet
Bio 160 - Exercise No 1 - Statistics
9 pages
Lab3Instructions_Knitr
No ratings yet
Lab3Instructions_Knitr
5 pages
Statical Chapman
100% (1)
Statical Chapman
385 pages
ESci 117 Answers To LAs in Modules 2
100% (1)
ESci 117 Answers To LAs in Modules 2
21 pages
Lesson 1 - A Review of Statistics
No ratings yet
Lesson 1 - A Review of Statistics
11 pages
To compute descriptive statistics , draw QQ plots, stem and leaf plot and box plot
No ratings yet
To compute descriptive statistics , draw QQ plots, stem and leaf plot and box plot
11 pages
Chapter 1 Exercise
No ratings yet
Chapter 1 Exercise
4 pages
Assignment 3 - Topic 5
No ratings yet
Assignment 3 - Topic 5
2 pages
Programming Python Statistics
No ratings yet
Programming Python Statistics
7 pages
Problems
No ratings yet
Problems
22 pages
Applied Environmental Measurement Techniques: Statistics Exploratory Data Analysis
No ratings yet
Applied Environmental Measurement Techniques: Statistics Exploratory Data Analysis
17 pages
Statistics and Biology
No ratings yet
Statistics and Biology
8 pages
Assignment
No ratings yet
Assignment
11 pages
Rocky Mountain Mammals: A Handbook of Mammals of Rocky Mountain National Park and Vicinity, Third Edition
From Everand
Rocky Mountain Mammals: A Handbook of Mammals of Rocky Mountain National Park and Vicinity, Third Edition
David M. Armstrong
No ratings yet
RSPB Handbook of British Birds: Fifth edition
From Everand
RSPB Handbook of British Birds: Fifth edition
Peter Holden
No ratings yet
Life History of the Kangaroo Rat
From Everand
Life History of the Kangaroo Rat
Walter P. (Walter Penn) Taylor
No ratings yet
Key to the identification and ecology of Cyclopoida (Crustacea, Copepoda) of North America (north of Mexico)
From Everand
Key to the identification and ecology of Cyclopoida (Crustacea, Copepoda) of North America (north of Mexico)
Leszek Bledzki
No ratings yet
Introduced Dung Beetles in Australia: A Pocket Field Guide
From Everand
Introduced Dung Beetles in Australia: A Pocket Field Guide
Penny Edwards
No ratings yet
Morphological Variation in a Population of the Snake, Tantilla gracilis Baird and Girard
From Everand
Morphological Variation in a Population of the Snake, Tantilla gracilis Baird and Girard
Laurence M. Hardy
No ratings yet
WS 5
No ratings yet
WS 5
6 pages
WS 2
No ratings yet
WS 2
77 pages
WS 3
100% (1)
WS 3
12 pages
Stat 3
No ratings yet
Stat 3
14 pages
SD 4
No ratings yet
SD 4
17 pages
PDF Grade 3 Mathematics Answer Sheet Subject Code M 3 - Compress
No ratings yet
PDF Grade 3 Mathematics Answer Sheet Subject Code M 3 - Compress
9 pages
Physics Material For PAT
No ratings yet
Physics Material For PAT
14 pages
2019 Vanda Global Answers
No ratings yet
2019 Vanda Global Answers
2 pages
DAY 3 - Arithmetic Sequences Worksheet
No ratings yet
DAY 3 - Arithmetic Sequences Worksheet
3 pages
Confidence Interval For Median Based On Sign Test
100% (1)
Confidence Interval For Median Based On Sign Test
32 pages
One Between-Subjects Factor: Pairwise Comparisons
No ratings yet
One Between-Subjects Factor: Pairwise Comparisons
20 pages
Immediate Download Engineering Statistics: An Introduction Edward B. Magrab Ebooks 2024
100% (5)
Immediate Download Engineering Statistics: An Introduction Edward B. Magrab Ebooks 2024
49 pages
Research Article: Bootstrapping Nonparametric Prediction Intervals For Conditional Value-at-Risk With Heteroscedasticity
No ratings yet
Research Article: Bootstrapping Nonparametric Prediction Intervals For Conditional Value-at-Risk With Heteroscedasticity
7 pages
ML VN Unit1 1
No ratings yet
ML VN Unit1 1
27 pages
Wayspire AI Course
No ratings yet
Wayspire AI Course
4 pages
STAT212 093 Old-Exam First-Major Solved
No ratings yet
STAT212 093 Old-Exam First-Major Solved
4 pages
B. Com. Semester-II Business Mathematics and Statistics (Code: 52411202)
No ratings yet
B. Com. Semester-II Business Mathematics and Statistics (Code: 52411202)
3 pages
NOTA INTERPRETING TEST SCORES & NORMS
No ratings yet
NOTA INTERPRETING TEST SCORES & NORMS
8 pages
12 Anova
No ratings yet
12 Anova
43 pages
Sampling and Its Types
No ratings yet
Sampling and Its Types
7 pages
The Types of Variables
No ratings yet
The Types of Variables
2 pages
GMM Stata
No ratings yet
GMM Stata
27 pages
Chi-Squared Test For Nominal (Categorical) Data: Yellow
No ratings yet
Chi-Squared Test For Nominal (Categorical) Data: Yellow
7 pages
Hypothesis Testing: Erwin L. Medina
0% (2)
Hypothesis Testing: Erwin L. Medina
8 pages
13_dind
No ratings yet
13_dind
58 pages
Handbook Of Survival Analysis 1st Edition John P Klein Hans C Van Houwelingen download
No ratings yet
Handbook Of Survival Analysis 1st Edition John P Klein Hans C Van Houwelingen download
90 pages
Bharathidasan University-Statistics-QP-Nov-2010
No ratings yet
Bharathidasan University-Statistics-QP-Nov-2010
3 pages
SPSS Tutorial
100% (1)
SPSS Tutorial
19 pages
NDE 6310 Assignment 1
No ratings yet
NDE 6310 Assignment 1
1 page
(eBook PDF) Statistics for Business Economics 13th Edition by Davidinstant download
100% (2)
(eBook PDF) Statistics for Business Economics 13th Edition by Davidinstant download
43 pages
AE114 Topic 1 Module 1
No ratings yet
AE114 Topic 1 Module 1
7 pages
Chapter 1
No ratings yet
Chapter 1
44 pages
SHS Final Exam (Statistics)
No ratings yet
SHS Final Exam (Statistics)
3 pages
Sampling Lab
No ratings yet
Sampling Lab
5 pages
Econ 320
No ratings yet
Econ 320
4 pages
1 Normality PDF
No ratings yet
1 Normality PDF
5 pages
(Ebook) Research Methods for the Behavioral Sciences by Gregory J. Privitera ISBN 2018037364 - Download the ebook now for instant access to all chapters
100% (2)
(Ebook) Research Methods for the Behavioral Sciences by Gregory J. Privitera ISBN 2018037364 - Download the ebook now for instant access to all chapters
79 pages
Mathematics: Unit Statistics 1B
No ratings yet
Mathematics: Unit Statistics 1B
20 pages
A Quick Guide To Quantitative Research in The Social Sciences
100% (1)
A Quick Guide To Quantitative Research in The Social Sciences
27 pages

Stat 4

Uploaded by

Stat 4

Uploaded by

Worksheet 2 - Basic statistics

Basic statistics references

Question 1 - Population parameters

Format of kiwi.csv data files

Band Unique bird identification band number

Open the kiwi data file.

Q1-1.Describe the shape of the distribution?

Question 2 - Samples as estimates of populations

Format of lovett.csv data files

STREAM Name of the site (stream) from which

Open the lovett data file.

Q2-1. What is the purpose of sampling?

Q2-2. How would you describe the boxplot?

Q2-4. Calculate the sample mean

Q2-5. Calculate the sample standard deviation

Q2-6. Calculate the standard error of the mean

Q2-7. Calculate the 95% confidence interval of the mean +/-

Construct a boxplot of hydrogen concentration (H) from the sample observations

Q2-8. How would you describe the boxplot?

Q2-9. Does the transformation successfully normalize these data? (Y or N)

Summary Statistic DOC Modified DOC

Question 3 - Exploratory data analysis

Format of sanchez.csv data files

COLTYPE Type of bird colony (N = no birds, R

Open the sanchez data file.

Summary No Roosting Breeding

a. Which colony type has the greatest variance? (N, R or B)

a. Are there any outliers identified? (Y or N)

b. Describe the shape of each distribution.

a. Is there any evidence of non-linearity? (Y or N)

a. Firstly, is there any evidence of non-normality? (Y or N)

Question 4 - Hypothesis testing

Format of furness.csv data files

SEX Sex of breeding northern fulmars

Open the furness data file.

a. The biological hypotheses of interest

b. The biological null hypotheses

Assumption Diagnostic/Risk Minimization

4.5 Is there any evidence that;

a. The assumption of normality has been violated?

b. The assumption of homogeneity of variance has been violated?

b. What is the df (degrees of freedom).

c. What is the p value.

Question 5 - Paired data

Format of elgar.csv data files

PAIR Name given to each pair of webs spun by a particular spider

Open the elgar data file.

Question 6 - Non-parametric tests

a. What null hypothesis does this test actually evaluate?

b. What are the underlying assumptions of a Wilcoxon-Mann-Whitney test?

Welcome to the end of Worksheet 2

You might also like