Download Analysis of Binary Data, Second Edition David Roxbee Cox ebook All Chapters PDF
Download Analysis of Binary Data, Second Edition David Roxbee Cox ebook All Chapters PDF
com
https://textbookfull.com/product/analysis-of-
binary-data-second-edition-david-roxbee-cox/
https://textbookfull.com/product/analysis-of-survival-data-first-
edition-cox/
textbookfull.com
https://textbookfull.com/product/visual-six-sigma-making-data-
analysis-lean-ian-cox/
textbookfull.com
https://textbookfull.com/product/biota-grow-2c-gather-2c-cook-loucas/
textbookfull.com
https://textbookfull.com/product/newton-the-alchemist-science-enigma-
and-the-quest-for-natures-secret-fire-william-r-newman/
textbookfull.com
Writing an Interactive Story 1st Edition Pierre Lacombe
https://textbookfull.com/product/writing-an-interactive-story-1st-
edition-pierre-lacombe/
textbookfull.com
https://textbookfull.com/product/marieb-human-anatomy-physiology-12th-
edition-katja-hoehn/
textbookfull.com
https://textbookfull.com/product/neuroimmune-diseases-from-cells-to-
the-living-brain-hiroshi-mitoma/
textbookfull.com
https://textbookfull.com/product/sensing-the-nation-s-law-stefan-
huygebaert/
textbookfull.com
Using think-aloud interviews and cognitive labs in
educational research 1st Edition Leighton
https://textbookfull.com/product/using-think-aloud-interviews-and-
cognitive-labs-in-educational-research-1st-edition-leighton/
textbookfull.com
M O N O G R A P H S ON
STATISTICS A N D A P P L I E D P R O B A B I L I T Y
General Editors
D.R. Cox, D.V. Hinkley, D. Rubin and B.W. Silverman
1 Stochastic Population Models in Ecology and Epidemiology
M.S. Bartlett (1960)
2 Queues D.R. Cox and W.L. Smith (1961)
3 Monte Carlo Methods J.M. Hammersley and D.C. Handscomb (1964)
4 The Statistical Analysis of Series of Events D.R. Cox and
P.A.W. Lewis (1966)
5 Population Genetics W.J. Ewens (1969)
6 Probability, Statistics and Time M.S. Bartlett (1975)
7 Statistical Inference S.D. Silvey (1975)
8 The Analysis of Contingency Tables B.S. Everitt (1977)
9 Multivariate Analysis in Behavioural Research A.E. Maxwell (1977)
10 Stochastic Abundance Models S. Engen (1978)
11 Some Basic Theory for Statistical Inference E.J.G. Pitman (1978)
12 Point Processes D.R. Cox and V. Isham (1980)
13 Identification of Outliers D.M. Hawkins (1980)
14 Optimal Design S.D. Silvey (1980)
15 Finite Mixture Distributions B.S. Everitt and D.J. Hand (1981)
16 Classification A.D. Gordon (1981)
17 Distribution-free Statistical Methods 7.5. Maritz (1981)
18 Residuals and Influence in Regression R.D. Cook and S. Weisberg (1982)
19 Applications of Queueing Theory G.F. Newell (1982)
20 Risk Theory, 3rd edition R.E. Beard, T. Pentikainen and
E. Pesonen (1984)
21 Analysis of Survival Data D.R. Cox and D. Oakes (1984)
22 An Introduction to Latent Variable Models B.S. Everitt (1984)
23 Bandit Problems D.A. Berry and B. Fristedt (1985)
24 Stochastic Modelling and Control M.H.A. Davis and R. Vinter (1985)
25 The Statistical Analysis of Compositional Data 7. Aitchison (1986)
26 Density Estimation for Statistical and Data Analysis
B.W. Silverman (1986)
27 Regression Analysis with Applications G.B. Wetherill (1986)
28 Sequential Methods in Statistics, 3rd edition G.B. Wetherill (1986)
29 Tensor Methods in Statistics P. McCullagh (1987)
30 Transformation and Weighting in Regression R J. Carroll and
D. Ruppert (1988)
31 Asymptotic Techniques for use in Statistics O.E. Barndoff-Nielson
and D.R. Cox (1989)
32 Analysis of Binary Data, 2nd edition D.R. Cox and E.J. Snell (1989)
33 Analysis of Infectious Disease Data N.G. Becker (1989)
34 Design and Analysis of Cross-Over Trials B. Jones and
M.G. Kenward (1989)
35 Empirical Bayes Method, 2nd edition J.S. Maritz and T. tw in (1989)
36 Symmetric Multivariate and Related Distributions K.-T. Fang,
S. Kotz and K. Ng (1989)
37 Generalized Linear Models, 2nd edition P. McCullagh and
J.A. Nelder (1989)
38 Cyclic Designs y.v4. John (1987)
39 Analog Estimation Methods in Econometrics C.F. Manski (1988)
40 Subset Selection in Regression A.J. Miller (1990)
41 Analysis of Repeated Measures M. Crowder and D.J. Hand (1990)
42 Statistical Reasoning with Imprecise Probabilities P. Walley (1990)
43 Generalized Additive Models T.J. Hastie and R.J. Tibshirani (1990)
44 Inspection Errors for Attributes in Quality Control
N.L. Johnson, S. Kotz and X Ww (1991)
45 The Analysis of Contingency Tables, 2nd edition B.S. Everitt (1992)
(Full details concerning this series are available from the Publishers.)
Analysis of
Binary Data
SECOND EDITION
D.R. COX
Nuffield College
Oxford
and
E.J. SNELL
Department of Mathematics
Imperial College London
References 215
This monograph concerns the analysis of binary (or quantal) data, i.e.
data in which an observation takes one of two possible forms, e.g.
success or failure. The central problem is to study how the probability
of success depends on explanatory variables and groupings of the
material.
Many particular methods, especially significance tests, have been
proposed for such problems and one of the main themes of the
monograph is that these methods are unified by considering models
in which the logistic transform of the probability of success is a linear
combination of unknown parameters. These linear logistic models
play here much the same role as do normal-theory linear models in
the analysis of continuously distributed data.
Some knowledge of the theory of statistics is assumed. I have
written primarily for statisticians, but I hope also that scientists and
technologists interested in applying statistical methods will, by
concentrating on the examples, find something useful here.
I am very grateful to Dr Agnes M. Herzberg and Dr P.A.W. Lewis
for extremely helpful comments. I acknowledge also the help of Mrs
Jane Gentleman who programmed some of the calculations.
D.R. Cox
London
April 1969
Visit https://textbookfull.com
now to explore a rich
collection of eBooks, textbook
and enjoy exciting offers!
Preface to second edition
We have added new material partly to amplify matters dealt with only
very cryptically in the first edition and partly to describe some of the
more recent developments, for example on regression diagnostics. In
addition the contents of the first edition have been rearranged; for
example, the method of least squares with empirically estimated
weights is now of much less importance than it used to be because
computational developments have, for many problems, brought
maximum likelihood fitting within the painless grasp of most users of
statistical analysis. By giving some prominence to examples we have
aimed to make the book accessible to a range of readers. One of the
Appendices summarizes the theoretical background.
When the first edition was written it was feasible to give a relatively
complete annotated bibliography of work on the analysis of binary
data. The number of papers on this topic is now so large and is
increasing so rapidly that no such bibliography has been attempted.
Instead, in the Bibliographic Notes at the end of the each chapter we
have aimed to give just a few key references for further reading and for
details omitted from the main text.
We are grateful to Professor N. Wermuth, Mainz, for thoughtful
comments on a portion of the manuscript and to Professor J.K.
Lindsey, Liège, for advice over ordinal data and to Professor P.
McCullagh.
The first author is grateful to Science and Engineering Research
Council for a Senior Research Fellowship held at Department of
Mathematics, Imperial College, London.
D. R. Cox
E. J. Snell
London
July 1988
CH APTER 1
1.1 Introduction
Suppose that on each individual we have an observation that takes
one of two possible forms. The following are examples:
1. an electronic component may be defective, or may be
nondefective;
2. a test animal may die from a specified dose of a poison, or may
survive;
3. a subject may give the correct reply in an experimental situation,
or may give a wrong reply;
4. a test specimen may fracture when struck with a standardized
blow, or may not;
and so on. If for the ith individual we can represent this observation,
or response, by a random variable, Y;, we may without loss of
generality code the two possible values of 7; by 1 and 0 and write
E{Yi) = prob(y; = 1) = 9i, prob(7i = 0) = 1 - 0,, ( 1. 1)
say. It is often convenient to call 7^ = 1 a ‘success’ and 7^ = 0 a ‘failure’.
It is reasonable to call such observations binary; an older term is
quantal.
We assume that such binary observations are available on n
individuals, usually assumed to be independent. The problem is to
develop good methods of analysis for assessing any dependence of 0,
on explanatory variables representing, for example, groupings of the
individuals or quantitative explanatory variables.
We have followed the usual terminology and have distinguished
between (a) response variables and (b) explanatory variables, the
variables of the second type being used to explain or predict variation
in variables of the first type.
Sometimes a binary response variable arises by condensing a more
complex response. Thus a component may be classed as defective
2 BI NARY R E S P O N S E V AR I ABL E S
1.2 Examples
It is convenient to begin with a few simple specific examples
illustrating the problems to be considered.
Total n Q n ^ — R q — Ri R q -{-Ri no + Wj
Controls 32 11 43 0.256
Lung cancer
patients 60 3 63 0.048
Total 92 14 106
No. of
No of control No. not experimental No. not
Day, s babies, Mq.s crying, Tq., babies, ^ crying,
1 8 3 1 1
2 6 2 1 1
3 5 1 1 1
4 6 1 1 0
5 5 4 1 1
6 9 4 1 1
7 8 5 1 1
8 8 4 1 1
9 5 3 1 1
10 9 8 1 0
11 6 5 1 1
12 9 8 1 1
13 8 5 1 1
14 5 4 1 1
15 6 4 1 1
16 8 7 1 1
17 6 4 1 0
18 8 5 1 1
1.2 EXAMPLES 5
in each group.
The object of the analysis is to assess the effect of the treatment on
the probability of success. The tentative basis for the analysis is that
there is in some sense a constant treatment effect throughout the
experiment, even though there may be some systematic variation
from day to day. The experiment has the form of a randomized block
design, in fact a matched pair design, but the binary nature of the
response and the varying numbers of individuals in the groups
complicate the analysis.
For the reasons indicated after Example 1.1 it would not in general
be a sound method of analysis to pool the data over days, thus
forming a single 2 x 2 contingency table with entries ZR q.s»
One simple, if approximate, method of analysis that is not distorted
by systematic differences between groups is to calculate for the sth
group the difference in the proportions of successes, i.e.
0 ,s
( 1.2)
«0,/
This is an unbiased estimate for the sth set of the difference between
the probabilities of success. When (1.2) is averaged over all the sets, an
unbiased estimate of the mean difference between groups results. A
difficulty of this analysis is that the quantities (1.2) have in general
different precisions for different s.
Co 0 2 30 0.067
2co 1 8 30 0.267
4co 2 '15 30 0.500
8t'o 3 23 30 0.767
16co 4 27 30 0.900
Visit https://textbookfull.com
now to explore a rich
collection of eBooks, textbook
and enjoy exciting offers!
BI N AR Y R E S P O N S E V AR I A BL E S
Propn
Failures Successes Total successes
Group 0 «0 -7 Ro Ro no Rq/^O
Group g ^9 R jn ,
Propn of
Non-carriers Carriers Total carriers x
Tonsils present.
not enlarged 497 19 516 0.0368 - 1
Tonsils enlarged
+ 560 29 589 0.0492 0
Tonsils enlarged
269 24 293 0.0819 1
10 BI N AR Y R E S P O N S E V AR I A BL E S
Updated editions will replace the previous one—the old editions will
be renamed.
1.D. The copyright laws of the place where you are located also
govern what you can do with this work. Copyright laws in most
countries are in a constant state of change. If you are outside the
United States, check the laws of your country in addition to the terms
of this agreement before downloading, copying, displaying,
performing, distributing or creating derivative works based on this
work or any other Project Gutenberg™ work. The Foundation makes
no representations concerning the copyright status of any work in
any country other than the United States.
• You pay a royalty fee of 20% of the gross profits you derive from
the use of Project Gutenberg™ works calculated using the
method you already use to calculate your applicable taxes. The
fee is owed to the owner of the Project Gutenberg™ trademark,
but he has agreed to donate royalties under this paragraph to
the Project Gutenberg Literary Archive Foundation. Royalty
payments must be paid within 60 days following each date on
which you prepare (or are legally required to prepare) your
periodic tax returns. Royalty payments should be clearly marked
as such and sent to the Project Gutenberg Literary Archive
Foundation at the address specified in Section 4, “Information
about donations to the Project Gutenberg Literary Archive
Foundation.”
• You comply with all other terms of this agreement for free
distribution of Project Gutenberg™ works.
1.F.
1.F.4. Except for the limited right of replacement or refund set forth in
paragraph 1.F.3, this work is provided to you ‘AS-IS’, WITH NO
OTHER WARRANTIES OF ANY KIND, EXPRESS OR IMPLIED,
INCLUDING BUT NOT LIMITED TO WARRANTIES OF
MERCHANTABILITY OR FITNESS FOR ANY PURPOSE.
Please check the Project Gutenberg web pages for current donation
methods and addresses. Donations are accepted in a number of
other ways including checks, online payments and credit card
donations. To donate, please visit: www.gutenberg.org/donate.
Most people start at our website which has the main PG search
facility: www.gutenberg.org.