Introduction To Hierarchical Bayesian Modeling For Ecological Data E Parent download
Introduction To Hierarchical Bayesian Modeling For Ecological Data E Parent download
https://ebookbell.com/product/introduction-to-hierarchical-
bayesian-modeling-for-ecological-data-e-parent-4347918
https://ebookbell.com/product/introduction-to-modern-analysis-2nd-
edition-2nd-kantorovitz-44870612
https://ebookbell.com/product/introduction-to-the-speechmaking-
process-15th-edition-diana-k-leonard-raymond-s-ross-44874488
https://ebookbell.com/product/introduction-to-construction-
management-2nd-edition-2nd-fred-sherratt-44899008
https://ebookbell.com/product/introduction-to-analysis-with-complex-
numbers-irena-swanson-44912170
Introduction To Quantitative Methods In Business With Applications
Using Microsoft Office Excel 1st Edition Bharat Kolluri
https://ebookbell.com/product/introduction-to-quantitative-methods-in-
business-with-applications-using-microsoft-office-excel-1st-edition-
bharat-kolluri-44915766
https://ebookbell.com/product/introduction-to-biostatistical-
applications-in-health-research-with-microsoft-office-excel-and-r-2nd-
edition-robert-p-hirsch-44915830
https://ebookbell.com/product/introduction-to-strategies-for-organic-
synthesis-2nd-edition-laurie-s-starkey-44915846
https://ebookbell.com/product/introduction-to-hydrogen-technology-2nd-
edition-k-s-v-santhanam-44916142
https://ebookbell.com/product/introduction-to-behavioral-research-
methods-7th-edition-7th-mark-r-leary-44975468
Statistics
Introduction to
Introduction to Hierarchical
Making statistical modeling and inference more accessible to
ecologists and related scientists, Introduction to Hierarchical
Bayesian Modeling for Ecological Data gives readers a flexible and
effective framework to learn about complex ecological processes
Hierarchical
from various sources of data. It also helps readers get started on
building their own statistical models. Bayesian Modeling
for Ecological Data
The text begins with simple models that progressively become
more complex and realistic through explanatory covariates and
intermediate hidden states variables. When fitting the models to
data, the authors gradually present the concepts and techniques
of the Bayesian paradigm from a practical point of view using real
case studies. They emphasize how hierarchical Bayesian modeling
supports multidimensional models involving complex interactions
between parameters and latent variables. Data sets, exercises, and
R and WinBUGS codes are available on the authors’ website.
This book shows how Bayesian statistical modeling provides an
intuitive way to organize data, test ideas, investigate competing
hypotheses, and assess degrees of confidence of predictions. It
also illustrates how conditional reasoning can dismantle a complex
reality into more understandable pieces. As conditional reasoning
is intimately linked with Bayesian thinking, considering hierarchical
Parent • Rivot
models within the Bayesian setting offers a unified and coherent
framework for modeling, estimation, and prediction.
Éric Parent
C9195
Étienne Rivot
Published Titles
introduction to
HierarcHical
Bayesian Modeling
for ecological data
Éric Parent
Étienne rivot
CRC Press
Taylor & Francis Group
6000 Broken Sound Parkway NW, Suite 300
Boca Raton, FL 33487-2742
This book contains information obtained from authentic and highly regarded sources. Reasonable efforts
have been made to publish reliable data and information, but the author and publisher cannot assume
responsibility for the validity of all materials or the consequences of their use. The authors and publishers
have attempted to trace the copyright holders of all material reproduced in this publication and apologize to
copyright holders if permission to publish in this form has not been obtained. If any copyright material has
not been acknowledged please write and let us know so we may rectify in any future reprint.
Except as permitted under U.S. Copyright Law, no part of this book may be reprinted, reproduced, transmit-
ted, or utilized in any form by any electronic, mechanical, or other means, now known or hereafter invented,
including photocopying, microfilming, and recording, or in any information storage or retrieval system,
without written permission from the publishers.
For permission to photocopy or use material electronically from this work, please access www.copyright.
com (http://www.copyright.com/) or contact the Copyright Clearance Center, Inc. (CCC), 222 Rosewood
Drive, Danvers, MA 01923, 978-750-8400. CCC is a not-for-profit organization that provides licenses and
registration for a variety of users. For organizations that have been granted a photocopy license by the CCC,
a separate system of payment has been arranged.
Trademark Notice: Product or corporate names may be trademarks or registered trademarks, and are used
only for identification and explanation without intent to infringe.
Visit the Taylor & Francis Web site at
http://www.taylorandfrancis.com
Foreword xix
v
vi
Bibliography 375
Index 403
List of Figures
xiii
xiv List of Figures
10.19 Posterior pdfs for the full AI+ successive removal model 253
10.20 Predictive pdf of Salmon juvenile density versus AI . . 254
xvii
xviii List of Tables
Statistics? Not for us! Surprisingly, this is still what can sometimes be
heard when discussing statistics with scientists from applied disciplines.
How is it that such people disregard the science of doubt? No doubt that
practitioners possess a deep understanding of phenomenological behav-
ior of the system under study. They also have an acute feeling of its
variability and its complexity. They have elaborated a mental image of
the phenomenon, with operational shortcuts and permanent reformula-
tions. Dare we say: the seed for a model is already set in their minds.
At the same time, they never blindly trust model outputs and agree
that statistics is necessary to depict things that happened in the past
and to make projections about what may occur in the future. However,
they also often complain that the basic statistical procedures are too
simple and not flexible enough to address real problems they have to
struggle with, and one has to acknowledge that instruction manuals for
more advanced statistical tools are rarely funny readings! The present
book might not be exactly a funny reading, but we hope that it could
contribute to make the first steps of statistical modeling and inference
more accessible, and tear-free, to a wide community of practitioners.
In Ecology, interestingly, complex issues have often been the hotbed
of development of innovative statistical methodology. Statistical Ecol-
ogy, a branch of ecological sciences, has proved incredibly productive
in the last thirty years. However, temptation remains to apply ready-
to-use statistical recipes, and frustration can be heavy when the data
and the problem at hand do not fit exactly into the mold of the clas-
sical statistical toolboxes. But, to sacrifice the model realism for the
sake of statistical tractability, or to defer to the sole mathematician the
inference of the more complex model formulation, is that the unavoid-
able dilemma? In both cases, the analyst is deprived from the exciting
intellectual adventures of model making!
This book was written because we believe that Hierarchical Bayesian
Modeling is like a golden key to be given to scientists in Ecology. This
avenue of thought should highly contribute to free their creativity in
designing statistical models of their own.
xix
xx Foreword
with Jacques Bernier and Lucien Duckstein for useful discussions and
careful proofreading.
Jérôme Guitton patiently helped us to master enough html program-
ming language to implement our website hbm-for-ecology.org. This web-
site gives many of the datasets, R and WinBUGS or OpenBUGS codes
that we used to derive inference and figures in the book.
1
This page intentionally left blank
Chapter 1
Bayesian hierarchical models in
statistical ecology
Summary
The Salmon life cycle and the biomass production model exemplify
the key idea of this book: Hierarchical Bayesian Modeling (HBM) is a
Directed Acyclic Graph (DAG) modeling technique with the capacity to
cope with high-dimensional complex models typically needed for ecolog-
ical inferences and predictions ([64]; [65]; [67]; [75]; [313]).
HBM works through conditional decomposition of high-dimension
problems into a series of probabilistically linked simpler substructures.
HBM enables to exploit diverse sources of information to derive infer-
ences from large numbers of latent variables and parameters that de-
scribe complex relationships while keeping as close as possible to the
basic phenomena.
Based on these two motivating case studies from fisheries sciences,
we detail the three basic layers of hierarchical statistical models ([75];
[76]; [312]; [313]):
3
4 Introduction to Hierarchical Bayesian Modeling for Ecological Data
Here, the first two steps have been isolated for the purpose of clarity.
But this is somewhat artificial as they are practically embedded within
an iterative process in which the confrontation of the proposed model to
the available data (Step 2) may lead to reconsider the hypotheses that
were made in Step 1.
The relative importance of each of the three steps may vary depend-
ing upon the modeling objectives. Step 1 will receive primary attention
for fundamental ecological studies. If the management purpose is of pri-
mary interest, more modeling effort will be devoted to Step 3.
Throughout this book, we will exemplify how the Bayesian setting
enables one to combine these three steps (model design, model fitting
and predictive simulations) within a single unified and rigorous statistical
framework. A deeper discussion on the Bayesian setting is postponed to
Section 1.3.2.
1. Model errors
As any model is a functional simplification of a real process, it thus
provides an imperfect and a potentially misleading representation
of how the real system works. Model errors (also referred to as
structural errors) reflect our ignorance about the very nature of
the system. Model error may of course yield major consequences if
the model is used for forecasting (in particular when extrapolating
beyond the range of the data that have been collected). The role
of hypotheses is to restrict the number of plausible models. By
convention, a zero probability is put on the models that do not
belong to the set delimited by the hypotheses. The model error
term is to be quantified only for the members of this remaining
set.
2. Process stochasticity
Models with deterministic process are useful to learn from the in-
teraction of complex processes. However, when confronting models
with data, models must account for unpredictable variability or
stochasticity that cannot be explained by deterministic processes.
Process stochasticity can take different forms of which the most
common are demographic stochasticity (due to random differences
between individuals or statistical groups of individuals with the
Bayesian hierarchical models in statistical ecology 9
Bt
g(Bt ) = r · Bt · (1 − ) (1.2)
K
A Log-Normal random noise term is generally added to capture the
biological variability due to (unpredictable) environmental varia-
tions:
Bt+1 = (Bt + g(Bt ) − ct ) · et+1
(
iid
(1.3)
εt+1 ∼ N ormal(0, σ 2 )
with the ∼ sign meaning that t+1 , the logarithm of the perturba-
tion, is a normally distributed N (0, σ 2 ) random term standing for
the environmental noise. Mathematics for the Normal Distribution
will be developed in Chapters 3 and 6.
2. Step 2. Learning from observations.
Let t = 1, ..., n denote the epochs for which observations are
available. Available data in that case typically consist of a series
of observed catches c1 , ..., cn and of abundance indices i1 , ..., in ,
the latter are often assumed proportional to the current biomass
it = q · Bt , ∀t ∈ {1, ..., n}.
Abundance indices can be of different nature. They can be derived
from commercial data such as commercial catches per unit of fish-
ing effort, or from a scientific survey such as scientific trawling or
acoustic surveys. The abundance indices it=1:n are often assumed
proportional to the current biomass it = q · Bt , ∀t ∈ {1, ..., n} with
12 Introduction to Hierarchical Bayesian Modeling for Ecological Data
with ω1,t and ω2,t normally distributed N (0, τ1,o 2 ) and N (0, τ2,o 2 )
random terms describing the uncertainty in the observed catches
and abundance indices due to measurement and sampling error
(observation error).
Fisheries scientists would typically rely on the available knowledge
(expertise and past data) and observation (data at hand) to pro-
vide answers to the following questions of interest: i ) Is the logistic
growth function appropriate or does another form of production
function fit the observed data better? ii ) What are the credible
values for the parameters (r, K) and the associated uncertainty?
Can the growth rate r and the carrying capacity K be elicited from
some probabilistic prior judgmental expertise? iii ) What are the
credible values for the historical trajectory of the biomass level,
say B1 , ..., Bn and what is the level of the Biomass depletion over
the time series B B1 ?
n
r·K
CM SY =
4 (1.5)
B K
M SY =
2
What are credible values for the management reference points
CM SY and BM SY and their associated uncertainty? Do they match
Bayesian hierarchical models in statistical ecology 13
FIGURE 1.4: Simplified Atlantic salmon life cycle model with only one
sea-age class. Spawners Sp1 return and spawn one year before spawners
Sp2 of the same cohort.
Spawning occurs in the river in late fall and early winter and most
spawners die shortly after their first reproduction. Eggs hatch in the
gravel and alevins emerge during the next spring. After emergence,
young-of-the-year salmon (denoted 0+ in Fig. 1.4) spend one or sev-
eral years in the river before undergoing smoltification in the spring. At
this time, they migrate downstream to the sea as smolt. Adults return to
their home river for spawning after one or several years at sea. In France,
a major portion of the juveniles become smolts at one year of river age
(1+ Smolt, denoted Sm1 in Fig. 1.4), in the spring following their emer-
gence as young-of-the-year salmon, leaving behind the smaller juveniles
(P arr1) to spend an additional year in the river before seaward migra-
tion (2+ Smolt, denoted Sm2 in Fig. 1.4). Fish from the two smolt age
classes return as spawners after one or two years at sea. Other possible
life histories (early maturation in freshwater before seaward migration,
Bayesian hierarchical models in statistical ecology 15
Wt = Spt · pf · f ec (1.6)
iid
(1.7)
εt ∼ N (0, σ 2 )
1+ Parrs will survive (with a survival rate γP arr1 ) and will mi-
grate as 2+ Smolts. Demographic stochasticity in the survival
of resident parrs is also modelled using a Binomial distribu-
tion:
Let
[V1 | V2 ]
denotes the probability of event V1 given the event V2 has occurred.
It states that, given V1 , the issue V2 is uncertain and described by
probabilistic bets. The bracket notation will be used indifferently for
probability distributions of discrete variables and for probability density
functions (pdf) of continuous variables.
Finally, one will merely find the various elementary conditional con-
figurations illustrated in Fig. 1.5 as the small pieces of interaction issued
from such decomposition (Eq. (1.15)) of the joint distribution.
Constants and forcing conditions introduced into the analysis as de-
terministic covariates have no probabilistic status (conversely to the ran-
dom variables V1 , ..., Vn ) and should not in principle be considered in
Bayesian hierarchical models in statistical ecology 21
V1 V2
V3
V4
FIGURE 1.6: A simple example of DAG with four variables and one
covariate. V1 and V2 have no parents and are considered as param-
eters with independent unconditional distributions [V1 ] and [V2 ]. The
full joint distribution has been designed according to the factorization
[V1 , V2 , V3 , V4 |X] = [V1 ] × [V2 ] × [V3 |V1 , V2 ; X] × [V4 |V3 ].
a) b) c)
α β σ γ0+ θSm1
Parr1t+2
d) e)
γSm
γParr1
Sm1t+2 Sp1t+3
θ θ Parameters
Latent
Z variables
Y Y Observables
Vp+1 , ..., Vn ) and the other nonobserved quantities V1 , ..., Vp =(θ, Z) that
are unknown. Among these unknown quantities, we further make the dis-
tinction between the parameters (suppose for convenient notation that
the k first variables V1 , ..., Vk are nodes without parent) that are entry
points to the graph and are denoted θ and the p − k inner ones (the re-
maining Vk+1 , ..., Vp ) that are denoted Z. Parameters θ are often called
state of nature while latent variables Z are commonly named state of
the system. Such a three-layers structure as depicted by Fig. 1.8 is the
core of Bayesian Hierarchical Modeling (HBM).
Keeping in mind the groupings θ = (V1 , ..., Vk ), Z = (Vk+1 , ..., Vp ),
Y = (Vp+1 , ..., Vn ), and assuming that parameters have an unconditional
prior probability distribution [θ], the full joint distribution of the HBM
may be factorized as in Eq. (1.17):
denoted πSm , πSp , and π0+ , and considered constant for all years t:
C0+,t+1 ∼ Binomial(0+t+1 , π0+ )
CSm1,t+2 ∼ Binomial(Sm1t+2 , πSm )
CSm2,t+3 ∼ Binomial(Sm1t+3 , πSm ) (1.19)
CSp1,t+3 ∼ Binomial(Sp1t+3 , πSp )
CSp2,t+4 ∼ Binomial(Sp2t+4 , πSp )
FIGURE 1.9: DAG describing the A. salmon life cycle model with
Binomial observation process. Parameters (θ following our notations)
appear in the upper layer. The middle layer describes the dynamics of
hidden states ([Z|θ]). The observation model ([Y |Z, θ]) is presented in
the bottom layer.
ratios, Ricker parameters, etc.). The latent states Z (the other nodes in
between parameters and observables) would be mainly the number of in-
dividuals in the possible stages for all time indices, and the observations
y will be the catches:
C. LA CHRONIQUE LÉGENDAIRE
L’ÉGYPTE ARCHAÏQUE
I. PALÉOLITHIQUE
II. PRÉDYNASTIQUE
A. MONUMENTS
Silex
Les couches sédimentaires qui bordent la vallée du Nil sont
extrêmement riches en rognons de silex, qui atteignent parfois de
très grandes dimensions; sur les plateaux, le sol est couvert de
galets de silex, d’agate et de cornaline. Naturellement la qualité de
la pierre varie suivant les endroits, mais partout elle se prête à la
taille et les premiers habitants du pays avaient sous la main, d’un
bout à l’autre du pays, la matière première de laquelle ils pouvaient
tirer leurs armes et leurs outils. C’est vers le nord de l’Egypte, au
Fayoum en particulier, que le silex est le moins abondant, mais les
cailloux du diluvium peuvent le remplacer, et les indigènes en ont
tiré un très bon parti.
Villages
Tombea
ux
Ces nécropoles d’un type tout spécial ont très longtemps passé
inaperçues et elles semblent en effet, au premier abord, fort difficiles
à reconnaître. C’est avec le jour frisant du soir ou du matin qu’on
peut le mieux distinguer ces groupes de dépressions très légères, à
peine perceptibles en plein soleil, qui sont à la surface plus ou moins
inégale du terrain le seul indice extérieur des tombeaux archaïques.
Les sépultures sont de simples fosses creusées dans les bancs de
cailloux roulés qui s’étendent au pied de la montagne et qui forment
un terrain suffisamment consistant pour qu’il ne fût pas nécessaire
de soutenir, au moyen d’un mur ou d’un enduit, les bords de
l’excavation: leur forme générale est irrégulière, à peu près ovale ou
même presque ronde, et leur profondeur d’un mètre à deux au plus,
tandis que l’ouverture dépasse à peine un mètre cinquante dans sa
plus grande dimension. A côté de celles-là il en existait de plus
grandes, à peu près rectangulaires et atteignant jusqu’à quatre
mètres sur deux, sans que la profondeur en soit augmentée. Après
l’ensevelissement, les grandes comme les petites fosses étaient
simplement comblées avec du sable et des galets et se confondaient
avec le terrain environnant; il n’y a jamais la moindre superstructure,
pas même une pierre tombale.
Les dimensions des petites tombes, qui sont de beaucoup les plus
nombreuses, ne permettaient pas d’y déposer le mort étendu tout de
son long, comme on le fit plus tard pour les momies aux époques
historiques; les coutumes funéraires étaient en effet très différentes
et nous pouvons distinguer deux stages, deux modes
d’ensevelissement qui semblent correspondre à deux périodes. Dans
les plus anciennes sépultures, le mort est couché sur le côté gauche,
dans la position dite embryonnaire ou assise, c’est-à-dire avec les
membres repliés de manière que les mains se trouvent devant la
figure, les genoux à la hauteur de la poitrine et les pieds près du
bassin. Etant donnée l’orientation des tombeaux, qui du reste n’est
pas partout rigoureusement exacte, la tête est généralement au sud
la face tournée vers l’ouest.
Fig. 32. Tombeau prédynastique
(d’après J. de Morgan. Recherches sur les origines de l’Egypte, II, fig.
468).
ebookbell.com