0% found this document useful (0 votes)

76 views

Exponential Families: Peter D. Hoff September 26, 2013

This document provides an overview of exponential families. It defines an exponential family as a collection of probability distributions with densities of the form p(x|η) = exp(ηT t(x) - A(η)), where t(x) is a statistic and A(η) ensures the densities integrate to 1. It describes properties such as the natural parameter space H and full vs. curved exponential families. Examples include the normal, multinomial, and curved normal models. Key results discussed are the convexity of H and A(η), continuity of expectations, and formulas for computing moments using derivatives of the log normalization factor A(η).

Uploaded by

Christian Beren

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

76 views

Exponential Families: Peter D. Hoff September 26, 2013

Uploaded by

Christian Beren

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Exponential families

Peter D. Hoff
September 26, 2013

Much of this content comes from Lehmann and Casella [1998] section 1.5.

Contents
1 The canonical exponential family

2 Basic results

The canonical exponential family

Construction of an exponential family of densities

Exponential families are classes of probability measures constructed from
1. a dominating measure
1

2. a statistic t(X)
Let
(X , A) be a measurable space,
be a measure on A,
t : X Rs
For Rs , define the measure
Z

T t(x)

(dx) A A
Z
T
A() = log (X ) = log e t(x) (dx).

(A) =

If A() < , we can define a probability measure P on (X , A) via its density w.r.t. :
T

p(x|) = e t(x)A() , x X
Z
P (A) =
p(x|)(dx).
A

Note that
P (X ) = 1 by construction, and so (X , A, P ) is a probability space.
P is absolutely continuous w.r.t. , with RN density p(x|).
R T
We can construct such a density for each Rs for which e t(x) dx is finite.
Definition 1 (canonical exponential family). Let
(X , A, ) be a measure space,
t : X Rs be an s-dimensional statistic that does not satisfy any linear constraints,
R T
A() = log e t(x) (dx).
A collection of densities given by
, where
{p(x|) = exp( T t(x) A()) : H}
H = { : A() < }
H
is called an s-dimensional exponential family.
2

Notes:
The set H = { : A() < } is called the natural parameter space.
R
Each density p(x|) defines a measure P via P (A) = A p(x|)(dx).
have a common dominating measure .
We say that the measures {P : H}

Minimal, full and curved exponential families

Doesnt satisfy a linear constraint means
6 a Rs : a 6= 0, aT t(x) = c x X .
Some authors do not include this no linear constraints requirement for the statistic t.
If t does satisfy a linear constraint, the natural parameter space includes points that correspond to the same density and probability distribution. As a result, the parameter will be
non-identifiable (in the natural parameter space):
Definition 2. A model P = {p(x|) : H} for (X , A) is nonidentifiable if there exists
1 , 2 H : 1 6= 2 but P (A|1 ) = P (A|2 ) A A.
Exercise: Show that if t satisfies a linear constraint and H is the parameter space, then the
exponential family model is non-identifiable.
Most authors refer to an EFM where t does not satisfy a linear constraint as a minimal
parametrization. Since a non-minimal representation can always be made minimal, and the
recommendation is always to do so, it seems simplest just to require it in the definition.
Definition 3 (full rank). If the parameter space for an exponential family contains an sdimensional open set, then it is called full rank.
An exponential family that is not full rank is generally called a curved exponential family,
as typically the parameter space is a curve in Rs of dimension less than s.
Examples
Often an exponential family model is parameterized as
P = {p(x|) = h(x) exp{()T t(x) B() : }.
This is done
3

if the parameter is more interpretable than

so that the dominating measure can be something simple.
Example(normal model):
The univariate normal model on (R, B(R)) can be represented with the class of densities
{p(x|, 2 ) : R, 2 R+ } w.r.t. Lebesgue measure, where
p(x|, 2 ) = (2 2 )1/2 exp((x )2 /[2 2 ])
= (2)1/2 exp(x2 21 2 + x 2

2
2 2

12 log 2 ).

This is the same model as p(x|) = (2)1/2 exp( T t(x) A()) where
!
!
2
x
/
t(x) =
, (, 2 ) =
, A() = (2 / 2 + log 2 )/2.
2
2
1/(2 )
x
To reparameterize back, note that = 1 /(22 ) and 2 = 1/(22 ).
What is the natural parameter space?
Does it correspond to (, 2 ) R R+ ?
Recall,
Z

H = {1 , 2 :

e1 x+2 x dx < }

Convince yourself that H = R R , which gives (, 2 ) R R+ .

= H is the normal model.
The exponential family model defined by t(x) = (x, x2 ) and H
The normal model with (, 2 ) R R+ is a two-dimensional full rank exponential family.
Example(a curved normal model):
Consider the normal model having the following mean-variance relationship:
X normal(, 2 ) , R.
Let P = {p(x|, 2 ) : R, 2 = 2 }, where p(x|, 2 ) are the normal densities given above.
The densities in this model can be written
p(x|) = (22 )1/2 exp((x )2 /[22 ])
x exp((x2 2x + 2 )/[22 ])
= exp(x/ x2 /[22 ] 1/2)
x exp(x/ x2 /[22 ])
exp(1 t1 (x) + 2 t2 (x)).
4

Since t(x) = (x, x2 ) doesnt satisfy a linear constraint, this is a two-dimensional exponential
family.
The natural parameter space corresponding to t(x) is R R1 .
Our reduced parameter space is () = (1/, 1/[22 ]).
This is a one-dimensional curve in two-dimensional space.
Draw a picture.
This family is a two-dimensional exponential family (in minimal form).
It is not a full rank exponential family.
Example:(multinomial model)
Let X multinomial(n, ), for which
P
= { Rp : j = 1} and
P
X = {x {0, 1, . . . , n} :
xj = n}.
The density of P w.r.t. counting measure on X is
p(x|) =

n
x

1x1 pxp .

We can rewrite this in canonical exponential form as

p(x|) = exp(x1 1 + xp p ),
where j = log j and the dominating measure is

(x) =

n
x

(x),

i.e. the multinomial coefficient has been absorbed into the dominating measure.
= { Rp : P ej = 1}, which is a p1-dimensional
The parameter space for this model is H
curve in Rp .
Is the multinomial model a p-dimensional curved exponential family?
Note that 1T t(x) = 1 x X , so this family
doesnt satisfy our definition, or if you prefer
is not in minimal form.

Consider the usual parameterization again, but now express the model in terms of t(x) =
(x1 , . . . , xp1 ):
Pp1
x1
1 pn 1 xj
p1
xj
Y
j
n n
= x p
p

p(x|) =

n
x

j=1

n
x

exp(1 x1 + + p1 xp1 A()),

where j = log(j /p ) and A() can be computed as follows:

j = p ej
X
1 p = p
ej
p =

1
P

A() = n log p = n log(1 +

ej )

Thus the multinomial model is a (p 1)-dimensional exponential family generated by the

statistic t(x) = (x1 , . . . , xp1 ).
Does correspond to H?
H = { Rp1 :

exp{1 x1 + + p1 xp1 } < } = Rp1 .

This of contains a p 1-dimensional rectangle, and so the multinomial model is a full rank
p 1-dimensional exponential family.

Basic results

Convexity of H:
The largest EFM based on a statistic t(x) is the one based on the natural parameter space:
{p(x|) : H} since H
H.
{p(x|) : H}
6

The natural parameter space is usually (but not always) open,

making this fullest family also full rank.
It is always the case that H is convex, and that A() is convex on H.
Theorem 1. The natural parameter space H for densities of the form p(x|) = exp( T t(x)
A()) is convex, and A() is convex on H.
Proof. Recall Holders inequality: For a [0, 1], b = 1 a,
Z

Z
fg

1/a

a Z
g

1/b

Now let 1 , 2 H and apply the inequality:

Z
Z
T
T
A(a1 +b2 )
T
e
= exp((a1 + b2 ) t(x)) = ea1 t eb2 t
Z
a Z
b
1T t
2T t
e
e

= eaA(1 )+bA(2 ) <

and so a1 + b2 H, and A() is convex.

Continuity, integration and differentiation

The following theorem is useful in a variety of contexts:
Theorem 2 (LC 5.8). For any integrable function f the expected value function E[f |],
Z
E[f |] = f (x) exp( T t(x) A()) (dx),
is, at any in the interior of H,
1. continuous as a function of ,
2. has derivatives w.r.t. of all orders,
3. derivatives can be obtained by differentiating the integrand.
The first item is used in two key results in estimation and testing:
In estimation, the theorem implies that risk function for exponential family models are
continuous. This will help us characterize all admissible estimators for such models.
7

In testing, the theorem implies that the power function for any test is continuous. This
will help us characterize unbiased testing procedures.
An important application of the theorem is the calculation of moments of t.
R
By definition, eA() = et (dx).
Taking derivatives w.r.t. gives

d A()
e
d
0

A()

A ()e

d
d

Z
=
Z

A () =

et (dx)

tet (dx)
tetA() (dx) = E[t(X)|].

More generally,
Theorem 3 (Barndorff-Neilsen(1978) thm 8.1). Let P = {p(x|) = exp( T tA()) : H}
be an exponential family and int H. Then
Z
k
T
A()
e
= tk11 (x) tks s (x)e t(x) (dx)
k1
k1
1 1
k1 , . . . , ks 0.
This result helps us with the moment generating function.
Moment generating function:

Mt (u1 , . . . , up ) = E[eu t |]
Z
T
= e(+u) tA() (dx)
Z
T
A(+u)A()
=e
e(+u) tA(+u) (dx)
= eA(+u)A()
This works as long as is in the interior of H and u is small enough so that + u H.
From this, we can use the above theorem to show
k
Mt (u)|u=0 = E[tk11 tks s |].
1k1 1k1
8

References
E. L. Lehmann and George Casella. Theory of point estimation. Springer Texts in Statistics.
Springer-Verlag, New York, second edition, 1998. ISBN 0-387-98502-6.

(Original PDF) College Algebra: Graphs and Models 6th Editionpdf download
100% (2)
(Original PDF) College Algebra: Graphs and Models 6th Editionpdf download
56 pages
An Introduction To Signal Detection and Estimation - Second Edition Chapter IV: Selected Solutions
100% (1)
An Introduction To Signal Detection and Estimation - Second Edition Chapter IV: Selected Solutions
7 pages
My Journal Reflections: Observation 1: TPSD First Grade, Phonics First, Lesson 13c Level 2
100% (1)
My Journal Reflections: Observation 1: TPSD First Grade, Phonics First, Lesson 13c Level 2
21 pages
Lecture 02
No ratings yet
Lecture 02
41 pages
Natural Parameter Form For Multivariate Gaussian
No ratings yet
Natural Parameter Form For Multivariate Gaussian
17 pages
Statistics 550 Notes 8: SX F F S S SX TX F GT H T T
No ratings yet
Statistics 550 Notes 8: SX F F S S SX TX F GT H T T
14 pages
Exp Family
No ratings yet
Exp Family
7 pages
Conjugacy and the Exponential Family
No ratings yet
Conjugacy and the Exponential Family
6 pages
Exponential Families: Dr. Kempthorne
No ratings yet
Exponential Families: Dr. Kempthorne
33 pages
Exponential Families: Dr. Kempthorne
No ratings yet
Exponential Families: Dr. Kempthorne
33 pages
Lec12_glm_ExponentialFamilies
No ratings yet
Lec12_glm_ExponentialFamilies
24 pages
statistics_lecture 7
No ratings yet
statistics_lecture 7
47 pages
Families of Distributions: Beamer-Tu-Logo
No ratings yet
Families of Distributions: Beamer-Tu-Logo
19 pages
Exponential Family
No ratings yet
Exponential Family
17 pages
CQF January 2016 M5S8 Workings Annotated
No ratings yet
CQF January 2016 M5S8 Workings Annotated
7 pages
exponential family
No ratings yet
exponential family
45 pages
CPSC 440: Advanced Machine Learning: Exponential Families
No ratings yet
CPSC 440: Advanced Machine Learning: Exponential Families
15 pages
Lecture Notes 1: Brief Review of Basic Probability (Casella and Berger Chapters 1-4)
100% (1)
Lecture Notes 1: Brief Review of Basic Probability (Casella and Berger Chapters 1-4)
14 pages
IOSR Journals
No ratings yet
IOSR Journals
5 pages
Multivariate Exponential Families A Concise Guide to Statistical Inference One-Click eBook Download
No ratings yet
Multivariate Exponential Families A Concise Guide to Statistical Inference One-Click eBook Download
17 pages
Multivariate Exponential Families A Concise Guide to Statistical Inference pdf docx
100% (12)
Multivariate Exponential Families A Concise Guide to Statistical Inference pdf docx
15 pages
Adv Statistics I
No ratings yet
Adv Statistics I
95 pages
Lecture 1 (Chapter 3) - Common Families of Distributions
No ratings yet
Lecture 1 (Chapter 3) - Common Families of Distributions
12 pages
w6 - Statistical Modelling
No ratings yet
w6 - Statistical Modelling
24 pages
Tutorial On Generalized Expectation
No ratings yet
Tutorial On Generalized Expectation
6 pages
Tutorial On Generalized Expectation Maximization: Javier R. Movellan
No ratings yet
Tutorial On Generalized Expectation Maximization: Javier R. Movellan
6 pages
Lecture 11
No ratings yet
Lecture 11
6 pages
Exponential Family Related To LDA
No ratings yet
Exponential Family Related To LDA
12 pages
7 Mle
No ratings yet
7 Mle
31 pages
CQF ML Lab Estimating Default Probability With Logistic Regression
No ratings yet
CQF ML Lab Estimating Default Probability With Logistic Regression
7 pages
03 Estimation Part4
No ratings yet
03 Estimation Part4
11 pages
Probabilistic Learning and Generalized Linear Models (GLMS)
No ratings yet
Probabilistic Learning and Generalized Linear Models (GLMS)
38 pages
Research Article: Estimating The Reliability Function For A Family of Exponentiated Distributions
No ratings yet
Research Article: Estimating The Reliability Function For A Family of Exponentiated Distributions
11 pages
A Two Parameter Distribution Obtained by
No ratings yet
A Two Parameter Distribution Obtained by
15 pages
Foundations of Statistical Inference
No ratings yet
Foundations of Statistical Inference
89 pages
msqe_metrics_1_ps2
No ratings yet
msqe_metrics_1_ps2
11 pages
R300 Advanced Econometrics Methods Lecture Slides
No ratings yet
R300 Advanced Econometrics Methods Lecture Slides
362 pages
3.exponential Family & Point Estimation - 552
0% (1)
3.exponential Family & Point Estimation - 552
33 pages
CSE291D Lecture 4: Exponential Families Generalized Linear Models
No ratings yet
CSE291D Lecture 4: Exponential Families Generalized Linear Models
67 pages
Final Exam Practice Problems
No ratings yet
Final Exam Practice Problems
8 pages
15 Exponential Families
No ratings yet
15 Exponential Families
33 pages
Maximum Likelihood Notes1
No ratings yet
Maximum Likelihood Notes1
10 pages
θ, then the probability density function for Y, θ), can be written as  y∣=exp  ybcd  y θ) is called the natural −m  n y ,
No ratings yet
θ, then the probability density function for Y, θ), can be written as  y∣=exp  ybcd  y θ) is called the natural −m  n y ,
6 pages
College Statistics
No ratings yet
College Statistics
244 pages
Institute of Mathematical Statistics The Annals of Statistics
No ratings yet
Institute of Mathematical Statistics The Annals of Statistics
55 pages
Probabilistic Machine Learning: Exponential Families
No ratings yet
Probabilistic Machine Learning: Exponential Families
19 pages
Notes On The Cram Er-Rao Inequality: Kimball Martin February 8, 2012
No ratings yet
Notes On The Cram Er-Rao Inequality: Kimball Martin February 8, 2012
6 pages
Bios602 Wi13 Lec13 Presentation
No ratings yet
Bios602 Wi13 Lec13 Presentation
115 pages
Lecture12 Week6
No ratings yet
Lecture12 Week6
29 pages
Fundamentals of Statistics (18.6501x)
No ratings yet
Fundamentals of Statistics (18.6501x)
20 pages
STA 303 Theory of Estimation 9th Lecture-1
No ratings yet
STA 303 Theory of Estimation 9th Lecture-1
7 pages
Estimation Theory: x, x, x ,…… ……x ,x f x,θ θ θ θ
No ratings yet
Estimation Theory: x, x, x ,…… ……x ,x f x,θ θ θ θ
18 pages
RaoCramerans PDF
No ratings yet
RaoCramerans PDF
10 pages
Predição em Modelos de Tempo de Falha Acelerado Com Efeito Aleatório para Avaliação de Riscos de Falha - (JoaoBC)
No ratings yet
Predição em Modelos de Tempo de Falha Acelerado Com Efeito Aleatório para Avaliação de Riscos de Falha - (JoaoBC)
22 pages
STAT 426: Fall 2012
No ratings yet
STAT 426: Fall 2012
37 pages
Notes
No ratings yet
Notes
10 pages
stat-review__xid-8243919_1
No ratings yet
stat-review__xid-8243919_1
24 pages
Principles of Statistics
No ratings yet
Principles of Statistics
113 pages
Methods of Estimation II: Dr. Kempthorne
No ratings yet
Methods of Estimation II: Dr. Kempthorne
44 pages
Lecture 6 - Fall 2023
No ratings yet
Lecture 6 - Fall 2023
39 pages
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
Lampiran 1. Turunan Pertama Fungsi Log-Likelihood Terhadap Parameter
No ratings yet
Lampiran 1. Turunan Pertama Fungsi Log-Likelihood Terhadap Parameter
22 pages
Abtract Beren
No ratings yet
Abtract Beren
1 page
Setiap Fungsi Periodik (Sinyal) Dapat Dibentuk Dari Penjumlahan Gelombang-Gelombang Sinus/cosinus. Transformasi Citra Sesuai Namanya Merupakan Proses
No ratings yet
Setiap Fungsi Periodik (Sinyal) Dapat Dibentuk Dari Penjumlahan Gelombang-Gelombang Sinus/cosinus. Transformasi Citra Sesuai Namanya Merupakan Proses
2 pages
Ielts Workshop
100% (1)
Ielts Workshop
18 pages
2
No ratings yet
2
8 pages
Analisis Diskriminan: Tugas Individu Analisis Peubah Ganda
No ratings yet
Analisis Diskriminan: Tugas Individu Analisis Peubah Ganda
7 pages
Post Hoc Tests: Anova
No ratings yet
Post Hoc Tests: Anova
20 pages
Hmohiv
No ratings yet
Hmohiv
1 page
Why Use Kaplan Meier
No ratings yet
Why Use Kaplan Meier
7 pages
Exponential Function Day 1
No ratings yet
Exponential Function Day 1
38 pages
Applied Stochastic Differential Equations 1st Edition Simo Särkkä And Arno Solin All Chapters Instant Download
100% (2)
Applied Stochastic Differential Equations 1st Edition Simo Särkkä And Arno Solin All Chapters Instant Download
55 pages
Tables of The Exponential Integral Ei (X)
No ratings yet
Tables of The Exponential Integral Ei (X)
8 pages
BGBG BG BG: A Child'S Garden of Fractional Derivatives
No ratings yet
BGBG BG BG: A Child'S Garden of Fractional Derivatives
12 pages
Logarithmic and Exponential Equations: Objectives
No ratings yet
Logarithmic and Exponential Equations: Objectives
12 pages
General Mathematics - Grade 11 Alternative Delivery Mode Quarter 1 Week 8-Module 5: Logarithmic Functions First Edition, 2020
No ratings yet
General Mathematics - Grade 11 Alternative Delivery Mode Quarter 1 Week 8-Module 5: Logarithmic Functions First Edition, 2020
35 pages
(Ebook) Calculus : graphical, numerical, algebraic by Franklin D. Demana; Daniel Kennedy; Bert K. Waits; Ross L. Finney ISBN 9780133178579, 0133178579 - The full ebook version is ready for instant download
No ratings yet
(Ebook) Calculus : graphical, numerical, algebraic by Franklin D. Demana; Daniel Kennedy; Bert K. Waits; Ross L. Finney ISBN 9780133178579, 0133178579 - The full ebook version is ready for instant download
58 pages
Finite Mathematics and Applied Calculus, 8e 8th Edition Stefan Waner - The ebook is ready for download, no waiting required
100% (1)
Finite Mathematics and Applied Calculus, 8e 8th Edition Stefan Waner - The ebook is ready for download, no waiting required
56 pages
Powers and Roots: Example 1 Compute Solution
No ratings yet
Powers and Roots: Example 1 Compute Solution
10 pages
On The Lambert W Function - Applied Mathematics - University of
No ratings yet
On The Lambert W Function - Applied Mathematics - University of
32 pages
A Crack Along The Interface of A Circular Inclusion Embedded in An Infinite Solid
No ratings yet
A Crack Along The Interface of A Circular Inclusion Embedded in An Infinite Solid
24 pages
미분적분학 솔루션 2판 （제임스 스튜어트）－1－270
No ratings yet
미분적분학 솔루션 2판 （제임스 스튜어트）－1－270
270 pages
Examples02 Power Series
No ratings yet
Examples02 Power Series
12 pages
Power System Load Modelling
100% (2)
Power System Load Modelling
45 pages
Instant Download (Ebook PDF) Mathematics in Action: Algebraic, Graphical, and Trigonometric Problem Solving 5th Edition PDF All Chapter
100% (2)
Instant Download (Ebook PDF) Mathematics in Action: Algebraic, Graphical, and Trigonometric Problem Solving 5th Edition PDF All Chapter
41 pages
Silverstein, A New Approach To Local Times, J. Math. Mech. 17 (1967-1968), 1023-1054.
No ratings yet
Silverstein, A New Approach To Local Times, J. Math. Mech. 17 (1967-1968), 1023-1054.
33 pages
3.2 Graphs of Exponential Functions
No ratings yet
3.2 Graphs of Exponential Functions
3 pages
Java Math Method
No ratings yet
Java Math Method
14 pages
aarthi3
No ratings yet
aarthi3
11 pages
18-1 Beam Flexure: Elementary Case
No ratings yet
18-1 Beam Flexure: Elementary Case
10 pages
MTH1202 - Calculus 1CO (5) (1)
No ratings yet
MTH1202 - Calculus 1CO (5) (1)
2 pages
2022-2023 Math Course Listing
No ratings yet
2022-2023 Math Course Listing
174 pages
MATH 1009: Basic Mathematics For Business and Economics Non-Linear Equations Study Guide
No ratings yet
MATH 1009: Basic Mathematics For Business and Economics Non-Linear Equations Study Guide
2 pages
Intro To SMath Studio 102011
No ratings yet
Intro To SMath Studio 102011
22 pages
MCR3U7 Course Outline 2020
No ratings yet
MCR3U7 Course Outline 2020
2 pages
College Algebra Syllabus
100% (2)
College Algebra Syllabus
5 pages
MYP Math
No ratings yet
MYP Math
7 pages
EViews Illustrated Chapter 1
No ratings yet
EViews Illustrated Chapter 1
22 pages

Exponential Families: Peter D. Hoff September 26, 2013

Uploaded by

Exponential Families: Peter D. Hoff September 26, 2013

Uploaded by

Exponential families

The canonical exponential family

Construction of an exponential family of densities

Minimal, full and curved exponential families

if the parameter is more interpretable than

Convince yourself that H = R R , which gives (, 2 ) R R+ .

We can rewrite this in canonical exponential form as

exp(1 x1 + + p1 xp1 A()),

where j = log(j /p ) and A() can be computed as follows:

A() = n log p = n log(1 +

Thus the multinomial model is a (p 1)-dimensional exponential family generated by the

exp{1 x1 + + p1 xp1 } < } = Rp1 .

The natural parameter space is usually (but not always) open,

Now let 1 , 2 H and apply the inequality:

= eaA(1 )+bA(2 ) <

Continuity, integration and differentiation

You might also like