Bayesian Statistics
4.3 Conjugate pairs and the exponential family
For continuous Bayesian models, a very general form of conjugacy is known. We’ll restrict here to continuous models with one unknown parameter. We need to introduce the exponential family of
distributions. This term does not refer to the exponential distribution, but to a much wider class. We say that a real valued random variable is from the exponential family of distributions with parameter
if it has a p.d.f. in the form
Here and are arbitrary functions, with the restriction . Many of the families of distributions that you are familiar with can be fitted into this mould, including the normal, exponential,
gamma, chi-squared, and beta distributions.
Take a Bayesian model with model family given by (4.3), where
both and take values in . That is,
for . We’ll focus on the version of this model that takes independent items of data, in which case
The prior distribution that provides a conjugate pair to this model is given by
where and are parameters Note that this distribution is a specialized version of the form in (4.3), and that the function must be the same as in (4.5). Theorem 3.1.2
allows us to compute the posterior density
Therefore the Bayesian update in this case, from the parameters in (4.6) to (4.7) is that
-
Taking , and obtains the in (4.4). Making the same choice for , along with
and , obtains that distribution in (4.6). Putting these choices for and into (4.7) and applying Lemma 1.2.5 gives that the posterior is , as we already knew from Lemma 4.2.1.
-
There is also a version of this framework for multiple parameters, in which
and are row vectors and is a column vector, and multiplication of these quantities is done via the dot product. We won’t write down the details of that case here. There is also a version that
applies to discrete Bayesian models, but we won’t detail that either.