last updated: January 23, 2025

Bayesian Statistics

\CustomizeMathJax

1.7 Families with random parameters

In this section we are interested to take a model family (Mθ)θΠ and treat the parameter θ as a random variable, which will be denoted by a capital letter Θ. We think of first sampling the value of Θ and then (using whatever value we obtain) taking a sample X from MΘ. The resulting distribution is sometimes known as a compound or mixture distribution. We will have a detailed discussion of how this idea becomes useful in Section 2.1. For now let us note that it increases the range of models that we have available.

To make sense of the idea, let us state it more precisely. We want random variables X and Θ such that X|{Θ=θ}=dMθ. In this section we show that a pair (X,Θ) with this property is given by the distribution

(1.13)P[XA,ΘB]=BP[MθA]fΘ(θ)dθ.

where (Mθ) is a family of distributions with range RRn, as defined in Section 1.3, and fΘ is a probability density function with range ΠRd. This is a type of random variable you may not have seen before. We will shortly show that the Θ part is a continuous random variable, but the X part might be discrete or continuous, depending on (Mθ).

Our notation strongly suggests that we expect fΘ to be the (marginal) probability density function of Θ, and we can confirm this by setting A=Rn, in which case equation (1.13) becomes P[ΘB]=BfΘ(θ). We can also find the marginal distribution of X, by setting B=Rd, giving

P[XA]=P[XA,ΘRd]=RdP[MθA]fΘ(θ)dθ,

but that formula doesn’t really explain what is going on here. The relationship that we are interested in comes from the following lemma.

  • Lemma 1.7.1 Let (Mθ) and (X,Θ) have distribution given by (1.13). Suppose that fΘ(θ)>0 and that tfΘ(t) is continuous at t=θ. Then X|{Θ=θ}=dMθ.

Proof: () We give a sketch proof to illustrate the idea, in similar style to Lemma 1.6.1. From Lemma 1.5.1, for ARd we have

P[X|{|Θθ|ϵ}A]=P[|Θθ|ϵ,XA]P[|Θθ|ϵ]=θϵθ+ϵP[MtA]fΘ(t)dtθϵθ+ϵfΘ(t)dt

The second equality follows from (1.13) with B=[θϵ,θ+ϵ], for the numerator, and from the fact that fΘ is the p.d.f. of Θ, for the denominator. Using continuity, from the statement of the lemma and from Assumption 1.3.2, for |θt|ϵ we can approximate fΘ(t)fΘ(θ) and P[MtA]P[MθA]. This gives

P[X|{|Θθ|ϵA]θϵθ+ϵP[MθA]fΘ(θ)dtθϵθ+ϵfΘ(θ)dt=2ϵfΘ(θ)P[MθA]2ϵfΘ(θ)=P[MθA].

Letting ϵ0 we have P[X|{|Θθ|ϵA]P[MθA], so by Definition 1.2.1 and (1.9) we have that X|{Θ=θ} is well defined and X|{Θ=θ}=dMθ.   ∎