Inference from a data set and sequential use of Bayes formula

Next: Multidimensional case Up: Inferring numerical values of Previous: Poisson model

Inference from a data set and sequential use of Bayes formula

In the elementary examples shown above, the inference has been done from a single data point

. If we have a set of observations (data), indicated by ${\mbox{\boldmath$d$}}$ , we just need to insert in the Bayes formula the likelihood $p({\mbox{\boldmath$d$}}\,\vert\,\theta ,I)$ , where this expression indicates a multi-dimensional joint pdf.

Note that we could think of inferring $\theta$ on the basis of each newly observed datum . After the one observation:

$\begin{displaymath} p(\theta \,\vert\,d_1,I) \propto p(d_1 \,\vert\,\theta,I) \, p(\theta \,\vert\, I) \end{displaymath}$

(45)

and after the second:

$\displaystyle p(\theta \,\vert\,d_1,d_2,I)$	$\textstyle \propto$	$\displaystyle p(d_2 \,\vert\,\theta,d_1,I) \, p(\theta \,\vert\,d_1,I)$	(46)
	$\textstyle \propto$	$\displaystyle p(d_2 \,\vert\,\theta,d_1,I) \, p(d_1 \,\vert\,\theta,I) \, p(\theta \,\vert\, I)\,.$	(47)

We have written Eq. (47) in a way that the dependence between observables can be accommodated. From the product rule in Tab. 1, we can rewrite Eq. (47) as

$\displaystyle p(\theta \,\vert\,d_1,d_2,I)$

$\textstyle \propto$

$\displaystyle p(d_1,d_2 \,\vert\,\theta,I) \,p(\theta \,\vert\, I)\,.$

(48)

Comparing this equation with (47) we see that the sequential inference gives exactly the same result of a single inference that properly takes into account all available information. This is an important result of the Bayesian approach.

The extension to many variables is straightforward, obtaining

$\displaystyle p({\mbox{\boldmath$\theta$}}\,\vert\,{\mbox{\boldmath$d$}}, I)$

$\textstyle \propto$

$\displaystyle p({\mbox{\boldmath$d$}}\,\vert\,{\mbox{\boldmath$\theta$}},I) \, p({\mbox{\boldmath$\theta$}}\,\vert\,I) \,.$

(49)

Furthermore, when the

are independent, we get for the likelihood

$\displaystyle p({\mbox{\boldmath$d$}} \,\vert\,{\mbox{\boldmath$\theta$}},I)$	$\textstyle =$	$\displaystyle \prod_i p(d_i \,\vert\,{\mbox{\boldmath$\theta$}},I)$	(50)
$\displaystyle {\cal L}({\mbox{\boldmath$\theta$}}; {\mbox{\boldmath$d$}})$	$\textstyle =$	$\displaystyle \prod_i {\cal L}({\mbox{\boldmath$\theta$}}; d_i) \, ,$	(51)

that is, the combined likelihood is given by the product of the individual likelihoods.

Next: Multidimensional case Up: Inferring numerical values of Previous: Poisson model

Giulio D'Agostini 2003-05-13