Which generator?

A program chooses at random, with equal probability,

; then the generator produces a number, that, rounded to the 7-th decimal digit, is

. The question is, from which random generator does

come from?

At this point, the problem is rather easy to solve, if we know the probability of each generator to give

. They are⁵⁴

$\displaystyle P(x_E\,\vert\,H_1,I)$	$\displaystyle =$	$\displaystyle 3.68\times 10^{-8} \ \ \ ($ $\displaystyle \mbox{1 in $\approx$\,27 millions}$ $\displaystyle )$
$\displaystyle P(x_E\,\vert\,H_2,I)$	$\displaystyle =$	$\displaystyle 1.99\times 10^{-8} \ \ \ ($ $\displaystyle \mbox{1 in $\approx$\,50 millions}$ $\displaystyle )\,,$

$\begin{displaymath} \begin{array}{rclcl} \tilde O_{1,2}(x_E,I) &=& 1.85 \ \ \ &\Rightarrow& \Delta\mbox{JL}_{1,2}(x_E,I) = +0.27\,. \end{array}\end{displaymath}$

**Figure:** *Which random number generator has produced ? Which hypothesis favors the points indicated by ` $\times$ '?*
$\begin{figure}\centering\epsfig{file=two_normal.eps,clip=,width=0.8\linewidth}\end{figure}$

What matters when comparing hypotheses is never, stated in general terms, the absolute probability $P(E\,\vert\,H_i,I)$ . In particular, it doesn't make sense saying `` $P(H_i\,\vert\,E,I)$ is small because $P(E\,\vert\,H_i,I)$ is small''.⁵⁵ As a consequence, from a consistent probabilistic point of view, it makes no sense to test a single, isolated hypothesis, using `funny arguments', like how far if

from the peak of $f(x\,\vert\,H_i)$ , or how large is the area below $f(x\,\vert\,H_i)$ from

to infinity. In particular, if two models give exactly the same probability to produce an observation, like the two points indicated by ` $\times$ ' in fig. 9, the evidence provided by this observation is absolutely irrelevant [ $\Delta$ JL $_{1,2}($ ` $\times$ '

To get a bit familiar with the weight of evidence in favor of either hypothesis provided by different observations, the following table, reporting Bayes factors and JL's due to the integers between

and

, might be useful.

	$\tilde O_{1,2}(x_E)$	$\Delta$ JL $_{1,2}(x_E)$
	$5.1\times 10^{-6}$
	$2.9\times 10^{-4}$
	$7.5\times 10^{-3}$
	$9.4\times 10^{-2}$


0


	$5.2\times 10^{-2}$
	$3.4\times 10^{-3}$
	$1.0\times 10^{-4}$
	$1.5\times 10^{-6}$

We can check this by a little simulation. We choose a model, extract 50 random variables and analyze the data as if we didn't know which generator produced them, although considering

and

equally likely. We expect that, as we go on with the extractions, the pieces of evidence accumulate until we possibly reach a level of practical certainty. Obviously, the individual pieces of evidence do not provide the same $\Delta$ JL, and also the sign can fluctuate, although we expect more positive contributions if the points are generated by

and the other way around if they came from

. Therefore, as a function of the number of extractions the accumulated weight of evidence follows a kind of asymmetric random walk (imagine the JL indicator fluctuating as the simulated experiment goes on, but drifting `in average' in one direction).

**Figure:** Combined weights of evidence in simulated experiments. The above (blue) combined JL sequences have been obtained by the generator , as it can be recognized because they tend to large positive values as the number of extractions increases. The below one are generated by .
$\begin{figure}\centering\epsfig{file=simulazioni.eps,clip=,}\end{figure}$

Figure 10 shows 200 inferential stories, half per generator. We see that, in general, we get practically sure of the model after a couple of dozens of extractions. But there are also cases in which we need to wait longer before we can feel enough sure on one hypothesis.

It is interesting to remark that the leaning in favor of each hypothesis grows, in average, linearly with the number of extractions. That is, a little piece of evidence, which is in average positive for

and negative for

, is added after each extraction. However, around the average trend, there is a large varieties of individual inferential histories. They all start at $\Delta$ JL

for

, but in practice there are no two identical `trajectories'. All together they form a kind of `fuzzy band', whose `effective width' grows also with the number of extractions, but not linearly. The widths grows as the square root of

.⁵⁶This is the reason why, as

increases, the bands tend to move away from the line JL

. Nevertheless, individual trajectories can exhibit very `irregular'⁵⁷ behaviors as we can also see in figure 10.