Taking into account the uncertainty on $\pi_1$ and $\pi_2$

As we have seen in Sec. [*], the way to take into account all possible values of $\pi_1$ and $\pi_2$, using the rules of probability theory, consists in evaluating the following integral
$\displaystyle f(n_P\,\vert\,n_s,p_s)$ $\displaystyle =$ $\displaystyle \int_0^1\!\!\int_0^1\!f(n_P\,\vert\,n_s,p_s,\pi_1,\pi_2)\cdot
f(\pi_1)\cdot f(\pi_2)\,$d$\displaystyle \pi_1$d$\displaystyle \pi_2\,.$ (48)

Before tacking the problem of how to evaluate this integral, a very important remark on how we are going to model the uncertainty about $\pi_1$ and $\pi_2$ is in order. Going back to the practical issue of evaluating the integral, we use again Monte Carlo methods, employing e.g. the R script provided in Appendix B.2, for the case of $n_s=10000$ and $p_s=0.1$. The result, shown in the bottom plot of Fig. [*],
Figure: Probabilistic prediction of the numbers of positives, based on a hypothetical test on 10000 individuals, exactly 1000 of them being infected. In the upper plot we use $\pi_1=0.978$ and $\pi_2=0.115$. In the lower plot we take into account their possible variability (see text). The over-imposed curve shows a Gaussian with average 2013 and standard deviation 200, values obtained by the approximated Eqs. ([*]) and ([*]). (The top histogram is repeated, with enlarged horizontal scale, in Fig. [*].)
\begin{figure}\begin{center}
\centering {\epsfig{file=PredictionPositive.eps,cl...
...width=0.9\linewidth}}
\mbox{}\vspace{-0.4cm}\mbox{}
\end{center}
\end{figure}
is quite impressive, compared to the top one, in which the precise values $\pi_1=0.978$ and $\pi_2=0.115$ were used. The mean of the distribution is unchanged, as more or less expected (see Sec. [*]), but its standard deviation, which quantifies the uncertainty of the prediction, increases by more than a factor six. We have then good reasons to expect a similar effect when we will be interested in the `reverse' problem, that is inferring the number of infectees in the sample from the resulting number of positives. Going into details, we see that the expected number of positives is essentially the same of Sec. [*] (the reduction from 2060 to 2013 is simply due to the new reference values for $\pi_1$ and $\pi_2$ we are using starting from Sec. [*]). But this number is now accounted by an uncertainty, which rises to about 10% of its value, when the uncertainties about $\pi_1$ and $\pi_2$ are also taken into account.



Subsections