A more systematic study of the quality of the inference is shown
in Tab.
,
Table:
Proportion
of infected in a population, inferred from
the number
of positives in a sample of
individuals. The three blocks
of the table corresponds to the assumptions summarized by
and
,
,
.
|
which reports the
inferred value of
, summarized by the expected value and its standard
deviation evaluated by sampling, as a function
of the sample size and the number of positives in the sample.
The three blocks of the
table correspond to our typical hypotheses on the knowledge of
sensitivity and specificity, and summarized, from top to bottom,
by
,
and
,
corresponding then to the cases shown, in the same order, in
Figs.
-
(we have added an extra column with the numbers of
positives yielding
).
We see that, from columns 2 to 6, we get
ranging from
to
at steps of
, with
standard uncertainty varying with
and
(and therefore with
the fraction of positives
) in agreement with what
we have learned in Sec.
, studying the
predictive distributions (note the difference between resolution
power, used there, and standard uncertainty, used here).
We note that, instead, the results of the first column is
“not around zero, as expected” (naively).
The reason is very simple and it is illustrated in
Fig.
for the case of
.
Figure:
Inference of
from
and
.
 |
It is true that, if there were no infected in the population,
then we would expect
(with a standard uncertainty of 220),
but the distribution of
provided by the inference cannot have
a mean value zero, simply because negative values of
are
impossible.51Obviously the smaller is the number of positives in the sample
and more peaked is the distribution of
close to 0. But what happens
if, for
,
is much smaller of 1150?
This interesting case will be the subject of the next subsection.