Our results are in excellent agreement with the published
ones, if the latter are properly interpreted. In fact, it turns out that
they coincide with the modes of the respective
pdf
,
although we maintain that if a single number has to be provided,
especially to the media, it should be
the mean of the distribution.
This has, in fact, the meaning of the probability
that a vaccinated person will be shielded by the virus,
taking into account the unavoidable uncertainty on
, fully described by
. And this
is what really matters to define the efficacy of a vaccine.
Willing however to reduce the result of our analysis to a single
number to be compared
with the released ones, we get respectively
and with reasonable rounding, 93%, 94%, 94%, 86% and 60% for
Moderna-1, Moderna-2, Pfizer,
AstraZeneca (LDSD) and AstraZeneca (SDSD),
versus 94.5%, 94.1%, 95.0%, 90.0% and 62.1%
of Tab.
.
Therefore, as far as these numbers are concerned, there is
then a substantial agreement of the outcome of
our analysis with the published results, simply because
when a probability distribution is unimodal and
rather symmetrical then mode and mean tend to coincide.
Therefore, with respect to the main results,
our contribution to this point is mainly methodological.
The probability theory based result is, instead,
at odds with Moderna 100% claimed efficacy against severe disease,
for which a more sound 92% should be quoted.
In order to summarize more effectively the probability distribution
of with just a couple of numbers,
our preference goes to mean and standard deviation,
although we also report the bounds of the
central 95% credible interval.
This interval is, once more, in excellent
agreement not only with the Pfizer result, which
has also published an interval having exactly the same meaning,
but also with the uncertainty intervals of the other companies,
although they provide confidence intervals,
which, strictly speaking, do not have the same meaning
of the credible intervals. This is not a surprise to us.
We are in fact aware that in many practical cases not only
frequentistic point estimates are equivalent to the mode of the posterior
distribution of the model parameter, if a uniform prior was
used in a Bayesian analysis based on the same data,
but also `95% confidence intervals'
tend to be, numerically, equal to the 95% probable
intervals.17
This takes us to the question of the priors. As just reminded,
a uniform
prior over has been used in our analysis.
But, clearly, not because we believe that the efficacy of
a vaccine that has reached the Phase-3 trial
has the same chance to be close to zero or to one.
Instead, a flat prior can be considered a convenient practical choice,
if the inference is dominated by the data, as it
is often the case.
Moreover, the advantage of a uniform prior in
parametric inference is that the effect of an informative prior
reflecting the opinion of experts
can be taken into account at a later time.
This `posterior use of priors' might sound
paradoxical, but it is important to remind that in Bayesian
inference `prior' does not indicate time order but rather `based on
the status of knowledge without taking into account the new piece of information'
provided by the data entering the specific analysis.
Having, in fact, prior and likelihood symmetric and peer roles
in the Bayes' rule,
an expert can use her prior to `reshape' the posterior pdf
resulting from data analysis, if a flat prior was used,
without having to ask to repeat the analysis (see Ref. [17]
for details).
This reshaping becomes particularly simple if the prior is modeled
by a convenient, rather flexible probability distribution such as
the Beta. In fact, as we have seen, the pdf of starting from
a flat prior tends to resemble a Beta. The same is then true if also
the prior is modeled by a Beta (this is related to the well known fact
of the Beta being the conjugate prior of a binomial distribution,
even though our model is not just a simple binomial). These observations
are particularly interesting because they lead to the
expressions (
) - (
), which,
together with Eqs. (
) - (
),
allow to take easily into account the expert priors.
In fact, if the priors are
rather vague,
and
appearing in
Eqs. (
) - (
) are quite
small (although larger than one, since
and
are à priori reasonably ruled out) and, in particular,
smaller that
and
.18If, instead, the expert has
a strong opinion about the possible values of
, then
and
will play a role in her posterior,
and in that of her community, if its members trust
her.19
Coming back to the way to summarize
,
our preference goes to its mean and standard deviation.
The mean because, as reminded above, has the meaning of efficacy
for vaccine treated people not having been involved in the trial,
if all possible values of
are taken into account.
The standard deviation because it is mostly convenient,
together with the mean, to make use of the result
of the inference in further considerations and
in `propagation of uncertainties',
thanks to general probability rules.
We have just reminded the utility of mean and standard deviation in order to
re-obtain
, under the hypothesis that it is
almost a Beta distribution, making use of
Eqs. (
) - (
).
The application related to `propagation of uncertainties'
that we have seen in the paper has to do with predicting
the number of individual that will get infected in a group
that it is going to be vaccinated.
This is a problem in probabilistic forecasting and the
number of interest is uncertain for several reasons.
There is, unavoidably, the uncertainty deriving from the inherent
binomial distribution, having assumed an assault probability
in the new population.
But also the uncertainties on the values of
and
play a role, that can even be dominant
with respect to the `statistical' effect of the binomial.
Now, the probability distribution of the number of
vaccinated infectees can be evaluated extending our
basic Bayesian network, as we have done here.
But we have also stressed the importance of having
approximated expressions, based on linearization,
for its expected value and standard deviation.
And such expressions, thus obtained considering
and
as `systematic' [14], depend then on their
mean and standard deviation. For
example the contribution to
due to the uncertain
, and then
to be added `in quadrature' to the other sources of uncertainty,
is given by
E
.
This gives at a glance the contribution to the global uncertainty
without having to run a Monte Carlo.20
Finally, a comment on how to possible reduce
is in order. In fact, the relative uncertainty on
depends on the small number of vaccinated infectees.
This suggests that the quality of its `measurement'
could be improved, keeping constant the total numbers of
individuals entering the trial,
if the size of the placebo group is reduced.
We have checked by simulation that reducing it by 2/3,
thus having about a factor of five between the two groups,
is expected to be reduced by about 20%.
Not much indeed, but this different sharing of individuals
in the two groups would have the advantage
of increasing the chance of detecting side effects of the vaccine,
basically at the same cost.