A number of confounding factors affect the results, which we
investigate here. For reference, the results before adjustments are shown in
Figure 2.
Figure 2. Deaths per million for countries using widespread early
HCQ versus those that do not,
before adjustments.
Age.
The COVID-19 IFR varies around four
orders of magnitude depending on age. Since the proportion of older adults
varies significantly between countries, this is likely to have a significant
effect on the results
[Leffler]. We approximate the relative risk
based on age using the infection fatality rates provided in
[Verity],
and shown in
Figure 3. Due to the distribution, simple adjustment based
on the median age, the proportion of people over 65, or similar may not be
very accurate. We obtained age demographics from
[United Nations] which
provides a breakdown within 5 year age groups. Using the 9 age groups provided
by
[Verity], we computed an age adjustment factor for each country to
normalize the observed deaths to the predicted number of deaths if the
country's age distribution matched that of the country with the oldest
population. The age distributions and computed age factors are provided in
Appendix 1. These adjustments are relatively significant as in
[Leffler].
Gender.
Risk differs significantly based
on gender
[Gebhard], so we also normalized for this in a similar
fashion. Data is from
[United Nations], and using the hazard ratio of 1.78 from
[Williamson] the resulting adjustment factors are shown in
Appendix 1. These adjustments are relatively minor as in
[Leffler]. After adjusting for age and gender we obtain the results in
Figure 4. Adusted mean treatment and control deaths become
267.8 per million and 889.8 per million,
relative risk 0.30.
Figure 4. Deaths per million for countries with widespread early
HCQ versus those that do not, after adjustment for differences in
demographics.
Early isolation and masks.
Many countries
have taken an isolation approach, isolating themself from the world quickly
and aggresively preventing any spread. With a very small and unknown fraction
of the population infected, we can not easily analyze these countries. Many of
these countries have also not taken a strong position on HCQ use. Mask usage
was analyzed in
[Leffler], which found 29 countries that widely and
quickly adopted masks, as shown in
Appendix 12. These countries in
general took swift action with interventions and travel restrictions in order
to prevent spread and have significantly lower spread of the virus to date. We
excluded countries on this list, this excluded South Korea, Czech Republic,
Indonesia, and Venezuela, which were provisionally identified as countries
using early HCQ. This favors the control group. If we do not exclude these
countries, the treatment group adjusted mean deaths is
258.5 per million, and the relative risk decreases to
0.29.
Population health.
Health conditions
such as diabetes, obesity, and hypertension significantly increase the risk of
death with COVID-19
[Gao, Williamson]. This could affect the
results because the prevalence of these conditions differs between countries.
These conditions often occur together, for example
[Iglay] found the
most common comorbid conditions for diabetes were hypertension (82%) and
obesity (78%), which makes combined country-level adjustment difficult,
however we can first analyze the conditions individually. We examined the
relationship of the diabetes, obesity, and hypertension levels with the
adjusted deaths per million for the countries in our study, with data from
[International Diabetes Federation],
[CIA], and
[Mills] respectively.
Appendix 2,
Appendix 3, and
Appendix 4 show scatter plots, and the data can be found in
Appendix 1. There was no significant correlation for diabetes,
r2 = 0.12, obesity,
r2 = 0.07, or hypertension,
r2 = 0.08.
Based on this we do not expect adjustments to significantly affect
the results. We re-ran the analysis adjusting for each of these factors
individually (HR estimates: diabetes 1.63
[Williamson],
obesity 1.4
[Williamson], hypertension
2.12
[Gao (B)]), which resulted in a relative risk of
0.313, 0.309,
0.316 respectively for diabetes, obesity, and
hypertension. We also examined life expectancy with data from
[Our World in Data (B)].
Appendix 5 shows a
scatter plot and the data can be found in
Appendix 1. The
correlation,
r2 = 0.00, is
relatively low, and is in the direction of higher life expectancy resulting in
higher deaths. Therefore we do not find evidence that country-level
differences in health have a significant effect on the results.
Testing.
Countries with more
widespread testing could potentially be more successful in minimizing deaths.
We examined the relationship of testing per capita with adjusted deaths, with
data from
[Our World in Data (C)].
Appendix 11
shows a scatter plot, and the data can be found in
Appendix 1.
The correlation
r2 =
0.01, is very low and is also in
the opposite direction of the expected potential correlation (we find that
more testing is correlated with higher deaths). Therefore differences in
testing do not appear to significantly affect the results.
BCG vaccine.
Research suggests that the BCG
vaccine may provide some protection against COVID-19
[Escobar]. A
correlation was shown between a country's BCG vaccine use and mortality,
although causation has not been established
[de Freitas e Silva, Escobar, Hegarty, Sharquie], and more recent analysis found
the correlation was no longer significant
[Lindestam Arlehamn]. We
examined the correlation between the adjusted deaths and the mean BCG vaccine
coverage as defined by
[Escobar].
Appendix 7 shows the scatter
plot for the BCG vaccine coverage and the adjusted deaths per million and the
data is shown in
Appendix 1. The correlation
r2
= 0.04 is low. Excluding countries with a BCG
vaccine coverage below 50 (5 countries) reduces the
correlation,
r2 = 0.01.
Re-running the analysis in this case results in a relative risk of
0.23, i.e., the treatment group has
77.1% lower chance of death. Therefore we do not find
evidence that differences in BCG vaccine use significantly affect the results.
Co-administered treatments.
Several
theories exist for why HCQ is effective
[Andreani, Brufsky, Clementi, de Wilde, Derendorf, Devaux, Fantini, Grassin-Delyle, Hoffmann, Hu, Keyaerts, Kono, Liu, Pagliano, Savarino, Savarino (B), Scherrmann, Sheaff, Vincent, Wang, Wang (B)],
some of which involve co-administration of other medication or suppplements.
Most commonly used are zinc
[Derwand, Shittu] and Azithromycin (AZ)
[Guérin].
In vitro experiments report a synergistic effect of HCQ
and AZ on antiviral activity
[Andreani] at concentrations obtained in
the human lung, and
in vivo results are consistent with this
[Gautret]. Zinc reduces SARS-CoV RNA-dependent RNA polymerase
activity
in vitro [te Velthuis], however it is difficult to obtain
significant intracellular concentrations with zinc alone
[Maret].
Combining it with a zinc ionophore such as HCQ increases cellular uptake,
making it more likely to achieve effective intracellular concentrations
[Xue]. Zinc deficiency varies and inclusion of zinc may be more or
less important based on an individual's existing zinc level. Zinc consumption
varies widely based on diet
[NIH].
To the extent that the co-administration of zinc, Azithromycin,
or other medication or supplements is important, we may underestimate the
effectiveness of HCQ because not all countries and locations are using the
optimal combination.
Treatment regimen.
There are
differences in treatment regimens between and within countries. Details of
timing, determination of risk, and dosages differ. Because not all locations
are using the optimal regimen, this may reduce the effect observed.
Adherence.
Some people in the control
group obtained the treatment. This may reduce the effect observed.
Seasonality.
Seasonality could affect
results, although
[Jamil] show there is currently little evidence for
a large temperature dependence. We also note that the pandemic already covers
more than one season and over time is likely to cover all seasons.
Accuracy of death statistics.
The
accuracy of reported death statistics varies across and within countries.
Excess death statistics may be used in the future if they become available for
more countries, however it may be difficult to separate deaths due to COVID-19
and changes to other causes of death related to interventions.
Degree of spread.
The virus has spread
throughout countries at different rates, due to differences in the initial
number of infected persons arriving at the country, differences in treatments,
population dynamics, cultural differences, and interventions including masks,
social distancing, lockdowns, quarantine, and border restrictions. This factor
is likely to be significant but will decline over time. Since it is unlikely
that the virus will be eliminated soon, we expect that increasingly similar
percentages of people will have been exposed over time, and we will update
this analysis periodically to reflect the latest data. While interventions can
temporarily slow the spread of the virus, it is unlikely that high
intervention levels can be sustained indefinitely. Some countries, such as New
Zealand, have highly contained the virus to date, essentially by quickly
isolating themselves from the world with travel restrictions and strictly
enforced quarantine rules. These countries may avoid significant spread while
they remain isolated, however all of the countries in the treatment and
control groups here have experienced significant spread of the virus.
We tested the effect of interventions using the average
intervention stringency index
[University of Oxford] over the period analyzed,
as provided by
[Our World in Data (E), Our World in Data (F)].
Appendix 9 shows
a scatter plot, the correlation
r2 =
0.00, suggesting that the differences in
non-medical interventions have a relatively minor affect on the results at
present.
The treatment group countries generally show significantly
slower growth in mortality which may be due to treatment, interventions,
differences in culture, or the initial degree of infections arriving into the
country. Over time we expect that increasingly similar percentages of people
will have been exposed, since it is unlikely that the virus will be eliminated
soon.
To account for future spread, we created an estimate of the
future adjusted deaths per million for each country, 90 days
in the future, based on a second degree polynomial fit according to the most
recent 30 days, enforcing the requirement that deaths do not
decrease, and using an assumption of a progressively decreasing maximum
increase over time.
Figure 5 shows the results, which predicts a
future relative risk of 0.30, i.e., the
treatment group has 69.9% lower chance of
death.
Figure 5. Demographic adjusted deaths per
million for countries using widespread early HCQ versus those that do not,
with an extended prediction for the following 90
days.
Introduction.
CQ and HCQ are
4-aminoquinoline synthetic alternatives to quinine, a naturally occurring
compound found in cinchona bark, which has long been used for respiratory
infections and other conditions
[Burrows]. The cost of HCQ is around
$0.28 per dose according to
[Centers for Medicare and Medicaid Services]. CQ, HCQ, and quinine are on the
World Health Organization's list of essential medicines, the safest and most
effective medicines needed in a health system
[World Health Organization].
Theory, in vitro, and ex vivo results.
Several
in vitro studies
[Andreani, Clementi, de Wilde, Hoffmann, Keyaerts, Kono, Liu, Savarino, Sheaff, Vincent, Wang, Wang (B)] show that CQ
inhibits related viruses and SARS-CoV-2, supported by several related theory
articles
[Brufsky, Derendorf, Devaux, Fantini, Hu, Pagliano, Savarino (B), Scherrmann]. Theories for the mechanism of action
include HCQ/CQ protonation and accumulation in the endosome which prevents the
acidification required for genome release
[Fitch]; acting as an
ionophoric agent that transports zinc ions into infected cells, where they
inhibit viral RNA replicase enzyme
[Xue]; dampening excess immune
responses thereby mitigating the hyperactive immune response sometimes
associated with COVID-19
[Schrezenmeier]; and inhibiting oxidative
phosphorylation in mitochondria, likely by sequestering protons needed to
drive ATP synthase
[Sheaff].
[Savarino (B, 2003)] reviews the antiviral effects of CQ, noting that CQ inhibits the replication of several viruses including members of the flaviviruses, retroviruses, and coronaviruses. They note that CQ has immunomodulatory effects, suppressing the production/release of tumour necrosis factor α and interleukin 6, which mediate the inflammatory complications of several viral diseases;
[Keyaerts (2004)] show that the IC50 of CQ for inhibition of SARS-CoV
in vitro approximates the plasma concentrations of CQ reached during treatment of acute malaria, suggesting that CQ may be considered for immediate use in the prevention and treatment of SARS-CoV;
[Vincent (2005)] show that CQ has strong antiviral effects on SARS CoV infection when cells are treated either before or after exposure, suggesting prophylactic and treatment use, and describing three mechanisms by which the drug could work;
[Savarino (2006)] in an update to their 2003 paper discuss the
in vitro confirmation of CQ inhibiting SARS replication via two studies, and note that CQ affects an early stage of SARS replication;
[Kono (2008)] showed that CQ inhibits viral replication of HCoV-229E at concentrations lower than in clinical usage;
[de Wilde (2014)] show that CQ inhibits SARS-CoV, MERS-CoV, and HCoV-229E-GFP replication in the low-micromolar range;
[Wang (B, 2/4/20)] showed that CQ (EC50 = 1.13 μM; CC50 > 100 μM, SI > 88.50) potently blocked virus infection at low-micromolar concentration and showed high selectivity
in vitro;
[Devaux (3/12/20)] discusses mechanisms of CQ interference with the SARS-CoV-2 replication cycle;
[Liu (3/18/20)] show that HCQ is effective
in vitro and less toxic than CQ. They note that in addition to the direct antiviral activity, HCQ is a safe and successful anti-inflammatory agent that has been used extensively in autoimmune diseases and can significantly decrease the production of cytokines and, in particular, pro-inflammatory factors. Therefore, in COVID-19 patients, HCQ may also contribute to attenuating the inflammatory response. They note that based on the selectivity index, careful design of clinical trials is important to achieve efficient and safe control of the infection;
[Hu (3/23/20)] note that CQ is known in nanomedicine research for the investigation of nanoparticle uptake in cells, and may have potential for the treatment of COVID-19;
[Pagliano (3/24/20)] note that CQ and HCQ inhibit replication at early stages of infection, that no similar effect is reported for other drugs which are only able to interfere after cell infection, and that there is a large volume of existing data on safety;
[Clementi (3/31/20)] show a greater inhibition for combined pre and post-exposure treatment with Vero E6 and Caco-2 cells;
[Fantini (4/3/20)] ;
[Brufsky (4/15/20)] present a theory on HCQ effectiveness with COVID-19, wherein HCQ blocks the polarization of macrophages to an M1 inflammatory subtype and is predicted to interfere with glycosylation of a number of proteins involved in the humoral immune response, possibly including the macrophage FcR gamma IgG receptor and other immunomodulatory proteins, potentially through inhibition of UDP‐N‐acetylglucosamine 2‐epimerase. In combination with potential other immunomodulatory effects, this could blunt the progression of COVID‐19 pneumonia all to way up to ARDS;
[Andreani (4/25/20)] show that HCQ and AZ have a synergistic effect
in vitro on SARS-CoV-2 at concentrations compatible with that obtained in the human lung;
[Derendorf (5/7/20)] discuss pharmacokinetic properties of HCQ+AZ as a potential underlying mechanism of the observed antiviral effects;
[Grassin-Delyle (5/8/20)] use human lung parenchymal explants, showing that CQ concentration clinically achievable in the lung (100 µM) inhibited the lipopolysaccharide-induced release of TNF-ɑ (by 76%), IL-6 (by 68%), CCL2 (by 72%), and CCL3 (by 67%). In addition to antiviral activity, CQ may also mitigate the cytokine storm associated with severe pneumonia caused by coronaviruses;
[Scherrmann (6/12/20)] propose a new mechanism supporting the synergistic interaction between HCQ+AZ;
[Sheaff (8/2/20)] present a new theory on SARS-CoV-2 infection and why HCQ/CQ provides benefits, which also potentially explains the observed relationships with smoking, diabetes, obesity, age, and treatment delay, and confirms the importance of accurate dosing. Metabolic analysis revealed HCQ/CQ inhibit oxidative phosphorylation in mitochondria (likely by sequestering protons needed to drive ATP synthase), inhibiting infection and/or slowing replication; and
[Wang (9/2/20)] show that CQ and HCQ both inhibit the entrance of 2019-nCoV into cells by blocking the binding of the virus with ACE2.
[Hoffmann] perform an
in vitro study of CQ and
HCQ inhibition of SARS-CoV-2 into Vero (kidney), Vero-TMPRSS2, and Calu-3
(derived from human lung carcinoma) cells. They suggest a lack of
effectiveness, but there appears to be three unsupported steps made to reach
the conclusions in this paper. Firstly, authors conclude that CQ does not
block infection of Calu-3 when the results show statistically
significant inhibition at higher concentrations. Second, authors go from
analysis of one specific type of pulmonary adenocarcinoma cells that resemble
serous gland cells,
in vitro, into a general claim of no inhibition in
lung cells. Thirdly, they disregard existing theories of CQ/HCQ effectiveness
to conclude a general lack of effectiveness.
Calu-3 is one of many cell lines derived from human lung
carcinomas
[Shen]. Calu-3 cells resemble serous gland cells (they do
not express 15-lipoxygenase, an enzyme specifically localized to the surface
epithelium, but they do express secretory component, secretory leukocyte
protease inhibitor, lysozyme, and lactoferrin, all markers of serous gland
cells).
[Shen] note that the absence of systemic inflammation,
circulatory factors, and other paracrine systemic influences is a potential
limitation of the isolated cell system.
[Hoffmann] Fig. 1b @100uM shows CQ results in a ~4.5
fold decrease (note a log scale is used) in extracellular virus, p=0.05, after
24 hours (estimated from the graph). We note that the paper marks this as not
significant because the value is 0.517, however the p value is unlikely to be
accurate to this level. Additionally authors use Dunnett's test while other
tests may have higher power
[Sauder]. We further note that the 95%
significance level is just a convention and results do not magically go from
non-significant at p=0.051 to significant at p=0.049. Results only apply to 24
hours later and we expect further decrease over time. Fig. 1a shows a ~45-50%
entry inhibition @100uM for HCQ/CQ (p=0.0005/0.0045), ~10-30% @10uM
(p=0.13/0.99). Inhibition is significantly better with Vero cells.
There are several theories on how HCQ may help with COVID-19,
and we note that authors do not consider one of the most common theories where
HCQ functions as a zinc ionophore, facilitating significant intracellular
concentrations of zinc. Zinc is known to inhibit SARS-CoV RNA-dependent RNA
polymerase activity, and is widely thought to be important for effectiveness
with SARS-CoV-2
[Shittu].
Animal in vivo studies.
[Keyaerts (B, 2009)] showed that CQ inhibits HCoV-OC43 replication in HRT-18 cells in a mouse study. Lethal HCoV-OC43 infection in newborn C57BL/6 mice was treated with CQ acquired transplacentally or via maternal milk, with the highest survival rate (98.6%) found when mother mice were treated daily with a concentration of 15 mg of CQ per kg of body weight. Survival rates declined in a dose-dependent manner, with 88% survival when treated with 5 mg/kg CQ and 13% survival when treated with 1 mg/kg CQ. They conclude that CQ can be highly effective against HCoV-OC43 infection in newborn mice and may be considered as a future drug against HCoVs;
[Yan (2012)] show that CQ can efficiently ameliorate acute lung injury and dramatically improve the survival rate in mice infected with live avian influenza A H5N1 virus; and
[Maisonnasse (5/6/20)] study treatment in monkeys. They report no effect, however the data has several signs of effectiveness despite the very small sample sizes and 100% recovery of all treated and control monkeys. The final day lung lesion data shows 63% of control monkeys have lesions, while only 26% of treated monkeys do, p=0.095 (missing data for 7 monkeys is predicted based on the day 5 results and the trend of comparable monkeys). After one week, 74% of treated monkeys have recovered with <= 4 log10 copies/mL viral load, compared to 38% of control monkeys, p=0.095. 38% of control monkeys also have a higher peak viral load than 100% of the 23 treated monkeys post-treatment. The group with the lowest peak viral load is the PrEP group. All animals in this study were infected with the same initial viral load, whereas real-world infections vary in the initial viral load, and lower inital viral loads allow greater time to mount an immune response.
Human in vivo studies.
We found
154 studies related to the human
in vivo use of HCQ
for treating COVID-19
[Abd-Elsalam, Abella, Ahmad, Alamdari, Alberici, Almazrou, An, Aparisi, Arleo, Arshad, Ashinyo, Ashraf, Ayerbe, Barbosa, BaŞaran, Berenguer, Bernaola, Bhattacharya, Borba, Boulware, Bousquet, Carlucci, Carlucci (B), Catteau, Cavalcanti, Chamieh, Chatterjee, Chen, Chen (B), Chen (C), Chen (D), Choi, Coll, Cravedi, D'Arminio Monforte, Dabbous, Davido, de la Iglesia, Derwand (B), Derwand (C), Desbois, Di Castelnuovo, Dubee, Dubernet, Elbazidi, Esper, Faíco-Filho, Ferreira, Ferri, Fonseca, Fontana, Fried, Furtado, Gao (B), Gautret, Gautret (B), Geleris, Gendelman, Gentry, Giacomelli, Goenka, Goldman, Goldman (B), Gonzalez, Grau-Pujol, Guisado-Vasco, Gupta, Guérin, Heberto, Heras, Hong, Huang, Huang (B), Huang (C), Huh, Ip, Ip (B), Izoulet, Jiang, Kamran, Kelly, Khan, Khurana, Kim, Kirenga, Komissarov, Konig, Lagier, Lammers, Lano, Laplana, Lauriola, Lecronier, Lee, Lopes, Lopez, Luo, Ly, Lyngbakken, López, Macias, Magagnoli, Mahévas, Martinez-Lopez, McGrail, Membrillo de Novales, Meo, Mikami, Million, Mitchell, Mitjà, Mitjà (B), Molina, Nachega, Núñez-Gil, Okour, Otea, Paccoud, Peters, Pinato, Pirnay, Podder, Rajasingham, RECOVERY, Rentsch, Rivera, Roomi, Rosenberg, Saleemi, Sbidian, Self, Serrano, Sheshah, Shoaibi, Singh, Skipper, Solh, SOLIDARITY, Soto-Becerra, Sulaiman, Synolaki, Sánchez-Álvarez, Tang, Tehrani, Trullàs, Ulrich, Wang (C), Xia, Xue (B), Yu, Yu (B), Zhong, Zhong (B), Ñamendys-Silva]. 86
of these present positive results (of varying degrees and confidence)
[Ahmad, Alamdari, Alberici, Almazrou, Aparisi, Arleo, Arshad, Ashinyo, Ayerbe, Berenguer, Bernaola, Bhattacharya, Boulware, Bousquet, Carlucci, Carlucci (B), Catteau, Chamieh, Chatterjee, Chen (B), Chen (C), Coll, D'Arminio Monforte, Davido, Derwand (B), Derwand (C), Di Castelnuovo, Dubernet, Elbazidi, Esper, Ferreira, Ferri, Fonseca, Fontana, Gao (B), Gautret (B), Goenka, Gonzalez, Guisado-Vasco, Guérin, Heberto, Heras, Hong, Huang, Huang (B), Huang (C), Ip, Izoulet, Jiang, Khan, Khurana, Kim, Lagier, Lammers, Lauriola, Lee, Lopes, Ly, López, Martinez-Lopez, Membrillo de Novales, Meo, Mikami, Million, Mitchell, Nachega, Núñez-Gil, Okour, Otea, Pinato, Pirnay, Sbidian, Serrano, Sheshah, Shoaibi, Sulaiman, Synolaki, Sánchez-Álvarez, Tehrani, Xia, Xue (B), Yu, Yu (B), Zhong, Zhong (B), Ñamendys-Silva], 30 present negative
results (also of varying degrees and confidence)
[An, Barbosa, Borba, Cavalcanti, Chen (D), Choi, Cravedi, Giacomelli, Gupta, Ip (B), Kelly, Komissarov, Lecronier, Magagnoli, Mahévas, Molina, Peters, RECOVERY, Rentsch, Rivera, Roomi, Rosenberg, Saleemi, Self, Singh, Solh, SOLIDARITY, Soto-Becerra, Tang, Ulrich], while the remainder are either inconclusive
or were retracted.
Table 1 shows a distribution of studies based
on treatment time.
Late treatment studies.
Most studies focus on late treatment with hospitalized
patients, and the results are very mixed. We found 55
of the studies reported positive effectiveness, while
29 reported negative effectiveness, both with varying
degrees of effect and confidence. We do not consider the late treatment
studies further here since we are concerned with early treatment, other than
to note that these studies suggest HCQ may potentially be beneficial in a
hospital setting if used very quickly and with patients that have not reached
a more advanced stage of the disease; and it may be of limited or negative
value with later stage disease. Three studies consider higher dosages than
typically used
[Borba, RECOVERY, SOLIDARITY], and the results suggest
that these dosages in late stage patients may be harmful.
Pre-Exposure Prophylaxis (PrEP) studies.
We found 22 PrEP studies
[Abella, Arleo, Bhattacharya, Chatterjee, de la Iglesia, Desbois, Ferreira, Ferri, Gendelman, Gentry, Goenka, Grau-Pujol, Huang, Huh, Khurana, Konig, Laplana, Macias, Mitchell, Rajasingham, Rentsch, Zhong].
Several studies analyze HCQ usage by systemic autoimmune
disease patients. SLE, RA, and other autoimmune conditions are associated with
significantly increased susceptibility to and incidence of infections
[Bouza, Bultink, Herrinton, Iliopoulos, Kim (B), Li, Listing]. For
COVID-19 specifically, research confirms that the risk for systemic autoimmune
disease patients is much higher,
[Ferri] show OR 4.42, p<0.001, which
is the observed real-world risk, taking into account factors such as these
patients potentially being more careful to avoid exposure.
[Arleo] perform a retrospective analysis of hospitalized rheumatic disease patients showing 50% lower mortality for patients on HCQ;
[Goenka] study SARS-CoV-2-IgG antibodies in 1122 health care workers in India, finding 87% lower positives for adequate HCQ prophylaxis, 1.3% HCQ versus 12.3% for no HCQ prophylaxis;
[Abella] report on a very small early-terminated underpowered PrEP RCT with 64/61 HCQ/control patients and only 8 infections, HCQ infection rate 6.3% versus control 6.6%, RR 0.95 [0.25 - 3.64]. There was no hospitalization or death, no significant difference in QTc, no severe adverse events, no cardiac events (e.g., syncope and arrhythmias) observed. Medication adherence was 81%. Therapeutic levels of HCQ may not have been reached by the time of the infection in the first week. 2 infections were reported to be after discontinuation of the medication, but the authors do not specify which arm these were in. Hypothetically, if these were both in the HCQ arm, the resulting RR for treatment would be much lower;
[Gentry] perform a retrospective analysis of patients with rheumatologic conditions showing zero mortality with HCQ, odds ratio OR 0.0, p=0.10. 0 of 10,703 COVID-19 deaths for HCQ patients versus 7 of 21,406 for control patients. COVID-19 cases OR 0.79, p=0.27. There are several significant differences in the propensity matched patients that could affect results, e.g., 20.9% SLE versus 24.7%;
[Rajasingham] show HCQ COVID-19 case HR 0.73, p = 0.12 with a PrEP RCT. This trial was halted after 47% enrollment, p < 0.05 will be reached at ~75% enrollment if similar results continue. HR 0.66/0.68 for full medication adherence (1x/2x dosing). Efficacy for first responders was higher, OR 0.32, p = 0.01. First responders had a much higher incidence, allowing greater power, and reducing the effect of confounders such as misdiagnosis of other conditions or survey issues. Performance is similar to placebo for the first 3 weeks. The effect may be greater with a dosage regimen that achieves therapeutic levels faster. ~40% of participants suspected they might have had COVID-19 before the trial, the effect in people without prior COVID-19 may be higher. Authors note that the trial was underpowered, investigation into more frequent dosing may be warranted, and there was insufficient dosing with no participants achieving more than the
in vitro EC50;
[Grau-Pujol] performed a small PrEP RCT showing that PrEP with HCQ is safe at the dosage used. No deaths, hospitalizations, or serious adverse events occurred. With only one case (in the placebo arm), efficacy was not evaluated;
[Rentsch] perform an observational database study of RA/SLE patients in the UK, HCQ HR 1.03 [0.80-1.33] after adjustments. 70 patients with HCQ prescriptions died. One major problem is that there is no knowlege of medication adherence for these 70 - for example, it is possible that they were all part of the expected percentage of patients that did not take the medication as prescribed, invalidating the result. Both confirmed and suspected deaths were included. It is not clear why the authors did not report the result for only confirmed cases. Other limitations include confounding by use of bDMARDs, confounding by severity of rheumatological disease, and incorrectly classified deaths;
[Laplana] survey 319 autoimmune disease patients taking CQ/HCQ finding 5.3% COVID-19 incidence, compared to a control group from the general population (matched on age, sex, and region, but not adjusted for autoimmune disease), with 3.4% incidence. It not clear why authors did not compare with autoimmune patients not on CQ/HCQ. If we adjust for the different baseline risk, the result would become RR 0.36, p<0.001, suggesting a substantial benefit for HCQ/CQ treatment;
[de la Iglesia] analyze autoimmune disease patients on HCQ, compared to a control group from the general population (matched on age and sex, but not adjusted for autoimmune disease), showing non-significant differences between groups. If we adjust for the different baseline risk, the mortality result becomes RR 0.35, p=0.23, suggesting a substantial benefit for HCQ treatment;
[Ferri] analyze 1641 autoimmune systemic disease (ASD) patients showing csDMARD (HCQ etc.) RR 0.37, p=0.015. csDMARDs include HCQ, CQ, and several other drugs, so the effect of HCQ/CQ alone could be higher. This study also confirms that the risk of COVID-19 for ASD patients in general is much higher, OR 4.42, p<0.001, which is the real-world risk, accounting for factors such as ASD patients potentially being more careful to avoid exposure;
[Khurana] presents a study of hospital health care workers showing HCQ prophylaxis reduces COVID-19 significantly, OR 0.30, p=0.02. 94 positive health care workers with a matched sample of 87 testing negative. The actual benefit of HCQ may be larger because the severity of symptoms are not considered;
[Desbois] retrospectively analyze 199 sarcoidosis patients, showing HCQ RR 0.83, p=1.0;
[Zhong] analyzed 6,228 patients with autoimmune rheumatic diseases with 55 COVID positive members of families exposed to COVID-19, showing that patients on HCQ had a lower risk of COVID-19 than those on other disease-modifying anti-rheumatic drugs with OR 0.09 (0.01–0.94), p=0.044;
[Ferreira] analyze 26,815 patients showing that chronic HCQ treatment (77 patients) provides protection against COVID-19, odds ratio 0.51 (0.37-0.70);
[Huang] analyze 1255 COVID-19 patients in Wuhan Tongji Hospital finding 0.61% with systemic autoimmune diseases, much lower than authors expected (3%–10%). Authors hypothesise that protective factors, such as CQ/HCQ use, reduce hospitalization;
[Bhattacharya] shows PrEP HCQ reduced cases from 38% to 7% with 106 people;
[Chatterjee] shows PrEP HCQ of 4+ doses was associated with a significant decline in the odds of getting infected, along with a dose-response relationship, based on 378 treatment and 373 control cases;
[Konig] analyzed 80 SLE patients diagnosed with COVID-19, showing the frequency of hospitalisation did not differ significantly between individuals using an antimalarial versus non-users. Authors suggest that the dosage used may be too low to reach therapeutic levels;
[Mitchell] analyze COVID-19 amongst 2.4B people, showing a wide counterintuitive disparity between well-developed and less-developed countries, with more affluent countries about one hundred times more likely to be infected and die due to COVID-19. They find the effect is most apparent when comparing to countries with the highest rates of endemic malaria. Since travelers to malaria-endemic countries are likely to be taking antimalarial prophylaxis, authors find the data highly probative for the hypothesis that prophylactic antimalarial use by incoming visitors markedly attenuates a country’s COVID-19 fatality rate. While authors do not adjust for age differences, those adjustments can only account for a small fraction of the observed difference;
[Huh] perform a database analysis of many drugs and COVID-19 cases, with 23 cases taking HCQ, and 251 control patients not taking HCQ, showing OR 1.07, p=0.77, and in multivariable analysis OR 1.48, p=0.086. Patients taking HCQ are most likely taking it for systemic autoimmune diseases where the risk of COVID-19 is much higher. Adjusting the multivariable analysis result for the difference in baseline risk of systemic autoimmune patients results in RR 0.34. Details of the multivarible analysis are not provided for assessment, but the analysis may be significantly affected by overfitting and/or multicollinearity. We note that many results in this study differ from other research, for example proton pump inhibitors show OR 0.47, p<0.001 whereas PPIs are classified as "no expected benefit" and other research suggests they increase risk;
[Gendelman] presents a small study of rheumatic disease/autoimmune disorder patients showing no significant difference without adjusting for baseline risk. Adjusting for the difference in baseline risk using the result in Ferri et al. shows substantial benefit for HCQ, RR 0.211, but with only 3 HCQ cases the result is inconclusive; and
[Macias] analyzes incidence among patients with rheumatic disease, however with only 3 confirmed cases, and not adjusting for significant differences between groups and the expected infection rates based on patient conditions, we consider this study inconclusive.
[Boulware] reports a lack of efficacy due to
statistical significance not being reached, however multiple secondary
analyses show statistically significant and positive results. Due to this
difference, we provide a detailed explanation. The paper shows a 17% reduction
in cases, p=0.35 due to the small sample size - we can say this is
inconclusive, but not negative (it is more likely to be positive than
negative). Authors initially believed 3 days post exposure was the maximum
enrollment delay of interest, however there was a mid-trial modification
extending this to allow an addional day delay. With the original trial
specification, they show a 30% reduction in cases for treatment, p=0.13. If
the trial was not ended early, and if the observed trend continued, p=0.05
would have been reached at ~840 patients total (the original trial
specification was 1,242 patients).
In the supplementary appendix, we can see that COVID-19 cases
are reduced by [49%, 29%, 16%] respectively when taken within ~[70, 94, 118]
hours of exposure (including shipping delay), as shown in
Figure 6.
A priori the most important cases to consider are the treatment
delay-response relationship and the shortest delay to treatment (~70 hours on
average in this case). The shortest delay to treatment is significant @94%
versus all placebo. By simulation, assuming that cases occur randomly
according to the observed frequency, we found the probability that the results
follow the observed beneficial delay-reponse relationship is 0.2%
[CovidAnalysis]. Since we have performed 2 tests, conservative
Bonferroni adjustment
[Jafari] gives us
p = 0.004. The efficacy
of treatment has also been shown in multiple other secondary analyses
[Luco, Watanabe, Wiseman].
A priori we expect an effective treatment here to be
more effective when administered sooner
[Cohen]. Extrapolating the
treatment delay-response trend suggests 93% reduction in cases for immediate
treatment, of course we have little confidence in this prediction, however it
would be consistent with the data and can not be ruled out.
The effectiveness found is even more notable considering the
limitations of the study. Treatment was relatively late, with enrollment up to
4 days after exposure, and an unspecified shipping delay. While the paper does
not provide shipping details, the study protocol gives some information. While
not clear, it indicates no shipping on the weekends and a possible 12pm cutoff
for same day dispensing and mailing, from which we estimate the treatment
delay as ~70 to 140 hours after exposure on average for the 1-4 days since
enrollment specified in the paper (we will update this when authors respond to
our request for details). There was only 75% medication adherence, including
16% who did not take the medication at all, so the actual effectiveness is
likely to be higher. The study relies on Internet surveys, and false surveys
were received (identified by 555 numbers), suggesting there could be
additional unidentified false entries.
The accompanying editorial to this paper also notes that in a
small-animal model of SARS-CoV-2
[Sheahan], prevention of infection or
more severe disease was observed only when the antiviral agent was given
before or shortly after exposure
[Cohen]. Research also shows that the
placebo used in the US (folate) may be protective for COVID-19
[Acosta-Elias]. More details on this analysis can be found in
[CovidAnalysis].
[Mitjà] perform a highly delayed PEP treatment study
which suggests efficacy but lacks statistical significance due to the small
number of cases. Death rates reduced from 0.6% to 0.4%, RR 0.71, not
statistically significant due to low incidence (8 control cases, 5 treatment
cases).
Enrollment was up to 7 days after exposure and the treatment
delay in this study is unclear, without details of the exposure event timing
or medication dispensing. They appear to identify index cases based on the
date of a positive test for a contact, which is likely to be much later than
the actual exposure time. Due to quarantine at the time and likely
cohabitation of a majority of the contacts, it is likely that the actual
exposure time was significantly earlier. 13.1% of patients already tested
positive at baseline, which is consistent with the actual exposure time being
significantly earlier. Nasopharyngeal viral load analysis is subject to test
unreliability and temporo-spatial differences in viral shedding
[Wang (D)]. PCR testing has a very high false-negative rate in early
stages (e.g., 100% on day 1, 67% on day 4, and 20% on day 8
[Kucirka],
hence it is likely that a much higher percentage were infected at an unknown
time before enrollment.
Given the enrollment delay, PCR test delay, and PCR false
negative rate at early stages, the treatment delay in general for this study
was very long and could be over 2 weeks.
This study focuses on the existence of symptoms or PCR-positive
results, however severity of symptoms is more important. Research has shown
HCQ concentrations may be much higher in the lung compared to plasma
[Browning], which may help minimize the occurrence of severe cases and
death. The outcome analyzed here may not be highly relevant to the goal of
reducing mortality. For positive symptomatic cases, they find RR=0.89,
favoring treatment but not statistically significant. The RR for non-PCR
positive at baseline is 0.74, which is consistent with earlier treatment being
more effective. A greater effect is seen for nursing home residents, RR=0.49,
possibly because the exposure events are identified faster in this context,
versus home exposure where testing of the source may be more delayed. There is
a treatment-delay response relationship consistent with an effective
treatment.
The paper does not mention zinc. Zinc deficiency in Spain has
been reported at 83%
[Olza], this may significantly reduce
effectiveness to the extent that zinc is important for the success of HCQ
treatment.
The definition of COVID-19 symptoms is very broad - just
existence of a headache alone or muscle pain alone was considered COVID-19.
There was an overall very low incidence of confirmed COVID-19 (138 cases
across both arms). There were no serious adverse events that were adjudicated
as being treatment related. Authors exclude those with symptoms in the
previous two weeks, however, those with symptoms up to several months before
may still test PCR-positive even though there may be no viable virus. There
appears to be inaccurate data in the paper. Table 2, secondary outcomes,
control, hospital/vital records shows that 8 of 1042 is 9.7%.
In summary, this study appears positive in the context of very
delayed treatment and the small number of cases.
Early treatment studies.
We found 26 early treatment studies
[Ahmad, Ashraf, Chen (B), Derwand (B), Derwand (C), Elbazidi, Esper, Fonseca, Gautret, Gautret (B), Guérin, Heras, Hong, Huang (C), Ip, Izoulet, Kirenga, Lagier, Ly, Meo, Million, Mitjà (B), Otea, Pirnay, Skipper, Sulaiman] which all show some degree of
effectiveness.
[Fonseca] show 64% lower hospitalization with HCQ. Retrospective 717 patients in Brazil with early treatment, adjusted OR 0.32, p=0.00081, for HCQ versus no medication, and OR 0.45, p=0.0065, for HCQ vs. anything else;
[Derwand (B)] ;
[Derwand (C)] performs a retrospective analysis of 518 patients (141 treated, 377 control) showing that early treatment with HCQ+AZ+Z results in 84% lower hospitalization and 80% lower death - hospitalization OR 0.16 (p<0.001), death OR 0.2 (p=0.16);
[Sulaiman] perform a prospective analysis of 5,541 patients in Saudi Arabia showing adjusted HCQ mortality OR 0.36, p = 0.012, and adjusted HCQ hospitalization OR 0.57, p < 0.001;
[Kirenga] retrospectively analyze 56 patients in Uganda, 29 HCQ and 27 control, showing 25.6% faster recovering with HCQ, 6.4 vs. 8.6 days (p = 0.20). There was no ICU admission, mechanical ventilation, or death;
[Heras] perform a retrospective analysis of 100 confirmed COVID-19 elderly nursing home patients, median age 85, showing HCQ+AZ mortality 11.4% versus control 61.9%, RR 0.18, p<0.001. Details of differences between groups are not provided, and no adjustments are made. Authors indicate treatment was early but do not specify the treatment delay;
[Elbazidi] analyze US states and countries. For countries they find a significant correlation between the dates of decisions to adopt/decline HCQ, and corresponding trend changes in CFR. For US states they find a significant correlation between CFR and the level of acceptance of HCQ;
[Ip] perform a retrospective analysis of 1,274 outpatients, finding a 47% reduction in hospitalization with HCQ with propensity matching, HCQ OR 0.53 [0.29-0.95]. Sensitivity analyses revealed similar associations. Adverse events were not increased (2% QTc prolongation events, 0% arrhythmias);
[Ly] perform a retrospective analysis of retirement homes with 1690 elderly residents (226 infected, 116 treated, mean age 83), showing HCQ+AZ >= 3 days resulted in 41% lower mortality (15.5% vs. 26.4%), OR = 0.37, p=0.02. Detection via mass screening also showed significant improvements (16.9% vs. 40.6%, OR = 0.20, p=0.001), suggesting that earlier detection and treatment is more successful;
[Hong] showed that HCQ 1-4 days from diagnosis was the only protective factor against prolonged viral shedding found, OR 0.111, p=0.001. 57.1% viral clearance with 1-4 days delay vs. 22.9% for 5+ days delayed treatment. Authors report that early administration of HCQ significantly ameliorates inflammatory cytokine secretion and that COVID-19 patients should be administrated HCQ as soon as possible. 42 patients with HCQ 1-4 days from diagnosis, 48 with HCQ 5+ days from diagnosis;
[Lagier] analyzed 3,737 patients showing that early treatment leads to significantly better clinical outcome and faster viral load reduction with matched sample mortality HR 0.41 p=0.048;
[Chen (B)] showed significantly faster clinical recovery and shorter time to RNA negative (from 7.0 days to 2.0 days (HCQ), p=0.01 with 67 mild/moderate cases;
[Otea] showed HCQ+AZ appears to reduce serious complications and death with 80 patients;
[Pirnay] analyze 68 very high risk nursing home residents, median age 86, using HCQ+AZ early treatment within 2.5 days onset, showing significantly less mortality than other nursing homes in France and the same as the median death for the same period in 2019/2018;
[Guérin] performed a small retrospective study with 88 patients and found mean recovery time reduced from 26 days to 9 days with HCQ+AZ, p<0.0001 or to 13 days with AZ, including a case control analysis with matched patients;
[Ahmad] treated 54 patients in long term care facilities with 6% death, compared to 22% using a naive indirect comparison;
[Million] showed HCQ+AZ is safe and results in a low fatality rate with a retrospective analysis of 1,061 patients;
[Ashraf] concluded that HCQ improved clinical outcome with a small limited trial of 100 patients in Iran;
[Izoulet] compares the dynamics of daily deaths in the 10 days following the 3rd death in countries using and not using [H]CQ. They show dramatically lower death in [H]CQ countries, but do not attempt to account for other differences between the countries;
[Esper] analyzed 636 patients showing HCQ+AZ reduced hospitalization 79% when used within 7 days (65% overall);
[Gautret (B)] presented a pilot study suggesting improvement with HCQ+AZ and recommending further study;
[Huang (C)] analyzed 22 patients with all CQ patients discharged by day 14 versus 50% of Lopinavir/Rotinavir patients, and all CQ patient's pneumonia improved, versus 75% of Lopinavir/Rotinavir patients.; and
[Gautret] in an early and small trial with significant limitations, showed that HCQ was associated with viral load reduction and that this was enhanced with AZ.
[Gautret] also performed an early and small trial, showing that HCQ was associated with viral load reduction and that this was enhanced with AZ, however this study has significant limitations
[Machiels, Rosendaal]. In addition,
[Risch] presents an updated meta analysis that includes several studies that are currently unpublished. 7 new studies of high-risk outpatients are reported, for a total of 12 studies, all showing significant benefit.
[Mitjà (B)] present a randomized trial of 293 low-risk
patients with no deaths, no serious adverse events, and no statistically
significant improvements. There was a 25% reduction in hospitalization and 16%
reduction in the median time to symptom resolution for HCQ, without
statistical significance due to small samples. However, this paper has
inconsistent data - some of the values reported in Table 2 and the abstract
correspond to 12 control hospitalizations, while others correspond to 11
control hospitalizations, hence we are unsure of other data reported here.
This paper also does not specify the treatment delay, reporting only an
enrollment delay of up to 120 hours post symptoms, plus an additional
unspecified delay where medication was provided to patients at the first home
visit. They do not break down results by treatment delay. Undetectable viral
load was changed to 3 log10 copies/mL potentially partially masking
effectiveness. For viral load with nasopharyngeal swabs, we note that viral
activity in the lung may be especially important for COVID-19, and that HCQ
concentration in the lung may be significantly higher (for example, about 30
times blood concentration in
[Chhonker]). Nasopharyngeal viral load
analysis is subject to test unreliability and temporo-spatial differences in
viral shedding
[Wang (D)]. Viral detection by PCR does not equate to
viable virus
[Academy of Medicine]. PCR testing does not distinguish
between live virus and fragments of dead virus cells, which may take months to
clear
[Bo-gyung].
[Skipper] present an RCT with Internet surveys of 423
patients. As with the companion PEP study, we find the results significantly
more positive than typically reported. They show ~70 to 140 hour delayed
treatment with HCQ reduced combined hospitalization/death by 50%, p=0.29 (5
HCQ cases, 10 control cases), and reduced hospitalization by 60%, p=0.17.
There was one hospitalized control death and one non-hospitalized HCQ death.
It is unclear why there was a non-hospitalized death, external factors such as
lack of standard care may be involved. Excluding that case results in one
control death and zero HCQ deaths (not statistically significant but noted as
reducing mortality is the most important outcome). Details for the
hospitalizations and deaths such as medication adherence and treatment delay
may be informative but are not provided.
The paper states the end point was changed from
hospitalization/death to symptom severity because they would have required
6,000 participants. However, if the observed trend continued, they would hit
95% significance on the reduction in hospitalization at ~725 patients, and 95%
on the reduction in combined hospitalization/death at ~1,145 patients, both of
which are less than the original plan of 1,242 patients. We hope this trial
can be continued for statistical significance.
As with the companion PEP trial, treatment in this trial was
relatively late, with an unspecified shipping delay, which we estimate as ~70
to 140 hours after symptoms for enrollment days 1 to 4. We note there is no
overlap with the more typical delays used such as 0 - 36 hours for
oseltamivir.
The paper compares 0 - 36 hour delayed treatment with
oseltamivir (influenza) and ~70 to 140 hour delayed treatment with HCQ
(COVID-19), noting that oseltamivir seemed more effective. However, a more
comparable study is
[McLean] who showed that 48 - 119 hour delayed
treatment with oseltamivir has no effect. This suggests that HCQ is more
effective than oseltamivir, and that HCQ may still have significant effect for
some amount of delay beyond the delay where oseltamivir is effective.
6 people were included that enrolled with >4d symptoms,
although they do not match the study inclusion criteria. This reduces observed
effectiveness. Patients in this study are relatively young and most of them
recover without assistance. This reduces the room for a treatment to make
improvements. The maximum improvement of an effective treatment would be
expected before all patients approach recovery. For symptoms, authors focus on
the end result where most have recovered, but it is more informative to
examine the curve and the point of maximum effectiveness. Authors did not
collect data for every day but they do have interim results for days 3, 5, 10.
The results are consistent with an effective treatment and show a
statistically significant improvement, p = 0.05, at day 10 (other unreported
days might show increased effectiveness). Results also show a larger treatment
effect for those >50, not statistically significant due to the small sample,
but noted as COVID-19 risk dramatically increases with age.
As with the companion PEP trial, this study relies on Internet
surveys. Known fake surveys were submitted to the PEP trial and there could be
an unknown number of undetected fake surveys in both trials. Research shows
the placebo used in the US may be protective for COVID-19
[Acosta-Elias] so the true effectiveness of HCQ could be higher than
observed. Medication adherence was only 77% also making the true effect of
treatment likely to be higher. Authors note that the results are not
generalizable to the COVID high-risk population.