The novel coronavirus that emerged in Wuhan, China, at the end of 2019, severe acute respiratory syndrome–coronavirus 2 (SARS-CoV-2), quickly spread to all Chinese provinces and, as of 1 March 2020, to 58 other countries (
1,
2). Efforts to contain the virus are ongoing; however, given the many uncertainties regarding pathogen transmissibility and virulence, the effectiveness of these efforts is unknown.
The fraction of undocumented but infectious cases is a critical epidemiological characteristic that modulates the pandemic potential of an emergent respiratory virus (
3–
6). These undocumented infections often go unrecognized owing to mild, limited, or lack of symptoms and thus, depending on their contagiousness and numbers, can expose a far greater portion of the population to the virus than would otherwise occur. Here, to assess the full epidemic potential of SARS-CoV-2, we use a model-inference framework to estimate the contagiousness and proportion of undocumented infections in China during the weeks before and after the shutdown of travel in and out of Wuhan.
We developed a mathematical model that simulates the spatiotemporal dynamics of infections among 375 Chinese cities (see supplementary materials). In the model, we divided infections into two classes: (i) documented infected individuals with symptoms severe enough to be confirmed, i.e., observed infections; and (ii) undocumented infected individuals. These two classes of infection have separate rates of transmission: β, the transmission rate due to documented infected individuals; and μβ, the transmission rate due to undocumented individuals, which is β reduced by a factor μ.
Spatial spread of SARS-CoV-2 across cities is captured by the daily number of people traveling from city
j to city
i and a multiplicative factor. Specifically, daily numbers of travelers between 375 Chinese cities during the Spring Festival period (“Chunyun”) were derived from human mobility data collected by the Tencent location-based service during the 2018 Chunyun period (1 February–12 March 2018) (
7). Chunyun is a period of 40 days—15 days before and 25 days after the Lunar New Year—during which there are high rates of travel within China. To estimate human mobility during the 2020 Chunyun period, which began 10 January, we aligned the 2018 Tencent data on the basis of relative timing to the Spring Festival. For example, we used mobility data from 1 February 2018 to represent human movement on 10 January 2020, as these days were similarly distant from the Lunar New Year. During the 2018 Chunyun, 1.73 billion travel events were captured in the Tencent data, whereas 2.97 billion trips were reported by the Ministry of Transport of the People’s Republic of China (
7). To compensate for underreporting and reconcile these two numbers, a travel multiplicative factor, θ, which is greater than 1, is included (see supplementary materials).
To infer SARS-CoV-2 transmission dynamics during the early stage of the outbreak, we simulated observations during 10–23 January 2020 (i.e., the period before the initiation of travel restrictions) (fig. S1) using an iterated filter-ensemble adjustment Kalman filter framework (
8–
10). With this combined model-inference system, we estimated the trajectories of four model state variables (
Si,
Ei,
, and
: the susceptible, exposed, documented infected, and undocumented infected subpopulations in city
i, respectively) for each of the 375 cities, while simultaneously inferring six model parameters (
Z,
D, μ, β, α, and θ: the average latency period, the average duration of infection, the transmission reduction factor for undocumented infections, the transmission rate for documented infections, the fraction of documented infections, and the travel multiplicative factor, respectively).
Details of model initialization, including the initial seeding of exposed and undocumented infections, are provided in the supplementary materials. To account for delays in infection confirmation, we also defined a time-to-event observation model using a gamma distribution (see supplementary materials). Specifically, for each new case in group
, a reporting delay td (in days) was generated from a gamma distribution with a mean value of Td. In fitting both synthetic and the observed outbreaks, we performed simulations with the model-inference system using different fixed values of Td (6 days ≤ Td ≤ 10 days) and different maximum seeding, Seedmax (1500 ≤ Seedmax ≤ 2500) (see supplementary materials) (fig. S2). The best-fitting model-inference posterior was identified by log likelihood.
Validation of the model-inference framework
We first tested the model-inference framework versus alternate model forms and using synthetic outbreaks generated by the model in free simulation. These tests verified the ability of the model-inference framework to accurately estimate all six target model parameters simultaneously (see supplementary methods and figs. S3 to S14). The system could identify a variety of parameter combinations and distinguish outbreaks generated with high α and low μ from those generated with low α and high μ. This parameter identifiability is facilitated by the assimilation of observed case data from multiple (375) cities into the model-inference system and the incorporation of human movement into the mathematical model structure (see supplementary methods and figs. S15 and S16).
Epidemiological characteristics during 10–23 January 2020
We next applied the model-inference framework to the observed outbreak before the travel restrictions imposed on 23 January 2020—a total of 801 documented cases throughout China, as reported by 8 February (
1).
Figure 1, A to C, shows simulations of reported cases generated using the best-fitting model parameter estimates. The distribution of these stochastic simulations captures the range of observed cases well. In addition, the best-fitting model captures the spread of infections with the novel coronavirus disease 2019 (COVID-19) to other cities in China (fig. S17). Our median estimate of the effective reproductive number,
Re—equivalent to the basic reproductive number,
R0, at the beginning of the epidemic—is 2.38 [95% credible interval (CI): 2.03−2.77], indicating that COVID-19 has a high capacity for sustained transmission (
Table 1 and
Fig. 1D). This finding aligns with other recent estimates of the reproductive number for this time period (
6,
11–
15). In addition, the median estimates for the latency and infectious periods are ~3.69 and 3.47 days, respectively. We also find that, during 10–23 January, only 14% (95% CI: 10–18%) of total infections in China were reported. This estimate reveals a very high rate of undocumented infections: 86%. This finding is independently corroborated by the infection rate among foreign nationals evacuated from Wuhan (see supplementary materials). These undocumented infections are estimated to have been half as contagious per individual as reported infections (μ = 0.55; 95% CI: 0.46–0.62). Other model fittings made using alternate values of
Td and
Seedmax or different distributional assumptions produced similar parameter estimates (figs. S18 to S22), as did estimations made using an alternate model structure with separate average infectious periods for undocumented and documented infections (see supplementary methods, table S1). Further sensitivity testing indicated that α and μ are uniquely identifiable given the model structure and abundance of observations used (see supplementary methods and
Fig. 1, E and F). In particular,
Fig. 1F shows that the highest log-likelihood fittings are centered in the 95% CI estimates for α and μ and drop off with distance from the best-fitting solution (α = 0.14 and μ = 0.55).
Using the best-fitting model (
Table 1 and
Fig. 1), we estimated 13,118 (95% CI: 2974–23,435) new COVID-19 infections (documented and undocumented combined) during 10–23 January in Wuhan city. Further, 86.2% (95% CI: 81.5–89.8%) of all infections originated from undocumented cases. Nationwide, the number of infections during 10–23 January was 16,829 (95% CI: 3797–30,271), with 86.2% (95% CI: 81.6–89.8%) originating from undocumented cases. To further examine the impact of contagious, undocumented COVID-19 infections on overall transmission and reported case counts, we generated a set of hypothetical outbreaks using the best-fitting parameter estimates but with μ = 0, i.e., the undocumented infections are no longer contagious (
Fig. 2). We find that without transmission from undocumented cases, reported infections during 10–23 January are reduced by 78.8% across all of China and by 66.1% in Wuhan. Further, there are fewer cities with more than 10 cumulative documented cases: only one city with more than 10 documented cases versus the 10 observed by 23 January (
Fig. 2C). This finding indicates that contagious, undocumented infections facilitated the geographic spread of SARS-CoV-2 within China.
Epidemiological characteristics after 23 January 2020
We also modeled the transmission of COVID-19 in China after 23 January, when greater control measures were effected. These control measures included travel restrictions imposed between major cities and Wuhan, self-quarantine and contact precautions advocated by the government, and more available rapid testing for infection confirmation (
11,
12). These measures, along with changes in medical care–seeking behavior due to increased awareness of the virus and increased personal protective behavior (e.g., wearing of face masks, social distancing, self-isolation when sick), likely altered the epidemiological characteristics of the outbreak after 23 January. To quantify these differences, we reestimated the system parameters using the model-inference framework and city-level daily cases reported between 24 January and 8 February. Given that intercity mobility was restricted after 23 January, we tested two altered travel scenarios: (i) scenario 1: a 98% reduction of travel in and out of Wuhan and an 80% reduction in travel between all other cities, as indicated by changes in the Baidu mobility index (
16) (table S2); and (ii) scenario 2: a complete stoppage of intercity travel (i.e., θ to 0) (see supplementary methods for more details).
The results of inference for the 24 January–8 February period are presented in
Table 2, figs. S23 to S26, and table S3. As control measures have continually shifted, we present estimates for both 24 January–3 February (period 1) and 24 January–8 February (period 2). For both periods, the best-fitting model for scenario 1 had a reduced reporting delay,
Td, of 6 days (versus 9 days before 23 January), consistent with more rapid confirmation of infections. Estimates of both the latency and infectious periods were similar to those made for 10–23 January; however, α, β, and
Re all shifted considerably. The transmission rate of documented cases, β, dropped to 0.52 (95% CI: 0.42–0.72) during period 1 and to 0.35 (95% CI: 0.28–0.45) during period 2, less than half the estimated transmission rate prior to travel restrictions (
Table 2). The fraction of all infections that were documented, α, was estimated to be 0.65 (95% CI: 0.60–0.69), i.e., 65% of infections were documented during period 1, up from 14% before travel restrictions, and remained nearly the same for period 2. The reproductive number was 1.34 (95% CI: 1.10–1.67) during period 1 and 0.98 (95% CI: 0.83–1.16) during period 2, down from 2.38 prior to travel restrictions. While the estimate for the relative transmission rate, μ, is lower than before 23 January, the contagiousness of undocumented infections, represented by μβ, was substantially reduced, possibly reflecting that only very mild, less contagious infections remain undocumented or that individual protective behavior and contact precautions have proven effective. Similar parameter estimates are derived under scenario 2 (no travel at all) (table S3). These inference results for both periods 1 and 2 should be interpreted with caution, as care-seeking behavior and control measures were continually in flux at these times.
Outlook
Overall, our findings indicate that a large proportion of COVID-19 infections were undocumented prior to the implementation of travel restrictions and other heightened control measures in China on 23 January and that a large proportion of the total force of infection was mediated through these undocumented infections (
Table 1). This high proportion of undocumented infections, many of which were likely not severely symptomatic, appears to have facilitated the rapid spread of the virus throughout China. Indeed, suppression of the infectiousness of these undocumented cases in model simulations reduces the total number of documented cases and the overall spread of SARS-CoV-2 (
Fig. 2). In addition, the best-fitting model has a reporting delay of 9 days from initial infectiousness to confirmation; in contrast, line-list data for the same 10–23 January period indicates an average 6.6-day delay from initial manifestation of symptoms to confirmation (
17). This discrepancy suggests that presymptomatic shedding may be typical among documented infections. The relative timing of onset and peak of viremia and shedding versus onset and peak of symptoms has been shown to potentially affect outbreak control success (
18).
Our findings also indicate that a radical increase in the identification and isolation of currently undocumented infections would be needed to fully control SARS-CoV-2. Increased news coverage and awareness of the virus in the general population have likely already prompted increased rates of seeking medical care for respiratory symptoms. In addition, awareness among health care providers and public health officials and the availability of viral identification assays suggest that capacity for identifying previously missed infections has increased. Further, general population and government response efforts have increased the use of face masks, restricted travel, delayed school reopening, and isolated suspected persons, all of which could additionally slow the spread of SARS-CoV-2.
Combined, these measures are expected to increase reporting rates, reduce the proportion of undocumented infections, and decrease the growth and spread of infection. Indeed, estimation of the epidemiological characteristics of the outbreak after 23 January in China indicates that government control efforts and population awareness have reduced the rate of virus spread (i.e., lower β, μβ, Re), increased the reporting rate, and lessened the burden on already overextended health care systems.
The situation on the ground in China is changing day to day. New travel restrictions and control measures are being imposed on populations in different cities, and these rapidly varying effects make certain estimation of the epidemiological characteristics for the outbreak difficult. Further, reporting inaccuracies and changing care-seeking behavior add another level of uncertainty to our estimations. Although the data and findings presented here indicate that travel restrictions and control measures have reduced SARS-CoV-2 transmission considerably, whether these controls are sufficient for reducing Re below 1 for the length of time needed to eliminate the disease locally and prevent a rebound outbreak once control measures are relaxed is unclear. Moreover, similar control measures and travel restrictions would have to be implemented outside China to prevent reintroduction of the virus.
The results for 10–23 January 2020 delineate the characteristics of SARS-CoV-2 moving through a developed country, China, without major restrictions or control. These findings provide a baseline assessment of the fraction of undocumented infections and their relative infectiousness for such an environment. However, differences in control activity, viral surveillance and testing, and case definition and reporting would likely affect rates of infection documentation. Thus, the key findings, that 86% of infections went undocumented and that, per person, these undocumented infections were 55% as contagious as documented infections, could shift in other countries with different control, surveillance, and reporting practices.
Our findings underscore the seriousness of SARS-CoV-2. The 2009 H1N1 pandemic influenza virus also caused many mild cases, quickly spread globally, and eventually became endemic. Presently, there are four endemic coronavirus strains circulating in human populations (229E, HKU1, NL63, and OC43). If the novel coronavirus follows the pattern of 2009 H1N1 pandemic influenza, it will also spread globally and become a fifth endemic coronavirus within the human population.
Acknowledgments
Funding: This work was supported by U.S. National Institutes of Health (NIH) grants GM110748 and AI145883. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of General Medical Sciences, the National Institute of Allergy and Infectious Diseases, or the NIH.
Author contributions: R.L., S.P., B.C., W.Y., and J.S. conceived of the study. R.L., B.C., Y.S., and T.Z. curated the data. S.P. performed the analysis. R.L., S.P., W.Y., and J.S. wrote the first draft of the manuscript. B.C., Y.S., and T.Z. reviewed and edited the manuscript.
Competing interests: J.S. and Columbia University disclose partial ownership of SK Analytics. J.S. also reports receiving consulting fees from Merck and BNI. All other authors declare no competing interests.
Data and materials availability: All code and data are available in the supplementary materials and posted online at
https://github.com/SenPei-CU/COVID-19 and (
19). This work is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. To view a copy of this license, visit
https://creativecommons.org/licenses/by/4.0/. This license does not apply to figures/photos/artwork or other content included in the article that is credited to a third party; obtain authorization from the rights holder before using such material.
Travel restrictions are effective to control SARS-CoV2.
Dear editor:
The recent coronavirus pneumonia that emerged in Wuhan (SARS-CoV2) at the end of 2019 quickly spread to all Chinese provinces, as of 27 Mar 2020, confirmed cases was reported has reached 600000 worldwide according to statistical data, which attracted global attention (https://www.who.int/) (1).
Therefore, I read the recent article by Li et al with interest (1). As the authors have made a strong comment as "We estimate 86% of all infections were undocumented (95% CI: [82%–90%]) prior to 23 January 2020 travel restrictions " (1). However, it was reported that "initial testing was focused mainly on travelers from Wuhan, potentially biasing estimates of travel related infections upwards " recently (2).
In their manuscript, the authors stated that "Efforts to contain the virus are ongoing; however, given the many uncertainties regarding pathogen transmissibility and virulence, the effectiveness of these efforts is unknown" (1). However, China has almost contained the spread of SARS-CoV2 through controlling the source of infection, cutting off the route of transmission, and protecting vulnerable groups including quarantining Wuhan City on January 23 (2, 3). It was also reported that "The combination of interventions implemented in China were clearly successful in mitigating spread and reducing local transmission of COVID-19"(2).
There are lots of overstatements in this manuscript (1). For example, when discussing the seriousness and pandemic potential of SARS-CoV2, the authors are biased. As the authors stated that "These findings explain the rapid geographic spread of SARS-CoV2 and indicate containment of this virus will be particularly challenging" (1), however, intensive control measures, including travel restrictions, have been implemented to limit the spread of COVID-19 in China (2).
This work was supported by grants from the National Undergraduate Training Program for Innovation and Entrepreneurship (no. 201811646025 to YW), the Student Research and Innovation Program of Ningbo University (no. 2017SRIP1918, no. 2018SRIP2507 and no. 2019SRIP1902 to YW). The author has nothing to disclose.
References:
1. R. Li, S. Pei, B. Chen, Y. Song, T. Zhang, W. Yang, J. Shaman. Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus(SARS-CoV2). Science. 2020; doi:10.1126/science.abb3221.
2. M.U.J. Kraemer, C.H. Yang, B. Gutierrez, C.H. Wu, B. Klein, D. M. Pigott, Open COVID-19 Data Working Group, et al. The effect of human mobility and control measures on the COVID-19 epidemic in China. Science. 2020; doi:10.1126/science.abb4218.
3. W.J. Guan, Z.Y. Ni, Y. Hu, W.H. Liang, C.Q. Ou, J.X. He, L. Liu, H. Shan, C.L. Lei, D.S.C. Hui, B. Du, L.J. Li, G. Zeng, K.Y. Yuen, R.C. Chen, C.L. Tang, T. Wang, P.Y. Chen, J. Xiang, S.Y.Li, J.L. Wang, Z.J. Liang, Y.X. Peng, L. Wei, Y. Liu, Y.H. Hu, P. Peng, J.M. Wang, J.Y. Liu, Z. Chen, G. Li, Z.J. Zheng, S.Q. Qiu, J. Luo, C.J. Ye, S.Y. Zhu, N.S. Zhong; China Medical Treatment Expert Group for Covid-19. Clinical Characteristics of Coronavirus Disease 2019 in China. N Engl J Med. 2020 Feb 28. doi: 10.1056/NEJMoa2002032.
Yezhao Wang 1,2*
1Zhejiang Province Key Laboratory of Pathophysiology, Ningbo University School of Medicine, Ningbo, 315211, China.
2Department of Biochemistry and Molecular Biology, Ningbo University School of Medicine, Ningbo, 315211, China.
*Correspondence to Yezhao Wang
Department of Biochemistry and Molecular Biology, Ningbo University School of Medicine, Ningbo, 315211, China.
E-mail: [email protected].
RE: Some thoughts on the analysis of the novel coronavirus epidemics by a dynamic model of infectious disease
Undocumented patients can spread the novel coronavirus. However, the exact number cannot be directly counted. Therefore, a mathematical model is needed to estimate the number and assess the impact of these patients on the epidemic. A paper published in Science titled "Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV2)" used a dynamic model to provide a methodological reference for solving this problem.
In this study, the authors estimated that the median basic reproductive number (R0) at the beginning of the epidemic was 2.38. I questioned this number for three reasons.
First, the authors presumed that the epidemic emerged on November 1, 2019, and that the cumulative infections reached approximately 2000 on January 10, 2020. On this basis, the authors estimated that the initial undocumented infections (I0u) that plays a key role in the research results were randomly selected from 0 to seedmax (2000). However, the authors did not provide strict evidence or reasons for the above hypothesis.
Second, the hypothesis that every patient in I0u is sick and has a full infection period is also flawed. The actual situation is that some patients just get sick, while others will get better; therefore the period during which they are infecting other people is shorter than that proposed by the author. This will cause the average number of susceptible persons infected by a patient in I0u to be less than the assumption. Therefore, the calculated R0 is less than the real value once the model has achieved the best goodness-of-fit.
Finally, patients in I0u are not independent; that is, there is an infector-infected relationship between them, which corresponds to the repeated transmission by some infectors and will lead to the underestimation of R0.
Therefore, I suggest that this paper underestimates the R0 due to the flaws in the research methodology.
What's the value of these findings in controlling the outbreak of SARS-CoV2?
Dear Théo Valette,
Please understand that there were about 10 million people in Wuhan and Wuhan had a floating population of nearly 5 million before 23 January 2020, a full grasp of dissemination of Wuhan before 23 January 2020 was impossible and further it is more important to contain spread than to know if every person has the condition. When you know there were 86% of all infections were undocumented (95% CI: [82%–90%]) prior to 23 January 2020 travel restrictions, could you find them in 10 million people? The indiscriminate testing without medical expert advice on whether an individual should be tested would only cause panic and not contain the spread. In a word, China will control the spread of SARS-CoV2 completely in the near future!
Yezhao Wang, 1,2*
1Zhejiang Province Key Laboratory of Pathophysiology, Ningbo University School of Medicine, Ningbo, 315211, China.
2Department of Biochemistry and Molecular Biology, Ningbo University School of Medicine, Ningbo, 315211, China.
*Correspondence to Yezhao Wang M.B.B.S
Department of Biochemistry and Molecular Biology, Ningbo University School of Medicine, Ningbo, 315211, China.
E-mail: [email protected].
RE: On the article and on some eLetters
Dear everyone,
To Mr Hawk,
Let's not forget that the fraction of reported infections in China is estimated at 0.14 for the period 10/01 - 23/01 and that it rises to 0.65 and 0.69 for the periods 24/01 - 03/02 and 24/01 - 08/02, respectively. So the paper is more about what happens during the time period before travel restrictions, extensive testing and other measures taken to prevent the spread of the virus. This situation probably has already been reproduced in a number of other countries now and it explains in part why the virus spreads so fast before the gouvernments are able to realize it. And this situation, in the future, might reproduce itself if travel restrictions and other measures are lifted too soon.
To Mr Wang,
Let's not forget that the paper was written the 15/02 at a time when we didn't know for sure if the measures taken in China would be effective.
I would also like to signal a minor mistake in the Materials and Methods. Denominator of the last righthand side term of the fourth equation is wrong and so it is on the U12 Poisson distribution but I checked the code and this error whas not reproduced in it so it in no way impacts the results of the paper.
Implication for Lethality
Given the suggestion from this article that suggests almost 10 fold higher asymptomatic / undocumented COVID-19 infection, the implication for lethality being correspondingly 10x lower is potential reason for hope.
Further commentary on this aspect would be appreciated.
Thank you,
The effectiveness of efforts to control SARS-CoV2 is known.
Dear editor:
The recent coronavirus pneumonia that emerged in Wuhan (SARS-CoV2) at the end of 2019 quickly spread to all Chinese provinces, as of 16 Mar 2020, confirmed cases was reported has reached 160000 worldwide according to statistical data, which attracted global attention (https://www.who.int/) (1). Therefore, I read the recent article by Li et al with great interest (1). In their manuscript, the authors stated that "Efforts to contain the virus are ongoing; however, given the many uncertainties regarding pathogen transmissibility and virulence, the effectiveness of these efforts is unknown" (1). However, China has almost contained the spread of SARS-CoV2 through controlling the source of infection, cutting off the route of transmission, and protecting vulnerable groups (2). Can the authors now conclude that "efforts to contain the virus are ongoing; however, given the many uncertainties regarding pathogen transmissibility and virulence, the effectiveness of these efforts is unknown" (1)?
This work was supported by grants from the National Undergraduate Training Program for Innovation and Entrepreneurship (no. 201811646025), the Student Research and Innovation Program of Ningbo University (no. 2017SRIP1918, no. 2018SRIP2507 and no. 2019SRIP1902 ). The author has nothing to disclose.
References:
1. Li R, Pei S, Chen B, et al. Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV2). Science. 2020; doi:10.1126/science.abb3221
2. Guan WJ, Ni ZY, Hu Y, et al. Clinical Characteristics of Coronavirus Disease 2019 in China. N Engl J Med. doi: 10.1056/NEJMoa2002032
Yezhao Wang 1,2*
1Zhejiang Province Key Laboratory of Pathophysiology, Ningbo University School of Medicine, Ningbo, 315211, China.
2Department of Biochemistry and Molecular Biology, Ningbo University School of Medicine, Ningbo, 315211, China.
*Correspondence to Yezhao Wang
Department of Biochemistry and Molecular Biology, Ningbo University School of Medicine, Ningbo, 315211, China.
E-mail: [email protected].
RE:
These findings also suggest unless every person on earth is tested, many times over, a full grasp of dissemination will not be known (asymptomatic carriers) and further it is more important to control spread than to know if every person has the condition. In addition, depending on the test type, one will not know if the person has the condition, developing the condition or had the condition. The indiscriminate testing without medical expert advice on whether an individual should be tested would only cause panic and not improve the spread. There is of course, the false positive and negative issues that it would create. Adhering to social distancing, hand hygiene and common sense will go a long way in stealth asymptomatic transmission.