The novel coronavirus disease 2019 (COVID-19) epidemic caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) began in Wuhan City, China, in December 2019 and quickly spread globally, with 2,063,161 cases reported in 185 countries or regions as of 16 April 2020 (
1). A total of 82,692 cases of COVID-19, including 4632 deaths, have been reported in mainland China, including 50,333 cases in Wuhan City and 628 cases in Shanghai City (
2). The epidemic in Wuhan and in the rest of China subsided after implementation of strict containment measures and movement restrictions, with recent cases originating from travel (
3). However, key questions remain about the age profile of susceptibility to infection, how social distancing alters age-specific contact patterns, and how these factors interact to affect transmission. These questions are relevant to the choice of control policies for governments and policy-makers around the world. In this study, we evaluate changes in mixing patterns linked to social distancing by collecting contact data in the midst of the epidemic in Wuhan and Shanghai. We also estimate age differences in susceptibility to infection based on contact-tracing data gathered by the Hunan Provincial Center for Disease Control and Prevention (CDC), China. Based on these empirical data, we developed a mathematical disease transmission model to disentangle how transmission is affected by age differences in the biology of COVID-19 infection and altered mixing patterns owing to social distancing. Additionally, we project the impact of social distancing and school closure on COVID-19 transmission.
To estimate changes in age-mixing patterns associated with COVID-19 interventions, we performed contact surveys in two cities: Wuhan, the epicenter of the outbreak, and Shanghai, one of the largest and most densely populated cities in southeast China. Shanghai experienced extensive importation of COVID-19 cases from Wuhan as well as local transmission (
4). The surveys were conducted from 1 February 2020 to 10 February 2020, as transmission of COVID-19 peaked across China and stringent interventions were put in place. Participants in Wuhan were asked to complete a questionnaire describing their contact behavior (
5,
6) on two different days: (i) a regular weekday between 24 December 2019 and 30 December 2019, before the COVID-19 outbreak was officially recognized by the Wuhan Municipal Health Commission (used as baseline); and (ii) the day before the interview (outbreak period). Participants in Shanghai were asked to complete the same questionnaire used for Wuhan but only report contacts for the outbreak period. For the baseline period in Shanghai, we relied on a survey conducted in 2017–2018 that followed the same design (
7). In these surveys, a contact was defined as either a two-way conversation involving three or more words in the physical presence of another person or a direct physical contact (e.g., a handshake). Details are given in the supplementary materials (SM, sections 1 and 2).
We analyzed a total of 1245 contacts reported by 636 study participants in Wuhan and 1296 contacts reported by 557 participants in Shanghai. In Wuhan, the average daily number of contacts per participant was significantly reduced, from 14.6 for the baseline period (mean contacts weighted by age structure: 14.0) to 2.0 for the outbreak period (mean contacts weighted by age structure: 1.9) (
p < 0.001). The reduction in contacts was significant for all stratifications by sex, age group, type of profession, and household size (
Table 1). A larger reduction was observed in Shanghai, where the average daily number of contacts decreased from 18.8 (mean contacts weighted by age structure: 19.8) to 2.3 (mean contacts weighted by age structure: 2.1). Although an average individual in Shanghai reported more contacts than one in Wuhan on a regular weekday, this difference essentially disappeared during the COVID-19 outbreak period. A similar decrease in the number of contacts was found in the United Kingdom during the COVID-19 lockdown period (
8).
The typical features of age-mixing patterns (
6,
7) emerge in Wuhan and Shanghai when we consider the baseline period (
Fig. 1, A and D). These features can be illustrated in the form of age-stratified contact matrices (provided as ready-to-use tables in the SM, section 3.6), where each cell represents the average number of contacts that an individual has with other individuals, stratified by age groups. The bottom left corner of the matrix, corresponding to contacts between school-age children, is where the largest number of contacts is recorded. The contribution of contacts in the workplace is visible in the central part of the matrix, and the three diagonals (from bottom left to top right) represent contacts between household members. By contrast, for the outbreak period when strict social distancing policies were in place, many of the above-mentioned features disappear, essentially leaving the sole contribution of household mixing (
Fig. 1, B and E). In particular, assortative contacts between school-age individuals are fully removed, as illustrated by differencing baseline and outbreak matrices (
Fig. 1, C and F). Overall, contacts during the outbreak mostly occurred at home with household members (94.1% in Wuhan and 78.5% in Shanghai). Thus, the outbreak contact matrix nearly coincides with the within-household contact matrix in both study sites, and the pattern of assortativity by age observed for regular days almost entirely disappears (SM, section 3.6). These findings are consistent with trends in within-city mobility data, which indicate an 86.9% drop in Wuhan and 74.5% drop in Shanghai between early January and early February (see SM, section 4). Such a large decrease in internal mobility is consistent with most of the contacts occurring in the household during the outbreak period. Of note, the strict social distancing measures implemented in Wuhan and Shanghai did not entirely zero out contacts in the workplace, because essential workers continued to perform their activities (as observed in our data; see SM, section 3.5).
The estimated mixing patterns are based on self-reported contacts that can thus be affected by various biases. In particular, reported contacts for the baseline period in Wuhan may be prone to recall bias because contacts were assessed retrospectively. Further, because of the retrospective nature of the baseline survey in Wuhan, we were unable to account for the lower number of contacts during weekends. The more complete data from Shanghai did not suffer recall bias and allowed us to weight contacts for weekdays and weekends; sensitivity analyses suggest that this has little impact on results (SM, section 8.3). Another possible bias is that survey participants may have felt pressure to minimize reported contacts that occurred during the outbreak, given that social distancing was in place and strictly enforced by the government, even if the anonymity and confidentiality of the survey were emphasized. However, results are robust to inflating reported contacts outside of the home severalfold, suggesting that these compliance and social acceptability biases linked to the outbreak period do not affect our main findings (SM, section 8.2). Another caveat is that in parallel to population-level social distancing measures, case-based interventions were implemented and could have affected contacts, including rapid isolation of confirmed and suspected cases and quarantine of close contacts for 14 days. However, only a small portion of the population in the two study sites was affected by contact tracing and quarantine, thus having little to no effect on average contact patterns in the general population.
Next, to understand the interplay between social distancing interventions, changes in human mixing patterns, and outbreak dynamics, we need to consider potential age differences in susceptibility to infection. This is currently a topic of debate, because little information on the age profile of asymptomatic cases is available (
9,
10). To this aim, we analyzed COVID-19 contact-tracing information gleaned from detailed epidemiological field investigations conducted by the Hunan CDC (SM, section 5). Briefly, all close contacts of COVID-19 cases reported in Hunan province were placed under medical observation for 14 days and were tested using real-time reverse transcription polymerase chain reaction (RT-PCR). Those who tested positive were considered as SARS-CoV-2 infections. We estimated the odds ratios (ORs) for a contact of a certain age group to be infected, relative to a reference age group. We performed generalized linear mixed model regression to account for clustering and potential correlation structure of contacts exposed to the same index case (e.g., in the household). We included the age group and gender of a contact, type of contact, and whether the contact traveled to Hubei or Wuhan as regression covariates (SM, section 5). We found that susceptibility to SARS-CoV-2 infection increased with age. Young individuals (aged 0 to 14 years) had a lower risk of infection than individuals aged 15 to 64 years {OR = 0.34 [95% confidence interval (CI): 0.24 to 0.49],
p < 0.0001}. By contrast, older individuals aged 65 years and older had a higher risk of infection than adults aged 15 to 64 years [OR = 1.47 (95% CI: 1.12 to 1.92),
p = 0.005]. These findings are in contrast with a previous study in Shenzhen, where susceptibility to infection did not change with age (
9).
Next, we explore how our data can inform control strategies for COVID-19. A key parameter regulating the dynamics of an epidemic is the basic reproduction number (
R0), which corresponds to the average number of secondary cases generated by an index case in a fully susceptible population. We estimated the impact of interventions on
R0, relying on our age-specific estimates of susceptibility to infection and contact patterns before and during interventions. We used the next-generation matrix approach to quantify changes in
R0 (
11) (SM, section 6). Additionally, to illustrate the impact of age-mixing patterns on the dynamics of the epidemic, we developed a simple SIR model of SARS-CoV-2 transmission (SM, section 6). In the model, the population is divided into three epidemiological categories: susceptible, infectious, and removed (either recovered or deceased individuals), stratified by 14 age groups. Susceptible individuals can become infectious after contact with an infectious individual according to the estimated age-specific susceptibility to infection. The rate at which contacts occur is determined by the estimated mixing patterns of each age group. The mean time interval between two consecutive generations of cases was taken to be 5.1 days, assuming it aligns with the mean of the serial interval reported by Zhang
et al. (
3).
In the early phases of COVID-19 spread in Wuhan, before interventions were put in place,
R0 values were estimated to range between 2.0 and 3.5 (
12–
18). In this analysis, we extended this range from 1 to 4 for the baseline period (i.e., before interventions). We find that the considerable changes of mixing patterns observed in Wuhan and Shanghai during the social distancing period led to a drastic decrease in
R0 (
Fig. 2). When we consider contact matrices representing the outbreak period, keeping the same baseline disease transmissibility as in the preintervention period, the reproductive number drops well below the epidemic threshold in Wuhan (
Fig. 2A) and Shanghai (
Fig. 2B). This finding is robust to relaxing assumptions about age differences in susceptibility to infection; the epidemic is still well controlled if SARS-CoV-2 infection is assumed to be equally likely in all age groups (
Fig. 2, A and B). We also performed sensitivity analyses regarding possible recall and compliance biases of self-reported contacts as well as the definition of contact (i.e., considering only contacts lasting more than 5 min). The results are consistent with those reported here (SM, section 8).
In an uncontrolled epidemic (without intervention measures, travel restrictions, or spontaneous behavioral responses of the population) and for
R0 in the range of 2 to 3, we estimate the mean infection attack rate to be in the range 53 to 92% after a year of SARS-CoV-2 circulation, with slight variation between Wuhan (
Fig. 2C) and Shanghai (
Fig. 2D). These estimates should be considered as an upper bound of the infection attack rate because they are based on a compartmental model that does not account for high clustering of contacts (e.g., repeated contacts among household members). If we consider a scenario in which social distancing measures are implemented early on, as the new virus emerges, the estimated
R0 remains under the epidemic threshold and thus the epidemic cannot take off in either location. Furthermore, we estimate that the magnitude of interventions implemented in Wuhan and Shanghai would have been enough to block transmission for an
R0 before the interventions of up to ~6 in Wuhan and ~7.8 in Shanghai.
Next, we use the model to estimate the impact of preemptive mass school closure. We considered two different contact pattern scenarios, based on data from Shanghai: contacts estimated during vacation periods (
7) and contacts estimated during regular weekdays, after all contacts occurring in school settings have been removed (
7). Both scenarios represent a simplification of a school closure strategy. Indeed, school closures in response to the COVID-19 pandemic in China have entailed interruption of all educational on-site services. However, mixing patterns measured during school vacations indicate that a fraction of children still attend additional educational activities, as is typical in Chinese cities. On the other hand, when removing all contacts in the school setting, we do not consider potential trickle-down effects on the mixing patterns of other age groups; for instance, parents may need to leave work to take care of school-age children. Our modeling approach indicates that limiting contact patterns to those observed during vacations would interrupt transmission for baseline
R0 up to 1.5 (
Fig. 3, A and C). Removing all school contacts would do the same for baseline
R0 up to 1.2. If we apply these interventions to a COVID-19 scenario, assuming a baseline
R0 of 2 to 3.5, we can achieve a noticeable decrease in infection attack rate and peak incidence and a delay in the epidemic, but transmission is not interrupted (
Fig. 3, B and D). For instance, for a baseline
R0 of 2.5 and assuming a vacation mixing pattern, the mean peak daily incidence is reduced by about 64%. In the corresponding scenario where school contacts are removed, we estimate a reduction of about 42%. Overall, school-based closure policies are not sufficient to entirely prevent a COVID-19 outbreak, but they can affect disease dynamics and hence hospital surge capacity. It is important to stress that individuals aged 5 to 19 years in Shanghai represent 9.5% of the population (
19), markedly lower than the mean in China [16.8% (
19)] and other countries [including Western countries; e.g., 19.7% in the United States (
20)].
The results of this study should be considered in light of the following limitations. In our simulation model, we estimated the effect of social distancing alone; combining social distancing with other interventions would have a synergistic effect to even further reduce transmission. It is likely that population-wide social distancing, case-based strategies, and decontamination efforts all contributed to achieve control in Wuhan and Shanghai, and their effect is difficult to separate out in retrospective observational studies. Our estimates of age differences in susceptibility to infection are based on active testing of 7375 contacts of 136 confirmed index cases. These data suffer from the usual difficulties inherent to the reconstruction of epidemiological links and detection of index cases. Contact data are useful, but seroepidemiology studies will be essential to fully resolve population susceptibility profiles to SARS-CoV-2 infection and disease. Although the age patterns of contacts were similar in the two study locations during the COVID-19 outbreak period, these patterns may not be fully representative of other locations in China and abroad, where social distancing measures may differ. Because reliable estimates of the contribution of asymptomatic SARS-CoV-2 infections to transmission are still lacking, we did not explicitly model differences between symptomatic and asymptomatic individuals. We considered a serial interval of 5.1 days (
3), based on a prior estimate from China, at a time when case-based and contact-tracing intervention measures were in place, which tends to shorten the interval between successive cases. However, this choice does not affect the estimated changes in reproduction number between the baseline and outbreak periods. Modeling results may underestimate the effect of social distancing interventions because our results concentrate on the number of contacts and ignore the type of social interactions (e.g., increased distance between individuals while in contact or use of a face mask), which may have changed owing to increased awareness of the population (
21,
22). Finally, it is worth noting that our school closure simulations are not meant to formulate a full intervention strategy, which would require identification of epidemic triggers to initiate closures and evaluation of different durations of intervention (
6). Nonetheless, our modeling exercise provides an indication of the possible impact of a nationwide preemptive strategy on the infection attack rate and peak incidence. To generalize these findings to other contexts, location-specific age-mixing patterns and population structures should be considered. Perhaps most importantly, strict lockdown strategies of the kind implemented in Wuhan, Shanghai, and other regions of the world are extremely disruptive economically and mentally, and more targeted approaches to block transmission are preferable in the long run. We do not necessarily endorse blunt lockdown policies here; we merely describe their impact on COVID-19 transmission based on the Chinese experience.
Our study provides evidence that the interventions put in place in Wuhan and Shanghai, and the resulting changes in human behavior, drastically decreased daily contacts, essentially reducing them to household interactions. This led to a dramatic reduction of SARS-CoV-2 transmission. As lockdown measures are put in place in other locations, human mixing patterns in the outbreak period could be captured by data on within-household contacts, which are available for several countries around the world (
5–
7,
23–
25). Moving forward, it will be particularly important to design targeted strategies for long-term control of COVID-19, including school- and work-based control strategies, along with large-scale testing and contact tracing (
26–
28). Research should concentrate on refining age-specific estimates of susceptibility to infection, disease, and infectiousness, which are instrumental to evaluating the impact of these strategies.
Acknowledgments
The authors acknowledge B. J. Cowling from the University of Hong Kong and C. L. Gilbert from the Johns Hopkins University for their helpful comments on the manuscript and N. Samay for her assistance in preparing the figures. This article does not necessarily represent the views of the NIH or the U.S. government.
Funding: H.Y. acknowledges financial support from the National Science Fund for Distinguished Young Scholars (no. 81525023), the Key Emergency Project of Shanghai Science and Technology Committee (no. 20411950100), and the National Science and Technology Major Project of China (nos. 2018ZX10201001-010, 2018ZX10713001-007, and 2017ZX10103009-005). The funder of the study had no role in study design, data collection, data analysis, data interpretation, or writing of the report. S.M. and M.A. acknowledge financial support from the European Commission H2020 MOOD project.
Author contributions: M.A. and H.Y. are joint senior authors. H.Y. and M.A. designed the experiments. J.Z., Y.L., S.Z., and Q.W. collected data. J.Z., M.L., Y.W., W.W., Y.L., Q.W., and M.A. analyzed the data. J.Z., M.L., S.M., C.V., A.V., M.A., and H.Y. interpreted the results. J.Z., M.L., C.V., M.A., and H.Y. wrote the manuscript. A.V. edited the manuscript.
Competing interests: A.V. has received funding from Metabiota Inc. H.Y. has received research funding from Sanofi Pasteur, GlaxoSmithKline, Yichang HEC Changjiang Pharmaceutical Company, and Shanghai Roche Pharmaceutical Company.
Ethics statement: Ethics approval was obtained from the institutional review board of the School of Public Health, Fudan University (IRB#2020-01-0801). Verbal informed consent was obtained from all subjects (from a parent or guardian if the participant was under 18 years of age).
Data and materials availability: All data and code are available in the main text or the supplementary materials or at Zenodo (
29). This work is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. To view a copy of this license, visit
https://creativecommons.org/licenses/by/4.0/. This license does not apply to figures/photos/artwork or other content included in the article that is credited to a third party; obtain authorization from the rights holder before using such material.
RE: Changes in contact patterns shape the dynamics of the COVID-19 outbreak in China
The press has been widely reporting a key finding based on Zhang et al. (Science 10.1126/science.abb8001(2020)) that children were less likely to become infected with COVID-19. In the Zhang et al. article, the children, ages 0-14 years, reported by the Hunan CDC to be infected in the Hunan province was 6.2% as compared to 8.6% among those 15-64 years and 16.3% among those 65 years and above. Zhang et al. obtained significant values of p <0.0001 for the comparison of children (0-14 years) to the reference group (15-64 years) and p =0.005 for the comparison of the older group (> 65 years) to the reference group.
It does not seem plausible that a comparison between 6.2% to 8.6% would be much more significant than a comparison between 16.3% to 8.6%. We repeated the statistical comparisons using a chi-square test (with Yates correction for a 2 x 2 table), to obtain p = 0.033 (which is not significant based on two-tailed α = .025 after Bonferroni correction for 2 multiple tests) for the children vs. reference group and a highly significant p<0.0001 for the comparison between the 15-64 group and the >65 group. Zhang et al. used generalized linear mixed regression with several covariates, which we could not perform without having their data. However, it is improbable that their p values would appear so different and in the opposite direction from ours.
Our statistical finding that children's infection rate is not significantly lower than that of adults, is consistent with another paper which Zhang et al. cite (Bi et al. 2020 https://doi.org/10.1101/2020.03.3120049023) for Shenzhen, China. Bi et al., like Zhang et al., used regression analysis but with smaller, more equal age (years) brackets (0-9, 10-19, 20-29, 30-39, 40-49, 50-59, 60-69, and >70) than the three used by Zhang et al. which lumped together everyone from ages 15 to 64 in their middle bracket. By including those between 60 and 64 in their middle bracket, Zhang et al. likely inflated the infection rate in that group, as those in the 60-69 age bracket in the Bi et al. paper had the highest infection rate at 15.4%. More detailed information on the infection rate in children should be forthcoming from a large NIH funded US study (HEROS) that is getting underway.
Allan Geliebter, Distinguished Professor, Touro College & University System; New York Medical College
Barbara Rumain, Associate Professor, Touro College & University System; New York Medical College
Moshe Schneiderman, SUNY Downstate College of Medicine
Corresponding author email: [email protected]
References
1. J. Zhang et al., Science 10.1126/science.abb8001 (2020)
2. Q. Bi, Y. Wu, S. Mei, C. Ye, X. Zou, Z. Zhang, X. Liu, L. Wei, S. A. Truelove, T. Zhang, W. Gao, C. Cheng, X. Tang, X. Wu, Y. Wu, B. Sun, S. Huang, Y. Sun, J. Zhang, T. Ma, J. Lessler, T. Feng. "Epidemiology and transmission of COVID-19 in 391 cases and 1286 of their close contacts in Shenzhen, China: a retrospective cohort study." The Lancet Infectious Diseases (2020), doi: 10.1016/s1473-3099(20)30287-5.