INTRODUCTION
Coronaviruses (CoVs) infect a wide variety of animals, including humans, causing respiratory, enteric, hepatic, and neurological diseases of various severities. On the basis of genotypic and serological characterization, CoVs were traditionally classified into three distinct groups (
1,
2). Recently, the Coronavirus Study Group of the International Committee on Taxonomy of Viruses (ICTV) has revised the nomenclature and taxonomy to reclassify the three CoV groups into three genera,
Alphacoronavirus,
Betacoronavirus, and
Gammacoronavirus (
3). Novel CoVs, which represent a novel genus,
Deltacoronavirus, have also been identified (
4–6). As a result of the ability to use a variety of host receptors and evolve rapidly through mutation and recombination, CoVs are able to adapt to new hosts and ecological niches, causing a wide spectrum of diseases (
2,
7–12).
The severe acute respiratory syndrome (SARS) epidemic and identification of SARS-CoV-like viruses from palm civets and horseshoe bats in China have boosted interest in the discovery of novel CoVs in both humans and animals (
13–20). It is now known that CoVs from all four genera can be found in mammals. Historically, alphacoronaviruses (αCoVs) and betacoronaviruses (βCoVs) have been found in mammals, while gammacoronaviruses (γCoVs) have been found in birds. However, recent findings also suggested the presence of γCoVs in mammals (
5,
21,
22). Although deltacoronaviruses (δCoVs) are also mainly found in birds, potential mammalian δCoVs have been reported (
4,
23). In particular, a δCoV closely related to sparrow CoV HKU17, porcine CoV HKU15, has been identified in pigs, which suggested bird-to-mammal transmission (
4). On the basis of current findings, a model for CoV evolution was proposed, where bat CoVs are likely the gene source of
Alphacoronavirus and
Betacoronavirus and avian CoVs are the gene source of
Gammacoronavirus and
Deltacoronavirus (
4). However, one notable exception to this model is
Betacoronavirus lineage A.
The genus
Betacoronavirus consists of four lineages, A to D. While human coronavirus (HCoV) OC43 and HCoV HKU1 belong to
Betacoronavirus lineage A (
20,
24–27), SARS-CoV belongs to
Betacoronavirus lineage B and the recently emerged Middle East respiratory syndrome coronavirus (MERS-CoV) belongs to
Betacoronavirus lineage C. No human CoV has yet been identified from
Betacoronavirus lineage D. On the other hand, besides
Alphacoronavirus, diverse bat CoVs have been found in
Betacoronavirus lineage B (e.g., SARS-related
Rhinolophus bat CoVs), lineage C (e.g.,
Tylonycteris bat CoV HKU4 and
Pipistrellus bat CoV HKU5), and lineage D (e.g.,
Rousettus bat CoV HKU9) (
8,
14,
15,
28–37), supporting the suggestion that bat CoVs are likely the ancestral origin of other mammalian CoVs in these lineages. However, no bat CoVs belonging to
Betacoronavirus lineage A have yet been identified, despite the numerous surveillance studies on bat CoVs conducted in various countries over the years (
38). Therefore, the ancestral origin of the mammalian lineage A βCoVs, such as HCoV OC43 and HCoV HKU1, remains obscure.
While HCoV OC43 is likely to have originated from zoonotic transmission, sharing a common ancestor with bovine coronavirus (BCoV) that dates back to 1890 (
27,
30,
39), closely related CoVs belonging to the same species,
Betacoronavirus 1, have also been found in various mammals, including pigs, horses, dogs, waterbucks, sable antelope, deer, giraffes, alpaca, and dromedary camels, suggesting a common ancestor in mammals with subsequent frequent interspecies transmission (
40–47). Although no zoonotic origin of HCoV HKU1 has been identified, the virus is most closely related to mouse hepatitis virus (MHV) and rat coronavirus (RCoV), which together are now classified as murine coronavirus (
3,
20,
42). We therefore hypothesize that rodent CoVs are the ancestral origin of
Betacoronavirus lineage A. In this study, we tested samples from various rodent species in Hong Kong and southern China for the presence of lineage A βCoVs. A novel CoV, China
Rattus coronavirus (ChRCoV) HKU24, was discovered from Norway rats in Guangzhou, China. Complete genome analysis showed that ChRCoV HKU24 represents a novel species within
Betacoronavirus lineage A but possesses features that resemble those of both
Betacoronavirus 1 and murine coronavirus. A high seroprevalence was also demonstrated among Norway rats from Guangzhou using Western blot analysis against ChRCoV HKU24 recombinant N protein and spike polypeptide. The present results suggest that ChRCoV HKU24 likely represents the murine origin of
Betacoronavirus 1 and provides insights into the ancestor of
Betacoronavirus lineage A.
DISCUSSION
We discovered a novel lineage A βCoV, ChRCoV HKU24, from Norway rats in southern China.
Betacoronavirus lineage A comprises the traditional group 2 CoVs, including members of murine coronavirus and
Betacoronavirus 1, HCoV HKU1, and RbCoV HKU14. ChRCoV HKU24 possessed <90% amino acid sequence identities to the amino acid sequences of all other lineage A βCoVs in five of the seven conserved replicase domains used for CoV species demarcation by ICTV (
3), supporting the suggestion that ChRCoV HKU24 belongs to a separate species. The genome of ChRCoV HKU24 also possesses features distinct from those of other lineage A βCoVs, including a unique putative nsp1/nsp2 cleavage site and a unique putative cleavage site in S protein. Phylogenetically, its position at the root of
Betacoronavirus 1, being distinct from the positions of murine coronavirus and HCoV HKU1, suggests that ChRCoV HKU24 may represent the murine ancestor for
Betacoronavirus 1, after branching off from the common ancestor of murine coronavirus and HCoV HKU1. Interestingly, the genome of ChRCoV HKU24 possessed features that resemble those of the genomes of both
Betacoronavirus 1 and murine coronavirus. It is more similar to
Betacoronavirus 1 than murine coronavirus by the higher sequence identities in most predicted proteins, including NS2a, NS5, and S. On the other hand, it is more similar to murine coronavirus than to
Betacoronavirus 1 in terms of its G+C content, the presence of a single NS4, and the absence of a TRS upstream of the E gene. Therefore, it is most likely that ChRCoV has evolved from the ancestor of murine coronavirus to infect other mammals, resulting in the generation of
Betacoronavirus 1 with the acquisition of a TRS for the E gene. The tMRCAs of ChRCoV HKU24, members of
Betacoronavirus 1, and RbCoV HKU14 were estimated to be 1402 (HPDs, 918.05 to 1749.91) and 1337 (HPDs, 724.59 to 1776.78) using complete RdRp and HE gene analysis, respectively, suggesting that interspecies transmission from rodents to other mammals occurred at least several centuries ago before the emergence of HCoV OC43 in humans in about the 1890s.
Western blot assays based on recombinant ChRCoV HKU24 N protein and spike polypeptide showed a high seroprevalence of ChRCoV HKU24 infection among Norway rats from Guangzhou. We evaluated the cross-reactivities of both N protein and spike polypeptide assays using sera from infections caused by other lineage A βCoVs, HCoV OC43 in humans and RbCoV HKU14 in rabbits, as well as SARS-CoV, a lineage B βCoV. Cross-reacting antibodies against N proteins were observed, a finding which is in line with previous findings on cross-reactivity between N proteins of different βCoVs (
49,
57). In contrast, no cross-reactivities between spike polypeptides were detected, supporting the specificity of CoV spike polypeptide-based assays and their ability to rectify cross-reactivities (
57,
58). Using the present assays, 60 of 74 Norway rats from Guangzhou were positive for antibodies against the ChRCoV HKU24 N protein, and among these rats, 21 were positive for antibodies against the ChRCoV HKU24 spike polypeptide, indicating that these 21 rats had previously been infected with ChRCoV HKU24. Interestingly, the three Norway rats positive for ChRCoV HKU24 in their alimentary tract samples were positive for antibodies against ChRCoV HKU24 N protein but negative for antibodies against ChRCoV HKU24 spike polypeptide. This is likely due to a delay in mounting neutralizing antibodies against spike protein during acute infection in these three rats, where antibodies against N protein may arise earlier as a result of the high abundance and antigenicity of CoV N proteins, or the response to the N protein may be a result of cross-reactions to N proteins from other βCoVs. The finding is also in keeping with previous findings on SARS-related
Rhinolophus bat CoV, in which a negative correlation between the viral load and neutralizing antibody titer was observed (
14). Besides Norway rats, antibodies against ChRCoV HKU24 N protein and spike polypeptide were also detected in two oriental house rats from Guangzhou, although antibodies against spike polypeptide were relatively weak. This suggests possible cross-species infection with ChRCoV HKU24 or cross-reactivity from a very close lineage A βCoV. Four black rats and 15 Norway rats in Hong Kong were also positive for antibodies against the ChRCoV HKU24 N protein but not the spike polypeptide. This suggests a possible past infection by another βCoV(s) and cross-reactivity between the N protein(s) of that βCoV(s) and the N protein of ChRCoV HKU24. More studies with diverse rodent species from China and other countries are required to determine the natural reservoir and host range of ChRCoV HKU24 and other murine lineage A βCoVs.
The present results extend our knowledge on the evolutionary origin of CoVs. While birds are important sources for γCoVs and δCoVs, bats host diverse αCoVs and βCoVs that may be the ancestral origins of various mammalian CoVs, including human CoVs. For human αCoVs, both HCoV NL63 and HCoV 229E likely originated from bat CoVs. HCoV NL63 has been shown to share a common ancestry with αCoVs from the North American tricolored bat, with the most recent common ancestor between these viruses occurring from approximately 563 to 822 years ago (
75). Moreover, immortalized lung cell lines derived from this bat species allowed replication of HCoV NL63, supporting potential zoonotic-reverse zoonotic transmission cycles between bats and humans. HCoV 229E also shared a common ancestor with diverse αCoVs from leaf-nosed bats in Ghana, with the most recent common ancestor dating to 1686 to 1800 (
76). However, no complete genomes are available for the putative bat ancestors of HCoV NL63 and HCoV 229E. For human βCoVs, SARS-CoV and MERS-CoV are also known to share common ancestors with bat CoVs. Soon after the SARS epidemic, horseshoe bats in China were found to be the reservoir for SARS-CoV-like viruses, which were postulated to have jumped from bats to civets and, later, humans (
8,
14,
15). A recent study also reported the isolation of a SARS-like bat CoV in Vero E6 cells and the ability of this bat virus to use the angiotensin-converting enzyme 2 (ACE2) from humans, civets, and Chinese horseshoe bats for cell entry (
77). MERS-CoV belongs to
Betacoronavirus lineage C, which was known to consist of only two bat viruses,
Tylonycteris bat CoV HKU4 and
Pipistrellus bat CoV HKU5, before the MERS epidemic (
35–37). This has led to the speculation that bats may be the zoonotic origin of MERS-CoV. However, recent evidence supports dromedary camels as the immediate source of human MERS-CoV (
78–80). Nevertheless, a conspecific virus from a South African
Neoromicia capensis bat has been found to share 85% nucleotide sequence identity to the nucleotide sequence of the MERS-CoV genome, suggesting the acquisition of MERS-CoV by camels from bats in sub-Saharan Africa, from where camels on the Arabian peninsula are imported (
81). In contrast, there has been no evidence that bats are the origin of human lineage A βCoVs, such as HCoV OC43 and HCoV HKU1. HCoV OC43, being closely related to BCoV, is believed to have emerged relatively recently from bovine-to-human transmission in about 1890 (
27,
30,
39). Both viruses belonged to the promiscuous CoV species
Betacoronavirus 1, which consists of many closely related mammalian CoVs, implying a low threshold for cross-mammalian species transmission and a complex evolutionary history among these viruses (
40–47,
49). However, the ancestral origin of members of
Betacoronavirus 1 remains elusive. As for HCoV HKU1, no recent zoonotic ancestor has yet been identified, although the virus is most closely related to members of murine coronaviruses (
20,
42). Although rodents constitute approximately 40% of all mammalian species, murine coronavirus has been the only CoV species known to exist in rodents. This is in contrast to the large diversity of CoVs found in bats, which make up another 20% of all species of mammals (
6,
33,
36). The present results suggest that rodents may be an important reservoir for lineage A βCoVs and may harbor other ancestral viruses of
Betacoronavirus 1 and HCoV HKU1 (
Fig. 6). Nevertheless, many mysteries about the evolution of lineage A βCoVs remain unresolved, such as the origin of their HE proteins. For example, both toroviruses and influenza C viruses can be found in bovine and porcine samples. Further studies are required to determine if the HE proteins of potential rodent CoV ancestors of
Betacoronavirus lineage A may have been acquired from cattle or pigs.
The potential pathogenicity and tissue tropism of ChRCoV HKU24 remain to be determined. While CoVs are associated with a wide spectrum of diseases in animals, some CoVs, especially those from bats, were detected in apparently healthy individuals without obvious signs of disease (
8,
14,
15,
31,
33). The detection of ChRCoV HKU24 in the alimentary tract samples of Norway rats suggested a possible enteric tropism. However, the three positive rats did not show obvious signs of disease. MHV, the prototype CoV most extensively studied before the SARS epidemic, can cause a variety of neurological, hepatic, gastrointestinal, and respiratory diseases in mice, depending on the strain tropism and route of inoculation. The virus, originally isolated from a mouse with spontaneous encephalomyelitis, causes disseminated encephalomyelitis with extensive destruction of myelin and focal necrosis of the liver in experimentally infected mice (
82–84). Strain MHV A59 is primarily hepatotropic, while strain MHV JHM is neurotropic. Enterotropic strains can spread quickly as a result of the high level of excretion in feces and cause significant environmental contamination in animal houses. Respiratory tract-tropic or polytropic strains, although uncommon, are the strains that commonly contaminate cell lines. As for RCoV, it causes diseases primarily in the respiratory tract, with strain sialodacryoadenitis virus (SDAV) being more associated with upper respiratory tract, salivary and lacrimal gland, and eye infections and strain RCoV Parker causing pneumonia in experimentally infected rats (
85,
86). Further investigations are required to study the tissue tropism and pathogenicity of ChRCoV HKU24 in Norway rats and other potential rodent reservoirs.
Elucidating the receptor of ChRCoV HKU24 will be important to understand the mechanism of host adaptation and interspecies transmission from rodents to other mammals. The higher sequence identity to
Betacoronavirus 1 than to murine coronavirus of the S protein and NTD of ChRCoV HKU24 is in line with the findings for other regions of the genome. Homology modeling showed that the conformation of the sugar-binding loop in the BCoV NTD is conserved in the ChRCoV HKU24 NTD. Moreover, 3 of the 4 critical sugar-binding residues in BCoV but only 2 of the 14 contact residues at the MHV NTD/mCEACAM1a interface are conserved in ChRCoV HKU24. While it remains to be ascertained if ChRCoV HKU24 may utilize sugar or CEACAM1 as a receptor, its predicted NTD appears to resemble that of BCoV more than that of MHV. On the basis of the presence of a β-sandwich fold in the NTDs of MHV and BCoV, it has been proposed that CoV NTDs may have originated from a host galectin with sugar-binding functions but evolved new structural features in MHV for binding to CEACMA1 (
10,
87). If rodents are indeed the host origin for
Betacoronavirus lineage A, including
Betacoronavirus 1, it would be interesting to study the sugar-binding activity of NTDs of different rodent βCoVs to understand their evolutionary history. Although some lineage A βCoVs, such as
Betacoronavirus 1 and MHV, can replicate in cell lines such as BSC-1 and HRT-18, attempts to isolate ChRCoV HKU24 from the three positive samples were unsuccessful. Future studies to isolate the virus from more rodent samples will allow characterization of its receptor usage and pathogenicity.
ACKNOWLEDGMENTS
We thank the following for facilitation of and assistance with sample collection: Wing-Man Ko, Secretary for the Food and Health Bureau; Vivian Lau, Kwok-Hau Sin, and M. C. Yuen of FEHD; Alan C. K. Wong, Siu-Fai Leung, Thomas Hon-Chung Sit, Howard Kai-Hay Wong, Chung-Tong Shek, and Joseph W. K. So of AFCD. We are grateful for the generous support of Carol Yu, Richard Yu, Hui Hoy, and Hui Ming in the genomic sequencing platform.
The views expressed in this paper are those of the authors only and do not represent the opinion of FEHD, AFCD, or the government of the HKSAR.
This work is partly supported by a Research Grant Council grant, University Grant Council; the Committee for Research and Conference Grants, the Strategic Research Theme Fund, and the University Development Fund, The University of Hong Kong; the Health and Medical Research Fund of the Food and Health Bureau of HKSAR; and the Consultancy Service for Enhancing Laboratory Surveillance of Emerging Infectious Disease for the HKSAR Department of Health.