Dynamically evolving novel overlapping gene as a factor in the SARS-CoV-2 pandemic

Elife. 2020 Oct 1:9:e59633. doi: 10.7554/eLife.59633.

Abstract

Understanding the emergence of novel viruses requires an accurate and comprehensive annotation of their genomes. Overlapping genes (OLGs) are common in viruses and have been associated with pandemics but are still widely overlooked. We identify and characterize ORF3d, a novel OLG in SARS-CoV-2 that is also present in Guangxi pangolin-CoVs but not other closely related pangolin-CoVs or bat-CoVs. We then document evidence of ORF3d translation, characterize its protein sequence, and conduct an evolutionary analysis at three levels: between taxa (21 members of Severe acute respiratory syndrome-related coronavirus), between human hosts (3978 SARS-CoV-2 consensus sequences), and within human hosts (401 deeply sequenced SARS-CoV-2 samples). ORF3d has been independently identified and shown to elicit a strong antibody response in COVID-19 patients. However, it has been misclassified as the unrelated gene ORF3b, leading to confusion. Our results liken ORF3d to other accessory genes in emerging viruses and highlight the importance of OLGs.

Keywords: ORF3d; SARS-CoV-2; evolutionary biology; genome annotation; infectious disease; microbiology; natural selection; overlapping genes; pandemic; virus.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Antibodies, Viral / immunology
  • Antibody Specificity
  • Antigens, Viral / biosynthesis
  • Antigens, Viral / genetics
  • Antigens, Viral / immunology
  • Betacoronavirus / genetics*
  • Betacoronavirus / pathogenicity
  • Betacoronavirus / physiology
  • COVID-19
  • China / epidemiology
  • Chiroptera / virology
  • Coronavirus / genetics
  • Coronavirus Infections / epidemiology
  • Coronavirus Infections / virology*
  • Epitopes / genetics
  • Epitopes / immunology
  • Europe / epidemiology
  • Eutheria / virology
  • Evolution, Molecular*
  • Gene Expression Regulation, Viral
  • Genes, Overlapping*
  • Genes, Viral*
  • Genetic Variation
  • Haplotypes / genetics
  • Host Specificity / genetics*
  • Humans
  • Models, Molecular
  • Mutation
  • Open Reading Frames / genetics*
  • Pandemics*
  • Phylogeny
  • Pneumonia, Viral / epidemiology
  • Pneumonia, Viral / virology*
  • Protein Biosynthesis
  • Protein Conformation
  • RNA, Viral / genetics
  • SARS-CoV-2
  • Sequence Alignment
  • Sequence Homology, Nucleic Acid
  • Viral Proteins / genetics*
  • Viral Proteins / immunology

Substances

  • Antibodies, Viral
  • Antigens, Viral
  • Epitopes
  • ORF3d protein, SARS-CoV2 virus
  • RNA, Viral
  • Viral Proteins

Associated data

  • GEO/GSE149973