Complete nucleotide sequence of SV40 DNA

W Fiers; R Contreras; G Haegemann; R Rogiers; A Van de Voorde; H Van Heuverswyn; J Van Herreweghe; G Volckaert; M Ysebaert

doi:10.1038/273113a0

Complete nucleotide sequence of SV40 DNA

Nature. 1978 May 11;273(5658):113-20. doi: 10.1038/273113a0.

Authors

W Fiers, R Contreras, G Haegemann, R Rogiers, A Van de Voorde, H Van Heuverswyn, J Van Herreweghe, G Volckaert, M Ysebaert

PMID: 205802
DOI: 10.1038/273113a0

Abstract

The determination of the total 5,224 base-pair DNA sequence of the virus SV40 has enabled us to locate precisely the known genes on the genome. At least 15.2% of the genome is presumably not translated into polypeptides. Particular points of interest revealed by the complete sequence are the initiation of the early t and T antigens at the same position and the fact that the T antigen is coded by two non-contiguous regions of the genome; the T antigen mRNA is spliced in the coding region. In the late region the gene for the major protein VP1 overlaps those for proteins VP2 and VP3 over 122 nucleotides but is read in a different frame. The almost complete amino acid sequences of the two early proteins as well as those of the late proteins have been deduced from the nucleotide sequence. The mRNAs for the latter three proteins are presumably spliced out of a common primary RNA transcript. The use of degenerate codons is decidedly non-random, but is similar for the early and late regions. Codons of the type NUC, NCG and CGN are absent or very rare.

MeSH terms

Antigens, Neoplasm / genetics
Antigens, Viral / genetics
Base Sequence
Codon
DNA Replication
DNA, Viral*
Genes
Genes, Viral*
RNA, Viral / genetics
Simian virus 40 / genetics*
Transcription, Genetic
Viral Proteins / genetics
Virus Replication

Substances

Antigens, Neoplasm
Antigens, Viral
Codon
DNA, Viral
RNA, Viral
Viral Proteins

Associated data

GENBANK/J02400
GENBANK/J02402
GENBANK/J02403
GENBANK/J02406
GENBANK/J02407
GENBANK/J02408
GENBANK/J02409
GENBANK/J02410
GENBANK/V01380