Advertisement
No access
Research Article

A Method to Identify Protein Sequences That Fold into a Known Three-Dimensional Structure

Science
12 Jul 1991
Vol 253, Issue 5016
pp. 164-170

Abstract

The inverse protein folding problem, the problem of finding which amino acid sequences fold into a known three-dimensional (3D) structure, can be effectively attacked by finding sequences that are most compatible with the environments of the residues in the 3D structure. The environments are described by: (i) the area of the residue buried in the protein and inaccessible to solvent; (ii) the fraction of side-chain area that is covered by polar atoms (O and N); and (iii) the local secondary structure. Examples of this 3D profile method are presented for four families of proteins: the globins, cyclic AMP (adenosine 3′,5′-monophosphate) receptor-like proteins, the periplasmic binding proteins, and the actins. This method is able to detect the structural similarity of the actins and 70- kilodalton heat shock proteins, even though these protein families share no detectable sequence similarity.

Get full access to this article

View all available purchase options and get full access to this article.

References

ALBER, T, CONTRIBUTIONS OF HYDROGEN-BONDS OF THR-157 TO THE THERMODYNAMIC STABILITY OF PHAGE-T4 LYSOZYME, NATURE 330: 41 (1987).
ALBER, T, REPLACEMENTS OF PRO86 IN PHAGE-T4 LYSOZYME EXTEND AN ALPHA-HELIX BUT DO NOT ALTER PROTEIN STABILITY, SCIENCE 239: 631 (1988).
BASHFORD, D, DETERMINANTS OF A PROTEIN FOLD - UNIQUE FEATURES OF THE GLOBIN AMINO-ACID-SEQUENCES, JOURNAL OF MOLECULAR BIOLOGY 196: 199 (1987).
BERNSTEIN, F.C., PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES, JOURNAL OF MOLECULAR BIOLOGY 112: 535 (1977).
BOWIE, J.U., IDENTIFYING DETERMINANTS OF FOLDING AND ACTIVITY FOR A PROTEIN OF UNKNOWN STRUCTURE, PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 86: 2152 (1989).
BOWIE, J.U., DECIPHERING THE MESSAGE IN PROTEIN SEQUENCES - TOLERANCE TO AMINO-ACID SUBSTITUTIONS, SCIENCE 247: 1306 (1990).
CHOTHIA, C, THE EVOLUTION OF PROTEIN STRUCTURES, COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY 52: 399 (1987).
CHOU, P. Y., Prediction of the Secondary Structure of Proteins from their Amino Acid Sequence, ADVANCES IN ENZYMOLOGY AND RELATED AREAS OF MOLECULAR BIOLOGY 47: 45 (1978).
Dayhoff, M. O., Atlas of Protein Sequence and Structure 5 3: 353 (1979).
Dayhoff, M. O., Atlas of Protein Sequence and Structure (1968).
DEVEREUX, J, A COMPREHENSIVE SET OF SEQUENCE-ANALYSIS PROGRAMS FOR THE VAX, NUCLEIC ACIDS RESEARCH 12: 387 (1984).
Doolittle, R. F., Methods in Enzymology 183 (1990).
Doolittle, R. F., Of Urfs and Ofis: A Primer on How to Analyze Derived Amino Acid Sequences (1986).
DORIT, R.L., HOW BIG IS THE UNIVERSE OF EXONS, SCIENCE 250: 1377 (1990).
Drexler, K. E., Proceedings of the National Academy of Sciences of the United States of America 78: 5275 (1981).
EISENBERG, D, INTERPRETATION OF PROTEIN FOLDING AND BINDING WITH ATOMIC SOLVATION PARAMETERS, CHEMICA SCRIPTA 29: 217 (1989).
EISENBERG, D, SOLVATION ENERGY IN PROTEIN FOLDING AND BINDING, NATURE 319: 199 (1986).
Fano, R. M., Transmission of Information (1961).
FITCH, W.M., JOURNAL OF MOLECULAR BIOLOGY 16: 1 (1966).
FLAHERTY, K.M., 3-DIMENSIONAL STRUCTURE OF THE ATPASE FRAGMENT OF A 70K HEAT-SHOCK COGNATE PROTEIN, NATURE 346: 623 (1990).
GEORGE, D.G., THE PROTEIN IDENTIFICATION RESOURCE (PIR), NUCLEIC ACIDS RESEARCH 14: 11 (1986).
GILBERT, W, THE EXON THEORY OF GENES, COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY 52: 901 (1987).
GO, M, PROTEIN ARCHITECTURE AND THE ORIGIN OF INTRONS, COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY 52: 915 (1987).
GRIBSKOV, M, PROFILE ANALYSIS, METHODS IN ENZYMOLOGY 183: 146 (1990).
GRIBSKOV, M, PROFILE ANALYSIS - DETECTION OF DISTANTLY RELATED PROTEINS, PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 84: 4355 (1987).
KABSCH, W, ATOMIC-STRUCTURE OF THE ACTIN - DNASE-I COMPLEX, NATURE 347: 37 (1990).
KATZ, B.A., THE CRYSTALLOGRAPHICALLY DETERMINED STRUCTURES OF ATYPICAL STRAINED DISULFIDES ENGINEERED INTO SUBTILISIN, JOURNAL OF BIOLOGICAL CHEMISTRY 261: 15480 (1986).
LEE, B, INTERPRETATION OF PROTEIN STRUCTURES - ESTIMATION OF STATIC ACCESSIBILITY, JOURNAL OF MOLECULAR BIOLOGY 55: 379 (1971).
LESK, A.M., HOW DIFFERENT AMINO-ACID-SEQUENCES DETERMINE SIMILAR PROTEIN STRUCTURES - STRUCTURE AND EVOLUTIONARY DYNAMICS OF THE GLOBINS, JOURNAL OF MOLECULAR BIOLOGY 136: 225 (1980).
LESK, A.M., EVOLUTION OF PROTEINS FORMED BY BETA-SHEETS .2. THE CORE OF THE IMMUNOGLOBULIN DOMAINS, JOURNAL OF MOLECULAR BIOLOGY 160: 325 (1982).
LEVITT, M, COMPUTER-SIMULATION OF PROTEIN FOLDING, NATURE 253: 694 (1975).
LEVITT, M, STRUCTURAL PATTERNS IN GLOBULAR PROTEINS, NATURE 261: 552 (1976).
LIM, W.A., ALTERNATIVE PACKING ARRANGEMENTS IN THE HYDROPHOBIC CORE OF LAMBDA-REPRESSOR, NATURE 339: 31 (1989).
LUTHY, R, SECONDARY STRUCTURE-BASED PROFILES - USE OF STRUCTURE-CONSERVING SCORING TABLES IN SEARCHING PROTEIN-SEQUENCE DATABASES FOR STRUCTURAL SIMILARITIES, PROTEINS-STRUCTURE FUNCTION AND GENETICS 10: 229 (1991).
MCLACHLAN, A.D., JOURNAL OF MOLECULAR BIOLOGY 61: 409 (1971).
MULLERHILL, B, SEQUENCE HOMOLOGY BETWEEN LAC AND GAL REPRESSORS AND 3 SUGAR-BINDING PERIPLASMIC PROTEINS, NATURE 302: 163 (1983).
NAHARRO, G, GENE-PRODUCT OF V-FGR ONC - HYBRID PROTEIN CONTAINING A PORTION OF ACTIN AND A TYROSINE-SPECIFIC PROTEIN-KINASE, SCIENCE 223: 63 (1984).
NEEDLEMAN, S.B., A GENERAL METHOD APPLICABLE TO SEARCH FOR SIMILARITIES IN AMINO ACID SEQUENCE OF 2 PROTEINS, JOURNAL OF MOLECULAR BIOLOGY 48: 443 (1970).
NEMETHY, G, Q REV BIOPHYS 10: 239 (1977).
PABO, C, MOLECULAR TECHNOLOGY - DESIGNING PROTEINS AND PEPTIDES, NATURE 301: 200 (1983).
PONDER, J.W., TERTIARY TEMPLATES FOR PROTEINS - USE OF PACKING CRITERIA IN THE ENUMERATION OF ALLOWED SEQUENCES FOR DIFFERENT STRUCTURAL CLASSES, JOURNAL OF MOLECULAR BIOLOGY 193: 775 (1987).
REIDHAAROLSON, J.F., COMBINATORIAL CASSETTE MUTAGENESIS AS A PROBE OF THE INFORMATIONAL CONTENT OF PROTEIN SEQUENCES, SCIENCE 241: 53 (1988).
RICHARDSON, J.S., ADVANCES IN PROTEIN CHEMISTRY 34: 167 (1981).
RICHMOND, T.J., PACKING OF ALPHA-HELICES - GEOMETRICAL CONSTRAINTS AND CONTACT AREAS, JOURNAL OF MOLECULAR BIOLOGY 119: 537 (1978).
SALI, A, DEFINITION OF GENERAL TOPOLOGICAL EQUIVALENCE IN PROTEIN STRUCTURES - A PROCEDURE INVOLVING COMPARISON OF PROPERTIES AND RELATIONSHIPS THROUGH SIMULATED ANNEALING AND DYNAMIC-PROGRAMMING, JOURNAL OF MOLECULAR BIOLOGY 212: 403 (1990).
SAMS, C.F., PREDICTED STRUCTURE OF THE SUGAR-BINDING SITE OF THE LAC REPRESSOR, NATURE 310: 429 (1984).
SANDBERG, W.S., INFLUENCE OF INTERIOR PACKING AND HYDROPHOBICITY ON THE STABILITY OF A PROTEIN, SCIENCE 245: 54 (1989).
SANDER, C, DATABASE OF HOMOLOGY-DERIVED PROTEIN STRUCTURES AND THE STRUCTURAL MEANING OF SEQUENCE ALIGNMENT, PROTEINS-STRUCTURE FUNCTION AND GENETICS 9: 56 (1991).
SMITH, T.F., ADV APPL MATH 2: 482 (1981).
SPIRO, S, FNR AND ITS ROLE IN OXYGEN-REGULATED GENE-EXPRESSION IN ESCHERICHIA-COLI, FEMS MICROBIOLOGY REVIEWS 75: 399 (1990).
TAKANO, T, STRUCTURE OF MYOGLOBIN REFINED AT 2.0 A RESOLUTION .1. CRYSTALLOGRAPHIC REFINEMENT OF METMYOGLOBIN FROM SPERM WHALE, JOURNAL OF MOLECULAR BIOLOGY 110: 537 (1977).
TAYLOR, W.R., JOURNAL OF MOLECULAR BIOLOGY 188: 233 (1986).
TAYLOR, W.R., PROTEIN-STRUCTURE ALIGNMENT, JOURNAL OF MOLECULAR BIOLOGY 208: 1 (1989).
VYAS, N.K., COMPARISON OF THE PERIPLASMIC RECEPTORS FOR L-ARABINOSE, D-GLUCOSE D-GALACTOSE, AND D-RIBOSE - STRUCTURAL AND FUNCTIONAL SIMILARITY, JOURNAL OF BIOLOGICAL CHEMISTRY 266: 5226 (1991).
WEAVER, L.H., HIGH-RESOLUTION STRUCTURE OF THE TEMPERATURE-SENSITIVE MUTANT OF PHAGE LYSOZYME, ARG 96-]HIS, BIOCHEMISTRY 28: 3793 (1989).
WEBER, I.T., PREDICTED STRUCTURES OF CAMP BINDING DOMAINS OF TYPE-I AND TYPE-II REGULATORY SUBUNITS OF CAMP-DEPENDENT PROTEIN-KINASE, BIOCHEMISTRY 26: 343 (1987).
WEBER, I.T., PREDICTED STRUCTURES OF THE CGMP BINDING DOMAINS OF THE CGMP-DEPENDENT PROTEIN-KINASE - A KEY ALANINE THREONINE DIFFERENCE IN EVOLUTIONARY DIVERGENCE OF CAMP AND CGMP BINDING-SITES, BIOCHEMISTRY 28: 6122 (1989).
WEBER, I.T., STRUCTURE OF A COMPLEX OF CATABOLITE GENE ACTIVATOR PROTEIN AND CYCLIC-AMP REFINED AT 2.5 A RESOLUTION, JOURNAL OF MOLECULAR BIOLOGY 198: 311 (1987).
Weber, I. T., Proceedings of the National Academy of Sciences of the United States of America 79: 7679 (1982).
WELLS, J.A., ADDITIVITY OF MUTATIONAL EFFECTS IN PROTEINS, BIOCHEMISTRY 29: 8509 (1990).

(0)eLetters

eLetters is a forum for ongoing peer review. eLetters are not edited, proofread, or indexed, but they are screened. eLetters should provide substantive and scholarly commentary on the article. Embedded figures cannot be submitted, and we discourage the use of figures within eLetters in general. If a figure is essential, please include a link to the figure within the text of the eLetter. Please read our Terms of Service before submitting an eLetter.

Log In to Submit a Response

No eLetters have been published for this article yet.

Information & Authors

Information

Published In

Science
Volume 253 | Issue 5016
12 July 1991

Submission history

Published in print: 12 July 1991

Permissions

Request permissions for this article.

Authors

Affiliations

James U. Bowie
Molecular Biology Institute and the Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90024-1570.
Roland Lüthy
Molecular Biology Institute and the Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90024-1570.
David Eisenberg
Molecular Biology Institute and the Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90024-1570.

Metrics & Citations

Metrics

Article Usage

Altmetrics

Citations

Cite as

Export citation

Select the format you want to export the citation of this publication.

Cited by

  1. Distinct control of PERIOD2 degradation and circadian rhythms by the oncoprotein and ubiquitin ligase MDM2, Science Signaling, 11, 556, (2021)./doi/10.1126/scisignal.aau0715
    Abstract
  2. Atomic structures of low-complexity protein segments reveal kinked β sheets that assemble networks, Science, 359, 6376, (698-701), (2021)./doi/10.1126/science.aan6398
    Abstract
  3. DNA-PKcs structure suggests an allosteric mechanism modulating DNA double-strand break repair, Science, 355, 6324, (520-524), (2021)./doi/10.1126/science.aak9654
    Abstract
  4. Learning to Speak the Language of Proteins, Science, 302, 5649, (1347-1348), (2021)./doi/10.1126/science.1092492
    Abstract
  5. Hidden Models in Biopolymers, Science, 282, 5393, (1436-1437), (1998)./doi/10.1126/science.282.5393.1436
    Abstract
  6. Crystal Structure of lac Repressor Core Tetramer and Its Implications for DNA Looping, Science, 268, 5218, (1721-1727), (1995)./doi/10.1126/science.7792597
    Abstract
  7. Architectures of Class-Defining and Specific Domains of Glutamyl-tRNA Synthetase, Science, 267, 5206, (1958-1965), (1995)./doi/10.1126/science.7701318
    Abstract
  8. Solution Structure of the Activator Contact Domain of the RNA Polymerase α Subunit, Science, 270, 5241, (1495-1497), (1995)./doi/10.1126/science.270.5241.1495
    Abstract
  9. Crystal Structure of LacI Member, PurR, Bound to DNA: Minor Groove Binding by α Helices, Science, 266, 5186, (763-770), (1994)./doi/10.1126/science.7973627
    Abstract
  10. The Parallel β Helix of Pectate Lyase C: Something to Sneeze at, Science, 260, 5113, (1444-1445), (1993)./doi/10.1126/science.8502989
    Abstract
  11. See more
Loading...

View Options

Check Access

Log in to view the full text

AAAS ID LOGIN

AAAS login provides access to Science for AAAS Members, and access to other journals in the Science family to users who have purchased individual subscriptions.

Log in via OpenAthens.
Log in via Shibboleth.

More options

Purchase digital access to this article

Download and print this article for your personal scholarly, research, and educational use.

Purchase this issue in print

Buy a single issue of Science for just $15 USD.

View options

PDF format

Download this article as a PDF file

Download PDF

Media

Figures

Multimedia

Tables

Share

Share

Share article link

Share on social media