Advertisement
No access
Research Articles

Design of a Novel Globular Protein Fold with Atomic-Level Accuracy

Science
21 Nov 2003
Vol 302, Issue 5649
pp. 1364-1368

Abstract

A major challenge of computational protein design is the creation of novel proteins with arbitrarily chosen three-dimensional structures. Here, we used a general computational strategy that iterates between sequence design and structure prediction to design a 93-residue α/β protein called Top7 with a novel sequence and topology. Top7 was found experimentally to be folded and extremely stable, and the x-ray crystal structure of Top7 is similar (root mean square deviation equals 1.2 angstroms) to the design model. The ability to design a new protein fold makes possible the exploration of the large regions of the protein universe not yet observed in nature.

Get full access to this article

View all available purchase options and get full access to this article.

Supplementary Material

File (1089427s.pdf)

References and Notes

1
B. I. Dahiyat, S. L. Mayo, Science278, 82 (1997).
2
J. R. Desjarlais, T. M. Handel, Protein Sci.4, 2006 (1995).
3
J. W. Ponder, F. M. Richards, J. Mol. Biol.193, 775 (1987).
4
J. Reinaet al., Nature Struct. Biol.9, 621 (2002).
5
L. L. Looger, M. A. Dwyer, J. J. Smith, H. W. Hellinga, Nature423, 185 (2003).
6
S. M. Malakauskas, S. L. Mayo, Nature Struct. Biol.5, 470 (1998).
7
E. C. Johnson, G. A. Lazar, J. R. Desjarlais, T. M. Handel, Structure Fold. Des.7, 967 (1999).
8
P. Koehl, M. Levitt, J. Mol. Biol.293, 1161 (1999).
9
B. Kuhlman, D. Baker, Proc. Natl. Acad. Sci. U.S.A.97, 10383 (2000).
10
A. Jaramillo, L. Wernisch, S. Hery, S. J. Wodak, Proc. Natl. Acad. Sci. U.S.A.99, 13554 (2002).
11
K. Raha, A. M. Wollacott, M. J. Italia, J. R. Desjarlais, Protein Sci.9, 1106 (2000).
12
A. Su, S. L. Mayo, Protein Sci.6, 1701 (1997).
13
S. M. Larson, J. L. England, J. R. Desjarlais, V. S. Pande, Protein Sci.11, 2804 (2002).
14
B. Kuhlman, J. W. O'Neill, D. E. Kim, K. Y. Zhang, D. Baker, J. Mol. Biol.315, 471 (2002).
15
P. B. Harbury, J. J. Plecs, B. Tidor, T. Alber, P. S. Kim, Science282, 1462 (1998).
16
A. E. Keating, V. N. Malashkevich, B. Tidor, P. S. Kim, Proc. Natl. Acad. Sci. U.S.A.98, 14825 (2001).
17
The TOPS database is available at www.tops.leeds.ac.uk/.
18
P. M. Bowers, C. E. Strauss, D. Baker, J. Biomol. NMR18, 311 (2000).
19
T. Kortemme, A. V. Morozov, D. Baker, J. Mol. Biol.326, 1239 (2003).
20
T. Lazaridis, M. Karplus, Proteins Struct. Func. Genet.35, 132 (1999).
21
Materials and methods are available as supporting material on Science Online.
22
R. L. Dunbrack, F. E. Cohen, Protein Sci.6, 1661 (1997).
23
R. Bonneauet al., Proteins (suppl. 5), 119 (2001).
24
C. Rohl, C. E. M. Straus, K. M. S. Misura, D. Baker, Methods Enzymol., in press.
25
We used the Davidson-Fletcher-Powell algorithm as described by W. H. Press et al., in Numerical Recipes in C (Cambridge Univ. Press, Cambridge, ed. 2, 1992), pp. 428–429.
26
B. Kuhlmanet al., data not shown.
27
The sequence of Top7 is mgDIQVQVNIDDNGKNFDYTYTVTTESELQKVLNELKDYIKKQGAKRVRISITARTKKEAEKFAAILIKVFAELGYNDINVTFDGDTVTVEGQLEggslehhhhhh; the computationally designed sequence is in uppercase and residues added to allow expression and purification are in lowercase. Single-letter abbreviations for the amino acid residues are as follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; I, Ile; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gln; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr.
28
S. F. Altschulet al., Nucleic Acids Res.25, 3389 (1997).
29
A synthetic gene with the Top7 sequence under the control of the T7 promoter, a C-terminal 6X His tag, and codon usage optimized for Escherichia coli was obtained from BlueHeron Biotechnologies (Bothell, WA). After expression in E. coli, the protein was readily purified to >95% homogeneity with the use of nickel affinity chromatography followed by anion exchange chromatography.
30
J. K. Myers, C. N. Pace, J. M. Scholtz, Protein Sci.4, 2138 (1995).
31
K. W. Plaxco, K. T. Simons, D. Baker, J. Mol. Biol.277, 985 (1998).
32
The structure of Top7_K35M was solved by molecular replacement with the program EPMR (39) and by direct rebuilding into an unbiased SAD electron density map and residual difference Fourier maps. For molecular replacement, 19 large surface residues such as Lys, Arg, Gln, and Glu were truncated to Ala in the search model. The correlation coefficient for the initial MR search, using data to 4.0 Å resolution, was 0.52, compared with background of 0.36. For SAD phasing, the position of SeMet 37 was determined from an anomalous difference Patterson map. The initial phasing power and figure of merit for SAD phasing were 1.99 and 0.24, respectively, before density modification. An interpretable electron density map was obtained after density modification with solvent flipping with a solvent content of 43% with the use of CNS (40). An initial model was built with the use of XtalView (41) and O (42). The model was refined with CNS with the use of the mlhl target (maximum likelihood, Hendrickson-Lattman coefficients) with 5% of the data excluded for the calculation of the cross-validating free R. Of all the built residues, 88% are in the most favorable regions of Ramachandran space, and 12% are in the allowed regions. Statistics from phasing and refinement are shown in table S2.
33
L. Holm, C. Sander, Trends Biochem. Sci.20, 478 (1995).
34
T. J. Hubbard, A. G. Murzin, S. E. Brenner, C. Chothia, Nucleic Acids Res.25, 236 (1997).
35
A. Murzin, personal communication.
36
J. J. Havranek, P. B. Harbury, Nature Struct. Biol.10, 45 (2003).
37
W. Jin, O. Kambara, H. Sasakawa, A. Tamura, S. Takada, Structure (Cambridge)11, 581 (2003).
38
L. Mirny, E. Shakhnovich, J. Mol. Biol.308, 123 (2001).
39
EPMR: A program for crystallographic molecular replacement by evolutionary search (C. R. Kissinger, D. K. Gehlhaar, Agouron Pharmaceuticals, La Jolla, CA).
40
A. T. Brüngeret al., Acta Crystallogr. D. Biol. Crystallogr.54, 905 (1998).
41
D. E. McRee, J. Struct. Biol.125, 156 (1999).
42
T. A. Jones, J. Y. Zou, S. W. Cowan, M. Kjeldgaard, Acta Crystallagr. A47, 110 (1991).
43
S. Mori, C. Abeygunawardana, M. O. Johnson, P. C. van Zijl, J. Magn. Reson. B.108, 94 (1995).
44
We acknowledge the expert assistance of B. Shen in crystallographic phasing, modeling, and refinement of the TOP7 structure, C. Rohl for aiding in the incorporation of RosettaDesign into Rosetta, C. Strauss for helping to generate the initial models of Top7, T. Leeper for help with two-dimensional NMR studies, and R. Klevit and the Klevit laboratory for help with preliminary NMR characterization of Top7. Academic users can obtain licensing information for RosettaDesign at www.unc.edu/kuhlmanpg/rosettadesign.htm. The coordinates and structure factors for the Top7 x-ray crystal structure have been deposited in the PDB with accession code 1QYS. B.K. was supported by a fellowship from the Cancer Research Fund of the Damon Runyon–Walter Winchell Foundation. This work was also supported by NIH grants to G.V., B.L.S., and D.B.

(0)eLetters

eLetters is a forum for ongoing peer review. eLetters are not edited, proofread, or indexed, but they are screened. eLetters should provide substantive and scholarly commentary on the article. Embedded figures cannot be submitted, and we discourage the use of figures within eLetters in general. If a figure is essential, please include a link to the figure within the text of the eLetter. Please read our Terms of Service before submitting an eLetter.

Log In to Submit a Response

No eLetters have been published for this article yet.

Information & Authors

Information

Published In

Science
Volume 302 | Issue 5649
21 November 2003

Submission history

Received: 21 July 2003
Accepted: 25 September 2003
Published in print: 21 November 2003

Permissions

Request permissions for this article.

Notes

Supporting Online Material
www.sciencemag.org/cgi/content/full/302/5649/1364/DC1
Materials and Methods
Tables S1 to S6

Authors

Affiliations

Brian Kuhlman*,†
Department of Biochemistry, University of Washington, Seattle, WA 98195, USA.
Gautam Dantas*
Department of Biochemistry, University of Washington, Seattle, WA 98195, USA.
Gregory C. Ireton
Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA 98109, USA.
Gabriele Varani
Department of Biochemistry, University of Washington, Seattle, WA 98195, USA.
Department of Chemistry, University of Washington, Seattle, WA 98195, USA.
Barry L. Stoddard
Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA 98109, USA.
David Baker [email protected]
Department of Biochemistry, University of Washington, Seattle, WA 98195, USA.
Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA.

Notes

To whom correspondence should be addressed. E-mail: [email protected]

Metrics & Citations

Metrics

Article Usage

Altmetrics

Citations

Cite as

Export citation

Select the format you want to export the citation of this publication.

Cited by

  1. Generative power of a protein language model trained on multiple sequence alignments, eLife, 12, (2023).https://doi.org/10.7554/eLife.79854
    Crossref
  2. Computational Insights into the Allosteric Modulation of a Phthalate-Degrading Hydrolase by Distal Mutations, Biomolecules, 13, 3, (443), (2023).https://doi.org/10.3390/biom13030443
    Crossref
  3. Molecular determinants of μ-conotoxin KIIIA interaction with the human voltage-gated sodium channel NaV1.7, Frontiers in Pharmacology, 14, (2023).https://doi.org/10.3389/fphar.2023.1156855
    Crossref
  4. Quantitative comparison of protein-protein interaction interface using physicochemical feature-based descriptors of surface patches, Frontiers in Molecular Biosciences, 10, (2023).https://doi.org/10.3389/fmolb.2023.1110567
    Crossref
  5. From nature to industry: Harnessing enzymes for biocatalysis, Science, 382, 6673, (2023)./doi/10.1126/science.adh8615
    Abstract
  6. Structural features of sensory two component systems: a synthetic biology perspective, Biochemical Journal, 480, 2, (127-140), (2023).https://doi.org/10.1042/BCJ20210798
    Crossref
  7. Computational design of a cyclic peptide that inhibits the CTLA4 immune checkpoint, RSC Medicinal Chemistry, (2023).https://doi.org/10.1039/D2MD00409G
    Crossref
  8. Computational design of constitutively active cGAS, Nature Structural & Molecular Biology, 30, 1, (72-80), (2023).https://doi.org/10.1038/s41594-022-00862-z
    Crossref
  9. Network of hotspot interactions cluster tau amyloid folds, Nature Communications, 14, 1, (2023).https://doi.org/10.1038/s41467-023-36572-3
    Crossref
  10. Computational Protein Design for COVID-19 Research and Emerging Therapeutics, ACS Central Science, (2023).https://doi.org/10.1021/acscentsci.2c01513
    Crossref
  11. See more
Loading...

View Options

Check Access

Log in to view the full text

AAAS ID LOGIN

AAAS login provides access to Science for AAAS Members, and access to other journals in the Science family to users who have purchased individual subscriptions.

Log in via OpenAthens.
Log in via Shibboleth.

More options

Purchase digital access to this article

Download and print this article for your personal scholarly, research, and educational use.

Purchase this issue in print

Buy a single issue of Science for just $15 USD.

View options

PDF format

Download this article as a PDF file

Download PDF

Full Text

FULL TEXT

Media

Figures

Multimedia

Tables

Share

Share

Share article link

Share on social media