Cell
Volume 149, Issue 7, 22 June 2012, Pages 1607-1621
Journal home page for Cell

Theory
Three-Dimensional Structures of Membrane Proteins from Genomic Sequencing

https://doi.org/10.1016/j.cell.2012.04.012 Get rights and content
Under an Elsevier user license
open archive

Summary

We show that amino acid covariation in proteins, extracted from the evolutionary sequence record, can be used to fold transmembrane proteins. We use this technique to predict previously unknown 3D structures for 11 transmembrane proteins (with up to 14 helices) from their sequences alone. The prediction method (EVfold_membrane) applies a maximum entropy approach to infer evolutionary covariation in pairs of sequence positions within a protein family and then generates all-atom models with the derived pairwise distance constraints. We benchmark the approach with blinded de novo computation of known transmembrane protein structures from 23 families, demonstrating unprecedented accuracy of the method for large transmembrane proteins. We show how the method can predict oligomerization, functional sites, and conformational changes in transmembrane proteins. With the rapid rise in large-scale sequencing, more accurate and more comprehensive information on evolutionary constraints can be decoded from genetic variation, greatly expanding the repertoire of transmembrane proteins amenable to modeling by this method.

Highlights

► 3D structure prediction of 11 membrane proteins with no known structure ► Algorithm finds evolutionary couplings from genetic variation and massive sequencing ► Accuracy tested by blind prediction of known 3D protein structures from 23 families ► Computation is fast; provides clues for functional sites, conformations, and oligomers

Cited by (0)