Improved de novo structure prediction in CASP11 by incorporating coevolution information into Rosetta

Proteins. 2016 Sep;84 Suppl 1(Suppl 1):67-75. doi: 10.1002/prot.24974. Epub 2016 Feb 24.

Abstract

We describe CASP11 de novo blind structure predictions made using the Rosetta structure prediction methodology with both automatic and human assisted protocols. Model accuracy was generally improved using coevolution derived residue-residue contact information as restraints during Rosetta conformational sampling and refinement, particularly when the number of sequences in the family was more than three times the length of the protein. The highlight was the human assisted prediction of T0806, a large and topologically complex target with no homologs of known structure, which had unprecedented accuracy-<3.0 Å root-mean-square deviation (RMSD) from the crystal structure over 223 residues. For this target, we increased the amount of conformational sampling over our fully automated method by employing an iterative hybridization protocol. Our results clearly demonstrate, in a blind prediction scenario, that coevolution derived contacts can considerably increase the accuracy of template-free structure modeling. Proteins 2016; 84(Suppl 1):67-75. © 2015 Wiley Periodicals, Inc.

Keywords: ab initio prediction; coevolution; contact prediction; protein structure prediction; rosetta.

MeSH terms

  • Amino Acid Sequence
  • Computational Biology / methods
  • Computational Biology / statistics & numerical data*
  • Crystallography, X-Ray
  • Directed Molecular Evolution
  • Escherichia coli / chemistry
  • Escherichia coli Proteins / chemistry*
  • Humans
  • Internet
  • Models, Molecular*
  • Models, Statistical*
  • Protein Folding
  • Protein Interaction Domains and Motifs
  • Protein Structure, Secondary
  • Sequence Alignment
  • Software*

Substances

  • Escherichia coli Proteins
  • YaaA protein, E coli