TY - JOUR
T1 - Contact prediction for beta and alpha-beta proteins using integer linear optimization and its impact on the first principles 3d structure prediction method ASTRO-FOLD
AU - Rajgaria, R.
AU - Wei, Y.
AU - Floudas, C. A.
PY - 2010/6
Y1 - 2010/6
N2 - An integer linear optimization model is presented to predict residue contacts in β, α + β, and α/β proteins. The total energy of a protein is expressed as sum of a C α-C α distance dependent contact energy contribution and a hydrophobic contribution. The model selects contact that assign lowest energy to the protein structure as satisfying a set of constraints that are included to enforce certain physically observed topological information. A new method based on hydrophobicity is proposed to find the β-sheet alignments. These β-sheet alignments are used as constraints for contacts between residues of β-sheets. This model was tested on three independent protein test sets and CASP8 test proteins consisting of β, α + β, α/β proteins and it was found to perform very well. The average accuracy of the predictions (separated by at least six residues) was ∼61%. The average true positive and false positive distances were also calculated for each of the test sets and they are 7.58 Å and 15.88 ̊, respectively. Residue contact prediction can be directly used to facilitate the protein tertiary structure prediction. This proposed residue contact prediction model is incorporated into the first principles protein tertiary structure prediction approach, ASTROFOLD. The effectiveness of the contact prediction model was further demonstrated by the improvement in the quality of the protein structure ensemble generated using the predicted residue contacts for a test set of 10 proteins.
AB - An integer linear optimization model is presented to predict residue contacts in β, α + β, and α/β proteins. The total energy of a protein is expressed as sum of a C α-C α distance dependent contact energy contribution and a hydrophobic contribution. The model selects contact that assign lowest energy to the protein structure as satisfying a set of constraints that are included to enforce certain physically observed topological information. A new method based on hydrophobicity is proposed to find the β-sheet alignments. These β-sheet alignments are used as constraints for contacts between residues of β-sheets. This model was tested on three independent protein test sets and CASP8 test proteins consisting of β, α + β, α/β proteins and it was found to perform very well. The average accuracy of the predictions (separated by at least six residues) was ∼61%. The average true positive and false positive distances were also calculated for each of the test sets and they are 7.58 Å and 15.88 ̊, respectively. Residue contact prediction can be directly used to facilitate the protein tertiary structure prediction. This proposed residue contact prediction model is incorporated into the first principles protein tertiary structure prediction approach, ASTROFOLD. The effectiveness of the contact prediction model was further demonstrated by the improvement in the quality of the protein structure ensemble generated using the predicted residue contacts for a test set of 10 proteins.
KW - CASP8
KW - CSA
KW - Force field
KW - Hydrophobic
KW - Integer linear optimization
KW - αBb
UR - http://www.scopus.com/inward/record.url?scp=77953489790&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77953489790&partnerID=8YFLogxK
U2 - 10.1002/prot.22696
DO - 10.1002/prot.22696
M3 - Article
C2 - 20225257
AN - SCOPUS:77953489790
SN - 0887-3585
VL - 78
SP - 1825
EP - 1846
JO - Proteins: Structure, Function and Bioinformatics
JF - Proteins: Structure, Function and Bioinformatics
IS - 8
ER -