Solution structure of a de novo protein from a designed combinatorial library

Yinan Wei, Seho Kim, David Fela, Jean Baum, Michael H. Hecht

Research output: Contribution to journalArticlepeer-review

106 Scopus citations


Combinatorial libraries of de novo amino acid sequences can provide a rich source of diversity for the discovery of novel proteins. Randomly generated sequences, however, rarely fold into well ordered protein-like structures. To enhance the quality of a library, diversity must be focused into those regions of sequence space most likely to yield well folded structures. We have constructed focused libraries of de novo sequences by designing the binary pattern of polar and nonpolar amino acids to favor structures that contain abundant secondary structure, while simultaneously burying hydrophobic side chains in the protein interior and exposing hydrophilic side chains to solvent. Because binary patterning specifies only the polar/nonpolar periodicity, but not the identities of the side chains, detailed structural features, including packing interactions, cannot be designed a priori. Can binary patterned libraries nonetheless encode well folded proteins? An unambiguous answer to this question requires determination of a 3D structure. We used NMR spectroscopy to determine the structure of S-824, a novel protein from a recently constructed library of 102-residue sequences. This library is "naive" in that it has not been subjected to high-throughput screens or directed evolution. The experimentally determined structure of S-824 is a four-helix bundle, as specified by the design. As dictated by the binary-code strategy, nonpolar side chains are buried in the protein interior, and polar side chains are exposed to solvent. The polypeptide backbone and buried side chains are well ordered, demonstrating that S-824 is not a molten globule and forms a unique structure. These results show that amino acid sequences that have neither been selected by evolution, nor designed by computer, nor isolated by high-throughput screening, can form native-like structures. These findings validate the binary-code strategy as an effective method for producing vast collections of well folded de novo proteins.

Original languageEnglish
Pages (from-to)13270-13273
Number of pages4
JournalProceedings of the National Academy of Sciences of the United States of America
Issue number23
StatePublished - Nov 11 2003

ASJC Scopus subject areas

  • General


Dive into the research topics of 'Solution structure of a de novo protein from a designed combinatorial library'. Together they form a unique fingerprint.

Cite this