TY - JOUR
T1 - A 3D building blocks approach to analyzing and predicting structure of proteins
AU - Unger, Ron
AU - Harel, David
AU - Wherland, Scot
AU - Sussman, Joel L.
PY - 1989
Y1 - 1989
N2 - A new approach is introduced for analyzing and ultimately predicting protein structures, defined at the level of Cα coordinates. We analyze hexamers (oligopeptides of six amino acid residues) and show that their structure tends to concentrate in specific clusters rather than vary continuously. Thus, we can use a limited set ofstandard structural building blocks taken from these clusters as representatives of the repertoire of observed hexamers. We demonstrate that protein structures can be approximated by concatenating such building blocks. We have identified about 100 building blocks by applying clustering algorithms, and have shown that they can “replace” about 76% ofall hexamers in well‐refined known proteins with an error of less than 1 Å, and can be joined together to cover 99% of the residues. After replacing each hexamer by a standard building block with similar conformation, we can approximately reconstruct the actual structure by smoothly joining the overlapping building blocks into a full protein. The reconstructed structures show, in most cases, high resemblance to the original structure, although using a limited number of building blocks and local criteria of concatenating them is not likely to produce a very precise global match. Since these building blocks reflect, in many cases, some sequence dependency, it may be possible to use the results of this study as a basis for a protein structure prediction procedure.
AB - A new approach is introduced for analyzing and ultimately predicting protein structures, defined at the level of Cα coordinates. We analyze hexamers (oligopeptides of six amino acid residues) and show that their structure tends to concentrate in specific clusters rather than vary continuously. Thus, we can use a limited set ofstandard structural building blocks taken from these clusters as representatives of the repertoire of observed hexamers. We demonstrate that protein structures can be approximated by concatenating such building blocks. We have identified about 100 building blocks by applying clustering algorithms, and have shown that they can “replace” about 76% ofall hexamers in well‐refined known proteins with an error of less than 1 Å, and can be joined together to cover 99% of the residues. After replacing each hexamer by a standard building block with similar conformation, we can approximately reconstruct the actual structure by smoothly joining the overlapping building blocks into a full protein. The reconstructed structures show, in most cases, high resemblance to the original structure, although using a limited number of building blocks and local criteria of concatenating them is not likely to produce a very precise global match. Since these building blocks reflect, in many cases, some sequence dependency, it may be possible to use the results of this study as a basis for a protein structure prediction procedure.
KW - prediction
KW - primary
KW - protein
KW - secondary
KW - structure
UR - http://www.scopus.com/inward/record.url?scp=0024395940&partnerID=8YFLogxK
U2 - 10.1002/prot.340050410
DO - 10.1002/prot.340050410
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 2798411
AN - SCOPUS:0024395940
SN - 0887-3585
VL - 5
SP - 355
EP - 373
JO - Proteins: Structure, Function and Bioinformatics
JF - Proteins: Structure, Function and Bioinformatics
IS - 4
ER -