A 3D building blocks approach to analyzing and predicting structure of proteins

Ron Unger, David Harel, Scot Wherland, Joel L. Sussman

Research output: Contribution to journalArticlepeer-review

225 Scopus citations

Abstract

A new approach is introduced for analyzing and ultimately predicting protein structures, defined at the level of Cα coordinates. We analyze hexamers (oligopeptides of six amino acid residues) and show that their structure tends to concentrate in specific clusters rather than vary continuously. Thus, we can use a limited set ofstandard structural building blocks taken from these clusters as representatives of the repertoire of observed hexamers. We demonstrate that protein structures can be approximated by concatenating such building blocks. We have identified about 100 building blocks by applying clustering algorithms, and have shown that they can “replace” about 76% ofall hexamers in well‐refined known proteins with an error of less than 1 Å, and can be joined together to cover 99% of the residues. After replacing each hexamer by a standard building block with similar conformation, we can approximately reconstruct the actual structure by smoothly joining the overlapping building blocks into a full protein. The reconstructed structures show, in most cases, high resemblance to the original structure, although using a limited number of building blocks and local criteria of concatenating them is not likely to produce a very precise global match. Since these building blocks reflect, in many cases, some sequence dependency, it may be possible to use the results of this study as a basis for a protein structure prediction procedure.

Original languageEnglish
Pages (from-to)355-373
Number of pages19
JournalProteins: Structure, Function and Bioinformatics
Volume5
Issue number4
DOIs
StatePublished - 1989
Externally publishedYes

Keywords

  • prediction
  • primary
  • protein
  • secondary
  • structure

Fingerprint

Dive into the research topics of 'A 3D building blocks approach to analyzing and predicting structure of proteins'. Together they form a unique fingerprint.

Cite this