Abstract
We study the statistical properties of contact vectors, a construct to characterize a protein’s structure. The contact vector of an N-residue protein is a list of N integers [formula presented] representing the number of residues in contact with residue i. We study analytically (at mean-field level) and numerically the amount of structural information contained in a contact vector. Analytical calculations reveal that a large variance in the contact numbers reduces the degeneracy of the mapping between contact vectors and structures. Exact enumeration for lengths up to [formula presented] on the three-dimensional cubic lattice indicates that the growth rate of number of contact vectors as a function of N is only 3% less than that for contact maps. In particular, for compact structures we present numerical evidence that, practically, each contact vector corresponds to only a handful of structures. We discuss how this information can be used for better structure prediction.
Original language | English |
---|---|
Pages (from-to) | 7 |
Number of pages | 1 |
Journal | Physical Review E |
Volume | 65 |
Issue number | 4 |
DOIs | |
State | Published - Apr 2002 |