Abstract
We present a statistical analysis of protein structures based on interatomic C-alpha distances. The overall distance distributions reflect in detail the contents of sequence-specific substructures maintained by local interactions (such as alpha-helixes) and longer range interactions (such as disulfide bridges and beta-sheets). We also show that a volume scaling of the distances makes distance distributions for protein chains of different length superimposable. Distance distributions were also calculated specifically for amino acids separated by a given number of residues. Specific features in these distributions are visible for sequence separations of up to 20 amino acid residues. A simple representation, which preserves most of the information in the distance distributions, was obtained using six parameters only. The parameters give rise to canonical distance intervals and when predicting coarse-grained distance constraints by methods such as data-driven artificial neural networks, these should preferable be selected from these intervals. We discuss the use of the six parameters for determining or reconstructing 3-D protein structures.
Original language | English |
---|---|
Journal | Protein Engineering |
Volume | 9 |
Issue number | 9 |
Pages (from-to) | 733-740 |
ISSN | 0269-2139 |
DOIs | |
Publication status | Published - 1996 |