Prediction of local structure in proteins using a library of sequence-structure motifs

被引:244
作者
Bystroff, C [1 ]
Baker, D [1 ]
机构
[1] Univ Washington, Dept Biochem, Seattle, WA 98195 USA
基金
美国国家科学基金会;
关键词
I-sites library; sequence patterns; clustering; initiation sites; protein folding;
D O I
10.1006/jmbi.1998.1943
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We describe a new method for local protein structure prediction based on a library of short sequence pattern that correlate strongly with protein three-dimensional structural elements. The library: was generated using an automated method for finding correlations between protein sequence and local structure, and contains most previously described local sequence-structure correlations as well as new relationships, including a diverging type-II beta-turn, a frayed helix, and a proline-terminated helix. The query sequence is scanned for segments 7 to 19 residues in length that strongly match one of the 82 patterns in the library. Matching segments are assigned the three-dimensional structure characteristic of the corresponding sequence pattern, and backbone torsion angles for the entire query sequence are then predicted by piecing together mutually compatible segment predictions. In predictions of local structure in a test set of 55 proteins, about 50% of all residues, and 76% of residues covered by high-confidence predictions, were found in eight-residue segments within 1.4 Angstrom of their true structures. The predictions are complementary to traditional secondary structure predictions because they are considerably more specific in turn regions, and may contribute to ab initio tertiary structure prediction and fold recognition. (C) 1998 Academic Press.
引用
收藏
页码:565 / 577
页数:13
相关论文
共 49 条
[11]  
Everitt B., 1993, CLUSTER ANAL
[12]  
Fischer D, 1996, PROTEIN SCI, V5, P947
[13]  
GRIBSKOV M, 1990, METHOD ENZYMOL, V183, P146
[14]   Global properties of the mapping between local amino acid sequence and local structure in proteins [J].
Han, KF ;
Baker, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (12) :5814-5818
[15]   RECURRING LOCAL SEQUENCE MOTIFS IN PROTEINS [J].
HAN, KF ;
BAKER, D .
JOURNAL OF MOLECULAR BIOLOGY, 1995, 251 (01) :176-187
[16]   Three-dimensional structures and contexts associated with recurrent amino acid sequence patterns [J].
Han, KF ;
Bystroff, C ;
Baker, D .
PROTEIN SCIENCE, 1997, 6 (07) :1587-1590
[17]  
Hart P.E., 1973, Pattern recognition and scene analysis
[18]  
HOBOHM U, 1992, PROTEIN SCI, V1, P409
[19]   A REVISED SET OF POTENTIALS FOR BETA-TURN FORMATION IN PROTEINS [J].
HUTCHINSON, EG ;
THORNTON, JM .
PROTEIN SCIENCE, 1994, 3 (12) :2207-2216
[20]   SYNTHETIC PEPTIDES PROBE FOLDING INITIATION SITES IN PLATELET FACTOR-IV - STABLE CHAIN REVERSAL FOUND WITHIN THE HYDROPHOBIC SEQUENCE LIATLKNGRKISL [J].
ILYINA, E ;
MILIUS, R ;
MAYO, KH .
BIOCHEMISTRY, 1994, 33 (45) :13436-13444