Protein secondary structure prediction based on position-specific scoring matrices

被引:4425
作者
Jones, DT [1 ]
机构
[1] Univ Warwick, Dept Biol Sci, Coventry CV4 7AL, W Midlands, England
关键词
protein structure prediction; secondary structure; protein folding; sequence analysis; neural network;
D O I
10.1006/jmbi.1999.3091
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A two-stage neural network has been used to predict protein secondary structure based on the position specific scoring matrices generated by PSI-BLAST. Despite the simplicity and convenience of the approach used, the results are found to be superior to those produced by other methods, including the popular PHD method according to our own benchmarking results and the results from the recent Critical Assessment of Techniques for Protein Structure Prediction experiment (CASP3), where the method was evaluated by stringent blind testing. Using a new testing set based on a set of 187 unique folds, and three-way cross-validation based on structural similarity criteria rather than sequence similarity criteria used previously (no similar folds were present in both the testing and training sets) the method presented here (PSIPRED) achieved an average Q(3) score of between 76.5% to 78.3% depending on the precise definition of observed secondary structure used, which is the highest published score for any method to date. Given the success of the method in CASP3, it is reasonable to be confident that the evaluation presented here gives a fair indication of the performance of the method in general. (C) 1999 Academic Press.
引用
收藏
页码:195 / 202
页数:8
相关论文
共 32 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   PATTERNS OF DIVERGENCE IN HOMOLOGOUS PROTEINS AS INDICATORS OF SECONDARY AND TERTIARY STRUCTURE - A PREDICTION OF THE STRUCTURE OF THE CATALYTIC DOMAIN OF PROTEIN-KINASES [J].
BENNER, SA ;
GERLOFF, D .
ADVANCES IN ENZYME REGULATION, 1991, 31 :121-181
[3]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[4]   PREDICTION OF PROTEIN CONFORMATION [J].
CHOU, PY ;
FASMAN, GD .
BIOCHEMISTRY, 1974, 13 (02) :222-245
[5]  
Cuff JA, 1999, PROTEINS, V34, P508, DOI 10.1002/(SICI)1097-0134(19990301)34:4<508::AID-PROT10>3.0.CO
[6]  
2-4
[7]   Incorporation of non-local interactions in protein secondary structure prediction from the amino acid sequence [J].
Frishman, D ;
Argos, P .
PROTEIN ENGINEERING, 1996, 9 (02) :133-142
[8]  
Frishman D, 1997, PROTEINS, V27, P329, DOI 10.1002/(SICI)1097-0134(199703)27:3<329::AID-PROT1>3.0.CO
[9]  
2-8
[10]   ANALYSIS OF ACCURACY AND IMPLICATIONS OF SIMPLE METHODS FOR PREDICTING SECONDARY STRUCTURE OF GLOBULAR PROTEINS [J].
GARNIER, J ;
OSGUTHORPE, DJ ;
ROBSON, B .
JOURNAL OF MOLECULAR BIOLOGY, 1978, 120 (01) :97-120