DETECTING FRAME SHIFTS BY AMINO-ACID-SEQUENCE COMPARISON

被引:25
作者
CLAVERIE, JM
机构
[1] National Center for Biotechnology Information, National Library of Medicine, National institutes of Health, Bethesda
关键词
AMINO ACID SEQUENCE; COMPUTER ANALYSIS; FRAME SHIFT; EVOLUTION; ERROR;
D O I
10.1006/jmbi.1993.1666
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Various amino acid substitution scoring matrices are used in conjunction with local alignments programs to detect regions of similarity and infer potential common ancestry between proteins. The usual scoring schemes derive from the implicit hypothesis that related proteins evolve from a common ancestor by the accumulation of point mutations and that amino acids tend to be progressively substituted by others with similar properties. However, other frequent single mutation events, like nucleotide insertion or deletion and gene inversion, change the translation reading frame and cause previously encoded amino acid sequences to become unrecognizable at once. Here, I derive five new types of scoring matrix, each capable of detecting a specific frame shift (deletion, insertion and inversion in 3 frames) and use them with a regular local alignments program to detect amino acid sequences that may have derived from alternative reading frames of the same nucleotide sequence. Frame shifts are inferred from the sole comparison of the protein sequences. The five scoring matrices were used with the BLASTP program to compare all the protein sequences in the Swissprot database. Surprisingly, the searches revealed hundreds of highly significant frame shift matches, of which many are likely to represent sequencing errors. Others provide some evidence that frame shift mutations might be used in protein evolution as a way to create new amino acid sequences from pre-existing coding regions. © 1993 Academic Press Limited.
引用
收藏
页码:1140 / 1157
页数:18
相关论文
共 33 条
[11]   ALIGNING AMINO-ACID SEQUENCES - COMPARISON OF COMMONLY USED METHODS [J].
FENG, DF ;
JOHNSON, MS ;
DOOLITTLE, RF .
JOURNAL OF MOLECULAR EVOLUTION, 1985, 21 (02) :112-125
[12]   LOSS OF RAS ACTIVITY IN SACCHAROMYCES-CEREVISIAE IS SUPPRESSED BY DISRUPTIONS OF A NEW KINASE GENE, YAKI, WHOSE PRODUCT MAY ACT DOWNSTREAM OF THE CAMP-DEPENDENT PROTEIN-KINASE [J].
GARRETT, S ;
BROACH, J .
GENES & DEVELOPMENT, 1989, 3 (09) :1336-1348
[13]   ANCIENT CONSERVED REGIONS IN NEW GENE-SEQUENCES AND THE PROTEIN DATABASES [J].
GREEN, P ;
LIPMAN, D ;
HILLIER, L ;
WATERSTON, R ;
STATES, D ;
CLAVERIE, JM .
SCIENCE, 1993, 259 (5102) :1711-1716
[14]   AMINO-ACID SUBSTITUTION MATRICES FROM PROTEIN BLOCKS [J].
HENIKOFF, S ;
HENIKOFF, JG .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (22) :10915-10919
[15]   THE RAPID GENERATION OF MUTATION DATA MATRICES FROM PROTEIN SEQUENCES [J].
JONES, DT ;
TAYLOR, WR ;
THORNTON, JM .
COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1992, 8 (03) :275-282
[16]   ORIGINS OF GENES - BIG-BANG OR CONTINUOUS CREATION [J].
KEESE, PK ;
GIBBS, A .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (20) :9489-9493
[17]   MPF FROM STARFISH OOCYTES AT 1ST MEIOTIC METAPHASE IS A HETERODIMER CONTAINING 1 MOLECULE OF CDC2 AND 1 MOLECULE OF CYCLIN-B [J].
LABBE, JC ;
CAPONY, JP ;
CAPUT, D ;
CAVADORE, JC ;
DERANCOURT, J ;
KAGHAD, M ;
LELIAS, JM ;
PICARD, A ;
DOREE, M .
EMBO JOURNAL, 1989, 8 (10) :3053-3058
[19]   TESTS FOR COMPARING RELATED AMINO-ACID SEQUENCES CYTOCHROME-C AND CYTOCHROME-C551 [J].
MCLACHLAN, AD .
JOURNAL OF MOLECULAR BIOLOGY, 1971, 61 (02) :409-+
[20]   BIRTH OF A UNIQUE ENZYME FROM AN ALTERNATIVE READING FRAME OF THE PREEXISTED, INTERNALLY REPETITIOUS CODING SEQUENCE [J].
OHNO, S .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1984, 81 (08) :2421-2425