PairWise and SearchWise: Finding the optimal alignment in a simultaneous comparison of a protein profile against all DNA translation frames

被引:128
作者
Birney, E [1 ]
Thompson, JD [1 ]
Gibson, TJ [1 ]
机构
[1] UNIV OXFORD BALLIOL COLL, OXFORD OX1 3BJ, ENGLAND
关键词
D O I
10.1093/nar/24.14.2730
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
DNA translation frames can be disrupted for several reasons, including: (i) errors in sequence determination; (ii) RNA processing, such as intron removal and guide RNA editing; (iii) less commonly, polymerase frameshifting during transcription or ribosomal frameshifting during translation. Frameshifts frequently confound computational activities involving homologous sequences, such as database searches and inferences on structure, function or phylogeny made from multiple alignments, A dynamic alignment algorithm is reported here which compares a protein profile (a residue scoring matrix for one or more aligned sequences) against the three translation frames of a DNA strand, allowing frameshifting. The algorithm has been incorporated into a new package, WiseTools, for comparison of biological sequences. A protein profile can be compared against either a DNA sequence or a protein sequence, The program PairWise may be used interactively for alignment of any two sequence inputs, SearchWise can perform combinations of searches through DNA or protein databases by a protein profile or DNA sequence, Routine application of the programs has revealed a set of database entries with frameshifts caused by errors in sequence determination.
引用
收藏
页码:2730 / 2739
页数:10
相关论文
共 46 条
[1]  
AASLAND R, 1995, NUCLEIC ACIDS RES, V23, P3168
[2]   The SANT domain: A putative DNA-binding domain in the SWI-SNF and ADA complexes, the transcriptional corepressor N-CoR and TFIIIB [J].
Aasland, R ;
Stewart, AF ;
Gibson, T .
TRENDS IN BIOCHEMICAL SCIENCES, 1996, 21 (03) :87-88
[3]   THE PHD FINGER - IMPLICATIONS FOR CHROMATIN-MEDIATED TRANSCRIPTIONAL REGULATION [J].
AASLAND, R ;
GIBSON, TJ ;
STEWART, AF .
TRENDS IN BIOCHEMICAL SCIENCES, 1995, 20 (02) :56-59
[4]  
ADAMS JM, 1992, ONCOGENE, V7, P611
[5]   COMPLEMENTARY-DNA SEQUENCING - EXPRESSED SEQUENCE TAGS AND HUMAN GENOME PROJECT [J].
ADAMS, MD ;
KELLEY, JM ;
GOCAYNE, JD ;
DUBNICK, M ;
POLYMEROPOULOS, MH ;
XIAO, H ;
MERRIL, CR ;
WU, A ;
OLDE, B ;
MORENO, RF ;
KERLAVAGE, AR ;
MCCOMBIE, WR ;
VENTER, JC .
SCIENCE, 1991, 252 (5013) :1651-1656
[6]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[7]  
[Anonymous], 1978, Atlas of protein sequence and structure
[8]  
BAIROCH A, 1994, NUCLEIC ACIDS RES, V22, P3578
[9]   The PROSITE database, its status in 1995 [J].
Bairoch, A ;
Bucher, P ;
Hofmann, K .
NUCLEIC ACIDS RESEARCH, 1996, 24 (01) :189-196
[10]   ACCURACY OF DNA-SEQUENCING - SHOULD THE SEQUENCE QUALITY BE MONITORED [J].
BECK, S .
DNA SEQUENCE, 1993, 4 (03) :215-217