Pattern-constrained multiple polypeptide sequence alignment

被引：4

作者：

Du, ZH ^{[1
]}

Lin, F ^{[1
]}

机构：

[1] Nanyang Technol Univ, BioInformat Res Ctr, Singapore 639798, Singapore

来源：

COMPUTATIONAL BIOLOGY AND CHEMISTRY | 2005年 / 29卷 / 04期

关键词：

multiple sequence alignment; prosite databank; structural information; domain knowledge; regular expression;

D O I：

10.1016/j.compbiolchem.2005.06.002

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Multiple sequence alignment (MSA) is one of the fundamental research topics in computational biology. The alignments help us to find functional assignment, evolutionary history and conserved region. Previous methods use a substitution matrix and do not incorporate knowledge of the sequences being aligned. Therefore, they do not assure the alignment of similar structures and common patterns in the sequences. We have been investigating into the solution to the problem in multiple and making use of knowledge of the sequences being aligned, including patterns in the Prosite databank, Blocks+, eBlocks databases, as well as motif and structural information. A pattern-constrained algorithm has been developed. Experiments with protein sequences have shown more accurate alignments with incorporation of the domain knowledge available in the sequences. (c) 2005 Elsevier Ltd. All rights reserved.

引用

页码：303 / 307

页数：5

共 22 条

[11]

GUSFIELD D, 1991, CSE914 U CAL

[12] Increased coverage of protein families with the Blocks Database servers [J].

Henikoff, JG ;

Greene, EA ;

Pietrokovski, S ;

Henikoff, S .

NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :228-230

[13] Blocks+: a non-redundant database of protein alignment blocks derived from multiple compilations [J].

Henikoff, S ;

Henikoff, JG ;

Pietrokovski, S .

BIOINFORMATICS, 1999, 15 (06) :471-479

[14] AUTOMATED ASSEMBLY OF PROTEIN BLOCKS FOR DATABASE SEARCHING [J].

HENIKOFF, S ;

HENIKOFF, JG .

NUCLEIC ACIDS RESEARCH, 1991, 19 (23) :6565-6572

[15] The PROSITE database, its status in 1999 [J].

Hofmann, K ;

Bucher, P ;

Falquet, L ;

Bairoch, A .

NUCLEIC ACIDS RESEARCH, 1999, 27 (01) :215-219

[16] MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform [J].

Katoh, K ;

Misawa, K ;

Kuma, K ;

Miyata, T .

NUCLEIC ACIDS RESEARCH, 2002, 30 (14) :3059-3066

[17] Multiple DNA and protein sequence alignment based on segment-to-segment comparison [J].

Morgenstern, B ;

Dress, A ;

Werner, T .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (22) :12098-12103

[18] A GENERAL METHOD APPLICABLE TO SEARCH FOR SIMILARITIES IN AMINO ACID SEQUENCE OF 2 PROTEINS [J].

NEEDLEMAN, SB ;

WUNSCH, CD .

JOURNAL OF MOLECULAR BIOLOGY, 1970, 48 (03) :443-+

[19] COFFEE: An objective function for multiple sequence alignments [J].

Notredame, C ;

Holm, L ;

Higgins, DG .

BIOINFORMATICS, 1998, 14 (05) :407-422

[20]

Smith TF., 1981, Advances in applied mathematics, V2, P482, DOI DOI 10.1016/0196-8858(81)90046-4

← 1 2 3 →