Fast recovery of evolutionary trees with thousands of nodes

被引:21
作者
Csúrös, M [1 ]
机构
[1] Yale Univ, Dept Comp Sci, New Haven, CT 06520 USA
关键词
phylogeny; evolutionary tree reconstruction; distance-based method; harmonic greedy triplets;
D O I
10.1089/10665270252935467
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We present a novel distance-based algorithm for evolutionary tree reconstruction. Our algorithm reconstructs the topology of a tree with n leaves in O(n 2) time using O(n) working space. In the general Markov model of evolution, the algorithm recovers the topology successfully with (1 - o(l)) probability from sequences with polynomial length in n. Moreover, for almost all trees, our algorithm achieves the same success probability on polylogarithmic sample sizes. The theoretical results are supported by simulation experiments involving trees with 500, 1,895, and 3,135 leaves. The topologies of the trees are recovered with high success from 2,000 bp DNA sequences.
引用
收藏
页码:277 / 297
页数:21
相关论文
共 37 条
[1]  
[Anonymous], 1971, STAT DECISION THEORY
[2]  
[Anonymous], THESIS YALE U
[3]   The performance of neighbor-joining methods of phylogenetic reconstruction [J].
Atteson, K .
ALGORITHMICA, 1999, 25 (2-3) :251-278
[4]   RECONSTRUCTING THE SHAPE OF A TREE FROM OBSERVED DISSIMILARITY DATA [J].
BANDELT, HJ ;
DRESS, A .
ADVANCES IN APPLIED MATHEMATICS, 1986, 7 (03) :309-343
[5]   Phylogeny - Deep Green rewrites evolutionary history of plants [J].
Brown, KS .
SCIENCE, 1999, 285 (5430) :990-991
[6]   Weighted neighbor joining: A likelihood-based approach to distance-based phylogeny reconstruction [J].
Bruno, WJ ;
Socci, ND ;
Halpern, AL .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (01) :189-197
[7]  
BRZUSTOWSKI J, 1998, QCLUST V0 2
[8]  
Buneman P., 1971, Mathematics in the Archaeological and Historical Sciences, P387
[9]   PHYLOGENETICS OF SEED PLANTS - AN ANALYSIS OF NUCLEOTIDE-SEQUENCES FROM THE PLASTID GENE RBCL [J].
CHASE, MW ;
SOLTIS, DE ;
OLMSTEAD, RG ;
MORGAN, D ;
LES, DH ;
MISHLER, BD ;
DUVALL, MR ;
PRICE, RA ;
HILLS, HG ;
QIU, YL ;
KRON, KA ;
RETTIG, JH ;
CONTI, E ;
PALMER, JD ;
MANHART, JR ;
SYTSMA, KJ ;
MICHAELS, HJ ;
KRESS, WJ ;
KAROL, KG ;
CLARK, WD ;
HEDREN, M ;
GAUT, BS ;
JANSEN, RK ;
KIM, KJ ;
WIMPEE, CF ;
SMITH, JF ;
FURNIER, GR ;
STRAUSS, SH ;
XIANG, QY ;
PLUNKETT, GM ;
SOLTIS, PS ;
SWENSEN, SM ;
WILLIAMS, SE ;
GADEK, PA ;
QUINN, CJ ;
EGUIARTE, LE ;
GOLENBERG, E ;
LEARN, GH ;
GRAHAM, SW ;
BARRETT, SCH ;
DAYANANDAN, S ;
ALBERT, VA .
ANNALS OF THE MISSOURI BOTANICAL GARDEN, 1993, 80 (03) :528-580
[10]   Evolutionary trees can be learned in polynomial time in the two-state general Markov model [J].
Cryan, M ;
Goldberg, LA ;
Goldberg, PW .
39TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 1998, :436-445