UNROOTED GENEALOGICAL TREE PROBABILITIES IN THE INFINITELY-MANY-SITES MODEL

被引:76
作者
GRIFFITHS, RC [1 ]
TAVARE, S [1 ]
机构
[1] UNIV SO CALIF,DEPT MATH & BIOL SCI,LOS ANGELES,CA 90089
基金
美国国家科学基金会;
关键词
D O I
10.1016/0025-5564(94)00044-Z
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The infinitely-many-sites process is often used to model the sequence variability observed in samples of DNA sequences. Despite its popularity, the sampling theory of the process is rather poorly understood. We describe the tree structure underlying the model and show how this may be used to compute the probability of a sample of sequences. We show how to produce the unrooted genealogy from a set of sites in which the ancestral labeling is unknown and from this the corresponding rooted genealogies. We derive recursions for the probability of the configuration of sequences (equivalently, of trees) in both the rooted and unrooted cases. We give a computational method based on Monte Carlo recursion that provides approximants to sampling probabilities for samples of any size. Among several applications, this algorithm may be used to find maximum likelihood estimators of the substitution rate, both when the ancestral labeling of sites is known and when it is unknown.
引用
收藏
页码:77 / 98
页数:22
相关论文
共 11 条
[11]  
Watterson, On the number of segregating sites in genetical models without recombination, Theor. Pop. Biol., 7, pp. 256-276, (1975)