Automating resequencing-based detection of insertion-deletion polymorphisms

被引:65
作者
Bhangale, Tushar R. [1 ]
Stephens, Matthew
Nickerson, Deborah A.
机构
[1] Univ Washington, Dept Bioengn, Seattle, WA 98195 USA
[2] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
[3] Univ Washington, Dept Stat, Seattle, WA 98195 USA
关键词
D O I
10.1038/ng1925
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Structural and insertion-deletion (indel) variants have received considerable recent attention, partly because of their phenotypic consequences. Among these variants, the most common are small indels (similar to 1-30 bp). Identifying and genotyping indels using sequence traces obtained from diploid samples requires extensive manual review, which makes large-scale studies inconvenient. We report a new algorithm, implemented in available software (PolyPhred version 6.0), to help automate detection and genotyping of indels from sequence traces. The algorithm identifies heterozygous individuals, which permits the discovery of low-frequency indels. It finds 80% of all indel polymorphisms with almost no false positives and finds 97% with a false discovery rate of 10%. Additionally, genotyping accuracy exceeds 99%, and it correctly infers indel length in 96% of the cases. Using this approach, we identify indels in the HapMap ENCODE regions, providing the first report of these polymorphisms in this data set.
引用
收藏
页码:1457 / 1462
页数:6
相关论文
共 32 条
[1]   CLONING OF THE PUTATIVE TUMOR-SUPPRESSOR GENE FOR HEREDITARY MULTIPLE EXOSTOSES (EXT1) [J].
AHN, J ;
JOSEFLUDECKE, H ;
LINDOW, S ;
HORTON, WA ;
LEE, B ;
WAGNER, MJ ;
HORSTHEMKE, B ;
WELLS, DE .
NATURE GENETICS, 1995, 11 (02) :137-143
[2]   ON THE FORMATION OF SPONTANEOUS DELETIONS - THE IMPORTANCE OF SHORT SEQUENCE HOMOLOGIES IN THE GENERATION OF LARGE DELETIONS [J].
ALBERTINI, AM ;
HOFER, M ;
CALOS, MP ;
MILLER, JH .
CELL, 1982, 29 (02) :319-328
[3]   A haplotype map of the human genome [J].
Altshuler, D ;
Brooks, LD ;
Chakravarti, A ;
Collins, FS ;
Daly, MJ ;
Donnelly, P ;
Gibbs, RA ;
Belmont, JW ;
Boudreau, A ;
Leal, SM ;
Hardenbol, P ;
Pasternak, S ;
Wheeler, DA ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Zeng, CQ ;
Gao, Y ;
Hu, HR ;
Hu, WT ;
Li, CH ;
Lin, W ;
Liu, SQ ;
Pan, H ;
Tang, XL ;
Wang, J ;
Wang, W ;
Yu, J ;
Zhang, B ;
Zhang, QR ;
Zhao, HB ;
Zhao, H ;
Zhou, J ;
Gabriel, SB ;
Barry, R ;
Blumenstiel, B ;
Camargo, A ;
Defelice, M ;
Faggart, M ;
Goyette, M ;
Gupta, S ;
Moore, J ;
Nguyen, H ;
Onofrio, RC ;
Parkin, M ;
Roy, J ;
Stahl, E ;
Winchester, E ;
Ziaugra, L ;
Shen, Y .
NATURE, 2005, 437 (7063) :1299-1320
[4]   Comprehensive identification and characterization of diallelic insertion-deletion polymorphisms in 330 human candidate genes [J].
Bhangale, TR ;
Rieder, MJ ;
Livingston, RJ ;
Nickerson, DA .
HUMAN MOLECULAR GENETICS, 2005, 14 (01) :59-69
[5]   Additional SNPs and linkage-disequilibrium analyses are necessary for whole-genome association studies in humans [J].
Carlson, CS ;
Eberle, MA ;
Rieder, MJ ;
Smith, JD ;
Kruglyak, L ;
Nickerson, DA .
NATURE GENETICS, 2003, 33 (04) :518-521
[6]   A high-resolution survey of deletion polymorphism in the human genome [J].
Conrad, DF ;
Andrews, TD ;
Carter, NP ;
Hurles, ME ;
Pritchard, JK .
NATURE GENETICS, 2006, 38 (01) :75-81
[7]   Familial PAX8 small deletion (c.989_992delACCC) associated with extreme phenotype variability [J].
de Sanctis, L ;
Corrias, A ;
Romagnolo, D ;
Di Palma, T ;
Biava, A ;
Borgarello, G ;
Gianino, P ;
Silvestro, L ;
Zannini, M ;
Dianzani, I .
JOURNAL OF CLINICAL ENDOCRINOLOGY & METABOLISM, 2004, 89 (11) :5669-5674
[8]   Widening the spectrum of human genetic variation [J].
Eichler, EE .
NATURE GENETICS, 2006, 38 (01) :9-11
[9]   Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194
[10]   Base-calling of automated sequencer traces using phred.: I.: Accuracy assessment [J].
Ewing, B ;
Hillier, L ;
Wendl, MC ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :175-185