Personalized copy number and segmental duplication maps using next-generation sequencing

被引:513
作者
Alkan, Can [1 ,2 ]
Kidd, Jeffrey M. [1 ]
Marques-Bonet, Tomas [1 ,3 ]
Aksay, Gozde [1 ]
Antonacci, Francesca [1 ]
Hormozdiari, Fereydoun [4 ]
Kitzman, Jacob O. [1 ]
Baker, Carl [1 ]
Malig, Maika [1 ]
Mutlu, Onur [5 ]
Sahinalp, S. Cenk [4 ]
Gibbs, Richard A. [6 ]
Eichler, Evan E. [1 ,2 ]
机构
[1] Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
[2] Howard Hughes Med Inst, Seattle, WA USA
[3] UPF, CSIC, Inst Biol Evolut, Barcelona, Catalonia, Spain
[4] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada
[5] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15213 USA
[6] Baylor Coll Med, Houston, TX 77030 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
LARGE-SCALE VARIATION; HUMAN-GENOME; STRUCTURAL VARIATION; SUSCEPTIBILITY; POLYMORPHISMS; HYBRIDIZATION; PREDISPOSES; REGIONS; SNPS;
D O I
10.1038/ng.437
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Despite their importance in gene innovation and phenotypic variation, duplicated regions have remained largely intractable owing to difficulties in accurately resolving their structure, copy number and sequence content. We present an algorithm (mrFAST) to comprehensively map next-generation sequence reads, which allows for the prediction of absolute copy-number variation of duplicated segments and genes. We examine three human genomes and experimentally validate genome-wide copy number differences. We estimate that, on average, 73-87 genes vary in copy number between any two individuals and find that these genic differences overwhelmingly correspond to segmental duplications (odds ratio = 135; P < 2.2 x 10(-16)). Our method can distinguish between different copies of highly identical genes, providing a more accurate assessment of gene content and insight into functional constraint without the limitations of array-based technology.
引用
收藏
页码:1061 / U29
页数:8
相关论文
共 40 条
[1]   Copy number polymorphism in Fcgr3 predisposes to glomerulonephritis in rats and humans [J].
Aitman, TJ ;
Dong, R ;
Vyse, TJ ;
Norsworthy, PJ ;
Johnson, MD ;
Smith, J ;
Mangion, J ;
Roberton-Lowe, C ;
Marshall, AJ ;
Petretto, E ;
Hodges, MD ;
Bhangal, G ;
Patel, SG ;
Sheehan-Rooney, K ;
Duda, M ;
Cook, PR ;
Evans, DJ ;
Domin, J ;
Flint, J ;
Boyle, JJ ;
Pusey, CD ;
Cook, HT .
NATURE, 2006, 439 (7078) :851-855
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]   Recent segmental duplications in the human genome [J].
Bailey, JA ;
Gu, ZP ;
Clark, RA ;
Reinert, K ;
Samonte, RV ;
Schwartz, S ;
Adams, MD ;
Myers, EW ;
Li, PW ;
Eichler, EE .
SCIENCE, 2002, 297 (5583) :1003-1007
[4]   Segmental duplications: Organization and impact within the current Human Genome Project assembly [J].
Bailey, JA ;
Yavor, AM ;
Massa, HF ;
Trask, BJ ;
Eichler, EE .
GENOME RESEARCH, 2001, 11 (06) :1005-1017
[5]   Tandem repeats finder: a program to analyze DNA sequences [J].
Benson, G .
NUCLEIC ACIDS RESEARCH, 1999, 27 (02) :573-580
[6]   Accurate whole human genome sequencing using reversible terminator chemistry [J].
Bentley, David R. ;
Balasubramanian, Shankar ;
Swerdlow, Harold P. ;
Smith, Geoffrey P. ;
Milton, John ;
Brown, Clive G. ;
Hall, Kevin P. ;
Evers, Dirk J. ;
Barnes, Colin L. ;
Bignell, Helen R. ;
Boutell, Jonathan M. ;
Bryant, Jason ;
Carter, Richard J. ;
Cheetham, R. Keira ;
Cox, Anthony J. ;
Ellis, Darren J. ;
Flatbush, Michael R. ;
Gormley, Niall A. ;
Humphray, Sean J. ;
Irving, Leslie J. ;
Karbelashvili, Mirian S. ;
Kirk, Scott M. ;
Li, Heng ;
Liu, Xiaohai ;
Maisinger, Klaus S. ;
Murray, Lisa J. ;
Obradovic, Bojan ;
Ost, Tobias ;
Parkinson, Michael L. ;
Pratt, Mark R. ;
Rasolonjatovo, Isabelle M. J. ;
Reed, Mark T. ;
Rigatti, Roberto ;
Rodighiero, Chiara ;
Ross, Mark T. ;
Sabot, Andrea ;
Sankar, Subramanian V. ;
Scally, Aylwyn ;
Schroth, Gary P. ;
Smith, Mark E. ;
Smith, Vincent P. ;
Spiridou, Anastassia ;
Torrance, Peta E. ;
Tzonev, Svilen S. ;
Vermaas, Eric H. ;
Walter, Klaudia ;
Wu, Xiaolin ;
Zhang, Lu ;
Alam, Mohammed D. ;
Anastasi, Carole .
NATURE, 2008, 456 (7218) :53-59
[7]   Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing [J].
Campbell, Peter J. ;
Stephens, Philip J. ;
Pleasance, Erin D. ;
O'Meara, Sarah ;
Li, Heng ;
Santarius, Thomas ;
Stebbings, Lucy A. ;
Leroy, Catherine ;
Edkins, Sarah ;
Hardy, Claire ;
Teague, Jon W. ;
Menzies, Andrew ;
Goodhead, Ian ;
Turner, Daniel J. ;
Clee, Christopher M. ;
Quail, Michael A. ;
Cox, Antony ;
Brown, Clive ;
Durbin, Richard ;
Hurles, Matthew E. ;
Edwards, Paul A. W. ;
Bignell, Graham R. ;
Stratton, Michael R. ;
Futreal, P. Andrew .
NATURE GENETICS, 2008, 40 (06) :722-729
[8]  
Chiang DY, 2009, NAT METHODS, V6, P99, DOI [10.1038/nmeth.1276, 10.1038/NMETH.1276]
[9]   Systematic assessment of copy number variant detection via genome-wide SNP genotyping [J].
Cooper, Gregory M. ;
Zerr, Troy ;
Kidd, Jeffrey M. ;
Eichler, Evan E. ;
Nickerson, Deborah A. .
NATURE GENETICS, 2008, 40 (10) :1199-1203
[10]   Chromosomal regions containing high-density and ambiguously mapped putative single nucleotide polymorphisms (SNPs) correlate with segmental duplications in the human genome [J].
Estivill, X ;
Cheung, J ;
Pujana, MA ;
Nakabayashi, K ;
Scherer, SW ;
Tsui, LC .
HUMAN MOLECULAR GENETICS, 2002, 11 (17) :1987-1995