Accurate whole human genome sequencing using reversible terminator chemistry

被引:2401
作者
Bentley, David R. [1 ]
Balasubramanian, Shankar [2 ]
Swerdlow, Harold P. [1 ]
Smith, Geoffrey P. [1 ]
Milton, John [1 ]
Brown, Clive G. [1 ]
Hall, Kevin P. [1 ]
Evers, Dirk J. [1 ]
Barnes, Colin L. [1 ,2 ]
Bignell, Helen R. [1 ]
Boutell, Jonathan M. [1 ]
Bryant, Jason [1 ]
Carter, Richard J. [1 ]
Cheetham, R. Keira [1 ]
Cox, Anthony J. [1 ]
Ellis, Darren J. [1 ]
Flatbush, Michael R. [3 ]
Gormley, Niall A. [1 ]
Humphray, Sean J. [1 ]
Irving, Leslie J. [1 ]
Karbelashvili, Mirian S. [3 ]
Kirk, Scott M. [3 ]
Li, Heng [4 ]
Liu, Xiaohai [1 ,2 ]
Maisinger, Klaus S. [1 ]
Murray, Lisa J. [1 ]
Obradovic, Bojan [1 ]
Ost, Tobias [1 ]
Parkinson, Michael L. [1 ]
Pratt, Mark R. [3 ]
Rasolonjatovo, Isabelle M. J. [1 ]
Reed, Mark T. [3 ]
Rigatti, Roberto [1 ]
Rodighiero, Chiara [1 ]
Ross, Mark T. [1 ]
Sabot, Andrea [1 ]
Sankar, Subramanian V. [3 ]
Scally, Aylwyn [4 ]
Schroth, Gary P. [3 ]
Smith, Mark E. [1 ]
Smith, Vincent P. [1 ]
Spiridou, Anastassia [1 ]
Torrance, Peta E. [1 ]
Tzonev, Svilen S. [3 ]
Vermaas, Eric H. [3 ]
Walter, Klaudia [4 ]
Wu, Xiaolin [1 ]
Zhang, Lu [3 ]
Alam, Mohammed D. [3 ]
Anastasi, Carole [1 ]
机构
[1] Illumina Cambridge Ltd, Saffron Walden CB10 1XL, Essex, England
[2] Univ Cambridge, Dept Chem, Univ Chem Lab, Cambridge CB2 1EW, England
[3] Illumina Hayward, Hayward, CA 94343 USA
[4] Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England
[5] Manteia Predict Med SA, CH-1267 Coinsins, Switzerland
[6] Illumina Inc, Corp Headquarters, San Diego, CA 92121 USA
[7] NHGRI, NIH, Bethesda, MD 20892 USA
基金
英国生物技术与生命科学研究理事会; 英国医学研究理事会; 英国惠康基金; 美国国家卫生研究院;
关键词
D O I
10.1038/nature07517
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
DNA sequence information underpins genetic research, enabling discoveries of important biological or medical benefit. Sequencing projects have traditionally used long ( 400 - 800 base pair) reads, but the existence of reference sequences for the human and many other genomes makes it possible to develop new, fast approaches to re- sequencing, whereby shorter reads are compared to a reference to identify intraspecies genetic variation. Here we report an approach that generates several billion bases of accurate nucleotide sequence per experiment at low cost. Single molecules of DNA are attached to a flat surface, amplified in situ and used as templates for synthetic sequencing with fluorescent reversible terminator deoxyribonucleotides. Images of the surface are analysed to generate high- quality sequence. We demonstrate application of this approach to human genome sequencing on flow- sorted X chromosomes and then scale the approach to determine the genome sequence of a male Yoruba from Ibadan, Nigeria. We build an accurate consensus sequence from. 303 average depth of paired 35- base reads. We characterize four million single- nucleotide polymorphisms and four hundred thousand structural variants, many of which were previously unknown. Our approach is effective for accurate, rapid and economical whole- genome re- sequencing and many other biomedical applications.
引用
收藏
页码:53 / 59
页数:7
相关论文
共 32 条
[11]   Single-molecule DNA sequencing of a viral genome [J].
Harris, Timothy D. ;
Buzby, Phillip R. ;
Babcock, Hazen ;
Beer, Eric ;
Bowers, Jayson ;
Braslavsky, Ido ;
Causey, Marie ;
Colonell, Jennifer ;
DiMeo, James ;
Efcavitch, J. William ;
Giladi, Eldar ;
Gill, Jaime ;
Healy, John ;
Jarosz, Mirna ;
Lapen, Dan ;
Moulton, Keith ;
Quake, Stephen R. ;
Steinmann, Kathleen ;
Thayer, Edward ;
Tyurina, Anastasia ;
Ward, Rebecca ;
Weiss, Howard ;
Xie, Zheng .
SCIENCE, 2008, 320 (5872) :106-109
[12]   Whole-genome sequencing and variant discovery in C-elegans [J].
Hillier, LaDeana W. ;
Marth, Gabor T. ;
Quinlan, Aaron R. ;
Dooling, David ;
Fewell, Ginger ;
Barnett, Derek ;
Fox, Paul ;
Glasscock, Jarret I. ;
Hickenbotham, Matthew ;
Huang, Weichun ;
Magrini, Vincent J. ;
Richt, Ryan J. ;
Sander, Sacha N. ;
Stewart, Donald A. ;
Stromberg, Michael ;
Tsung, Eric F. ;
Wylie, Todd ;
Schedl, Tim ;
Wilson, Richard K. ;
Mardis, Elaine R. .
NATURE METHODS, 2008, 5 (02) :183-188
[13]   Genome-wide in situ exon capture for selective resequencing [J].
Hodges, Emily ;
Xuan, Zhenyu ;
Balija, Vivekanand ;
Kramer, Melissa ;
Molla, Michael N. ;
Smith, Steven W. ;
Middle, Christina M. ;
Rodesch, Matthew J. ;
Albert, Thomas J. ;
Hannon, Gregory J. ;
McCombie, W. Richard .
NATURE GENETICS, 2007, 39 (12) :1522-1527
[14]   The Ensembl genome database project [J].
Hubbard, T ;
Barker, D ;
Birney, E ;
Cameron, G ;
Chen, Y ;
Clark, L ;
Cox, T ;
Cuff, J ;
Curwen, V ;
Down, T ;
Durbin, R ;
Eyras, E ;
Gilbert, J ;
Hammond, M ;
Huminiecki, L ;
Kasprzyk, A ;
Lehvaslaiho, H ;
Lijnzaad, P ;
Melsopp, C ;
Mongin, E ;
Pettett, R ;
Pocock, M ;
Potter, S ;
Rust, A ;
Schmidt, E ;
Searle, S ;
Slater, G ;
Smith, J ;
Spooner, W ;
Stabenau, A ;
Stalker, J ;
Stupka, E ;
Ureta-Vidal, A ;
Vastrik, I ;
Clamp, M .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :38-41
[15]   Genome-wide mapping of in vivo protein-DNA interactions [J].
Johnson, David S. ;
Mortazavi, Ali ;
Myers, Richard M. ;
Wold, Barbara .
SCIENCE, 2007, 316 (5830) :1497-1502
[16]   Mapping and sequencing of structural variation from eight human genomes (Reprinted from Nature, vol 453, pg 56-64, 2008) [J].
Kidd, Jeffrey M. ;
Cooper, Gregory M. ;
Donahue, William F. ;
Hayden, Hillary S. ;
Sampas, Nick ;
Graves, Tina ;
Hansen, Nancy ;
Teague, Brian ;
Alkan, Can ;
Antonacci, Francesca ;
Haugen, Eric ;
Zerr, Troy ;
Yamada, N. Alice ;
Tsang, Peter ;
Newman, Tera L. ;
Tuzun, Eray ;
Cheng, Ze ;
Ebling, Heather M. ;
Tusneem, Nadeem ;
David, Robert ;
Gillett, Will ;
Phelps, Karen A. ;
Weaver, Molly ;
Saranga, David ;
Brand, Adrianne ;
Tao, Wei ;
Gustafson, Erik ;
McKernan, Kevin ;
Chen, Lin ;
Malig, Maika ;
Smith, Joshua D. ;
Korn, Joshua M. ;
McCarroll, Steven A. ;
Altshuler, David A. ;
Peiffer, Daniel A. ;
Dorschner, Michael ;
Stamatoyannopoulos, John ;
Schwartz, David ;
Nickerson, Deborah A. ;
Mullikin, James C. ;
Wilson, Richard K. ;
Bruhn, Laurakay ;
Olson, Maynard V. ;
Kaul, Rajinder ;
Smith, Douglas R. ;
Eichler, Evan E. .
NATURE GENETICS, 2009, :S22-S30
[17]  
Korbel JO, 2007, SCIENCE, V318, P420, DOI 10.1126/science.1149504
[18]   The diploid genome sequence of an individual human [J].
Levy, Samuel ;
Sutton, Granger ;
Ng, Pauline C. ;
Feuk, Lars ;
Halpern, Aaron L. ;
Walenz, Brian P. ;
Axelrod, Nelson ;
Huang, Jiaqi ;
Kirkness, Ewen F. ;
Denisov, Gennady ;
Lin, Yuan ;
MacDonald, Jeffrey R. ;
Pang, Andy Wing Chun ;
Shago, Mary ;
Stockwell, Timothy B. ;
Tsiamouri, Alexia ;
Bafna, Vineet ;
Bansal, Vikas ;
Kravitz, Saul A. ;
Busam, Dana A. ;
Beeson, Karen Y. ;
Mclntosh, Tina C. ;
Remington, Karin A. ;
Abril, Josep F. ;
Gill, John ;
Borman, Jon ;
Rogers, Yu-Hui ;
Frazier, Marvin E. ;
Scherer, Stephen W. ;
Strausberg, Robert L. ;
Venter, J. Craig .
PLOS BIOLOGY, 2007, 5 (10) :2113-2144
[19]   Mapping short DNA sequencing reads and calling variants using mapping quality scores [J].
Li, Heng ;
Ruan, Jue ;
Durbin, Richard .
GENOME RESEARCH, 2008, 18 (11) :1851-1858
[20]   Highly integrated single-base resolution maps of the epigenome in Arabidopsis [J].
Lister, Ryan ;
O'Malley, Ronan C. ;
Tonti-Filippini, Julian ;
Gregory, Brian D. ;
Berry, Charles C. ;
Millar, A. Harvey ;
Ecker, Joseph R. .
CELL, 2008, 133 (03) :523-536