Computation of haplotypes on SNPs subsets: advantage of the "global method"

被引:6
作者
Coulonges, Cedric
Delaneau, Olivier
Girard, Manon
Do, Herve
Adkins, Ronald
Spadoni, Jean-Louis
Zagury, Jean-Francois
机构
[1] INSERM U736, Equipe Genom Bioinformat & Pathol Syst Immunitair, F-75006 Paris, France
[2] Conservatoire Natl Arts & Metiers, Chaire Bioinformat, F-75003 Paris, France
[3] Univ Tennessee, Childrens Fdn Res Ctr, Memphis, TN USA
[4] Univ Tennessee, Ctr Genom & Bioinformat, Memphis, TN USA
关键词
D O I
10.1186/1471-2156-7-50
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: Genetic association studies aim at finding correlations between a disease state and genetic variations such as SNPs or combinations of SNPs, termed haplotypes. Some haplotypes have a particular biological meaning such as the ones derived from SNPs located in the promoters, or the ones derived from non synonymous SNPs. All these haplotypes are "subhaplotypes" because they refer only to a part of the SNPs found in the gene. Until now, subhaplotypes were directly computed from the very SNPs chosen to constitute them, without taking into account the rest of the information corresponding to the other SNPs located in the gene. In the present work, we describe an alternative approach, called the "global method", which takes into account all the SNPs known in the region and compare the efficacy of the two "direct" and "global" methods. Results: We used empirical haplotypes data sets from the GHI promoter and the APOE gene, and 10 simulated datasets, and randomly introduced in them missing information (from 0% up to 20%) to compare the 2 methods. For each method, we used the PHASE haplotyping software since it was described to be the best. We showed that the use of the "global method" for subhaplotyping leads always to a better error rate than the classical direct haplotyping. The advantage provided by this alternative method increases with the percentage of missing genotyping data (diminution of the average error rate from 25% to less than 10%). We applied the global method software on the GRIV cohort for AIDS genetic associations and some associations previously identified through direct subhaplotyping were found to be erroneous. Conclusion: The global method for subhaplotyping can reduce, sometimes dramatically, the error rate on patient resolutions and haplotypes frequencies. One should thus use this method in order to minimise the risk of a false interpretation in genetic studies involving subhaplotypes. In practice the global method is always more efficient than the direct method, but a combination method taking into account the level of missing information in each subject appears to be even more interesting when the level of missing information becomes larger (> 10%).
引用
收藏
页数:12
相关论文
共 44 条
[1]   Comparison of the accuracy of methods of computational haplotype inference using a large empirical dataset [J].
Adkins, RM .
BMC GENETICS, 2004, 5 (1)
[2]   E-cadherin Polymorphisms and haplotypes influence risk for prostate cancer [J].
Bonilla, C ;
Mason, T ;
Long, LR ;
Ahaghotu, C ;
Chen, WD ;
Zhao, AQ ;
Coulibaly, A ;
Bennett, F ;
Aiken, W ;
Tullock, T ;
Coard, K ;
Freeman, V ;
Kittles, RA .
PROSTATE, 2006, 66 (05) :546-556
[3]   Clone-based systematic haplotyping (CSH): A procedure for physical haplotyping of whole genomes [J].
Burgtorf, C ;
Kepper, P ;
Hoehe, M ;
Schmitt, C ;
Reinhardt, R ;
Lehrach, H ;
Sauer, S .
GENOME RESEARCH, 2003, 13 (12) :2717-2724
[4]   A comparison of five methods for selecting tagging single-nucleotide polymorphisms [J].
Burkett, KM ;
Ghadessi, M ;
McNeney, B ;
Graham, J ;
Daley, D .
BMC GENETICS, 2005, 6 (Suppl 1)
[5]   TagSNP analyses of the PON gene cluster: effects on PON1 activity, LDL oxidative susceptibility, and vascular disease [J].
Carlson, Christopher S. ;
Heagerty, Patrick J. ;
Hatsukami, Thomas S. ;
Richter, Rebecca J. ;
Ranchalis, Jane ;
Lewis, Julieann ;
Bacus, Tamara J. ;
McKinstry, Laura A. ;
Schellenberg, Gerard D. ;
Rieder, Mark ;
Nickerson, Deborah ;
Furlong, Clement E. ;
Chait, Alan ;
Jarvik, Gail P. .
JOURNAL OF LIPID RESEARCH, 2006, 47 (05) :1014-1024
[6]   Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium [J].
Carlson, CS ;
Eberle, MA ;
Rieder, MJ ;
Yi, Q ;
Kruglyak, L ;
Nickerson, DA .
AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 74 (01) :106-120
[7]  
CHEONG HS, 2006, J HUM GENET
[8]  
COULONGES C, 2006, SUBHAP
[9]   Selection of SNP subsets for association studies in candidate genes: comparison of the power of different strategies to detect single disease susceptibility locus effects [J].
Cousin, E ;
Deleuze, JF ;
Genin, E .
BMC GENETICS, 2006, 7 (1) :1-9
[10]   Direct molecular haplotyping of long-range genomic DNA with M1-PCR [J].
Ding, CM ;
Cantor, CR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (13) :7449-7453