| 317 | 4 | 17 |
| 下载次数 | 被引频次 | 阅读次数 |
为了剔除畜禽全基因组关联分析结果中的假阳性结果,寻找最优的假设检验方法,解决畜禽全基因组关联分析中的多重比较问题,本研究将现有GWAS研究中常用的七种假设检验方法和贝叶斯因子法进行比较。通过对模拟数据和公开数据集的研究,结果表明:畜禽全基因组关联分析中用贝叶斯因子法进行假设推断,其优良的统计性能与假设检验数目(SNP数)和最小等位基因频率(MAF)基本无关,其在假设检验中的某些表现优于其它几种基于p值(p-value)的方法。本研究为进一步解决畜禽全基因组关联分析中的多重比较问题奠定了基础。
Abstract:In order to eliminate the false positive results in the analysis of genome-wide association of livestock and poultry, we attempted to find the optimal solution of the hypothesis test method to the multiple comparison problem in the studies of genome-wide association of livestock and poultry. Seven kinds of hypothesis test methods commonly used in the study of GWAS and Bayesian factor method were compared in this research. The study of the simulation data and public data set showed that the statistical properties of the hypothesis test and excellent number(SNP number) and the minor allele frequency(MAF) was almost independent based on hypotheses inferred by Bayesian factor method of genome-wide association analysis of livestock and poultry. Some of the performance in Bayesian factor method of genome-wide association analysis should be better than that of others based on the p value(p-value) method. This study might lay a foundation for further solving the multiple comparison problem analysis of genome-wide association in livestock and poultry.
Almudevar A.,2013,Multiple hypothesis testing:A methodological overview,Methods Mol.Biol.,972:37-55
Benjamini Y.,and Hochberg Y.,1995,Controlling the false discovery rate:A practical and powerful approach to multipletesting,J.R.Statist.Soc.B,57(1):289-300
Benjamini Y.,and Yekutieli D.,2001,The control of the falsediscovery rate in multiple testing under dependency,Ann.Statist.,29(4):1165-1188
Bickel D.R.,2013,Simple estimators of false discovery rates given as few as one or two p-values without strong parametricassumptions,Stat.Appl.Genet.Mol.Biol.,12(4):529-543
Churchill G.A.,and Doerge R.W.,1994,Empirical threshold values for quantitative trait mapping,Genetics,138(3):963-971
Consortium T.,2007,Genome-wide association study of 14 000cases of seven common diseases and 3 000 shared controls,Nature,447(7145):661-678
Holm S.,1979,A simple sequentially rejective multiple test procedure,Scandinavian Journal of Statistics,6(2):65-70
Kass R.E.,and Raftery A.E.,1995,Bayes factors,Journal of theAmerican Statistical Association,90(430):773-795
Marchini J.,Howie B.,Myers S.,Mc Vean G.,and Donnelly P.,2007,A new multipoint method for genome-wide association studies by imputation of genotypes,Nat.Genet.,39(7):906-913
Meuwissen T.,and Goddard M.,2010,Accurate prediction of genetic values for complex traits by whole genome resequencing,Genetics,185:623-631
Morey R.D.,Rouder J.N.,Pratte M.S.,and Speckman P.L.,2011,Using MCMC chain outputs to efficiently estimate Bayesfactors,J.Math.Psychol.,55(5):368-378
Sawcer S.,2010,Bayes factors in complex genetics,Eur.J.Hum.Genet.,18(7):746-750
Stephens M.,and Balding D.J.,2009,Bayesian statistical methodsfor genetic association studies,Nat.Rev.Genet.,10(10):681-690
Wakefield J.,2007,A Bayesian measure of the probability offalse discovery in genetic epidemiology studies,Am.J.Hum.Genet.,81(2):208-227
Wakefield J.,2009,Bayes factors for genome-wide associationstudies:comparison with P-values,Genet.Epidemiol.,33(1):79-86
Wasserman L.,2000,Bayesian model selection and model averaging,J.Math.Psychol.,44(1):92-107
Ytournel F.,2008,Linkage disequilibrium and QTL fine mappingin a selected population,Station de G佴n佴tique Quantitativeet Appliqu佴e,INRA
基本信息:
DOI:10.13417/j.gab.033.001211
中图分类号:Q75
引用信息:
[1]梅步俊,王志华.贝叶斯因子法在畜禽全基因组关联分析中的应用[J].基因组学与应用生物学,2014,33(06):1211-1216.DOI:10.13417/j.gab.033.001211.
基金信息:
国家自然科学基金(31460594);; 河套学院教学研究项目(HTXYJZ14005)共同资助
2014-12-28
2014-12-28