nav emailalert searchbtn searchbox tablepage yinyongbenwen piczone journalimg journalInfo journalinfonormal searchdiv searchzone qikanlogo popupnotification paper paperNew
2018, 08, v.37 3304-3312
家畜全基因组分析中稀有变异上位效应检测方法
基金项目(Foundation): 国家自然科学基金(31460594;31760660);; 河套学院教学研究项目(HTXYJZ14005)共同资助
邮箱(Email): meibujun@163.com;
DOI: 10.13417/j.gab.037.003304
摘要:

下一代测序技术的出现和检测费用不断下降清除了稀有变异检测的技术障碍,可以检测出包括常见变异和稀有变异在内的数以千万的遗传变异,其中绝大部分的变异都是稀有变异。这种情况对统计分析方法和结果解释提出了新的挑战。当前最为流行的全基因组分析方法主要针对常见变异间上位效应的检测问题,家畜育种中较少涉及稀有变异间、稀有变异-常见变异上位效应的研究。本综述探讨使用贝叶斯多元回归方法将上位效应检测单位由成对SNP扩展到基因组窗口间的检测,整合基因组信息进一步缩减数据集维度,并使用基因组窗口后验关联概率控制假阳性比例。这种新的研究策略无疑具有以下两个优良特性:1)这种方法将基因组窗口中所有SNP作为一个整体,可以利用该区域内的所有信息检测上位效应;2)该方法可以大幅度减少多重检测数量。其次,中国畜牧企业表型数据丰富,缺乏基因组测序数据,本研究借鉴单步基因组预测原理,设计检测包括上位效应在内的"穷"GWAS方法。

Abstract:

The emergence of next generati on sequencing technology and the declining detection cost have removed the technical barriers to rare mutation detection.Tens of millions of genetic variations,including common and rare variations,can be detected,and most of them are rare variations..Such situation brings a new challenge to statistical analysis methods and interpretation of results.At present,the most popular method for whole genome analysis is mainly aimed at the detection of epistatic effects between common variants.Researches on the epistatic effects between rare variants or between rare and common variants are seldom involved in livestock breeding.In this review,the basic unit of interaction analysis was extended from a pair of SNPs to the genomic windows by using Bayesian multivariate linear regression(BMR).Genomic information was integrated to further reduce the dataset dimension,and the false positive ratio(FPR) was controlled by using the genome window posterior probability of association(WPPA).This new paradigm of epistasis analysis might have the following two excellent features:(1) this method could take all SNP in the genome window as a whole and use all the information in the region to detect the epistatic effect;(2) This method could largely reduce the number of multiple detections.Moreover,there are abundant phenotypic data and lack of genome sequencing data in China's livestock enterprises.Therefore,the"poor"GWAS method,including the detection of epistatic effects would be designed based on the theory of single-step genomic BLUP(ss GBLUP).

参考文献

Bomba L.,Walter K.,and Soranzo N.,2017,The impact of rare and low-frequency genetic variants in common disease,Genome Biol.,18(1):77

Chatterjee N.,Kalaylioglu Z.,Moslehi R.,Peters U.,and Wacholder S.,2006,Powerful multilocus tests of genetic association in the presence of gene-gene and gene-environment interactions,Am.J.Hum.Genet.,79(6):1002-1016

Cordell H.J.,2009,Detecting gene-gene interactions that underlie human diseases,Nat.Rev.Genet.,10(6):392-404

Dickson S.P.,Wang K.,Krantz I.,Hakonarson H.,and Goldstein D.B.,2010,Rare variants create synthetic genome-wide associations,PLo S Biol.,8(1):e1000294

Do R.,Kathiresan S.,and Abecasis G.R.,2012,Exome sequencing and complex disease:practical aspects of rare variant association studies,Hum.Mol.Genet.,21(R1):R1-R9

Gauderman W.J.,2002,Sample size requirements for association studies of gene-gene interaction,Am.J.Epidemiol.,155(5):478-484

Gibson G.,2012,Rare and common variants:twenty arguments,Nat.Rev.Genet.,13(2):135-145

Hodge S.E.,Hager V.R.,and Greenberg D.A.,2016,Correction:using linkage analysis to detect gene-gene interactions,improved reliability and extension to more-complex models,PLo S One,11(3):e0151686

Lee S.,Abecasis G.R.,Boehnke M.,and Lin X.,2014,Rare-variant association analysis:study designs and statistical tests,Am.J.Hum.Genet.,95(1):5-23

Li D.,and Won S.,2016,Efficient strategy to identify gene-gene interactions and its application to type 2 diabetes,Genomics&Informatics,14(4):160-165

Li J.,Malley J.D.,Andrew A.S.,Karagas M.R.,and Moore J.H.,2016,Detecting gene-gene interactions using a permutation-based random forest method,Bio Data Mining,9:14

Marei H.E.,Althani A.,Suhonen J.,El Zowalaty M.E.,Albanna M.A.,Cenciarelli C.,Wang T.,and Caceci T.,2016,Common and rare genetic variants associated with alzheimer's disease,J.Cell.Physiol.,231(7):1432-1437

Matullo G.,Gaetano C.D.,and Guarrera S.,2013,Next generation sequencing and rare genetic variants:from human population studies to medical genetics,Environ.Mol.Mutagen.,54(7):518-532

Millstein J.,2013,Screening-testing approaches for gene-gene and gene-environment interactions using independent statistics,Front.Genet.,4(13):306

Moore J.H.,and Williams S.M.,2002,New strategies for identifying gene-gene interactions in hypertension,Ann.Med.,34(2):88-95

Oualkacha K.,Dastani Z.,Li R.,Cingolani P.E.,Spector T.D.,Hammond C.J.,Richards J.B.,Ciampi A.,and Greenwood C.M.,2013,Adjusted sequence kernel association test for rare variants controlling for cryptic and family relatedness,Genet.Epidemiol.,37(4):366-376

Ritchie M.D.,Hahn L.W.,and Moore J.H.,2003,Power of multifactor dimensionality reduction for detecting gene-gene interactions in the presence of genotyping error,missing data,phenocopy,and genetic heterogeneity,Genet.Epidemiol.,24(2):150-157

Solomon T.,Smith E.N.,Matsui H.,Braekkan S.K.,Wilsgaard T.,Njolstad I.,Mathiesen E.B.,Hansen J.B.,Frazer K.A.,and Invent C.,2016,Associations between common and rare exonic genetic variants and serum levels of 20 cardiovascular-related proteins:the tromso study,Circ.Cardiovasc.Genet.,9(4):375-383

Stanislas V.,Dalmasso C.,and Ambroise C.,2017,Eigen-epistasis for detecting gene-gene interactions,BMC Bioinformatics,18(1):54

Stratz P.,Baes,C.,Ruckert C.,Preuss S.,and Bennewitz J.,2013,A two-step approach to map quantitative trait loci for meat quality in connected porcine F(2)crosses considering main and epistatic effects,Anim.Genet.,44(1):14-23

Sung P.Y.,Wang Y.T.,Yu Y.W.,and Chung R.H.,2016,An efficient gene-gene interaction test for genome-wide association studies in trio families,Bioinformatics,32(12):1848-1855

Teng W.,Li W.,Zhang Q.I.,Wu D.,Zhao X.,Li H.,Han Y.,and Li W.,2017,Identification of quantitative trait loci underlying seed protein content of soybean including main,epistatic and qtl X environment effects in different regions of northeast china,Genome,60(8):649-655

Timbers T.A.,Garland S.J.,Mohan S.,Flibotte S.,Edgley M.,Muncaster Q.,Au V.,Li-Leger E.,Rosell F.I.,Cai J.,Rademak ers S.,Jansen G.,Moerman D.G.,and Leroux M.R.,2016,Accelerating gene discovery by phenotyping whole-Genome sequenced multi-mutation strains and using the sequence kernelassociation test(Skat),PLo SGenet.,12(8):e1006235

Urrutia E.,Lee S.,Maity A.,Zhao N.,Shen J.,Li Y.,and Wu M.C.,2015,Rare variant testing across methods and thresholds ising the multi-kernel sequence kernel association test(Mk-Skat),Stat.Interface.,8(4):495-505

Wagner M.J.,2013,Rare-variant genome-wide association studies:a new frontier in genetic analysis of complex traits,Pharmacogenomics,14(4):413-424

Wang D.,El-Basyoni I.S.,Baenziger P.S.,Crossa J.,Eskridge K.M.,and Dweikat I.,2012,Prediction of genetic values of quantitative traits with epistatic effects in plant breeding populations,Heredity(Edinb),109(5):313-319

Wu B.,and Pankow J.S.,2016,Sequence kernel association test of multiple continuous phenotypes,Genet.Epidemiol.,40(2):91-100

Wu M.C.,Lee S.,Cai T.,Li Y.,Boehnke M.,and Lin X.,2011,Rare-variant association testing for sequencing data with the sequence kernel association test,Am.J.Hum.Genet.,89(1):82-93

Xin D.,Qi Z.,Jiang H.,Hu Z.,Zhu R.,Hu J.,Han H.,Hu G.,Liu C.,and Chen Q.,2016,Qtl location and epistatic effect analysis of 100-seed weight using wild soybean(Glycine Soja Sieb.&Zucc.)chromosome segment substitution lines,PLo S One,11(3):e0149380

Xu J.,Yuan Z.,Ji J.,Zhang X.,Li H.,Wu X.,Xue F.,and Liu Y.,2016,A powerful score-based test statistic for detecting gene-gene co-association,BMC Genet.,17(1):31

Zhang A.M.,Song H.,Shen Y.H.,and Liu Y.,2015,Construction of a gene-gene interaction network with a combined score across multiple approaches,Genet.Mol.Res.,14(2):7018-7030

Zhao J.,Zhu Y.,and Xiong M.,2016,Genome-wide gene-gene interaction analysis for next-generation sequencing,Eur.J.Hum.Genet.,24(3):421-428

Zuk O.,Schaffner S.F.,Samocha K.,Do R.,Hechter E.,Kathiresan S.,Daly M.J.,Neale B.M.,Sunyaev S.R.,and Lander E.S.,2014,Searching for missing heritability:designing rare variant association studies,Proc.Natl.Acad.Sci.USA,111(4):E455-E464

Mei B.J.,and Wang Z.H.,2014,Application of bayesian factor method in genome-wide association studies in animal,Jiyinzuxue Yu Yingyong Shengwuxue(Genomics and Applied Biology),33(6):1211-1216(梅步俊,王志华,2014,贝叶斯因子法在畜禽全基因组关联分析中的应用,基因组学与应用生物学,33(6):1211-1216)

Mei B.J.,and Wang Z.H.,2015,Genomic selection methods in livestock and poultry based on the whole genome sequencing,Heilongjiang Xumu Shouyi(Heilongjiang Animal Science and Veterinary Medicine),(9):95-97(梅步俊,王志华,2015,基于全基因组测序的畜禽基因组选择方法,黑龙江畜牧兽医,(9):95-97)

Mei B.J.,and Wang Z.H.,2016,Single-step bayesian method for genome-wide selection in domestic animals,Jiyinzuxue Yu Yingyong Shengwuxue(Genomics and Applied Biology),35(10):2668-2675(梅步俊,王志华,2016,家畜全基因组选择中的单步贝叶斯方法,基因组学与应用生物学,35(10):2668-2675)

基本信息:

DOI:10.13417/j.gab.037.003304

中图分类号:S813.1

引用信息:

[1]梅步俊,王志华.家畜全基因组分析中稀有变异上位效应检测方法[J].基因组学与应用生物学,2018,37(08):3304-3312.DOI:10.13417/j.gab.037.003304.

基金信息:

国家自然科学基金(31460594;31760660);; 河套学院教学研究项目(HTXYJZ14005)共同资助

发布时间:

2017-10-14

出版时间:

2017-10-14

网络发布时间:

2017-10-14

检 索 高级检索