| 125 | 1 | 32 |
| 下载次数 | 被引频次 | 阅读次数 |
下一代测序技术的出现和检测费用不断下降清除了稀有变异检测的技术障碍,可以检测出包括常见变异和稀有变异在内的数以千万的遗传变异,其中绝大部分的变异都是稀有变异。这种情况对统计分析方法和结果解释提出了新的挑战。当前最为流行的全基因组分析方法主要针对常见变异间上位效应的检测问题,家畜育种中较少涉及稀有变异间、稀有变异-常见变异上位效应的研究。本综述探讨使用贝叶斯多元回归方法将上位效应检测单位由成对SNP扩展到基因组窗口间的检测,整合基因组信息进一步缩减数据集维度,并使用基因组窗口后验关联概率控制假阳性比例。这种新的研究策略无疑具有以下两个优良特性:1)这种方法将基因组窗口中所有SNP作为一个整体,可以利用该区域内的所有信息检测上位效应;2)该方法可以大幅度减少多重检测数量。其次,中国畜牧企业表型数据丰富,缺乏基因组测序数据,本研究借鉴单步基因组预测原理,设计检测包括上位效应在内的"穷"GWAS方法。
Abstract:The emergence of next generati on sequencing technology and the declining detection cost have removed the technical barriers to rare mutation detection.Tens of millions of genetic variations,including common and rare variations,can be detected,and most of them are rare variations..Such situation brings a new challenge to statistical analysis methods and interpretation of results.At present,the most popular method for whole genome analysis is mainly aimed at the detection of epistatic effects between common variants.Researches on the epistatic effects between rare variants or between rare and common variants are seldom involved in livestock breeding.In this review,the basic unit of interaction analysis was extended from a pair of SNPs to the genomic windows by using Bayesian multivariate linear regression(BMR).Genomic information was integrated to further reduce the dataset dimension,and the false positive ratio(FPR) was controlled by using the genome window posterior probability of association(WPPA).This new paradigm of epistasis analysis might have the following two excellent features:(1) this method could take all SNP in the genome window as a whole and use all the information in the region to detect the epistatic effect;(2) This method could largely reduce the number of multiple detections.Moreover,there are abundant phenotypic data and lack of genome sequencing data in China's livestock enterprises.Therefore,the"poor"GWAS method,including the detection of epistatic effects would be designed based on the theory of single-step genomic BLUP(ss GBLUP).
Bomba L.,Walter K.,and Soranzo N.,2017,The impact of rare and low-frequency genetic variants in common disease,Genome Biol.,18(1):77
Chatterjee N.,Kalaylioglu Z.,Moslehi R.,Peters U.,and Wacholder S.,2006,Powerful multilocus tests of genetic association in the presence of gene-gene and gene-environment interactions,Am.J.Hum.Genet.,79(6):1002-1016
Cordell H.J.,2009,Detecting gene-gene interactions that underlie human diseases,Nat.Rev.Genet.,10(6):392-404
Dickson S.P.,Wang K.,Krantz I.,Hakonarson H.,and Goldstein D.B.,2010,Rare variants create synthetic genome-wide associations,PLo S Biol.,8(1):e1000294
Do R.,Kathiresan S.,and Abecasis G.R.,2012,Exome sequencing and complex disease:practical aspects of rare variant association studies,Hum.Mol.Genet.,21(R1):R1-R9
Gauderman W.J.,2002,Sample size requirements for association studies of gene-gene interaction,Am.J.Epidemiol.,155(5):478-484
Gibson G.,2012,Rare and common variants:twenty arguments,Nat.Rev.Genet.,13(2):135-145
Hodge S.E.,Hager V.R.,and Greenberg D.A.,2016,Correction:using linkage analysis to detect gene-gene interactions,improved reliability and extension to more-complex models,PLo S One,11(3):e0151686
Lee S.,Abecasis G.R.,Boehnke M.,and Lin X.,2014,Rare-variant association analysis:study designs and statistical tests,Am.J.Hum.Genet.,95(1):5-23
Li D.,and Won S.,2016,Efficient strategy to identify gene-gene interactions and its application to type 2 diabetes,Genomics&Informatics,14(4):160-165
Li J.,Malley J.D.,Andrew A.S.,Karagas M.R.,and Moore J.H.,2016,Detecting gene-gene interactions using a permutation-based random forest method,Bio Data Mining,9:14
Marei H.E.,Althani A.,Suhonen J.,El Zowalaty M.E.,Albanna M.A.,Cenciarelli C.,Wang T.,and Caceci T.,2016,Common and rare genetic variants associated with alzheimer's disease,J.Cell.Physiol.,231(7):1432-1437
Matullo G.,Gaetano C.D.,and Guarrera S.,2013,Next generation sequencing and rare genetic variants:from human population studies to medical genetics,Environ.Mol.Mutagen.,54(7):518-532
Millstein J.,2013,Screening-testing approaches for gene-gene and gene-environment interactions using independent statistics,Front.Genet.,4(13):306
Moore J.H.,and Williams S.M.,2002,New strategies for identifying gene-gene interactions in hypertension,Ann.Med.,34(2):88-95
Oualkacha K.,Dastani Z.,Li R.,Cingolani P.E.,Spector T.D.,Hammond C.J.,Richards J.B.,Ciampi A.,and Greenwood C.M.,2013,Adjusted sequence kernel association test for rare variants controlling for cryptic and family relatedness,Genet.Epidemiol.,37(4):366-376
Ritchie M.D.,Hahn L.W.,and Moore J.H.,2003,Power of multifactor dimensionality reduction for detecting gene-gene interactions in the presence of genotyping error,missing data,phenocopy,and genetic heterogeneity,Genet.Epidemiol.,24(2):150-157
Solomon T.,Smith E.N.,Matsui H.,Braekkan S.K.,Wilsgaard T.,Njolstad I.,Mathiesen E.B.,Hansen J.B.,Frazer K.A.,and Invent C.,2016,Associations between common and rare exonic genetic variants and serum levels of 20 cardiovascular-related proteins:the tromso study,Circ.Cardiovasc.Genet.,9(4):375-383
Stanislas V.,Dalmasso C.,and Ambroise C.,2017,Eigen-epistasis for detecting gene-gene interactions,BMC Bioinformatics,18(1):54
Stratz P.,Baes,C.,Ruckert C.,Preuss S.,and Bennewitz J.,2013,A two-step approach to map quantitative trait loci for meat quality in connected porcine F(2)crosses considering main and epistatic effects,Anim.Genet.,44(1):14-23
Sung P.Y.,Wang Y.T.,Yu Y.W.,and Chung R.H.,2016,An efficient gene-gene interaction test for genome-wide association studies in trio families,Bioinformatics,32(12):1848-1855
Teng W.,Li W.,Zhang Q.I.,Wu D.,Zhao X.,Li H.,Han Y.,and Li W.,2017,Identification of quantitative trait loci underlying seed protein content of soybean including main,epistatic and qtl X environment effects in different regions of northeast china,Genome,60(8):649-655
Timbers T.A.,Garland S.J.,Mohan S.,Flibotte S.,Edgley M.,Muncaster Q.,Au V.,Li-Leger E.,Rosell F.I.,Cai J.,Rademak ers S.,Jansen G.,Moerman D.G.,and Leroux M.R.,2016,Accelerating gene discovery by phenotyping whole-Genome sequenced multi-mutation strains and using the sequence kernelassociation test(Skat),PLo SGenet.,12(8):e1006235
Urrutia E.,Lee S.,Maity A.,Zhao N.,Shen J.,Li Y.,and Wu M.C.,2015,Rare variant testing across methods and thresholds ising the multi-kernel sequence kernel association test(Mk-Skat),Stat.Interface.,8(4):495-505
Wagner M.J.,2013,Rare-variant genome-wide association studies:a new frontier in genetic analysis of complex traits,Pharmacogenomics,14(4):413-424
Wang D.,El-Basyoni I.S.,Baenziger P.S.,Crossa J.,Eskridge K.M.,and Dweikat I.,2012,Prediction of genetic values of quantitative traits with epistatic effects in plant breeding populations,Heredity(Edinb),109(5):313-319
Wu B.,and Pankow J.S.,2016,Sequence kernel association test of multiple continuous phenotypes,Genet.Epidemiol.,40(2):91-100
Wu M.C.,Lee S.,Cai T.,Li Y.,Boehnke M.,and Lin X.,2011,Rare-variant association testing for sequencing data with the sequence kernel association test,Am.J.Hum.Genet.,89(1):82-93
Xin D.,Qi Z.,Jiang H.,Hu Z.,Zhu R.,Hu J.,Han H.,Hu G.,Liu C.,and Chen Q.,2016,Qtl location and epistatic effect analysis of 100-seed weight using wild soybean(Glycine Soja Sieb.&Zucc.)chromosome segment substitution lines,PLo S One,11(3):e0149380
Xu J.,Yuan Z.,Ji J.,Zhang X.,Li H.,Wu X.,Xue F.,and Liu Y.,2016,A powerful score-based test statistic for detecting gene-gene co-association,BMC Genet.,17(1):31
Zhang A.M.,Song H.,Shen Y.H.,and Liu Y.,2015,Construction of a gene-gene interaction network with a combined score across multiple approaches,Genet.Mol.Res.,14(2):7018-7030
Zhao J.,Zhu Y.,and Xiong M.,2016,Genome-wide gene-gene interaction analysis for next-generation sequencing,Eur.J.Hum.Genet.,24(3):421-428
Zuk O.,Schaffner S.F.,Samocha K.,Do R.,Hechter E.,Kathiresan S.,Daly M.J.,Neale B.M.,Sunyaev S.R.,and Lander E.S.,2014,Searching for missing heritability:designing rare variant association studies,Proc.Natl.Acad.Sci.USA,111(4):E455-E464
Mei B.J.,and Wang Z.H.,2014,Application of bayesian factor method in genome-wide association studies in animal,Jiyinzuxue Yu Yingyong Shengwuxue(Genomics and Applied Biology),33(6):1211-1216(梅步俊,王志华,2014,贝叶斯因子法在畜禽全基因组关联分析中的应用,基因组学与应用生物学,33(6):1211-1216)
Mei B.J.,and Wang Z.H.,2015,Genomic selection methods in livestock and poultry based on the whole genome sequencing,Heilongjiang Xumu Shouyi(Heilongjiang Animal Science and Veterinary Medicine),(9):95-97(梅步俊,王志华,2015,基于全基因组测序的畜禽基因组选择方法,黑龙江畜牧兽医,(9):95-97)
Mei B.J.,and Wang Z.H.,2016,Single-step bayesian method for genome-wide selection in domestic animals,Jiyinzuxue Yu Yingyong Shengwuxue(Genomics and Applied Biology),35(10):2668-2675(梅步俊,王志华,2016,家畜全基因组选择中的单步贝叶斯方法,基因组学与应用生物学,35(10):2668-2675)
基本信息:
DOI:10.13417/j.gab.037.003304
中图分类号:S813.1
引用信息:
[1]梅步俊,王志华.家畜全基因组分析中稀有变异上位效应检测方法[J].基因组学与应用生物学,2018,37(08):3304-3312.DOI:10.13417/j.gab.037.003304.
基金信息:
国家自然科学基金(31460594;31760660);; 河套学院教学研究项目(HTXYJZ14005)共同资助
2017-10-14
2017-10-14
2017-10-14