| 68 | 1 | 30 |
| 下载次数 | 被引频次 | 阅读次数 |
在蛋白质组学研究中,差异表达分析能够帮助寻找与疾病相关的重要蛋白和生物标志物。目前广泛应用的差异表达分析方法大多是在单蛋白水平进行的,但是很多复杂疾病或者表型是由一些关键模块或通路上多个蛋白的微弱变化累加所致。本研究中,我们比较评估了5种基于模块的差异表达统计学方法,这些方法相比单蛋白差异表达分析方法,应该能够找到更多跟癌症相关的功能模块。通过模拟数据集的评估结果表明,基于Mean的方法在不同的模拟数据集中都展现了较好的统计效力,L2Norm、GM和FM 3种方法的统计效力基本相同,而WKS的统计效力较差。此外,我们将5种方法应用到结直肠癌患者样本蛋白质表达谱数据分析中。结果显示,单蛋白水平差异分析方法和基于模块的统计分析方法都找到了跟端粒酶相关的功能模块,而基于5种方法综合排名的结果找到了更多与癌症密切关联的途径,包括p53调节的内源性细胞凋亡的功能模块。
Abstract:Differential expression analysis can help us identify important disease-related proteins and biomarkers in proteomics study. The commonly used statistical methods are designed at single protein-level. Nevertheless,many complex diseases and clinical phenotypes are proved to be associated with the accumulation of protein expression of subtle changes in modules or pathways. In this study, five module/pathway based statistical methods were systematically evaluated. These methods, compared with single protein differential expression analysis,should be able to find more functional modules related to cancer. In the simulation study, we found that mean-based method achieved better statistical validity in different simulated datasets.The statistical validity of L2 Norm, GM and FM was basically the same, while that of WKS was poor. In addition, we applied the five methods to data analysis of protein expression profiles in colorectal cancer patients. The results showed that the telomerase-related module could be identified by the methods at the single-protein and the module-based levels,but the modules-based statistical methods found more cancer-associated modules, including the module of intrinsic apoptotic signaling pathway by p53 class mediator.
Audic S.,and Claverie J.M.,1997,The significance of digita gene expression profiles,Genome Res.,7(10):986-995
de Wit M.,Kant H.,Piersma S.R.,Pham T.V.,Mongera S.,van Berkel M.P.,Boven E.,Pontén F.,Meijer G.A.,Jimenez CR.,and Fijneman R.J.,2014,Colorectal cancer candidate biomarkers identified by tissue secretome proteome profiling,J.Proteomics,99(1):26-39
Dinu I.,Potter J.D.,Mueller T.,Liu Q.,Adewale A.J.,Jhangri GS.,Einecke G.,Famulski K.S.,Halloran P.,and Yasui Y.2007,Improving gene set analysis of microarray data by SAM-GS,BMC Bioinformatics,8:242
Fisher R.A.,1922,On the interpretation of x2from contingency tables and the calculation of P,J.R.Stat.Soc.,85(1):87-94
Fleming J.S.,2006,A technique for analysis of geometric mean renography,Nucl.Med.Commun.,27(9):701-708
Heaphy C.M.,Subhawong A.P.,Hong S.M.,Goggins M.G.Montgomery E.A.,Gabrielson E.,Netto G.J.,Epstein J.I.Lotan T.L.,Westra W.H.,Shih I.M.,Iacobuzio-Donahue CA.,Maitra A.,Li Q.K.,Eberhart C.G.,Taube J.M.,Rakheja D.,Kurman R.J.,Wu T.C.,Roden R.B.,Argani P.,De Marzo A.M.,Terracciano L.,Torbenson M.,and Meeker A.K.,2011,Prevalence of the alternative lengthening of telomeres telomere maintenance mechnism in human cancer subtypes,Am.J.Pathol.,179(4):1608-1615
Huang D.W.,Sherman B.T.,Tan Q.,Kir J.,Liu D.,Bryant D.,Guo Y.,Stephens R.,Baseler M.W.,Lane H.C.,and Lempicki R.A.,2007,DAVID Bioinformatics Resources:expanded annotation database and novel algorithms to better extract biology from large gene lists,Nucleic Acids Res.,35(Web Server Issue):W169-175
Kasprzyk A.,2011,Bio Mart:driving a paradigm change in biological data management,Database(Oxford),bar049
Kim S.Y.,and Volsky D.J.,2005,PAGE:parametric analysis of gene set enrichment,BMC Bioinformatics,6:144
Leitch M.C.,Mitra I.,and Sadygov R.G.,2012,Generalized linear and mixed models for label-free shotgun proteomics,Stat Interface,5(1):89-98
Li C.Q.,Robles A.I.,Hanigan C.L.,Hofseth L.J.,Trudel L.J.,Harris C.C.,and Wogan G.N.,2004,Apoptotic signaling pathways induced by nitric oxide in human lymphoblastoid cells expressing wild-type or mutant p53,Cancer Res.,64(9):3022-3029
Naudin C.,Sirvent A.,Leroy C.,Larive R.,Simon V.,Pannequin J.,Bourgaux J.F.,Pierre J.,Robert B.,Hollande F.,and Roche S.,2014,SLAP displays tumour suppressor functions in colorectal cancer via destabilization of the SRC substrate EPHA2,Nat.Commun.,5:3159
Pearl R.,1907,On the error of counting with a haemacytometer,Biometrika,5(3):351-360
Razak Z.R.,Varkonyi R.J.,Kulp-Mc Eliece M.,Caslini C.,Testa J.R.,Murphy M.E.,and Broccoli D.,2004,p53 differentially inhibits cell growth depending on the mechanism of telomere maintenance,Mol.Cell.Biol.,24(13):5967-5977
Sethi M.K.,Thaysen-Andersen M.,Smith J.T.,Baker M.S.,Packer N.H.,Hancock W.S.,and Fanayan S.,2014,Comparative N-glycan profiling of colorectal cancer cell lines reveals unique bisecting Glc NAc and alpha-2,3-linked sialic acid determinants are associated with membrane proteins of the more metastatic/aggressive cell lines,J.Proteome Res.,13(1):277-288
Shadish W.R.,Hedges L.V.,and Pustejovsky J.E.,2014,Analysis and meta-analysis of single-case designs with a standardized mean difference statistic:a primer and applications,J.Sch.Psychol.,52(2):123-147
Subramanian A.,Tamayo P.,Mootha V.K.,Mukherjee S.,Ebert B.L.,Gillette M.A.,Paulovich A.,Pomeroy S.L.,Golub T.R.,Lander E.S.,and Mesirov J.P.,2005,Gene set enrichment analysis:a knowledge-based approach for interpreting genome-wide expression profiles,Proc.Natl.Acad.Sci.,102(43):15545-15550
Tomlinson R.L.,Abreu E.B.,Ziegler T.,Ly H.,Counter C.M.Terns R.M.,and Terns M.P.,2008,Telomerase reverse transcriptase is required for the localization of telomerase RNAto cajal bodies and telomeres in human cancer cells,Mol Biol.Cell.,19(9):3793-3800
Wang C.L.,Wang J.Y.,Liu Z.Y.,Ma X.M.,Wang X.W.,Jin H.Zhang X.P.,Fu D.,Hou L.J.,and Lu Y.C.,2014,Ubiquitin-specific protease 2a stabilizes MDM4 and facilitates the p53-mediated intrinsic apoptotic pathway in g lioblastoma Carcinogenesis,35(7):1500-1509
Wilson R.H.,Mc Ardle R.,Watts K.L.,and Smith S.L.,2012,The revised speech perception in noise test(R-SPIN)in a multiple signal-to-noise ratio paradigm,J.Am.Acad.Audiol.,23(8):590-605
Yang J.J.,Li J.,Williams L.K.,and Buu A.,2016,An efficient genome-wide association test for multivariate phenotypes based on the Fisher combination function,BMC Bioinformatics,17(1):1-11
Zhao Y.P.,Ruan C.P.,Wang H.,Hu Z.Q.,Fang M.,Gu X.,Ji J.,Zhao J.Y.,and Gao C.F.,2012,Identification and assessment of new biomarkers for colorectal cancer with serum N-glycan profiling,Cancer,118(3):639-650
Zhu Y.,Tomlinson R.L.,Lukowiak A.A.,Terns R.M.,and Terns M.P.,2004,Telomerase RNA accumulates in Cajal bodies in human cancer cells,Mol.Biol.Cell,15(1):81-90
基本信息:
DOI:10.13417/j.gab.036.004134
中图分类号:R730.2
引用信息:
[1]雷明莉,王博,李婧.基于模块统计模型在蛋白质表达谱分析中的比较与应用[J].基因组学与应用生物学,2017,36(10):4134-4140.DOI:10.13417/j.gab.036.004134.
基金信息:
国家自然科学基金(31271416)资助
2017-10-25
2017-10-25