| 320 | 0 | 212 |
| 下载次数 | 被引频次 | 阅读次数 |
本研究旨在基于多源转录组数据与机器学习算法,筛选出能准确识别阿尔茨海默病(Alzheimer′s disease, AD)的特征基因并构建竞争性内源RNA(competing endogenous RNA, ceRNA)调控网络,揭示其在AD病例进程中的表达机制。通过整合GEO数据库中AD人脑组织转录组数据(训练集:GSE37263、 GSE122063、 GSE138260;验证集:GSE5281),采用主成分分析(principal component analysis, PCA)进行聚类并筛选差异表达基因(differentially expressed genes, DEGs),联合最小绝对值收缩和选择算子(least absolute shrinkage and selection operator, LASSO)回归、支持向量机(support vector machine, SVM)与随机森林(random forest, RF)机器学习算法,筛选出AD相关特征基因,并进行GO和KEGG分析,显示其富集的功能通路。同时对特征基因CARTPT高/低表达组DEGs进行基因集富集分析(gene set enrichment analysis, GSEA)与基因集变异分析(gene set variation analysis, GSVA),得出高/低表达组中富集通路的差异。最后进行免疫细胞浸润分析,构建ceRNA调控网络。鉴定出4个AD相关特征基因(CARTPT、FCGBP、NPTX2、RBM3),多基因联合诊断模型AUC达0.945(95%CI:0.904~0.977)。其中,CARTPT在独立验证集中表现出最优的疾病识别效能,CARTPT基因调控网络与AD病理进程之间密切相关。CARTPT可能通过调节内源性大麻素信号、神经活性配体信号通路、核苷酸代谢、嘌呤代谢等,减缓AD进展。基于GSEA和GSVA的多维度解析显示:CARTPT高表达组与促炎、组织重构及NOTCH信号驱动的细胞命运调控相关;而低表达组呈现氧化磷酸化代谢亢进与帕金森病相关通路富集,导致神经代谢紊乱和能量代谢抑制。通过免疫微环境分析,发现CARTPT的表达水平与初始B细胞丰度呈显著负相关,而仅有静息肥大细胞在AD病理进程中存在差异性,CARTPT可能并非通过调控免疫微环境参与AD的病理进程。最后,通过构建ceRNA调控网络,从系统层面解析基因表达的调控机制。本研究整合多组学数据与机器学习算法,筛选出4个AD潜在关联基因(CARTPT、FCGBP、NPTX2、RBM3),构建的多基因诊断模型在训练集中显示优异性能(AUC=0.945)。其中CARTPT在独立验证集(AUC=0.793)中展现诊断潜力,其参与的神经递质与代谢通路(如内源性大麻素信号、嘌呤代谢)为探索AD机制提供有价值的参考。
Abstract:This study aims to screen signature genes capable of accurately identifying Alzheimer′s disease(AD) and construct a competing endogenous RNA(ceRNA) regulatory network based on multi-source transcriptomic data and machine learning algorithms, thereby revealing their expression mechanisms during AD disease progression. By integrating AD human brain tissue transcriptomic data from the GEO database(training sets: GSE37263, GSE122063, GSE138260; validation set: GSE5281), principal component analysis(PCA) was used for clustering and screening differentially expressed genes(DEGs), least absolute shrinkage and selection operator(LASSO) regression, support vector machine(SVM), and random forest(RF) machine learning algorithms were combined to screen AD-related feature genes. Gene ontology(GO) and Kyoto encyclopedia of genes and genomes(KEGG) analyses revealed enriched functional pathways. Gene set enrichment analysis(GSEA) and gene set variation analysis(GSVA) of DEGs in high/low CARTPT expression groups identified differentially enriched pathways. Immune cell infiltration analysis was performed, and a ceRNA regulatory network was constructed. Four AD-related feature genes(CARTPT, FCGBP, NPTX2, RBM3) were identified, with the multigene diagnostic model achieving an AUC of 0.945(95% CI: 0.904~0.977). CARTPT demonstrated optimal disease-discrimination perfor-mance in the independent validation set, and its regulatory network was closely associated with AD pathology. CARTPT may alleviate AD progression by regulating endocannabinoid signaling, neuroactive ligand-receptor interactions, nucleotide metabolism, and purine metabolism. Multidimensional GSEA and GSVA analyses showed: high CARTPT expression correlated with pro-inflammatory responses, tissue remodeling, and NOTCH signaling-driven cell fate regulation, while low expression exhibited hyperactive oxidative phosphorylation, Parkinson′s disease-related pathway enrichment, neurometabolic dysregulation, and suppressed energy metabolism. Immune microenvironment analysis revealed a significant negative correlation between CARTPT expression and na6ve B-cell abundance. Only resting mast cells showed differential infiltration in AD pathology, suggesting CARTPT may not participate in AD pathogenesis via immune microenvironment regulation. Finally, a ceRNA regulatory network was built to systematically analyze gene expression regulation mechanisms. This study integrates multi-omics data with machine learning algorithms to identify four genes potentially associated with Alzheimer′s disease(CARTPT, FCGBP, NPTX2, RBM3). The constructed polygenic diagnostic model demonstrated excellent performance in the training set(AUC=0.945). Among these, CARTPT demonstrated potential diagnostic utility in an independent validation set(AUC=0.793). Its involvement in neurotransmitter and metabolic pathways(e.g., endocannabinoid signaling, purine metabolism) provides valuable insights for exploring the mechanisms of AD.
ANDREIS D T,SINGER M,2016.Catecholamines for inflammatory shock:a Jekyll-and-Hyde conundrum.Intensive Care Med.,42(9):1387-1397.
BARKI M,XUE H,2022.GABRB2,a key player in neuropsychiatric disorders and beyond.Gene,809:146021.
BETTCHER B M,TANSEY M G,DOROTHéE G,et al.,2021.Peripheral and central immune system crosstalk in Alzheimer disease-a research prospectus.Nat.Rev.Neurol.,17(11):689-701.
DOROSZKIEWICZ J,MROCZKO B,2022.New possibilities in the therapeutic approach to Alzheimer′s disease.Int.J.Mol.Sci.,23(16):8902.
GENG P Y,YE F,DOU P,et al.,2024.HIF-1α-HPRT1 axis promotes tumorigenesis and gefitinib resistance by enhancing purine metabolism in EGFR-mutant lung adenocarcinoma.J.Exp.Clin.Cancer Res.,43(1):269.
HUANG C,WEN X Y,XIE H S,et al.,2022.Identification and experimental validation of marker genes between diabetes and Alzheimer′s disease.Oxid.Med.Cell.Longev.,2022(1):8122532.
KHO M,SMITH J A,VERWEIJ N,et al.,2020.Genome-wide association meta-analysis of individuals of European ancestry identifies suggestive loci for sodium intake,potassium intake,and their ratio measured from 24-hour or half-day urine samples.J.Nutr.,150(10):2635-2645.
KUMAR S,REDDY P H,2020.The role of synaptic microRNAs in Alzheimer′s disease.Biochim.Biophys.Acta Mol.Basis Dis.,1866(12):165937.
LANE C A,HARDY J,SCHOTT J M,2018.Alzheimer′s disease.Eur.J.Neurol.,25(1):59-70.
LIAN P P,CAI X,WANG C L,et al.,2024.Identification and experimental validation of m7G-related molecular subtypes,immune signature,and feature genes in Alzheimer′s disease.Heliyon,10(13):e33836.
LU T X,ROTHENBERG M E,2018.microRNA.J.Allergy Clin.Immunol.,141(4):1202-1207.
LUO J,WANG H S,WANG L,et al.,2021.lncRNA GAS6-AS1 inhibits progression and glucose metabolism reprogramming in LUAD via repressing E2F1-mediated transcription of GLUT1.Mol.Ther.Nucleic Acids,25:11-24.
PETRELLA C,DI CERTO M G,BARBATO C,et al.,2019.Neuropeptides in Alzheimer′s disease:an update.Curr.Alzheimer Res.,16(6):544-558.
SACAI H,SAKOORI K,KONNO K,et al.,2020.Autism spectrum disorder-like behavior caused by reduced excitatory synaptic transmission in pyramidal neurons of mouse prefrontal cortex.Nat.Commun.,11(1):5140.
SALMENA L,POLISENO L,TAY Y,et al.,2011.A ceRNA hypothesis:the Rosetta Stone of a hidden RNA language?Cell,146(3):353-358.
SCHELTENS P,DE STROOPER B,KIVIPELTO M,et al.,2021.Alzheimer′s disease.Lancet,397(10284):1577-1590.
SILVESTRO S,BRAMANTI P,MAZZON E,2019.Role of miRNAs in Alzheimer′s disease and possible fields of application.Int.J.Mol.Sci.,20(16):3979.
SüDHOF T C,2023.Cerebellin-neurexin complexes instructing synapse properties.Curr.Opin.Neurobiol.,81:102727.
TIWARI S,ATLURI V,KAUSHIK A,et al.,2019.Alzheimer′s disease:pathogenesis,diagnostics,and therapeutics.Int.J.Nanomedicine,14:5541-5554.
TWAROWSKI B,HERBET M,2023.Inflammatory processes in Alzheimer′s disease-pathomechanism,diagnosis and treatment:a review.Int.J.Mol.Sci.,24(7):6518.
WANG W,GAO W W,GONG P,et al.,2023.Neuronal-specific TNFAIP1 ablation attenuates postoperative cognitive dysfunction via targeting SNAP25 for K48-linked ubiquitination.Cell Commun.Signal.,21(1):356.
WINKLE M,EL-DALY S M,FABBRI M,et al.,2021.Noncoding RNA therapeutics:challenges and potential solutions.Nat.Rev.Drug Discov.,20(8):629-651.
YOSTEN G L C,HADDOCK C J,HARADA C M,et al.,2021.Past,present and future of cocaine- and amphetamine-regulated transcript peptide.Physiol.Behav.,235:113380.
YU L,TASAKI S,SCHNEIDER J A,et al.,2020.Cortical proteins associated with cognitive resilience in community-dwelling older persons.JAMA Psychiatry,77(11):1172-1180.
ZHENG Q Y,WANG X,2025.Alzheimer′s disease:insights into pathology,molecular mechanisms,and therapy.Protein Cell,16(2):83-120.
ZHOU S F,ZHANG D,GUO J N,et al.,2021.Deficiency of NEAT1 prevented MPP+-induced inflammatory response,oxidative stress and apoptosis in dopaminergic SK-N-SH neuroblastoma cells via miR-1277-5p/ARHGAP26 axis.Brain Res.,1750:147156.
ZITNIK M,NGUYEN F,WANG B,et al.,2019.Machine learning for integrating data in biology and medicine:principles,practice,and opportunities.Inf.Fusion,50:71-91.
基本信息:
DOI:10.13417/j.gab.044.000964
中图分类号:TP181;R749.16
引用信息:
[1]朱奕霖,于田,李婷婷,等.整合多源转录组数据与机器学习挖掘阿尔茨海默病特征基因[J].基因组学与应用生物学,2025,44(09):964-978.DOI:10.13417/j.gab.044.000964.
基金信息:
国家自然科学基金项目(82302547)资助
2025-07-21
2025-07-21
2025-07-21