您好,欢迎访问湖北省农业科学院 机构知识库!

A Feature Selection Approach Guided an Early Prediction of Anthocyanin Accumulation Using Massive Untargeted Metabolomics Data in Mulberry

文献类型: 外文期刊

作者: Yu, Cui 1 ; Dong, Zhaoxia 1 ; Jemaa, Essemine 2 ; Zhu, Zhixian 1 ; Mo, Rongli 1 ; Li, Yong 1 ; Deng, Wen 1 ; Hu, Xingming 1 ; Zhang, Cheng 1 ; Han, Guangming 1 ;

作者机构: 1.Hubei Acad Agr Sci, Ind Crops Inst, 43 Nanhu Rd, Wuhan 430064, Hubei, Peoples R China

2.Chinese Acad Sci, CAS Ctr Excellence Mol Plant Sci, Natl Key Lab Plant Mol Genet, 300 Fenglin Rd, Shanghai 200032, Peoples R China

关键词: Anthocyanin; Feature selection; Machine learning; Metabolomics; Mulberry; Univariate statistics

期刊名称:PLANT AND CELL PHYSIOLOGY ( 影响因子:4.937; 五年影响因子:5.783 )

ISSN: 0032-0781

年卷期: 2022 年 63 卷 5 期

页码:

收录情况: SCI

摘要: Identifying the early predictive biomarkers or compounds represents a pivotal task for guiding a targeted agricultural practice. Despite the various available tools, it remains challenging to define the ideal compound combination and thereby elaborate an effective predictive model fitting that. Hence, we employed a stepwise feature selection approach followed by a maximum relevance and minimum redundancy (MRMR) on the untargeted metabolism in four mulberry genotypes at different fruit developmental stages (FDSs). Thus, we revealed that 7 out of 226 differentially abundant metabolites (DAMs) explained up to 80% variance of anthocyanin based on linear regression model and stepwise feature selection approach accompanied by an MRMR across the genotypes over the FDSs. Among them, the phosphoenolpyruvate, d-mannose and shikimate show the top 3 attribution indexes to the accumulation of anthocyanin in the fruits of these genotypes across the four FDSs. The obtained results were further validated by assessing the regulatory genes expression levels and the targeted metabolism approach. Taken together, our findings provide valuable evidences on the fact that the anthocyanin biosynthesis is somehow involved in the coordination between the carbon metabolism and secondary metabolic pathway. Our report highlights as well the importance of using the feature selection approach for the predictive biomarker identification issued from the untargeted metabolomics data.

  • 相关文献
作者其他论文 更多>>