A Feature Selection Approach Guided an Early Prediction of Anthocyanin Accumulation Using Massive Untargeted Metabolomics Data in Mulberry

文献类型: 外文期刊

第一作者: Yu, Cui

作者: Yu, Cui;Dong, Zhaoxia;Zhu, Zhixian;Mo, Rongli;Li, Yong;Deng, Wen;Hu, Xingming;Zhang, Cheng;Han, Guangming;Jemaa, Essemine

作者机构:

关键词: Anthocyanin; Feature selection; Machine learning; Metabolomics; Mulberry; Univariate statistics

期刊名称:PLANT AND CELL PHYSIOLOGY ( 影响因子:4.937; 五年影响因子:5.783 )

ISSN: 0032-0781

年卷期: 2022 年 63 卷 5 期

页码:

收录情况: SCI

摘要: Identifying the early predictive biomarkers or compounds represents a pivotal task for guiding a targeted agricultural practice. Despite the various available tools, it remains challenging to define the ideal compound combination and thereby elaborate an effective predictive model fitting that. Hence, we employed a stepwise feature selection approach followed by a maximum relevance and minimum redundancy (MRMR) on the untargeted metabolism in four mulberry genotypes at different fruit developmental stages (FDSs). Thus, we revealed that 7 out of 226 differentially abundant metabolites (DAMs) explained up to 80% variance of anthocyanin based on linear regression model and stepwise feature selection approach accompanied by an MRMR across the genotypes over the FDSs. Among them, the phosphoenolpyruvate, d-mannose and shikimate show the top 3 attribution indexes to the accumulation of anthocyanin in the fruits of these genotypes across the four FDSs. The obtained results were further validated by assessing the regulatory genes expression levels and the targeted metabolism approach. Taken together, our findings provide valuable evidences on the fact that the anthocyanin biosynthesis is somehow involved in the coordination between the carbon metabolism and secondary metabolic pathway. Our report highlights as well the importance of using the feature selection approach for the predictive biomarker identification issued from the untargeted metabolomics data.

分类号:

  • 相关文献
作者其他论文 更多>>