Machine Learning-Driven Identification of Key Environmental Factors Influencing Fiber Yield and Quality Traits in Upland Cotton

文献类型: 外文期刊

第一作者: Souaibou, Mohamadou

作者: Souaibou, Mohamadou;Yan, Haoliang;Pan, Jingtao;Li, Yang;Shi, Yuzhen;Gong, Wankui;Shang, Haihong;Gong, Juwu;Yuan, Youlu;Yan, Haoliang;Pan, Jingtao;Li, Yang;Shi, Yuzhen;Shang, Haihong;Gong, Juwu;Yuan, Youlu;Dai, Panhong;Yuan, Youlu

作者机构:

关键词: cotton; machine learning; genotype environment interaction; SHAP interpretation; environmental factors

期刊名称:PLANTS-BASEL ( 影响因子:4.1; 五年影响因子:4.5 )

ISSN: 2223-7747

年卷期: 2025 年 14 卷 13 期

页码:

收录情况: SCI

摘要: Understanding the influence of environmental factors on cotton performance is crucial for enhancing yield and fiber quality in the context of climate change. This study investigates genotype-by-environment (GxE) interactions in cotton, using data from 250 recombinant inbred lines (CCRI70 RILs) cultivated across 14 diverse environments in China's major cotton cultivation areas. Our findings reveal that environmental effects predominantly influenced yield-related traits (boll weight, lint percentage, and the seed index), contributing to 34.7% to 55.7% of their variance. In contrast fiber quality traits showed lower environmental sensitivity (12.3-27.0%), with notable phenotypic plasticity observed in the boll weight, lint percentage, and fiber micronaire. Employing six machine learning models, Random Forest demonstrated superior predictive ability (R2 = 0.40-0.72; predictive Pearson correlation = 0.63-0.86). Through SHAP-based interpretation and sliding-window regression, we identified key environmental drivers primarily active during mid-to-late growth stages. This approach effectively reduced the number of influential input variables to just 0.1-2.4% of the original dataset, spanning 2-9 critical time windows per trait. Incorporating these identified drivers significantly improved cross-environment predictions, enhancing Random Forest accuracy by 0.02-0.15. These results underscore the strong potential of machine learning to uncover critical temporal environmental factors underlying GxE interactions and to substantially improve predictive modeling in cotton breeding programs, ultimately contributing to more resilient and productive cotton cultivation.

分类号:

  • 相关文献
作者其他论文 更多>>