您好,欢迎访问北京市农林科学院 机构知识库!

An explainable XGBoost model improved by SMOTE-ENN technique for maize lodging detection based on multi-source unmanned aerial vehicle images

文献类型: 外文期刊

作者: Han, Liang 1 ; Yang, Guijun 2 ; Yang, Xiaodong 2 ; Song, Xiaoyu 2 ; Xu, Bo 2 ; Li, Zhenhai 2 ; Wu, Jintao 2 ; Yang, Hao 2 ; Wu, Jianwei 4 ;

作者机构: 1.Shanxi Datong Univ, Coll Architecture & Geomat Engn, Datong 037009, Peoples R China

2.Beijing Acad Agr & Forestry Sci, Informat Technol Res Ctr, Key Lab Quantitat Remote Sensing Agr, Minist Agr & Rural Affairs, Beijing 100097, Peoples R China

3.Natl Engn Res Ctr Informat Technol Agr, Beijing 100097, Peoples R China

4.Beijing PAIDE Sci & Technol Dev Co Ltd, Beijing 100097, Peoples R China

5.Changan Univ, Sch Geol Engn & Surveying & Mapping, Xian 710054, Peoples R China

关键词: Lodging; XGBoost; SHAP; Remote sensing; SMOTE

期刊名称:COMPUTERS AND ELECTRONICS IN AGRICULTURE ( 影响因子:6.757; 五年影响因子:6.817 )

ISSN: 0168-1699

年卷期: 2022 年 194 卷

页码:

收录情况: SCI

摘要: Remote sensing image is becoming an increasingly popular tool for crop lodging detection because it conveniently provides features for building machine learning models and predicting lodging. However, difficulties in interpreting machine learning models and their predictions limit the confidence of using remote sensing images to detect lodging. In addition, the lodging datasets used for modeling are difficult to balance under natural conditions. Designing a robust and interpretable classification model for the detection of lodging in an imbalanced distribution dataset poses a particularly difficult challenge. In this study, visible and multi-spectral images were collected with a UAV to extract relevant features from remote sensing images.In a preliminary step, Synthetic Minority Oversampling Technique (SMOTE) and Edited Nearest Neighbors (ENN) method were used to treat imbalanced datasets. The SMOTE-ENN-XGBoost model is proposed for the efficient identification of maize lodging at the plot scale. The SMOTE-ENN-XGBoost model achieved an F1-score of 0.930 and a recall of 0.899 on a testing set, suggesting that it can be used for modeling lodging detection. Additionally, the SHapley Additive exPlanations (SHAP) approach was employed to interpret the identification and prioritization of features that determine lodging classification and activity prediction. The results showed that canopy structure and textural features are relatively stable compared with spectral features, which are susceptible to the external environment when modeling is employed to detect lodging. This work also showed that canopy structural, spectral, and textural information should be considered simultaneously rather than separately when detecting crop lodging in a crop breeding program in order to prevent differences in expression controlled by the interaction between genotype and environment obscuring the change in a single feature before and after lodging. For practical applications of machine learning models in crop lodging detection, such insights are of critical relevance. Taken together, the results of this study encourage further applications of remote sensing techniques to build interpretable machine learning models.

  • 相关文献
作者其他论文 更多>>