Improving model parsimony and accuracy by modified greedy feature selection in digital soil mapping
文献类型: 外文期刊
第一作者: Zhang, Xianglin
作者: Zhang, Xianglin;Chen, Songchao;Zhang, Xianglin;Chen, Songchao;Wang, Nan;Xiao, Yi;Chen, Qianqian;Hong, Yongsheng;Shi, Zhou;Xue, Jie;Zhou, Yin;Teng, Hongfen;Hu, Bifeng;Zhuo, Zhiqing;Ji, Wenjun;Huang, Yuanfang;Gou, Yuxuan;Richer-de-Forges, Anne C.;Arrouays, Dominique
作者机构:
关键词: Digital soil mapping; Variable selection; Quantile regression forests; Computation efficiency; Northeast and North China
期刊名称:GEODERMA ( 影响因子:6.1; 五年影响因子:7.0 )
ISSN: 0016-7061
年卷期: 2023 年 432 卷
页码:
收录情况: SCI
摘要: In the context of increasing soil degradation worldwide, spatially explicit soil information is urgently needed to support decision-making for sustaining limited soil resources. Digital soil mapping (DSM) has been proven as an efficient way to deliver soil information from local to global scales. The number of environmental covariates used for DSM has rapidly increased due to the growing volume of remote sensing data, therefore variable selection is necessary to deal with multicollinearity and improve model parsimony. Compared with Boruta, recursive feature elimination (RFE), and variance inflation factor (VIF) analysis, we proposed the use of modified greedy feature selection (MGFS), for DSM regression. For this purpose, using quantile regression forest, 402 soil samples and 392 environmental covariates were used to map the spatial distribution of soil organic carbon density (SOCD) in Northeast and North China. The result showed that MGFS selected the most parsimonious model with only 9 covariates (e.g., brightness index, mean annual temperature), much lower than RFE (22 covariates), VIF (30 covariates), and Boruta (76 covariates). The repeated validation (50 times) showed that the MGFS derived model performed better (R2 of 0.60, LCCC of 0.74, RMSE of 13.80 t ha -1) than these using full covariates, Boruta, RFE and VIF (R2 of 0.48-0.57, LCCC of 0.64-0.72, RMSE of 14.24-15.79 t ha -1). Despite the similar performance of the uncertainty estimate (PICP), the model using MGFS and RFE had the lowest global uncertainty (0.86) as indicated by the uncertainty index. In addition, MGFS had the best computation efficiency when considering the steps of variable selection and map prediction. Given these advantages over Boruta, RFE and VIF, MGFS has a high potential in fine-resolution soil mapping practices, especially for these studies at a broad scale involving heavy computation on millions or billions of pixels.
分类号:
- 相关文献
作者其他论文 更多>>
-
Critical review and recent advances of emerging real-time and non-destructive strategies for meat spoilage monitoring
作者:Chen, Jiaci;Zhang, Juan;Wang, Nan;Xiao, Bin;Sun, Xiaoyun;Yang, Longrui;Pang, Xiangyi;Huang, Fengchun;Chen, Ailiang;Li, Jiapeng;Zhong, Ke;Huang, Fengchun;Chen, Ailiang
关键词:Meat safety; Meat quality monitoring; Meat spoilage; Meat freshness evaluation; Meat spoilage detection
-
Sap flow of two typical woody halophyte species responding to the meteorological and irrigation water conditions in Taklimakan Desert
作者:Liu, Jiao;Liu, Jiao;Zhao, Ying;Wang, Yongdong;Xue, Jie;Wang, Shunke;Liu, Jiao;Zhao, Ying;Zhang, Jianguo;Zhao, Ying;Xue, Jie;Wang, Shunke;Xue, Jie;Wang, Shunke;Chang, Jingjing
关键词:Taklimakan Desert Highway shelterbelt; Woody halophyte; Saline -tolerant plant; Sap flow; Irrigation regimes
-
Direct and indirect impacts of land use/cover change on urban heat environment: a 15-year panel data study across 365 Chinese cities during summer daytime and nighttime
作者:He, Tong;Wang, Nan;Chen, Jiayue;Lu, Yingshuang;Qiao, Zhi;He, Tong;Hao, Yan;Wu, Feng;Xu, Xinliang;Liu, Luo;Han, Dongrui;Sun, Zongyao
关键词:Urban heat environment; Land use/cover change; Contribution index; Regional climate; Direct and indirect impacts
-
Soybean ( Glycine max ) rhizosphere organic phosphorus recycling relies on acid phosphatase activity and specific phosphorusmineralizing-related bacteria in phosphate deficient acidic soils
作者:Chen, Qianqian;Zhao, Qian;Xie, Baoxing;Lu, Xing;Guo, Qi;Liu, Guoxuan;Zhou, Ming;Chen, Kang;Tian, Jiang;Liang, Cuiyue;Tian, Jihui;Lu, Weiguo
关键词:organic phosphorus; acid phosphatase; soybean; bacterial community; phoC -harboring bacteria
-
Genetically optimizing soybean nodulation improves yield and protein content
作者:Zhong, Xiangbin;Wang, Jie;Cai, Chenlin;Zhu, Xiaomin;Su, Jiaqing;Shi, Xiaolei;Yang, Chunyan;Bai, Mengyan;Kuang, Huaqin;Wang, Xin;Kong, Fanjiang;Guan, Yuefeng;Yuan, Cuicui;Wang, Nan;Su, Jiaqing;Guan, Yuefeng;He, Xin;Wang, Ertao;Liu, Xiao;Yang, Wenqiang
关键词:
-
Astragalus Polysaccharide Modulates the Gut Microbiota and Metabolites of Patients with Type 2 Diabetes in an In Vitro Fermentation Model
作者:Zhang, Xin;Jia, Lina;Ma, Qian;Zhang, Tongcun;Qi, Wei;Wang, Nan;Zhang, Xin;Jia, Lina;Ma, Qian;Zhang, Tongcun;Qi, Wei;Wang, Nan;Zhang, Xiaoyuan;Chen, Mian;Liu, Fei;Jia, Weiguo;Zhu, Liying
关键词:Astragalus polysaccharide; type 2 diabetes mellitus; fecal microbiota; metabolites
-
Rapid DNA extraction and microfluidic LAMP system in portable equipment for GM crops detection
作者:Xiao, Bin;Zhang, Juan;Wang, Nan;Li, Liang;Pang, Xiangyi;Liu, Chuan;Huang, Fengchun;Chen, Ailiang;Wang, Mengyu;Fu, Wei;Chen, Hong;Wang, Haoqian
关键词:Fast extraction method; Hand-held chip; LAMP; Portable analyzer; Genetically modified crops; On -spot detection