Semantic Segmentation of Rice Fields in Sub-Meter Satellite Imagery Using an HRNet-CA-Enhanced DeepLabV3+Framework

文献类型: 外文期刊

第一作者: Shao, Yifan

作者: Shao, Yifan;Pan, Pan;Zhao, Hongxin;Li, Jiale;Yu, Guoping;Zhang, Jianhua;Shao, Yifan;Pan, Pan;Zhao, Hongxin;Li, Jiale;Yu, Guoping;Zhang, Jianhua;Zhou, Guomin;Zhou, Guomin;Zhou, Guomin

作者机构:

关键词: rice area; DeepLabV3+; High-Resolution Network; coordinate attention; semantic segmentation

期刊名称:REMOTE SENSING ( 影响因子:4.1; 五年影响因子:4.8 )

ISSN:

年卷期: 2025 年 17 卷 14 期

页码:

收录情况: SCI

摘要: Accurate monitoring of rice-planting areas underpins food security and evidence-based farm management. Recent work has advanced along three complementary lines-multi-source data fusion (to mitigate cloud and spectral confusion), temporal feature extraction (to exploit phenology), and deep-network architecture optimization. However, even the best fusion- and time-series-based approaches still struggle to preserve fine spatial details in sub-meter scenes. Targeting this gap, we propose an HRNet-CA-enhanced DeepLabV3+ that retains the original model's strengths while resolving its two key weaknesses: (i) detail loss caused by repeated down-sampling and feature-pyramid compression and (ii) boundary blurring due to insufficient multi-scale information fusion. The Xception backbone is replaced with a High-Resolution Network (HRNet) to maintain full-resolution feature streams through multi-resolution parallel convolutions and cross-scale interactions. A coordinate attention (CA) block is embedded in the decoder to strengthen spatially explicit context and sharpen class boundaries. The rice dataset consisted of 23,295 images (11,295 rice + 12,000 non-rice) via preprocessing and manual labeling and benchmarked the proposed model against classical segmentation networks. Our approach boosts boundary segmentation accuracy to 92.28% MIOU and raises texture-level discrimination to 95.93% F1, without extra inference latency. Although this study focuses on architecture optimization, the HRNet-CA backbone is readily compatible with future multi-source fusion and time-series modules, offering a unified path toward operational paddy mapping in fragmented sub-meter landscapes.

分类号:

  • 相关文献
作者其他论文 更多>>