您好,欢迎访问北京市农林科学院 机构知识库!

MSOAR-YOLOv10: Multi-Scale Occluded Apple Detection for Enhanced Harvest Robotics

文献类型: 外文期刊

作者: Fu, Heng 1 ; Guo, Zhengwei 3 ; Feng, Qingchun 3 ; Xie, Feng 3 ; Zuo, Yijing 4 ; Li, Tao 1 ;

作者机构: 1.Minist Agr & Rural Affairs, Key Lab Modern Agr Intelligent Equipment South Chi, Guangzhou 510630, Peoples R China

2.Northwest A&F Univ, Coll Mech & Elect Engn, Yangling 712100, Peoples R China

3.Beijing Acad Agr & Forestry Sci, Intelligent Equipment Res Ctr, Beijing 100097, Peoples R China

4.Tianjin Foreign Studies Univ, Binhai Foreign Affairs Coll, Dept Basic Courses, Tianjin 300270, Peoples R China

关键词: YOLOv10; object detection; deep learning; apple

期刊名称:HORTICULTURAE ( 影响因子:3.0; 五年影响因子:3.2 )

ISSN:

年卷期: 2024 年 10 卷 12 期

页码:

收录情况: SCI

摘要: The accuracy of apple fruit recognition in orchard environments is significantly affected by factors such as occlusion and lighting variations, leading to issues such as missed and false detections. To address these challenges, particularly related to occluded apples, this study proposes an improved apple-detection model, MSOAR-YOLOv10, based on YOLOv10. Firstly, a multi-scale feature fusion network is enhanced by adding a 160 x 160 feature scale layer to the backbone network, which increases the model's sensitivity to small local features, particularly for occluded fruits. Secondly, the Squeeze-and-Excitation (SE) attention mechanism is integrated into the C2fCIB convolution module of the backbone network to improve the network's focus on the regions of interest in the input images. Additionally, a Diverse Branch Block (DBB) module is introduced to enhance the performance of the convolutional neural network. Furthermore, a Normalized Wasserstein Distance (NWD) loss function is proposed to effectively reduce missed detections of densely packed and overlapping targets. Experimental results in orchards indicate that the proposed improved YOLOv10 model achieves precision, recall, and mean average precision rates of 89.3%, 89.8%, and 92.8%, respectively, representing increases of 3.1%, 2.2%, and 3.0% compared to the original YOLOv10 model. These results validate that the proposed network significantly enhances apple recognition accuracy in complex orchard environments, particularly improving the operational precision of harvesting robots in real-world conditions.

  • 相关文献
作者其他论文 更多>>