Noise-tolerant RGB-D feature fusion network for outdoor fruit detection
文献类型: 外文期刊
第一作者: Sun, Qixin
作者: Sun, Qixin;Chai, Xiujuan;Zhou, Guomin;Sun, Tan;Sun, Qixin;Chai, Xiujuan;Zhou, Guomin;Sun, Tan;Zeng, Zhikang
作者机构:
关键词: Multi-modal; Feature fusion; Attention mechanism; Object detection; Convolutional neural network
期刊名称:COMPUTERS AND ELECTRONICS IN AGRICULTURE ( 影响因子:6.757; 五年影响因子:6.817 )
ISSN: 0168-1699
年卷期: 2022 年 198 卷
页码:
收录情况: SCI
摘要: In the process of farm automation, fruit detection is the basis and guarantee for yield prediction, automatic picking, and other orchard operations. RGB images can only obtain the two-dimensional information of the scene, which is not sufficient to effectively distinguish fruits that are dense growth and occlusion by branches and leaves. With the development of depth sensors, using RGB-D images with more complementary information can boost the performance of fruit detection. However, due to the nature of sensors and scene configurations, the quality of outdoor depth images is poor, posing a challenge when fusing RGB-D features. Therefore, this paper proposes an end-to-end RGB-D object detection network, termed as noise-tolerant feature fusion network (NTFFN), to utilize the outdoor multi-modal data properly and improve the detection accuracy. Specifically, the NTFFN first uses two structurally identical feature extractors to extract single-modal (color and depth) features, which is the base of the subsequent feature fusion. Then, to avoid introducing too much depth noise and focus the perception on the important part of the features, an attention-based fusion module is designed to adaptively fuse the multi-modal features. Finally, multi-scale features from the color images and the fusion modules are used to predict object position, which not only improves the network's ability to detect multi-scale objects but also further enhances the noise immunity of the network. In addition, this paper constructs an RGB-D citrus fruit dataset, which contributes to comprehensively evaluating the proposed network. Evaluation metrics on the dataset show that the NT-FFN achieves an AP(50) of 95.4% with a real-time speed, which outperforms single-modal methods, common multi-modal fusion strategies, and advanced multi-modal detection methods. The proposed NT-FFN also achieves excellent detection results in other fruit detection tasks, which verifies its generalization ability. This study provides the possibility and foundation for performing multi-modal information fusion in outdoor fruit detection.
分类号:
- 相关文献
作者其他论文 更多>>
-
SGR-YOLO: a method for detecting seed germination rate in wild rice
作者:Yao, Qiong;Yao, Qiong;Zheng, Xiaoming;Zhou, Guomin;Zhang, Jianhua;Zhou, Guomin;Zhang, Jianhua
关键词:wild rice; germination detection; deep learning; SGR-YOLO; BiFPN
-
KASP-IEva: an intelligent typing evaluation model for KASP primers
作者:Chen, Xiaojing;Fan, Jingchao;Yan, Shen;Zhang, Jianhua;Chen, Xiaojing;Huang, Longyu;Fan, Jingchao;Zhou, Guomin;Zhang, Jianhua;Huang, Longyu;Zhou, Guomin;Huang, Longyu
关键词:intelligent evaluation; KASP marker; decision tree; genotyping; cotton; molecular marker-assisted selection
-
ESG-YOLO: A Method for Detecting Male Tassels and Assessing Density of Maize in the Field
作者:Wu, Wendi;Zhang, Yuhang;Wu, Wendi;Zhang, Jianhua;Zhou, Guomin;Zhang, Yuhang;Wang, Jian;Hu, Lin;Zhang, Jianhua;Zhou, Guomin;Wang, Jian;Hu, Lin;Zhou, Guomin
关键词:maize; tassel; target detection; attention mechanism; SPD-Conv; ESG-YOLO; density assessment
-
Development, integration, and field evaluation of an autonomous Agaricus bisporus picking robot
作者:Zhong, Ming;Han, Ruiqing;Liu, Yan;Huang, Bo;Liu, Yaxin;Chai, Xiujuan
关键词:Agaricus bisporus; Agriculture robotics; Overlapping target detection; Picking sequence planning; End-effector
-
A k-mer-based pangenome approach for cataloging seed-storage-protein genes in wheat to facilitate genotype-to-phenotype prediction and improvement of end-use quality
作者:Zhang, Zhaoheng;Liu, Dan;Li, Binyong;Wang, Wenxi;Zhang, Jize;Xin, Mingming;Hu, Zhaorong;Liu, Jie;Du, Jinkun;Peng, Huiru;Ni, Zhongfu;Sun, Qixin;Guo, Weilong;Yao, Yingyin;Zhang, Zhaoheng;Liu, Dan;Li, Binyong;Wang, Wenxi;Zhang, Jize;Xin, Mingming;Hu, Zhaorong;Liu, Jie;Du, Jinkun;Peng, Huiru;Ni, Zhongfu;Sun, Qixin;Guo, Weilong;Yao, Yingyin;Hao, Chenyang;Zhang, Xueyong
关键词:wheat; seed-storage protein; end-use quality; k- mer; pangenome; genomic prediction
-
Design and Experimentation of a Machine Vision-Based Cucumber Quality Grader
作者:Liu, Fanghong;Du, Chengtao;Ren, Xu;Huang, Bo;Zhang, Yanqi;Chai, Xiujuan
关键词:quality grader; cucumber grading; deep learning; mass prediction
-
TaNAM-6A is essential for nitrogen remobilisation and regulates grain protein content in wheat (Triticum aestivum L.)
作者:Meng, Xinhao;Zhai, Shanshan;Zhang, Runqi;Liu, Guoyu;Xu, Weiya;Yu, Jiazheng;Zhang, Yufeng;Ni, Zhongfu;Sun, Qixin;Xing, Jiewen;Li, Baoyun;Lou, Hongyao
关键词:elite haplotype; GPC; nitrate transporter 1/peptide family; nitrogen transport; TaNAM