Noise-tolerant RGB-D feature fusion network for outdoor fruit detection
文献类型: 外文期刊
第一作者: Sun, Qixin
作者: Sun, Qixin;Chai, Xiujuan;Zhou, Guomin;Sun, Tan;Sun, Qixin;Chai, Xiujuan;Zhou, Guomin;Sun, Tan;Zeng, Zhikang
作者机构:
关键词: Multi-modal; Feature fusion; Attention mechanism; Object detection; Convolutional neural network
期刊名称:COMPUTERS AND ELECTRONICS IN AGRICULTURE ( 影响因子:6.757; 五年影响因子:6.817 )
ISSN: 0168-1699
年卷期: 2022 年 198 卷
页码:
收录情况: SCI
摘要: In the process of farm automation, fruit detection is the basis and guarantee for yield prediction, automatic picking, and other orchard operations. RGB images can only obtain the two-dimensional information of the scene, which is not sufficient to effectively distinguish fruits that are dense growth and occlusion by branches and leaves. With the development of depth sensors, using RGB-D images with more complementary information can boost the performance of fruit detection. However, due to the nature of sensors and scene configurations, the quality of outdoor depth images is poor, posing a challenge when fusing RGB-D features. Therefore, this paper proposes an end-to-end RGB-D object detection network, termed as noise-tolerant feature fusion network (NTFFN), to utilize the outdoor multi-modal data properly and improve the detection accuracy. Specifically, the NTFFN first uses two structurally identical feature extractors to extract single-modal (color and depth) features, which is the base of the subsequent feature fusion. Then, to avoid introducing too much depth noise and focus the perception on the important part of the features, an attention-based fusion module is designed to adaptively fuse the multi-modal features. Finally, multi-scale features from the color images and the fusion modules are used to predict object position, which not only improves the network's ability to detect multi-scale objects but also further enhances the noise immunity of the network. In addition, this paper constructs an RGB-D citrus fruit dataset, which contributes to comprehensively evaluating the proposed network. Evaluation metrics on the dataset show that the NT-FFN achieves an AP(50) of 95.4% with a real-time speed, which outperforms single-modal methods, common multi-modal fusion strategies, and advanced multi-modal detection methods. The proposed NT-FFN also achieves excellent detection results in other fruit detection tasks, which verifies its generalization ability. This study provides the possibility and foundation for performing multi-modal information fusion in outdoor fruit detection.
分类号:
- 相关文献
作者其他论文 更多>>
-
Efficient Triple Attention and AttentionMix: A Novel Network for Fine-Grained Crop Disease Classification
作者:Zhang, Yanqi;Zhang, Ning;Chai, Xiujuan;Zhu, Jingbo;Dong, Wei;Sun, Tan
关键词:crop pests and diseases; CNNs; channel attention; spatial attention; data augmentation
-
MMVSL: A multi-modal visual semantic learning method for pig pose and action recognition
作者:Guan, Zhibin;Chai, Xiujuan;Guan, Zhibin;Chai, Xiujuan
关键词:MMVSL; Multi-modal; Pig pose estimation; Action recognition; Improved HRNet
-
AECA-FBMamba: A Framework with Adaptive Environment Channel Alignment and Mamba Bridging Semantics and Details
作者:Chai, Xin;Zhang, Wenrong;Li, Zhaoxin;Zhang, Ning;Chai, Xiujuan
关键词:remote sensing; deep learning; weakly supervised learning; Mamba; Transformer
-
Digital twin-driven system for efficient tomato harvesting in greenhouses
作者:Lang, Yining;Zhang, Yanqi;Sun, Tan;Chai, Xiujuan;Zhang, Ning;Lang, Yining;Zhang, Yanqi;Zhang, Ning
关键词:Tomato; Harvest; Digital-twin; Greenhouse; Reinforcement-learning; Decision
-
Suppression of TaHDA8-mediated lysine deacetylation of TaAREB3 acts as a drought-adaptive mechanism in wheat root development
作者:Liu, Zehui;Yang, Qun;Liu, Xingbei;Li, Jinpeng;Zhang, Lei;Chu, Wei;Lin, Jingchen;Liu, Debiao;Zhao, Danyang;Peng, Xiao;Xin, Mingming;Yao, Yingyin;Peng, Huiru;Ni, Zhongfu;Sun, Qixin;Hu, Zhaorong;Zeng, Chaowu
关键词:wheat; drought resistance; root length; TaHDA8; TaAREB3; deacetylation
-
Extracting Fruit Disease Knowledge from Research Papers Based on Large Language Models and Prompt Engineering
作者:Fei, Yunqiao;Fan, Jingchao;Fei, Yunqiao;Fei, Yunqiao;Fan, Jingchao;Zhou, Guomin;Zhou, Guomin
关键词:research papers; knowledge extraction; large language models; prompt engineering; fruit tree diseases
-
PAB-Mamba-YOLO: VSSM assists in YOLO for aggressive behavior detection among weaned piglets
作者:Xia, Xue;Zhan, Ning;Guan, Zhibin;Chai, Xin;Ma, Shixin;Chai, Xiujuan;Xia, Xue;Zhan, Ning;Sun, Tan;Guan, Zhibin
关键词:Aggressive behaviors; Weaned piglet; Mamba; YOLO; Hybrid detection model