A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion
文献类型: 外文期刊
作者: Feng, Xuguang 1 ; Zhao, Chunjiang 2 ; Wang, Chunshan 1 ; Wu, Huarui 2 ; Miao, Yisheng 2 ; Zhang, Jingjian 5 ;
作者机构: 1.Hebei Agr Univ, Sch Informat Sci & Technol, Baoding, Peoples R China
2.Natl Engn Res Ctr Informat Technol Agr, Beijing, Peoples R China
3.Minist Agr & Rural Affairs Peoples Republ China, Agr Key Lab Digital Village, Beijing, Peoples R China
4.Hebei Key Lab Agr Big Data, Baoding, Peoples R China
5.Cangzhou Acad Agr & Forestry Sci, Cangzhou, Peoples R China
关键词: cross-modal fusion; transformer; few-shot; complex background; disease identification
期刊名称:FRONTIERS IN PLANT SCIENCE ( 影响因子:6.627; 五年影响因子:7.255 )
ISSN: 1664-462X
年卷期: 2022 年 13 卷
页码:
收录情况: SCI
摘要: In view of the differences in appearance and the complex backgrounds of crop diseases, automatic identification of field diseases is an extremely challenging topic in smart agriculture. To address this challenge, a popular approach is to design a Deep Convolutional Neural Network (DCNN) model that extracts visual disease features in the images and then identifies the diseases based on the extracted features. This approach performs well under simple background conditions, but has low accuracy and poor robustness under complex backgrounds. In this paper, an end-to-end disease identification model composed of a disease-spot region detector and a disease classifier (YOLOv5s + BiCMT) was proposed. Specifically, the YOLOv5s network was used to detect the disease-spot regions so as to provide a regional attention mechanism to facilitate the disease identification task of the classifier. For the classifier, a Bidirectional Cross-Modal Transformer (BiCMT) model combining the image and text modal information was constructed, which utilizes the correlation and complementarity between the features of the two modalities to achieve the fusion and recognition of disease features. Meanwhile, the problem of inconsistent lengths among different modal data sequences was solved. Eventually, the YOLOv5s + BiCMT model achieved the optimal results on a small dataset. Its Accuracy, Precision, Sensitivity, and Specificity reached 99.23, 97.37, 97.54, and 99.54%, respectively. This paper proves that the bidirectional cross-modal feature fusion by combining disease images and texts is an effective method to identify vegetable diseases in field environments.
- 相关文献
作者其他论文 更多>>
-
Staggered-Phase Spray Control: A Method for Eliminating the Inhomogeneity of Deposition in Low-Frequency Pulse-Width Modulation (PWM) Variable Spray
作者:Zhang, Chunfeng;Zhao, Chunjiang;Zhang, Chunfeng;Zhai, Changyuan;Zhang, Meng;Zhang, Chi;Zou, Wei;Zhao, Chunjiang;Zhang, Chunfeng;Zou, Wei;Zhai, Changyuan;Zhang, Meng;Zhao, Chunjiang
关键词:precision spray; variable spray; PWM; deposition; duty cycle; frequency
-
A Cucumber Leaf Disease Severity Grading Method in Natural Environment Based on the Fusion of TRNet and U-Net
作者:Yao, Hui;Wang, Chunshan;Liu, Bo;Liang, Fangfang;Yao, Hui;Wang, Chunshan;Liu, Bo;Liang, Fangfang;Wang, Chunshan;Zhang, Lijie;Li, Jiuxi
关键词:cucumber disease; disease spot; fusion of TRNet and U-Net; two-stage segmentation framework; disease severity grading
-
A novel electrochemical sensor for in situ and in vivo detection of sugars based on boronic acid-diol recognition
作者:Liu, Ke;Xu, Tongyu;Zhao, Chunjiang;Liu, Ke;Li, Aixue;Zhao, Chunjiang
关键词:Fructose; Glucose; Electrochemical biosensor; In situ; In vivo; Artificial neural network
-
Eliminating Primacy Bias in Online Reinforcement Learning by Self-Distillation
作者:Li, Jingchen;Wu, Huarui;Zhao, Chunjiang;Shi, Haobin;Hwang, Kao-Shing
关键词:Online reinforcement learning; overfitting; reinforcement learning
-
Using high-throughput phenotype platform MVS-Pheno to reconstruct the 3D morphological structure of wheat
作者:Li, Wenrui;Zhao, Chunjiang;Li, Wenrui;Wu, Sheng;Wen, Weiliang;Lu, Xianju;Liu, Haishen;Zhang, Minggang;Xiao, Pengliang;Guo, Xinyu;Zhao, Chunjiang;Li, Wenrui;Wu, Sheng;Wen, Weiliang;Lu, Xianju;Liu, Haishen;Zhang, Minggang;Xiao, Pengliang;Guo, Xinyu
关键词:3D reconstruction; plant morphology; point cloud segmentation; Wheat
-
Dynamic Compressive Stress Relaxation Model of Tomato Fruit Based on Long Short-Term Memory Model
作者:Ru, Mengfei;Zhao, Chunjiang;Feng, Qingchun;Sun, Na;Li, Yajun;Sun, Jiahui;Li, Jianxun;Ru, Mengfei;Feng, Qingchun;Zhao, Chunjiang
关键词:tomato; stress relaxation; machine learning; LSTM
-
Energy and environmental evaluation and comparison of a diesel-electric hybrid tractor, a conventional tractor, and a hillside mini-tiller using the life cycle assessment method
作者:Liu, Wei;Yang, Rui;Li, Li;Zhao, Chunjiang;Li, Guanglin;Zhao, Chunjiang
关键词:Agricultural machinery; Electrification; Hybrid electric tractor; Environmental impact