A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion
文献类型: 外文期刊
作者: Feng, Xuguang 1 ; Zhao, Chunjiang 2 ; Wang, Chunshan 1 ; Wu, Huarui 2 ; Miao, Yisheng 2 ; Zhang, Jingjian 5 ;
作者机构: 1.Hebei Agr Univ, Sch Informat Sci & Technol, Baoding, Peoples R China
2.Natl Engn Res Ctr Informat Technol Agr, Beijing, Peoples R China
3.Minist Agr & Rural Affairs Peoples Republ China, Agr Key Lab Digital Village, Beijing, Peoples R China
4.Hebei Key Lab Agr Big Data, Baoding, Peoples R China
5.Cangzhou Acad Agr & Forestry Sci, Cangzhou, Peoples R China
关键词: cross-modal fusion; transformer; few-shot; complex background; disease identification
期刊名称:FRONTIERS IN PLANT SCIENCE ( 影响因子:6.627; 五年影响因子:7.255 )
ISSN: 1664-462X
年卷期: 2022 年 13 卷
页码:
收录情况: SCI
摘要: In view of the differences in appearance and the complex backgrounds of crop diseases, automatic identification of field diseases is an extremely challenging topic in smart agriculture. To address this challenge, a popular approach is to design a Deep Convolutional Neural Network (DCNN) model that extracts visual disease features in the images and then identifies the diseases based on the extracted features. This approach performs well under simple background conditions, but has low accuracy and poor robustness under complex backgrounds. In this paper, an end-to-end disease identification model composed of a disease-spot region detector and a disease classifier (YOLOv5s + BiCMT) was proposed. Specifically, the YOLOv5s network was used to detect the disease-spot regions so as to provide a regional attention mechanism to facilitate the disease identification task of the classifier. For the classifier, a Bidirectional Cross-Modal Transformer (BiCMT) model combining the image and text modal information was constructed, which utilizes the correlation and complementarity between the features of the two modalities to achieve the fusion and recognition of disease features. Meanwhile, the problem of inconsistent lengths among different modal data sequences was solved. Eventually, the YOLOv5s + BiCMT model achieved the optimal results on a small dataset. Its Accuracy, Precision, Sensitivity, and Specificity reached 99.23, 97.37, 97.54, and 99.54%, respectively. This paper proves that the bidirectional cross-modal feature fusion by combining disease images and texts is an effective method to identify vegetable diseases in field environments.
- 相关文献
作者其他论文 更多>>
-
Overview of Pest Detection and Recognition Algorithms
作者:Guo, Boyu;Wang, Jianji;Guo, Minghui;Chen, Miao;Chen, Yanan;Guo, Boyu;Wang, Jianji;Guo, Minghui;Chen, Miao;Chen, Yanan;Guo, Minghui;Miao, Yisheng
关键词:smart agriculture; pest detection; pest recognition
-
Research on Positioning and Navigation System of Greenhouse Mobile Robot Based on Multi-Sensor Fusion
作者:Cheng, Bo;Li, Xiaoyue;Zhang, Ning;Song, Weitang;He, Xueying;Wu, Huarui
关键词:agricultural greenhouse; navigation robot; multi-sensor fusion; ultra-wideband; inertial measurement unit; odometry; rangefinder
-
Recognition of wheat rusts in a field environment based on improved DenseNet
作者:Chang, Shenglong;Cheng, Jinpeng;Fan, Zehua;Ma, Xinming;Li, Yong;Zhao, Chunjiang;Chang, Shenglong;Yang, Guijun;Cheng, Jinpeng;Fan, Zehua;Yang, Xiaodong;Zhao, Chunjiang
关键词:Plant disease; Wheat rust; Image processing; Deep learning; Computer vision (CV); DenseNet
-
GCVC: Graph Convolution Vector Distribution Calibration for Fish Group Activity Recognition
作者:Zhao, Zhenxi;Zhao, Chunjiang;Zhao, Zhenxi;Yang, Xinting;Zhou, Chao;Zhao, Chunjiang;Zhao, Zhenxi;Yang, Xinting;Zhou, Chao;Zhao, Chunjiang;Zhao, Zhenxi;Yang, Xinting;Zhou, Chao;Zhao, Chunjiang;Liu, Jintao
关键词:Fish; Feature extraction; Activity recognition; Calibration; Adhesives; Training; Convolution; Graph convolution vector calibration; fish group activity; activity feature vector calibration; fish activity dataset
-
Adaptive precision cutting method for rootstock grafting of melons: modeling, analysis, and validation
作者:Chen, Shan;Zhao, Chunjiang;Chen, Shan;Jiang, Kai;Zheng, Wengang;Jia, Dongdong;Zhao, Chunjiang;Jiang, Kai;Zheng, Wengang;Jia, Dongdong;Zhao, Chunjiang
关键词:Melon; Grafting robot; Adaptive cutting; Rootstock pith cavity; Machine vision
-
Long-range infrared absorption spectroscopy and fast mass spectrometry for rapid online measurements of volatile organic compounds from black tea fermentation
作者:Yang, Chongshan;Li, Guanglin;Zhao, Chunjiang;Fu, Xinglan;Yang, Chongshan;Jiao, Leizi;Wen, Xuelin;Lin, Peng;Duan, Dandan;Zhao, Chunjiang;Dong, Daming;Yang, Chongshan;Jiao, Leizi;Wen, Xuelin;Lin, Peng;Duan, Dandan;Dong, Daming;Dong, Chunwang
关键词:Black tea fermentation; Volatile organic compounds; Proton transfer reaction mass spectrometry; Fourier transform infrared spectroscopy; Principal component analysis; Extreme learning machine
-
Navigation line extraction algorithm for corn spraying robot based on YOLOv8s-CornNet
作者:Guo, Peiliang;Diao, Zhihua;Ma, Shushuai;He, Zhendong;Zhao, Suna;Zhao, Chunjiang;Li, Jiangbo;Zhang, Ruirui;Yang, Ranbing;Zhang, Baohua
关键词:agricultural robotics; computer vision; deep learning; navigation line extraction; network lightweight