Transfer large models to crop pest recognition-A cross-modal unified framework for parameters efficient fine-tuning
文献类型: 外文期刊
第一作者: Liu, Jianping
作者: Liu, Jianping;Xing, Jialu;Sun, Lulu;Chen, Xi;Liu, Jianping;Zhou, Guomin;Wang, Jian
作者机构:
关键词: Parameters efficient fine-tuning; Cross-modal fusion; Computer vision; Large pre-training model; Pest recognition; CLIP
期刊名称:COMPUTERS AND ELECTRONICS IN AGRICULTURE ( 影响因子:8.9; 五年影响因子:9.3 )
ISSN: 0168-1699
年卷期: 2025 年 237 卷
页码:
收录情况: SCI
摘要: Crop pest recognition is an important direction in agricultural research, which is of great significance for improving crop yield and scientifically classifying pests for precision agriculture. Traditional deep learning pest recognition usually trains proprietary models on single categories and scenes as well as unimodal information, achieving excellent performance. However, this scheme has a weak foundation of general knowledge, insufficient transferability, and unimodal information has limited effect on the recognition of pest background and different life stages. In recent years, transferring the general knowledge of Large pre-trained models (LPTM) to specific domains through full fine-tuning has become an effective solution. However, full fine-tuning requires massive data and operator resources to effectively adapt all parameters. Therefore, this paper proposes a cross-modal parameter efficient fine-tuning (PEFT) unified framework for crop pest recognition with the multimodal large model CLIP as the pre-training model. The proposed method employs CLIP as the encoder for both image and text modalities, introducing the Dual-(PAL)G model. Firstly, learnable Prompt sequences are embedded in the input or hidden layers of the encoder. Secondly, multimodal LoRA is parallelly replaced in the dimension expansion layer of the fully connected layer. Then, the Gate unit integrates three PEFT methods-Prompt, Adapter, and LoRA, to enhance learning ability. We designed the GSC-Adapter and the parameter-efficient Light-GCS-Adapter for cross-modal semantic information fusion. To verify the effectiveness of the method, we conducted a large number of experiments on public datasets for crop pest recognition. Firstly, on the public dataset IP102 (for fine-grained recognition), we surpassed ViT and Swin Transformer with 66% of the sample size. In wolfberry pest dataset WPIT9K, using only about 15% of the sample size, it surpasses the previous state-of-the-art model ITF-WPI, achieving 98% accuracy. It also shows excellent performance on eight general tasks. This study provides a new technical solution for the field of agricultural pest recognition . This solution can efficiently transfer the general knowledge of multimodal LPTM to the specific pest recognition field under the condition of a few samples, with only a minimal number of parameters introduced. At the same time, this method has universality in cross-modal recognition tasks. The code for this study will be posted on GitHub (https://github.com/VcRenOne/Dual-PAL-G)
分类号:
- 相关文献
作者其他论文 更多>>
-
An improved 3D-SwinT-CNN network to evaluate the fermentation degree of black tea
作者:Zhu, Fengle;Wang, Jian;Zhang, Yuqian;Zhao, Zhangfeng;Shi, Jiang;He, Mengzhu
关键词:Black tea fermentation; Hyperspectral imaging; 3D-SwinT-CNN; 3D convolutional neural networks; Swin transformer
-
Natural variation in CTF1 conferring cold tolerance at the flowering stage in rice
作者:Dong, Jingfang;Zhang, Shaohong;Hu, Haifei;Wang, Jian;Li, Risheng;Wu, Jing;Chen, Jiansong;Zhou, Lian;Ma, Yamei;Li, Wenhui;Nie, Shuai;Liu, Bin;Zhao, Junliang;Yang, Tifeng;Li, Risheng;Wu, Jing;Wang, Shaokui;Zhang, Guiquan
关键词:cold tolerance; QTL; single segment substitution line; haplotype analysis; functional site; rice
-
Long short-term search session-based document re-ranking model
作者:Liu, Jianping;Wang, Meng;Wang, Yingfei;Chu, Xintao;Liu, Jianping;Wang, Jian
关键词:Document re-ranking; Long short-term session search; Memory network; BERT; User intent
-
The OsNL1-OsTOPLESS2-OsMOC1/3 pathway regulates high-order tiller outgrowth in rice
作者:Liu, Xin;Zhang, Feng;Xun, Ziqi;Shao, Jiale;Luo, Wenfan;Jiang, Xiaokang;Wang, Jiachang;Wang, Jian;Li, Shuai;Lin, Qibing;Ren, Yulong;Cheng, Zhijun;Wan, Jianmin;Liu, Xin;Zhao, Huixian;Cheng, Zhijun;Wan, Jianmin;Wan, Jianmin
关键词:High-order tiller; OsNL1; OsTOPLESS2; HAN domain; OsMOC1; OsMOC3
-
Comparative transcriptome profiling reveals the key genes and molecular mechanisms involved in rice under blast infection
作者:Li, Gang;Wang, Jian;Cheng, Baoshan;Wang, Di;Gao, Hao;Xu, Weijun;Wang, Wei;Gao, Qingsong;Zhang, Wenxia;Ji, Jianhui;Li, Bianhao;Zhang, Guoliang;Qi, Zhongqiang;Liu, Yongfeng
关键词:Rice; Blast; Transcriptome; Disease resistance; Hormones; Biochemical indicators
-
Preparation and application of porcine broadly neutralizing monoclonal antibodies in an immunoassay for efficiently detecting neutralizing antibodies against foot-and-mouth disease virus serotype O
作者:Cao, Yimei;Li, Fengjuan;Xing, Xiangchuan;Zhang, Huiyan;Zhao, Qiongqiong;Sun, Pu;Fu, Yuanfang;Li, Pinghua;Ma, Xueqing;Zhang, Jing;Zhao, Zhixun;Yuan, Hong;Wang, Jian;Wang, Tao;Bao, Huifang;Bai, Xingwen;Li, Dong;Zhang, Qiang;Li, Kun;Lu, Zengjun
关键词:foot-and-mouth disease virus; porcine broadly neutralizing monoclonal antibody; competitive ELISA; neutralizing antibody; serotype O
-
The SUMO-conjugating enzyme OsSCE1a from wild rice regulates the functional stay-green trait in rice
作者:Yuan, Xuzhao;Luan, Yanfang;Liu, Dong;Wang, Jian;Peng, Jianxiang;Zhao, Jinlei;Li, Lupeng;Su, Jingjing;Xiao, Yang;Li, Yuanjie;Ma, Xin;Zhu, Xiaoyang;Tan, Lubin;Liu, Fengxia;Sun, Hongying;Gu, Ping;Xu, Ran;Zhu, Zuofeng;Sun, Chuanqing;Fu, Yongcai;Zhang, Kun;Yuan, Xuzhao;Xu, Ran;Liu, Dong;Wang, Jian;Peng, Jianxiang;Li, Yuanjie;Sun, Chuanqing;Zhang, Peijiang
关键词:functional stay-green; growth duration;
OsSCE1a ; SUMOylation; wild rice