A survey of efficient fine-tuning methods for Vision-Language Models - Prompt and Adapter
文献类型: 外文期刊
第一作者: Xing, Jialu
作者: Xing, Jialu;Liu, Jianping;Sun, Lulu;Chen, Xi;Gu, Xunxun;Wang, Yingfei;Liu, Jianping;Wang, Jian;Liu, Jianping
作者机构:
关键词: Vision-language; Computer vision; Efficient fine-tuning; Pre-training model; Prompt; Adapter
期刊名称:COMPUTERS & GRAPHICS-UK ( 影响因子:2.5; 五年影响因子:2.2 )
ISSN: 0097-8493
年卷期: 2024 年 119 卷
页码:
收录情况: SCI
摘要: Vision Language Model (VLM) is a popular research field located at the fusion of computer vision and natural language processing (NLP). With the emergence of transformer networks and mass web data, numerous large scale VLMs or Vision -Language Pre-training Models (VLPM) have been achieving state-of-the-art results in many tasks, such as retrieval (CLIP) and generation (DALL-E). Although large models have shown impressive results, the cost of retraining and full fine-tuning is prohibitive for general researchers. In recent years, Efficient fine-tuning (EFT) which a very low-cost tuning method has been a good solution to this problem has greatly alleviated this problem, and driven by this, a new fine-tuning paradigm has developed. Since Prompt and Adapter are most widely used in the field of visual language, this review focuses on analysing the progress of the application of these two methods. Firstly, we reviewed the VLM research paradigm based on the differences in pre-training-fine-tuning methods; Next, We categorized the Prompt into 3 types (7 subtypes) of usage patterns based on the different modal information, and categorized the Adapter into 2 types of usage patterns based on whether it plays a role in modal fusion, furthermore we discussed them in vision and vision-language tasks. Finally, we discussed the stability and social ethics of EFT, and possible future research directions were proposed.
分类号:
- 相关文献
作者其他论文 更多>>
-
An improved 3D-SwinT-CNN network to evaluate the fermentation degree of black tea
作者:Zhu, Fengle;Wang, Jian;Zhang, Yuqian;Zhao, Zhangfeng;Shi, Jiang;He, Mengzhu
关键词:Black tea fermentation; Hyperspectral imaging; 3D-SwinT-CNN; 3D convolutional neural networks; Swin transformer
-
Comprehensive analysis of Dendrobium catenatum HSP20 family genes and functional characterization of DcHSP20-12 in response to temperature stress
作者:Wang, Peng;Liu, Wen;Wang, Jian;Zhou, Yang;Wang, Peng;Li, Yuxin;Zhao, Xi;Liu, Wen;Hu, Yanping;Wang, Jian;Zhou, Yang;Zhang, Tingting;Hu, Yanping;Wang, Jian;Zhou, Yang
关键词:Dendrobium catenatum; HSP20 gene family; High temperature stress; Low temperature stress
-
ESG-YOLO: A Method for Detecting Male Tassels and Assessing Density of Maize in the Field
作者:Wu, Wendi;Zhang, Yuhang;Wu, Wendi;Zhang, Jianhua;Zhou, Guomin;Zhang, Yuhang;Wang, Jian;Hu, Lin;Zhang, Jianhua;Zhou, Guomin;Wang, Jian;Hu, Lin;Zhou, Guomin
关键词:maize; tassel; target detection; attention mechanism; SPD-Conv; ESG-YOLO; density assessment
-
Dissipation, accumulation, distribution and risk assessment of fungicides in greenhouse and open-field cowpeas
作者:Cui, Kai;Guan, Shuai;Liang, Jingyun;Fang, Liping;Ding, Ruiyan;Li, Teng;Dong, Zhan;Wang, Jian;Cui, Kai;Guan, Shuai;Liang, Jingyun;Fang, Liping;Ding, Ruiyan;Li, Teng;Dong, Zhan;Wang, Jian;Ma, Guoping;Zhao, Shengying;Hao, Qian
关键词:Fungicides; Cowpeas; Distribution; Accumulation; Risk assessment
-
GACDNet:Mapping winter wheat by generative adversarial cross-domain networks with transformer integration for zero-sample extraction
作者:Wang, Chunyang;Gu, Yanan;Xu, Zhaozhao;Wang, Chunyang;Li, Kai;Zhao, Zongze;Yang, Wei;Wang, Xinbing;Wang, Jian
关键词:Domain generalization; Contrast learning; Cross-domain; Image classification; Winter wheat
-
Probabilistic graph model and neural network perspective of click models for web search
作者:Liu, Jianping;Wang, Yingfei;Wang, Meng;Chu, Xintao;Liu, Jianping;Wang, Jian
关键词:Click model; Click prediction; Document ranking; Implicit feedback; Web search
-
Glycogen variations and glycometabolism during the gametogenesis cycle of Jinjiang oyster Crassostrea (Magallana) ariakensis
作者:Li, Zhuanzhuan;Zhao, Liyan;Wang, Yan;Chen, Xi;Ma, Peizhen;Liu, Zhihong;Sun, Xiujun;Zhou, Liqing;Wu, Biao;Li, Zhuanzhuan;Zhao, Liyan;Wang, Yan;Chen, Xi;Ma, Peizhen;Liu, Zhihong;Sun, Xiujun;Zhou, Liqing;Wu, Biao;Li, Zhuanzhuan;Zhao, Liyan;Wang, Yan;Chen, Xi;Ma, Peizhen;Liu, Zhihong;Sun, Xiujun;Zhou, Liqing;Wu, Biao;Ren, Jianfeng;Dou, Yu
关键词:Gametogenesis; Glycometabolism; Crassostrea ( Magallana ) ariakensis; Flavor