A New Chinese Named Entity Recognition Method for Pig Disease Domain Based on Lexicon-Enhanced BERT and Contrastive Learning
文献类型: 外文期刊
第一作者: Peng, Cheng
作者: Peng, Cheng;Wang, Xiajun;Li, Qifeng;Yu, Qinyang;Jiang, Ruixiang;Ma, Weihong;Wu, Wenbiao;Meng, Rui;Li, Haiyan;Huai, Heju;Wang, Shuyan;Peng, Cheng;Li, Qifeng;Yu, Qinyang;Jiang, Ruixiang;Ma, Weihong;Wu, Wenbiao;Meng, Rui;Li, Haiyan;Huai, Heju;Wang, Shuyan;Peng, Cheng;Li, Qifeng;Yu, Qinyang;Jiang, Ruixiang;Ma, Weihong;Wu, Wenbiao;Meng, Rui;Li, Haiyan;Huai, Heju;Wang, Shuyan;Wang, Xiajun;He, Longjuan
作者机构:
关键词: pig disease; Chinese named entity recognition; lexicon-enhanced BERT; contrastive learning; small sample
期刊名称:APPLIED SCIENCES-BASEL ( 影响因子:2.5; 五年影响因子:2.7 )
ISSN:
年卷期: 2024 年 14 卷 16 期
页码:
收录情况: SCI
摘要: Featured Application Our work provides reliable technical support for the information extraction of pig diseases in Chinese . It can be applied to other domain - specific fields, thereby facilitating seamless adaptation for named entity identification across diverse contexts .Abstract Named Entity Recognition (NER) is a fundamental and pivotal stage in the development of various knowledge-based support systems, including knowledge retrieval and question-answering systems. In the domain of pig diseases, Chinese NER models encounter several challenges, such as the scarcity of annotated data, domain-specific vocabulary, diverse entity categories, and ambiguous entity boundaries. To address these challenges, we propose PDCNER, a Pig Disease Chinese Named Entity Recognition method leveraging lexicon-enhanced BERT and contrastive learning. Firstly, we construct a domain-specific lexicon and pre-train word embeddings in the pig disease domain. Secondly, we integrate lexicon information of pig diseases into the lower layers of BERT using a Lexicon Adapter layer, which employs char-word pair sequences. Thirdly, to enhance feature representation, we propose a lexicon-enhanced contrastive loss layer on top of BERT. Finally, a Conditional Random Field (CRF) layer is employed as the model's decoder. Experimental results show that our proposed model demonstrates superior performance over several mainstream models, achieving a precision of 87.76%, a recall of 86.97%, and an F1-score of 87.36%. The proposed model outperforms BERT-BiLSTM-CRF and LEBERT by 14.05% and 6.8%, respectively, with only 10% of the samples available, showcasing its robustness in data scarcity scenarios. Furthermore, the model exhibits generalizability across publicly available datasets. Our work provides reliable technical support for the information extraction of pig diseases in Chinese and can be easily extended to other domains, thereby facilitating seamless adaptation for named entity identification across diverse contexts.
分类号:
- 相关文献
作者其他论文 更多>>
-
Effect of combined nitrogen and phosphorus fertilization on summer maize yield and soil fertility in coastal saline-alkali land
作者:Ma, Changjian;Wang, Yue;Liu, Lining;Wang, Xuejun;Sun, Zeqiang;Li, Yan;Ma, Changjian;Wang, Yue;Wu, Wenbiao;Hou, Peng;Li, Bowen;Yuan, Huabin
关键词:Grain yield; Biomass yield; Fertilizer physiological efficiency; Coastal saline-alkali land
-
Chromosome-level and haplotype-resolved genome assembly of Bougainvillea glabra
作者:Lan, Lan;Li, Haiyan;Xu, Shisong;Xu, Yueting;Leng, Qingyun;Yin, Junmei;Niu, Junhai;Lan, Lan;Wu, Zhiqiang;Lan, Lan;Li, Haiyan;Xu, Shisong;Leng, Qingyun;Yin, Junmei;Niu, Junhai;Xu, Yueting;Zhang, Linbi;Wu, Linqiao;Yin, Junmei
关键词:
-
The analysis of the genetic loci affecting phenotypic plasticity of soybean isoflavone content by dQTG.seq model
作者:Yang, Zhenhong;Zhan, Yuhang;Zhu, Yina;Zhu, Hanhan;Li, Haiyan;Teng, Weili;Li, Yongguang;Zhao, Xue;Wang, Yuhe;Han, Yingpeng;Zhou, Changjun;Yuan, Ming;Liu, Miao
关键词:
-
A Novel Quantification Method for Gene-Edited Animal Detection Based on ddPCR
作者:Wang, Kaili;Lan, Hangzhen;Ji, Yi;Peng, Cheng;Wang, Xiaofu;Yang, Lei;Xu, Junfeng;Chen, Xiaoyun;Ji, Yi;Peng, Cheng;Wang, Xiaofu;Yang, Lei;Xu, Junfeng;Chen, Xiaoyun
关键词:gene editing; MSTN; nucleic acid detection; ddPCR
-
DASNet a dual branch multi level attention sheep counting network
作者:Chen, Yini;Gao, Ronghua;Li, Qifeng;Wang, Rong;Ding, Luyu;Li, Xuwen;Chen, Yini;Zhao, Hongtao;Li, Xuwen
关键词:
-
An Ultra-Sensitive Quarantine Pathogen On-Site Detection Based on a One-Pot Asymmetric Recombinase Polymerase Amplification and MNAzyme-Assisted Target Recycling Biosensor (OAR-MNA)
作者:Yang, Lei;Peng, Cheng;Chen, Xiaoyun;Xu, Xiaoli;Wei, Wei;Wang, Xiaofu;Xu, Junfeng;Chen, Guanwei;Yan, Jiatong;Sun, Meihao;Bo, Yongming;Fang, Xiaoxue;Wu, Jian
关键词:asymmetric RPA; bacterial fruit blotch; cucumber green mottle mosaic virus; MNAzyme; one-pot; ssDNA
-
Development of an efficient extraction and enrichment method for total flavonoids compounds from Erigeron breviscapus using ultrasound-assisted extraction and macroporous resin adsorption
作者:Li, Yang;Li, Qifeng;Zhang, Jiayu;Xiong, Ranhua;Huang, Chaobo;Zhao, Wei;Yang, Anquan;Xie, Min
关键词:Adsorption; antioxidant; Erigeron breviscapus; extraction; macroporous resin; total flavonoids