北京市农林科学院机构知识库

Recognition of the Agricultural Named Entities With Multifeature Fusion Based on ALBERT

收藏
分享
全文链接

文献类型：外文期刊

作者： Zhao, Pengfei ¹ ; Wang, Wei ¹ ; Liu, Hai ¹ ; Han, Mo ¹ ;

作者机构： 1.Beijing Acad Agr & Forestry Sci, Natl Engn Res Ctr Informat Technol Agr, Beijing 100097, Peoples R China

关键词： Feature extraction; Semantics; Task analysis; Hidden Markov models; Agriculture; Diseases; Convolutional neural networks; Agriculture; named entity recognition; self-attention; long short-term memory; conditional random field

期刊名称：IEEE ACCESS （影响因子：3.476；五年影响因子：3.758 ）

ISSN： 2169-3536

年卷期： 2022 年 10 卷

页码：

收录情况： SCI

摘要： High quality agricultural named entity recognition (NER) model can provide effective support for agricultural information extraction, semantic retrieval and other tasks. However, the existing models ignore the potential characteristics of Chinese characters, resulting in the lack of internal semantics. Moreover, the agricultural text sequence is long, which leads to the lack of long-distance dependence of model capture. In order to solve the above problems, a self-attention mechanism RSA-CANER agricultural named entity recognition model is proposed which incorporating the potential characteristics of Chinese characters. First, the model takes character features and potential features of Chinese characters as input to enrich semantic information. Among them, character features are obtained based on ALBERT pre training tool, radical features are extracted based on convolutional neural network (CNN), and stroke features are extracted based on bidirectional long short-term memory model (BiLSTM). Then, based on the BiLSTM, the sequence characteristic matrix is obtained, and the self-attention mechanism is used to further enhance the ability of the model to capture long-distance dependence. Finally, the global optimal sequence is generated based on conditional random field (CRF) model. It obtains an F-score of 95.56%. The experimental results show that the model learns semantic information at multiple fine-grained levels of radicals and strokes, enriches the vector expression of target words, and its recognition precision is better than other models, improving the generalization ability of the model.

相关文献

作者其他论文更多>>

Research on the influence factors of sustainable development of plateau characteristic agriculture based on DEMATEL and AISM combined model

作者：Wang, Wei;Liu, Hai;Zhao, Pengfei;Han, Mo

关键词：
Bacteria Affect the Distribution of Soil-Dissolved Organic Matter on the Slope: A Long-Term Experiment in Black Soil Erosion

作者：Cai, Shanshan;Wang, Wei;Sun, Lei;Li, Yumei;Sun, Zhiling;Gao, Zhongchao;Zhang, Jiuming;Cai, Shanshan;Li, Yan;Wei, Dan

关键词：dissolved organic matter; black soil; slope; bacteria; fluorescence spectrum
Whole-genome resequencing of Russian sturgeon (Acipenser gueldenstaedtii) reveals selection signatures associated with caviar color

作者：Song, Hailiang;Dong, Tian;Wang, Wei;Yan, Xiaoyu;Hu, Hongxia;Song, Hailiang;Dong, Tian;Wang, Wei;Yan, Xiaoyu;Hu, Hongxia;Jiang, Boyun;Xu, Shijian;Song, Hailiang;Jiang, Boyun;Xu, Shijian;Hu, Hongxia

关键词：Acipenser gueldenstaedtii; Whole-genome resequencing; Selection signatures; Caviar color
Adsorption of typical dyes in water by sponge based covalent organic frameworks: Pore size and mechanism

作者：Wang, Shiyi;Guan, Tong;Zhu, Xingyi;Zhou, Shuangxi;Wang, Wei;Vakili, Mohammadtaghi;Gong, Wenwen

关键词：Covalent organic frameworks; Pore size; Adsorption mechanism; Methyl orange; Reusability
Cost-effective genomic prediction of critical economic traits in sturgeons through low-coverage sequencing

作者：Song, Hailiang;Dong, Tian;Wang, Wei;Yan, Xiaoyu;Geng, Chenfan;Bai, Song;Hu, Hongxia;Song, Hailiang;Dong, Tian;Wang, Wei;Yan, Xiaoyu;Geng, Chenfan;Bai, Song;Hu, Hongxia;Song, Hailiang;Dong, Tian;Wang, Wei;Jiang, Boyun;Yan, Xiaoyu;Geng, Chenfan;Bai, Song;Xu, Shijian;Hu, Hongxia;Jiang, Boyun;Xu, Shijian;Song, Hailiang;Dong, Tian;Wang, Wei;Yan, Xiaoyu;Geng, Chenfan;Bai, Song;Hu, Hongxia

关键词：Sturgeon; Low-coverage whole-genome sequencing; Imputation; Genomic prediction; Linkage disequilibrium pruning; Incremental feature selection
Straw mulching alters the composition and loss of dissolved organic matter in farmland surface runoff by inhibiting the fragmentation of soil small macroaggregates

作者：Cai, Shanshan;Wang, Jingkuan;Sun, Lei;Wang, Wei;Li, Yumei;Zhang, Jiuming;Li, Yan;Ding, Jianli;Jin, Liang;Wei, Dan

关键词：dissolved organic matter; black soil; surface runoff; aggregates; fluorescence spectrum
Lake Sinai virus is a diverse, globally distributed but not emerging multi-strain honeybee virus

作者：Hou, Chunsheng;Chen, Chenxiao;Liang, Hao;Liang, Hao;Zhao, Hongxia;Zhao, Pengfei;Deng, Shuai;Li, Beibei;Yang, Dahe;Yang, Sa;Wilfert, Lena

关键词：emerging disease; honeybee; varroa; vector; virus

Recognition of the Agricultural Named Entities With Multifeature Fusion Based on ALBERT

作者其他论文 更多>>

Research on the influence factors of sustainable development of plateau characteristic agriculture based on DEMATEL and AISM combined model

Bacteria Affect the Distribution of Soil-Dissolved Organic Matter on the Slope: A Long-Term Experiment in Black Soil Erosion

Whole-genome resequencing of Russian sturgeon (Acipenser gueldenstaedtii) reveals selection signatures associated with caviar color

Adsorption of typical dyes in water by sponge based covalent organic frameworks: Pore size and mechanism

Cost-effective genomic prediction of critical economic traits in sturgeons through low-coverage sequencing

Straw mulching alters the composition and loss of dissolved organic matter in farmland surface runoff by inhibiting the fragmentation of soil small macroaggregates

Lake Sinai virus is a diverse, globally distributed but not emerging multi-strain honeybee virus

意 见 箱

作者其他论文更多>>

意见箱