A Densely Connected GRU Neural Network Based on Coattention Mechanism for Chinese Rice-Related Question Similarity Matching

文献类型: 外文期刊

第一作者: Wang, Haoriqin

作者: Wang, Haoriqin;Xu, Tongyu;Wang, Haoriqin;Wang, Haoriqin;Zhu, Huaji;Wu, Huarui;Wang, Xiaomin;Han, Xiao;Wang, Haoriqin;Zhu, Huaji;Wu, Huarui;Wang, Xiaomin;Han, Xiao

作者机构: Shenyang Agr Univ, Coll Informat & Elect Engn, Shenyang 110866, Peoples R China;Inner Mongolia Univ Nationalities, Coll Comp Sci & Technol, Tongliao 028043, Peoples R China;Natl Engn Res Ctr Informat Technol Agr, Beijing 100097, Peoples R China;Beijing Res Ctr Informat Technol Agr, Beijing 100097, Peoples R China

关键词: rice-related question similarity matching; natural language processing; densely connected GRU; coattention mechanism; question-and-answering communities

期刊名称:AGRONOMY-BASEL ( 2020影响因子:3.417; 五年影响因子:3.64 )

ISSN:

年卷期: 2021 年 11 卷 7 期

页码:

收录情况: SCI

摘要: In the question-and-answer (Q&A) communities of the "China Agricultural Technology Extension Information Platform", thousands of rice-related Chinese questions are newly added every day. The rapid detection of the same semantic question is the key to the success of a rice-related intelligent Q&A system. To allow the fast and automatic detection of the same semantic rice-related questions, we propose a new method based on the Coattention-DenseGRU (Gated Recurrent Unit). According to the rice-related question characteristics, we applied word2vec with the TF-IDF (Term Frequency-Inverse Document Frequency) method to process and analyze the text data and compare it with the Word2vec, GloVe, and TF-IDF methods. Combined with the agricultural word segmentation dictionary, we applied Word2vec with the TF-IDF method, effectively solving the problem of high dimension and sparse data in the rice-related text. Each network layer employed the connection information of features and all previous recursive layers' hidden features. To alleviate the problem of feature vector size increasing due to dense splicing, an autoencoder was used after dense concatenation. The experimental results show that rice-related question similarity matching based on Coattention-DenseGRU can improve the utilization of text features, reduce the loss of features, and achieve fast and accurate similarity matching of the rice-related question dataset. The precision and F1 values of the proposed model were 96.3% and 96.9%, respectively. Compared with seven other kinds of question similarity matching models, we present a new state-of-the-art method with our rice-related question dataset.

分类号:

  • 相关文献
作者其他论文 更多>>