Layered Query Retrieval: An Adaptive Framework for Retrieval-Augmented Generation in Complex Question Answering for Large Language Models

文献类型: 外文期刊

第一作者: Huang, Jie

作者: Huang, Jie;Wang, Mo;Cui, Yunpeng;Liu, Juan;Chen, Li;Wang, Ting;Li, Huan;Wu, Jinming;Huang, Jie;Wang, Mo;Cui, Yunpeng;Liu, Juan;Chen, Li;Wang, Ting;Li, Huan;Wu, Jinming

作者机构:

关键词: retrieval-augmented generation; question answer; adaptive retrieval; complex classification

期刊名称:APPLIED SCIENCES-BASEL ( 影响因子:2.5; 五年影响因子:2.7 )

ISSN:

年卷期: 2024 年 14 卷 23 期

页码:

收录情况: SCI

摘要: Featured Application This work is being used to develop the QA application of agricultural planting and livestock breeding technologies.Abstract Retrieval-augmented generation (RAG) addresses the problem of knowledge cutoff and overcomes the inherent limitations of pre-trained language models by retrieving relevant information in real time. However, challenges related to efficiency and accuracy persist in current RAG strategies. A key issue is how to select appropriate methods for user queries of varying complexity dynamically. This study introduces a novel adaptive retrieval-augmented generation framework termed Layered Query Retrieval (LQR). The LQR framework focuses on query complexity classification, retrieval strategies, and relevance analysis, utilizing a custom-built training dataset to train smaller models that aid the large language model (LLM) in efficiently retrieving relevant information. A central technique in LQR is a semantic rule-based approach to distinguish between different levels of multi-hop queries. The process begins by parsing the user's query for keywords, followed by a keyword-based document retrieval. Subsequently, we employ a natural language inference (NLI) model to assess whether the retrieved document is relevant to the query. We validated our approach on multiple single-hop and multi-hop datasets, demonstrating significant improvements in both accuracy and efficiency compared to existing single-step, multi-step, and adaptive methods. Our method exhibits high accuracy and efficiency, particularly on the HotpotQA dataset, where it outperforms the Adaptive-RAG method by improving accuracy by 9.4% and the F1 score by 16.14%. The proposed approach carefully balances retrieval efficiency with the accuracy of the LLM's responses.

分类号:

  • 相关文献
作者其他论文 更多>>