农科机构知识库联盟

Dynamic preference inference network: Improving sample efficiency for multi-objective reinforcement learning by preference estimation

文献类型：外文期刊

第一作者： Liu, Yang

作者： Liu, Yang;Zhou, Ying;He, Ziming;Yang, Yusen;Li, Jingchen;Han, Qingcen

作者机构：

关键词： Multi-objective reinforcement learning; Sample efficiency; Reinforcement learning

期刊名称：KNOWLEDGE-BASED SYSTEMS （影响因子：7.6；五年影响因子：7.6 ）

ISSN： 0950-7051

年卷期： 2024 年 304 卷

页码：

收录情况： SCI

摘要： Multi-objective reinforcement learning (MORL) addresses the challenge of optimizing policies in environments with multiple conflicting objectives. Traditional approaches often rely on scalar utility functions, which require predefined preference weights, limiting their adaptability and efficiency. To overcome this, we propose the Dynamic Preference Inference Network (DPIN), a novel method designed to enhance sample efficiency by dynamically estimating the trajectory decision preference of the agent. DPIN leverages a neural network to predict the most favorable preference distribution for each trajectory, enabling more effective policy updates and improving overall performance in complex MORL tasks. Extensive experiments in various benchmark environments demonstrate that DPIN significantly outperforms existing state-of-the-art methods, achieving higher scalarized returns and hypervolume. Our findings highlight DPIN's ability to adapt to varying preferences, reduce sample complexity, and provide robust solutions in multi-objective settings.

分类号：

相关文献

作者其他论文更多>>

Gene expression profiles of Chinese medaka ( Oryzias sinensis ) primary hepatocytes in response to estrone (E1 ), 17 i3-estradiol (E2 ) and estriol (E3 )

作者：Wang, Yue;Lu, Junhui;Xie, Zhongtang;Huai, Narma;Zhang, Kailun;Zhou, Ying;Reze, Yilihamu;Li, Xiqing;Zhang, Zhaobin;Zhu, Hua

关键词：Oryzias sinensis; Primary hepatocytes; Natural estrogens; Vitellogenin; toxicogenomics
Structure dependence of the enhanced sulfur tolerance of core-shell CoMn oxides for benzene oxidation: Discrepant sulfur species and less affected reactant activation

作者：Liu, Yang;Cheng, Lin;Zhang, Di;Zhan, Jingjing;Shan, Jiajia;Zhou, Hao;Yi, Xianliang;Li, Zhonghong;Shen, Xudong

关键词：VOCs; Catalytic oxidation; Manganese oxide; Cobalt decoration; Sulfur resistance
Triterpenoid saponins in tea plants: A spatial and metabolic analysis using UPLC-QTOFMS, molecular networking, and DESI-MSI

作者：Du, Zhenghua;Zhou, Ying;Guo, Shuang;Dong, Yonghui;Yu, Xiaomin;Du, Zhenghua;Zhou, Ying;Guo, Shuang;Dong, Yonghui;Yu, Xiaomin;Xu, Yongquan

关键词：Camellia sinensis; Triterpenoid saponins; MS/MS fragmentation; Molecular networking; Spatial metabolomics; Mass spectrometry imaging
Increasing Planting Density with Reduced Topdressing Nitrogen Inputs Increased Nitrogen Use Efficiency and Improved Grain Quality While Maintaining Yields in Weak-Gluten Wheat

作者：Zhou, Wenyin;Yan, Suhui;Rehman, Abdul;Li, Haojie;Zhang, Shiya;Yong, Yudong;Liu, Yang;Xiao, Longfei;Li, Wenyang;Zheng, Chengyan

关键词：weak-gluten wheat; nitrogen tracing; planting density; grain yield; nitrogen agronomic efficiency; quality
Population Genetics, Demographic History, and Potential Distributions of the New Important Pests Monolepta signata (Coleoptera: Chrysomelidae) on Corn in China

作者：Liu, Yang;Ge, Yacong;Wang, Liming;Dong, Jingao;Wang, Yuyu;Wang, Zhenying

关键词：Monolepta signata; genetic diversity; phylogeography; potential suitability areas
Attraction and aversion of noctuid moths to fermented food sources coordinated by olfactory receptors from distinct gene families

作者：Hou, Xiao-Qing;Zhao, Hanbo;Wang, Guirong;Liu, Yang;Wang, Guirong;Zhang, Dan-Dan;Lofstedt, Christer;Hou, Xiao-Qing;Lofstedt, Christer

关键词：Isoamyl alcohol; Octanoic acid; Odorant receptor; Ionotropic receptor; Yeast; Functional conservation
A Coupling Coordination Assessment of the Land-Water-Food Nexus in China

作者：Liu, Cong;Wei, Jianmei;Lu, Hui;Li, Qing;Jiang, Wenlai;Liu, Yang

关键词：land-water-food nexus; coupling coordination development; influencing factors; future trend; China

Dynamic preference inference network: Improving sample efficiency for multi-objective reinforcement learning by preference estimation

作者其他论文 更多>>

Gene expression profiles of Chinese medaka ( Oryzias sinensis ) primary hepatocytes in response to estrone (E1 ), 17 i3-estradiol (E2 ) and estriol (E3 )

Structure dependence of the enhanced sulfur tolerance of core-shell CoMn oxides for benzene oxidation: Discrepant sulfur species and less affected reactant activation

Triterpenoid saponins in tea plants: A spatial and metabolic analysis using UPLC-QTOFMS, molecular networking, and DESI-MSI

Increasing Planting Density with Reduced Topdressing Nitrogen Inputs Increased Nitrogen Use Efficiency and Improved Grain Quality While Maintaining Yields in Weak-Gluten Wheat

Population Genetics, Demographic History, and Potential Distributions of the New Important Pests Monolepta signata (Coleoptera: Chrysomelidae) on Corn in China

Attraction and aversion of noctuid moths to fermented food sources coordinated by olfactory receptors from distinct gene families

A Coupling Coordination Assessment of the Land-Water-Food Nexus in China

作者其他论文更多>>