Dynamic preference inference network: Improving sample efficiency for multi-objective reinforcement learning by preference estimation
文献类型: 外文期刊
作者: Liu, Yang 1 ; Zhou, Ying 2 ; He, Ziming 2 ; Yang, Yusen 3 ; Han, Qingcen 4 ; Li, Jingchen 3 ;
作者机构: 1.Zhejiang Univ, Coll Opt Sci & Engn, Hangzhou 310058, Zhejiang, Peoples R China
2.Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
3.Beijing Acad Agr & Forestry Sci, Informat Technol Res Ctr, Beijing 100079, Peoples R China
4.Northwestern Polytech Univ, Elect Informat Coll, Xian 710072, Shaanxi, Peoples R China
关键词: Multi-objective reinforcement learning; Sample efficiency; Reinforcement learning
期刊名称:KNOWLEDGE-BASED SYSTEMS ( 影响因子:7.6; 五年影响因子:7.6 )
ISSN: 0950-7051
年卷期: 2024 年 304 卷
页码:
收录情况: SCI
摘要: Multi-objective reinforcement learning (MORL) addresses the challenge of optimizing policies in environments with multiple conflicting objectives. Traditional approaches often rely on scalar utility functions, which require predefined preference weights, limiting their adaptability and efficiency. To overcome this, we propose the Dynamic Preference Inference Network (DPIN), a novel method designed to enhance sample efficiency by dynamically estimating the trajectory decision preference of the agent. DPIN leverages a neural network to predict the most favorable preference distribution for each trajectory, enabling more effective policy updates and improving overall performance in complex MORL tasks. Extensive experiments in various benchmark environments demonstrate that DPIN significantly outperforms existing state-of-the-art methods, achieving higher scalarized returns and hypervolume. Our findings highlight DPIN's ability to adapt to varying preferences, reduce sample complexity, and provide robust solutions in multi-objective settings.
- 相关文献
作者其他论文 更多>>
-
Gene expression profiles of Chinese medaka ( Oryzias sinensis ) primary hepatocytes in response to estrone (E1 ), 17 i3-estradiol (E2 ) and estriol (E3 )
作者:Wang, Yue;Lu, Junhui;Xie, Zhongtang;Huai, Narma;Zhang, Kailun;Zhou, Ying;Reze, Yilihamu;Li, Xiqing;Zhang, Zhaobin;Zhu, Hua
关键词:Oryzias sinensis; Primary hepatocytes; Natural estrogens; Vitellogenin; toxicogenomics
-
Enhancing potato leaf protein content, carbon-based constituents, and leaf area index monitoring using radiative transfer model and deep learning
作者:Feng, Haikuan;Fan, Yiguang;Ma, Yanpeng;Liu, Yang;Chen, Riqiang;Bian, Mingbo;Fan, Jiejie;Yang, Guijun;Zhao, Chunjiang;Feng, Haikuan;Zhao, Chunjiang;Yue, Jibo;Fu, Yuanyuan;Leng, Mengdie;Jin, Xiuliang;Zhao, Yu
关键词:Potato; Deep learning; Radiative transfer model; Transfer learning; Leaf protein content
-
Segmentation and Fractional Coverage Estimation of Soil, Illuminated Vegetation, and Shaded Vegetation in Corn Canopy Images Using CCSNet and UAV Remote Sensing
作者:Zhang, Shanxin;Yue, Jibo;Shu, Meiyan;Zhang, Shanxin;Wang, Xiaoyan;Feng, Haikuan;Feng, Haikuan;Liu, Yang
关键词:segmentation; digital camera; corn; deep learning
-
A Large-Scale UAV Swarm Confrontation Method Based on Fuzzy Reinforcement Learning
作者:Hu, Chunyang;Gu, Qiong;Wu, Zhao;Ning, Bin;Li, Jingchen;Yang, Yusen
关键词:Unmanned aerial vehicle; Large-scale multi-agent systems; Multi-agent reinforcement learning
-
An Integrated Rapid Detection of Botryosphaeriaceae Species in Grapevine Based on Recombinase Polymerase Amplification, CRISPR/Cas12a, and Lateral Flow Dipstick
作者:Wang, Baoyu;Fan, Anran;Liu, Mei;Zhou, Ying;Zhang, Wei;Yan, Jiye
关键词:Botryosphaeria dieback; field detection; LFD; RPA-CRISPR/Cas12a
-
Estimation of potato above-ground biomass based on the VGC-AGB model and deep learning
作者:Feng, Haikuan;Fan, Yiguang;Bian, Mingbo;Liu, Yang;Chen, Riqiang;Ma, Yanpeng;Fan, Jiejie;Yang, Guijun;Zhao, Chunjiang;Yue, Jibo;Feng, Haikuan;Zhao, Chunjiang
关键词:Hyperspectral; Above-ground biomass; Potato; Deep learning; Leaf area index
-
Utilizing UAV-based hyperspectral remote sensing combined with various agronomic traits to monitor potato growth and estimate yield
作者:Liu, Yang;Feng, Haikuan;Fan, Yiguang;Fan, Jiejie;Ma, Yanpeng;Chen, Riqiang;Bian, Mingbo;Yang, Guijun;Liu, Yang;Yue, Jibo;Yang, Fuqin
关键词:Crop growth monitoring; Potato yield; Crop traits; UAV; Hyperspectral



