北京市农林科学院机构知识库

Dynamic preference inference network: Improving sample efficiency for multi-objective reinforcement learning by preference estimation

收藏
分享
全文链接

文献类型：外文期刊

作者： Liu, Yang ¹ ; Zhou, Ying ² ; He, Ziming ² ; Yang, Yusen ³ ; Han, Qingcen ⁴ ; Li, Jingchen ³ ;

作者机构： 1.Zhejiang Univ, Coll Opt Sci & Engn, Hangzhou 310058, Zhejiang, Peoples R China

2.Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China

3.Beijing Acad Agr & Forestry Sci, Informat Technol Res Ctr, Beijing 100079, Peoples R China

4.Northwestern Polytech Univ, Elect Informat Coll, Xian 710072, Shaanxi, Peoples R China

关键词： Multi-objective reinforcement learning; Sample efficiency; Reinforcement learning

期刊名称：KNOWLEDGE-BASED SYSTEMS （影响因子：7.6；五年影响因子：7.6 ）

ISSN： 0950-7051

年卷期： 2024 年 304 卷

页码：

收录情况： SCI

摘要： Multi-objective reinforcement learning (MORL) addresses the challenge of optimizing policies in environments with multiple conflicting objectives. Traditional approaches often rely on scalar utility functions, which require predefined preference weights, limiting their adaptability and efficiency. To overcome this, we propose the Dynamic Preference Inference Network (DPIN), a novel method designed to enhance sample efficiency by dynamically estimating the trajectory decision preference of the agent. DPIN leverages a neural network to predict the most favorable preference distribution for each trajectory, enabling more effective policy updates and improving overall performance in complex MORL tasks. Extensive experiments in various benchmark environments demonstrate that DPIN significantly outperforms existing state-of-the-art methods, achieving higher scalarized returns and hypervolume. Our findings highlight DPIN's ability to adapt to varying preferences, reduce sample complexity, and provide robust solutions in multi-objective settings.

相关文献

作者其他论文更多>>

Gene expression profiles of Chinese medaka ( Oryzias sinensis ) primary hepatocytes in response to estrone (E1 ), 17 i3-estradiol (E2 ) and estriol (E3 )

作者：Wang, Yue;Lu, Junhui;Xie, Zhongtang;Huai, Narma;Zhang, Kailun;Zhou, Ying;Reze, Yilihamu;Li, Xiqing;Zhang, Zhaobin;Zhu, Hua

关键词：Oryzias sinensis; Primary hepatocytes; Natural estrogens; Vitellogenin; toxicogenomics
Enhancing potato leaf protein content, carbon-based constituents, and leaf area index monitoring using radiative transfer model and deep learning

作者：Feng, Haikuan;Fan, Yiguang;Ma, Yanpeng;Liu, Yang;Chen, Riqiang;Bian, Mingbo;Fan, Jiejie;Yang, Guijun;Zhao, Chunjiang;Feng, Haikuan;Zhao, Chunjiang;Yue, Jibo;Fu, Yuanyuan;Leng, Mengdie;Jin, Xiuliang;Zhao, Yu

关键词：Potato; Deep learning; Radiative transfer model; Transfer learning; Leaf protein content
Segmentation and Fractional Coverage Estimation of Soil, Illuminated Vegetation, and Shaded Vegetation in Corn Canopy Images Using CCSNet and UAV Remote Sensing

作者：Zhang, Shanxin;Yue, Jibo;Shu, Meiyan;Zhang, Shanxin;Wang, Xiaoyan;Feng, Haikuan;Feng, Haikuan;Liu, Yang

关键词：segmentation; digital camera; corn; deep learning
A Large-Scale UAV Swarm Confrontation Method Based on Fuzzy Reinforcement Learning

作者：Hu, Chunyang;Gu, Qiong;Wu, Zhao;Ning, Bin;Li, Jingchen;Yang, Yusen

关键词：Unmanned aerial vehicle; Large-scale multi-agent systems; Multi-agent reinforcement learning
An Integrated Rapid Detection of Botryosphaeriaceae Species in Grapevine Based on Recombinase Polymerase Amplification, CRISPR/Cas12a, and Lateral Flow Dipstick

作者：Wang, Baoyu;Fan, Anran;Liu, Mei;Zhou, Ying;Zhang, Wei;Yan, Jiye

关键词：Botryosphaeria dieback; field detection; LFD; RPA-CRISPR/Cas12a
Estimation of potato above-ground biomass based on the VGC-AGB model and deep learning

作者：Feng, Haikuan;Fan, Yiguang;Bian, Mingbo;Liu, Yang;Chen, Riqiang;Ma, Yanpeng;Fan, Jiejie;Yang, Guijun;Zhao, Chunjiang;Yue, Jibo;Feng, Haikuan;Zhao, Chunjiang

关键词：Hyperspectral; Above-ground biomass; Potato; Deep learning; Leaf area index
Utilizing UAV-based hyperspectral remote sensing combined with various agronomic traits to monitor potato growth and estimate yield

作者：Liu, Yang;Feng, Haikuan;Fan, Yiguang;Fan, Jiejie;Ma, Yanpeng;Chen, Riqiang;Bian, Mingbo;Yang, Guijun;Liu, Yang;Yue, Jibo;Yang, Fuqin

关键词：Crop growth monitoring; Potato yield; Crop traits; UAV; Hyperspectral

Dynamic preference inference network: Improving sample efficiency for multi-objective reinforcement learning by preference estimation

作者其他论文 更多>>

Gene expression profiles of Chinese medaka ( Oryzias sinensis ) primary hepatocytes in response to estrone (E1 ), 17 i3-estradiol (E2 ) and estriol (E3 )

Enhancing potato leaf protein content, carbon-based constituents, and leaf area index monitoring using radiative transfer model and deep learning

Segmentation and Fractional Coverage Estimation of Soil, Illuminated Vegetation, and Shaded Vegetation in Corn Canopy Images Using CCSNet and UAV Remote Sensing

A Large-Scale UAV Swarm Confrontation Method Based on Fuzzy Reinforcement Learning

An Integrated Rapid Detection of Botryosphaeriaceae Species in Grapevine Based on Recombinase Polymerase Amplification, CRISPR/Cas12a, and Lateral Flow Dipstick

Estimation of potato above-ground biomass based on the VGC-AGB model and deep learning

Utilizing UAV-based hyperspectral remote sensing combined with various agronomic traits to monitor potato growth and estimate yield

意 见 箱

作者其他论文更多>>

意见箱