筛选
科研产出
资源类型: 外文期刊
作者:Liu, Yang(精确检索)
作者:Zhou, Ying(精确检索)
作者:He, Ziming(精确检索)
作者:Yang, Yusen(精确检索)
作者:Li, Jingchen(精确检索)
作者:Han, Qingcen(精确检索)
排序方式:

相关度

  • 时间
  • 相关度
1条记录
1Dynamic preference inference network: Improving sample efficiency for multi-objective reinforcement learning by preference estimation

作者:

机构:

来源:KNOWLEDGE-BASED SYSTEMS

关键词: Multi-objective reinforcement learning; Sample efficiency; Reinforcement learning

年份:2024

首页上一页1下一页尾页