A Large-Scale UAV Swarm Confrontation Method Based on Fuzzy Reinforcement Learning
文献类型: 外文期刊
第一作者: Hu, Chunyang
作者: Hu, Chunyang;Gu, Qiong;Wu, Zhao;Ning, Bin;Li, Jingchen;Yang, Yusen
作者机构:
关键词: Unmanned aerial vehicle; Large-scale multi-agent systems; Multi-agent reinforcement learning
期刊名称:INTERNATIONAL JOURNAL OF FUZZY SYSTEMS ( 影响因子:3.6; 五年影响因子:3.1 )
ISSN: 1562-2479
年卷期: 2025 年
页码:
收录情况: SCI
摘要: In unmanned aerial vehicle (UAV) swarm confrontations, the optimal policies obtained through deep reinforcement learning methods face an exponential increase in computational and storage resource consumption with the number of UAVs. To achieve efficient policies in large-scale UAV swarm confrontations while keeping the amount of parameters and floating-point operations within an acceptable range, this study proposes a method based on fuzzy multi-agent reinforcement learning. This method models the confrontation in large-scale UAV swarms as a fuzzy game, establishing the corresponding group decision-making process. With the proof of the Markov property, interactions among UAVs are fuzzified into interactions among a few abstract agents, so that policies assigned to abstract agents rather than individual UAV, while the storage consumption is reduced. Through defuzzification calculations, policies of abstract agents are mapped to specific UAV behaviors, significantly reducing the computing consumption while ensuring policy effectiveness. Comparative experiments with other baseline methods show that our approach significantly reduces the required floating-point operations and parameters in UAV swarm confrontations of various numbers of UAVs, with comparable performance of the learned polices.
分类号:
- 相关文献
作者其他论文 更多>>
-
U2Net-MGP: A Lightweight and Efficient Visual Perception Algorithm for Consumer Electronic Accessories
作者:Chen, Wenbai;Zhang, Bo;Zhao, Xin;Wang, Yiqun;Li, Jingchen;Shi, Haobin;Gou, Jianping
关键词:Image segmentation; Consumer electronics; Feature extraction; Assembly; Accuracy; Computational modeling; Decoding; salient object segmentation; ghost convolution; polarized self-attention mechanism; multi-scale feature fusion
-
Eliminating Primacy Bias in Online Reinforcement Learning by Self-Distillation
作者:Li, Jingchen;Wu, Huarui;Zhao, Chunjiang;Shi, Haobin;Hwang, Kao-Shing
关键词:Online reinforcement learning; overfitting; reinforcement learning
-
Dynamic preference inference network: Improving sample efficiency for multi-objective reinforcement learning by preference estimation
作者:Liu, Yang;Zhou, Ying;He, Ziming;Yang, Yusen;Li, Jingchen;Han, Qingcen
关键词:Multi-objective reinforcement learning; Sample efficiency; Reinforcement learning
-
Cournot Policy Model: Rethinking centralized training in multi-agent reinforcement learning
作者:Li, Jingchen;Yang, Yusen;Wu, Huarui;He, Ziming;Shi, Haobin;Chen, Wenbai;Wu, Huarui
关键词:Multi-agent reinforcement learning; Machine learning; Multi-agent system
-
Atractylodinol prevents pulmonary fibrosis through inhibiting TGF-β receptor 1 recycling by stabilizing vimentin
作者:Hao, Mengjiao;Zhang, Zhikang;Ai, Haopeng;Peng, Xing;Zhou, Huihao;Xu, Jun;Gu, Qiong;Guan, Zhuoji;Hao, Mengjiao;Gu, Qiong
关键词:
-
GmMKK4-activated GmMPK6 stimulates GmERF113 to trigger resistance to Phytophthora sojae in soybean
作者:Gao, Hong;Jiang, Liangyu;Du, Banghan;Ning, Bin;Ding, Xiaodong;Zhang, Chuanzhong;Song, Bo;Liu, Shanshan;Zhao, Ming;Zhao, Yuxin;Rong, Tianyu;Liu, Dongxue;Xu, Pengfei;Zhang, Shuzhen;Jiang, Liangyu;Wu, Junjiang
关键词:soybean; Phytophthora sojae; Mitogen-activated protein kinase cascade (MAPK); Ethylene response factor (ERF) transcription factor
-
The 26S Proteasome Regulatory Subunit GmPSMD Promotes Resistance to Phytophthora sojae in Soybean
作者:Liu, Tengfei;Wang, Huiyu;Liu, Zhanyu;Pang, Ze;Zhang, Chuanzhong;Zhao, Ming;Ning, Bin;Song, Bo;Liu, Shanshan;He, Zili;Wei, Wanling;Liu, Yaguang;Xu, Pengfei;Zhang, Shuzhen;Wu, Junjiang
关键词:soybean; Phytophthora sojae; GmPIB1; GmPSMD; ROS