农科机构知识库联盟

A Large-Scale UAV Swarm Confrontation Method Based on Fuzzy Reinforcement Learning

文献类型：外文期刊

第一作者： Hu, Chunyang

作者： Hu, Chunyang;Gu, Qiong;Wu, Zhao;Ning, Bin;Li, Jingchen;Yang, Yusen

作者机构：

关键词： Unmanned aerial vehicle; Large-scale multi-agent systems; Multi-agent reinforcement learning

期刊名称：INTERNATIONAL JOURNAL OF FUZZY SYSTEMS （影响因子：3.6；五年影响因子：3.1 ）

ISSN： 1562-2479

年卷期： 2025 年

页码：

收录情况： SCI

摘要： In unmanned aerial vehicle (UAV) swarm confrontations, the optimal policies obtained through deep reinforcement learning methods face an exponential increase in computational and storage resource consumption with the number of UAVs. To achieve efficient policies in large-scale UAV swarm confrontations while keeping the amount of parameters and floating-point operations within an acceptable range, this study proposes a method based on fuzzy multi-agent reinforcement learning. This method models the confrontation in large-scale UAV swarms as a fuzzy game, establishing the corresponding group decision-making process. With the proof of the Markov property, interactions among UAVs are fuzzified into interactions among a few abstract agents, so that policies assigned to abstract agents rather than individual UAV, while the storage consumption is reduced. Through defuzzification calculations, policies of abstract agents are mapped to specific UAV behaviors, significantly reducing the computing consumption while ensuring policy effectiveness. Comparative experiments with other baseline methods show that our approach significantly reduces the required floating-point operations and parameters in UAV swarm confrontations of various numbers of UAVs, with comparable performance of the learned polices.

分类号：

相关文献

作者其他论文更多>>

U2Net-MGP: A Lightweight and Efficient Visual Perception Algorithm for Consumer Electronic Accessories

作者：Chen, Wenbai;Zhang, Bo;Zhao, Xin;Wang, Yiqun;Li, Jingchen;Shi, Haobin;Gou, Jianping

关键词：Image segmentation; Consumer electronics; Feature extraction; Assembly; Accuracy; Computational modeling; Decoding; salient object segmentation; ghost convolution; polarized self-attention mechanism; multi-scale feature fusion
Eliminating Primacy Bias in Online Reinforcement Learning by Self-Distillation

作者：Li, Jingchen;Wu, Huarui;Zhao, Chunjiang;Shi, Haobin;Hwang, Kao-Shing

关键词：Online reinforcement learning; overfitting; reinforcement learning
Dynamic preference inference network: Improving sample efficiency for multi-objective reinforcement learning by preference estimation

作者：Liu, Yang;Zhou, Ying;He, Ziming;Yang, Yusen;Li, Jingchen;Han, Qingcen

关键词：Multi-objective reinforcement learning; Sample efficiency; Reinforcement learning
Cournot Policy Model: Rethinking centralized training in multi-agent reinforcement learning

作者：Li, Jingchen;Yang, Yusen;Wu, Huarui;He, Ziming;Shi, Haobin;Chen, Wenbai;Wu, Huarui

关键词：Multi-agent reinforcement learning; Machine learning; Multi-agent system
Atractylodinol prevents pulmonary fibrosis through inhibiting TGF-β receptor 1 recycling by stabilizing vimentin

作者：Hao, Mengjiao;Zhang, Zhikang;Ai, Haopeng;Peng, Xing;Zhou, Huihao;Xu, Jun;Gu, Qiong;Guan, Zhuoji;Hao, Mengjiao;Gu, Qiong

关键词：
GmMKK4-activated GmMPK6 stimulates GmERF113 to trigger resistance to Phytophthora sojae in soybean

作者：Gao, Hong;Jiang, Liangyu;Du, Banghan;Ning, Bin;Ding, Xiaodong;Zhang, Chuanzhong;Song, Bo;Liu, Shanshan;Zhao, Ming;Zhao, Yuxin;Rong, Tianyu;Liu, Dongxue;Xu, Pengfei;Zhang, Shuzhen;Jiang, Liangyu;Wu, Junjiang

关键词：soybean; Phytophthora sojae; Mitogen-activated protein kinase cascade (MAPK); Ethylene response factor (ERF) transcription factor
The 26S Proteasome Regulatory Subunit GmPSMD Promotes Resistance to Phytophthora sojae in Soybean

作者：Liu, Tengfei;Wang, Huiyu;Liu, Zhanyu;Pang, Ze;Zhang, Chuanzhong;Zhao, Ming;Ning, Bin;Song, Bo;Liu, Shanshan;He, Zili;Wei, Wanling;Liu, Yaguang;Xu, Pengfei;Zhang, Shuzhen;Wu, Junjiang

关键词：soybean; Phytophthora sojae; GmPIB1; GmPSMD; ROS

A Large-Scale UAV Swarm Confrontation Method Based on Fuzzy Reinforcement Learning

作者其他论文 更多>>

U2Net-MGP: A Lightweight and Efficient Visual Perception Algorithm for Consumer Electronic Accessories

Eliminating Primacy Bias in Online Reinforcement Learning by Self-Distillation

Dynamic preference inference network: Improving sample efficiency for multi-objective reinforcement learning by preference estimation

Cournot Policy Model: Rethinking centralized training in multi-agent reinforcement learning

Atractylodinol prevents pulmonary fibrosis through inhibiting TGF-β receptor 1 recycling by stabilizing vimentin

GmMKK4-activated GmMPK6 stimulates GmERF113 to trigger resistance to Phytophthora sojae in soybean

The 26S Proteasome Regulatory Subunit GmPSMD Promotes Resistance to Phytophthora sojae in Soybean

作者其他论文更多>>