Formation control without collision in uncertain environment based on deep reinforcement learning
中文关键词: 深度强化学习; 避障; 编队控制; 多智能体; 神经网络
英文关键词: deep reinforcement learning, collision avoidance, formation control, multi-agent, neural network
禹鑫燚 (浙江工业大学信息工程学院杭州 310023) 
杜丹枫 (浙江工业大学信息工程学院杭州 310023) 
欧林林 (浙江工业大学信息工程学院杭州 310023) 
摘要点击次数: 1250
全文下载次数: 1307
      The purpose of multi-agent formation control is to avoid obstacles while maintaining the formation. For the randomness and uncertainty of the complex environment, a formation and obstacle avoidance control method in uncertain environment based on deep reinforcement learning is proposed in the paper. Firstly, a value evaluation network is designed to increase the experience of special actions, such as touching obstacles or reaching the desired location, so that the agents can understand environmental rules faster. Secondly, when the agents select actions, the action selection strategy is improved based on the greedy strategy, which increases the learning efficiency of the agents. Then, the sample storage space is designed to increase the efficiency of model training while increasing the utilization of samples. And the multi-step learning algorithm is combined to make the value estimation more accurate in the decision-making stage. Finally, the proposed method is compared with other algorithms. The simulation results demonstrate that the proposed method can realize the multi-agent formation control without collision. The algorithm proposed in the paper improves learning rate of multi-agents effectively.
查看全文   查看/发表评论  下载PDF阅读器
