Unmanned Surface Vessel Cluster Path Planning Based on Deep Reinforcement Learning

With the wide application of unmanned surface vessels(USVs) in the field of maritime search, the traditional path planning algorithms fail to meet the complex rescue scenarios, which can lead to local optimum, low task completion rate, and slow convergence speed. For this reason, a path planning met...

Full description

Saved in:
Bibliographic Details
Main Authors: Manjiang YU, Jiawei HE, Bowen XING
Format: Article
Language:Chinese
Published: Science Press (China) 2025-04-01
Series:水下无人系统学报
Subjects:
Online Access:https://sxwrxtxb.xml-journal.net/cn/article/doi/10.11993/j.issn.2096-3920.2024-0179
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:With the wide application of unmanned surface vessels(USVs) in the field of maritime search, the traditional path planning algorithms fail to meet the complex rescue scenarios, which can lead to local optimum, low task completion rate, and slow convergence speed. For this reason, a path planning method for USV cluster cooperative search and rescue was proposed. Firstly, a long and short-term memory module was introduced based on the multi-agent deep deterministic policy gradient algorithm to enhance the ability of the USVs to utilize the temporal information in path planning; secondly, a multi-level representational experience pool was designed to improve the training efficiency and data utilization and reduce the interference between different experiences; finally, stochastic network distillation was used as a curiosity mechanism to provide intrinsic rewards for the USVs to explore new regions and solve the convergence due to the sparse rewards. The simulation experiment results show that the improved algorithm improves the convergence speed by about 38.46% compared with the original algorithm, and the path length has been shortened by 27.02%. In addition, the obstacle avoidance ability has been significantly improved.
ISSN:2096-3920