LDA-D3QN-Based Autonomous Navigation for Unmanned Surface Vehicles in Complex Obstacle Scenarios
Guoquan Xiao, Ruijie Rao, Yuanming Chen, Xiaobin HongAutonomous navigation of unmanned surface vehicles (USVs) in complex obstacle scenarios remains challenging due to redundant perception inputs, unstable value estimation, and inefficient policy convergence. To address these problems, this paper proposes LDA-D3QN, an improved deep reinforcement learning method for USV autonomous navigation. The proposed method constructs a compact navigation state representation by combining target-related information with local obstacle features, allowing the agent to retain key decision-making information while reducing unnecessary environmental redundancy. Based on this representation, an enhanced value-learning framework is developed to improve the stability of navigation decisions in cluttered environments. Moreover, a reward-guided and staged training strategy is introduced to help the agent gradually adapt to increasingly complex navigation tasks. The proposed method was evaluated on a Unity–ROS–MATLAB integrated simulation platform. Experimental results show that LDA-D3QN achieves superior overall navigation performance compared with several representative reinforcement learning algorithms. Specifically, the proposed method achieves a final training success rate of 91.4%, outperforming PPO (82.3%), Dueling DQN (78.5%), Double DQN (79.8%), and Rainbow DQN (86.5%). Additional tests in complex multi-obstacle and multi-target scenarios further demonstrate that the learned policy can generate safe, stable, and effective navigation behaviors. Preliminary validation using real-USV sensor data also confirms the feasibility of the LiDAR and GPS data processing procedures, providing a basis for future closed-loop autonomous navigation experiments and multi-sensor fusion deployment.