基于改进DQN算法的移动机器人路径规划

MOBILE ROBOT PATH PLANNING BASED ON IMPROVED DQN ALGORITHM

摘要: 移动机器人在动态未知复杂环境中进行路径规划时，需要保证机器人的实时性。针对DQN算法在移动机器人路径规划中存在的过估计问题以及收敛速度慢的问题，提出一种C-RD3QN算法（Combination-Residual Dueling Double DQN）。该算法在D3QN算法基础上，将卷积层修改为残差网络结构，使用竞争网络结构中的动作优势函数来估计动作值函数，将状态值函数与奖励值结合，使机器人达到更快的收敛速度。经过仿真实验对比分析，表明C-RD3QN算法能够进行更优的路径规划。

Abstract: It is important to guarantee real-time performance when the path planning is carried out in a dynamic and complex environment. Aimed at the overestimation problem and the slow convergence speed of DQN algorithm in mobile robot path planning, a C-RD3QN (combination-residual dueling double DQN) algorithm is proposed. Based on D3QN algorithm, the convolution layer was modified to the residual network structure, the action advantage function in the competitive network structure was used to estimate the action value function, and the state value function was combined with the reward value to achieve faster convergence speed. The simulation results show that the C-RD3QN algorithm can carry out better path planning.