基于深度強(qiáng)化學(xué)習(xí)的帶約束車輛路徑分層優(yōu)化研究

打開文本圖片集
中圖分類號:TP301 文獻(xiàn)標(biāo)志碼:A DOI:10.12305/j.issn.1001-506X.2025.03.15
Hierarchical optimization research of constrained vehicle routing based on deep reinforcement learning
TANG Kaiqiang,F(xiàn)U Huiqiao,LIU Jiasheng,DENG Guizhou,CHEN Chunlin (Schoolof EngineeringManagement,NanjingUniuersity,Nanjing 2loo93,China)
Abstract:For the capacitated vehicle routing problem (CVRP),a method is proposed to decouple the capacityconstraints using a hierarchical structure,split the complex CVRP into constraint planning and path planning,and perform deep reinforcement learning(DRL)optimisation for solving the problem respectively. Firstly,the upper layer alocates the vehicle distribution tasks based on the atention model and sampling mechanism to plan the set of subpaths that satisfy the constraints. Secondly,the lower layer adopts the pretrained unconstrained atention model to plan the paths for the setof subpaths.Finally,the network parameters of the upper layer are optimized through the feedback training and iteration of the Reinforce algorithm. Experimental results show that the method generalizes to CVRPand heterogeneous CVRP tasks of diferent sizes, outperforms thestate-of-the-art DRL method.Moreover,compared with other heuristic methods,in batch computing tasks,the solution speed improved by more than 1O times,while maintaining competitive solutions.
Keywords:deep reinforcement learning (DRL);vehicle routing problem(VRP);attention model;hierarchical optimization
0 引言
二[1-3],廣泛存在于物流、工業(yè)和運輸?shù)榷鄠€領(lǐng)域。(剩余21215字)