基于深度強(qiáng)化學(xué)習(xí)的帶約束車輛路徑分層優(yōu)化研究

打印
收藏

收藏成功

微博 QQ空間微信

打開文本圖片集

中圖分類號：TP301 文獻(xiàn)標(biāo)志碼：A DOI：10.12305/j.issn.1001-506X.2025.03.15

Hierarchical optimization research of constrained vehicle routing based on deep reinforcement learning

TANG Kaiqiang，F(xiàn)U Huiqiao，LIU Jiasheng，DENG Guizhou，CHEN Chunlin （Schoolof EngineeringManagement，NanjingUniuersity，Nanjing 2loo93，China）

Abstract：For the capacitated vehicle routing problem （CVRP），a method is proposed to decouple the capacityconstraints using a hierarchical structure，split the complex CVRP into constraint planning and path planning，and perform deep reinforcement learning（DRL）optimisation for solving the problem respectively. Firstly，the upper layer alocates the vehicle distribution tasks based on the atention model and sampling mechanism to plan the set of subpaths that satisfy the constraints. Secondly，the lower layer adopts the pretrained unconstrained atention model to plan the paths for the setof subpaths.Finally，the network parameters of the upper layer are optimized through the feedback training and iteration of the Reinforce algorithm. Experimental results show that the method generalizes to CVRPand heterogeneous CVRP tasks of diferent sizes， outperforms thestate-of-the-art DRL method.Moreover，compared with other heuristic methods，in batch computing tasks，the solution speed improved by more than 1O times，while maintaining competitive solutions.

Keywords：deep reinforcement learning （DRL）；vehicle routing problem（VRP）；attention model；hierarchical optimization

0 引言

二[1-3]，廣泛存在于物流、工業(yè)和運輸?shù)榷鄠€領(lǐng)域。（剩余21215字）

試讀結(jié)束

購買全文6.00元下一篇基于故障邏輯的民機(jī)液壓狀態(tài)監(jiān)控與故障診斷

系統(tǒng)工程與電子技術(shù)

2025年03期

￥24.00/本

特黄三级爱爱视频|国产1区2区强奸|舌L子伦熟妇aV|日韩美腿激情一区|6月丁香综合久久|一级毛片免费试看|在线黄色电影免费|国产主播自拍一区|99精品热爱视频|亚洲黄色先锋一区

基于深度強(qiáng)化學(xué)習(xí)的帶約束車輛路徑分層優(yōu)化研究