DOI: 10.1145/3822510 ISSN: 2374-0353
Network-Constrained Policy Optimization for Adaptive Multi-agent Vehicle Routing
Fazel Arasteh, Arian Haghparast, Manos PapagelisTraffic congestion in urban road networks is marked by longer trip times and higher emissions, especially during peak periods. While the
Shortest Path First
(SPF) algorithm is optimal for a single vehicle in a static network, it performs poorly in dynamic, multi-vehicle settings, often worsening congestion by routing all vehicles along identical paths. We address dynamic vehicle routing through a
multi-agent reinforcement learning (MARL)
framework for coordinated, network-aware fleet navigation. We first propose
Adaptive Navigation
(AN), a decentralized MARL model where each intersection agent provides routing guidance based on (i) local traffic and (ii) neighborhood state modeled using
Graph Attention Networks
(GAT). To improve scalability in large networks, we further propose
Hierarchical Hub-based Adaptive Navigation
(HHAN), an extension of AN that assigns agents only to key intersections (