Two-Stage Learned Decomposition for Scalable Routing on Multigraphs
Preprint, 2026

Most neural methods for Vehicle Routing Problems (VRPs) are limited to Euclidean settings or simple graphs. In this work, we instead consider multigraphs, where parallel edges represent distinct travel options with varying trade-offs (e.g., distance vs time). Few methods are designed for such formulations and those that do exist face major scalability issues. We mitigate these scalability issues via a Node-Edge Policy Factorization (NEPF) approach, which splits the routing policy into a node permutation stage and an edge selection stage. To enable the decomposition, we introduce a pre-encoding edge aggregation scheme and a non-autoregressive architecture for the edge stage, as well as a hierarchical reinforcement learning method to train the stages jointly. Our experiments across six VRP variants demonstrate that NEPF matches or outperforms the state-of-the-art in terms of solution quality, while being significantly faster in training and inference.

Multigraphs

Multi-Objective Optimization

Graph Learning

Machine Learning

Traveling Salesman Problem

Routing Problems

Graph Neural Networks

Author

Filip Rydin

Chalmers, Electrical Engineering, Systems and control

Morteza Haghir Chehreghani

Chalmers, Computer Science and Engineering (Chalmers), Data Science and AI

University of Gothenburg

Balázs Adam Kulcsár

Chalmers, Electrical Engineering, Systems and control

LEAR: Robust LEArning methods for electric vehicle Route selection

Swedish Electromobility Centre, 2023-01-01 -- 2026-12-31.

Areas of Advance

Transport

Subject Categories (SSIF 2025)

Transport Systems and Logistics

Discrete Mathematics

Artificial Intelligence

Infrastructure

Chalmers e-Commons (incl. C3SE, 2020-)

More information

Latest update

5/11/2026