MSSTAN: A Multi-Scale Spatio-Temporal Attention Network for Traffic Forecasting
Junji Zhu, Haochen Shi, Shengdong Du, Tianrui Li, Yan Yang, Jie HuTraffic forecasting is pivotal but challenging due to intricate spatio-temporal dynamics. Existing models often apply a uniform spatial mechanism across distinct temporal scales and rely on static feature embeddings. Consequently, they are inadequate in capturing scale-specific spatial heterogeneity and dynamic feature interdependencies. To address these limitations, we propose the Multi-Scale Spatio-Temporal Attention Network (MSSTAN) with a novel dual-branch architecture: (1) A Global-Local Feature Attention Network (GLFAN) that explicitly decouples spatial interactions across decomposed temporal components to capture multi-scale spatial patterns; and (2) A Spatio-Temporal Feature Attention Network (STFAN) that dynamically recalibrates feature importance based on specific spatio-temporal contexts. A dynamic branch fusion mechanism integrates these branches to optimally aggregate their complementary views. Extensive experiments on five real-world datasets demonstrate that MSSTAN achieves state-of-the-art or highly competitive performance, validating its efficacy for traffic forecasting.