DOI: 10.3390/rs18132121 ISSN: 2072-4292

A Wavelet-Guided Frequency–Spatial Decoupling Network for Visible–Infrared UAV Detection

Zeliang Dong, Jiaxin Pan, Xiangpeng Chen, Wuxia Zhang, Huinan Guo

Detecting unmanned aerial vehicles (UAVs) remains a difficult task, primarily due to their tiny size, rapid motion, and complex backgrounds. Fusing visible and infrared imagery offers complementary advantages for robust detection, yet existing methods rely on spatial feature aggregation that overlooks spectral disparities, coupling noise with textures. Moreover, the small scale and high dynamics of UAVs hinder standard convolution from decoupling target signals from background interference due to limited receptive fields. To solve these limitations, the Wavelet-guided Frequency–Spatial Decoupling Network (WFSD-Net) is designed for visible–infrared UAV detection. First, to tackle fusion noise, the Discrete Wavelet Band-Differentiated Fusion (DWBF) module is designed to explicitly decouple noise-dominant sub-bands from information-rich components by performing spectral decomposition. It aligns low-frequency distributions via adaptive spatial weighting and disentangles high-frequency details using physics-aware rules, achieving source-level noise suppression. Second, an Axial Strip Contextual Attention (ASCA) module is proposed. By utilizing anisotropic strip convolution via orthogonal decomposition, this module captures global contextual dependencies to effectively decouple weak target features from background clutter, enhancing the spatial position encoding capability for weak targets. Finally, the proposed WFSD-Net method is validated on Anti-UAV300 and Multi-Sensor and Multi-View Fixed-Wing UAV (MMFW-UAV) datasets, and experiments demonstrate that the proposed method is superior to existing state-of-the-art (SOTA) methods.

More from our Archive