FuDensityNet: Occlusion-Aware Multimodal Activation for Robust Object Detection
Zainab Ouardirhi, Mostapha Zbakh, Mohammed Benjelloun, Sidi Ahmed MahmoudiAccurate object detection remains a major challenge in autonomous systems and surveillance, particularly when objects are partially or fully obscured by occlusions. To address this issue, we revisit FuDensityNet as a multimodal detection framework that jointly leverages 2D RGB images and 3D LiDAR point clouds for robust feature representation. The model integrates spatial and depth cues through low-rank tensor fusion (LRTF) and incorporates an Occlusion Rate (OR) assessment module that estimates the degree of occlusion and dynamically selects the most suitable detection pathway to preserve performance. Experiments on the KITTI and NuScenes datasets indicate that this adaptive strategy improves robustness under high occlusion while maintaining competitive accuracy in less challenging conditions. In particular, FuDensityNet attains 76.6% AP for car detection under “Hard” conditions on KITTI and outperforms several RGB-only and RGB–LiDAR baselines. Owing to its adaptive and modular design, FuDensityNet remains compatible with both 2D and 3D detection pipelines, making it a practical option for real-world environments where visual obstructions are frequent.