A Residual U-Net Architecture for Built-Up Area Segmentation from Sentinel-2 Images
Mehtap ÜlkerAccurate and up-to-date mapping of built-up areas is of great importance for sustainable urban planning, disaster management, and the monitoring of environmental changes. In this study, a residual U-Net-based deep learning architecture named FiveBandTTA is proposed for built-up area segmentation from Sentinel-2 multispectral satellite imagery. The proposed model aims to simultaneously learn spatial and spectral features by jointly processing RGB, NIR (B8), and SWIR (B11) bands within the same encoder–decoder structure. The model incorporates standard residual blocks following the conventional residual learning principle, multi-level skip connection mechanisms, and TTA-based inference strategies. Within the scope of the study, a multi-temporal built-up area dataset was constructed from Sentinel-2 imagery acquired over Kocaeli Province. The performance of the proposed model was comparatively evaluated against RGB Baseline, FiveBand Single, DeepLabV3+, and SegFormer models. Experimental results demonstrated that the proposed model achieved the highest segmentation performance among all compared approaches, obtaining 0.8447 IoU, 0.9124 Dice, and 0.9249 Precision scores. It was observed that the use of multispectral bands together with the residual encoder–decoder structure may contribute to improved representation of small-scale built-up regions and complex boundary structures. Furthermore, the comparative experiments indicated that the NIR and SWIR bands provide complementary spectral information for distinguishing built-up areas, while the TTA-based inference strategy may contribute to improved segmentation stability and prediction consistency. Overall, the obtained results demonstrate that the proposed approach is an effective and robust method for built-up area segmentation from medium-resolution Sentinel-2 imagery.