SE-DBIRNet: Squeeze-and-Excitation Driven Dual-Path Residual Network for Mango Shelf-Life Stages Classification
Ibrar Ahmad, Bushra Siddique, Muhammad Junaid, Mostafa Gouda, Aftab Khaliq, Zia Ul Haq, Zhengjun QiuPost-harvest losses of mango (Mangifera indica L.) in developing economies are estimated at 5% to 30%, largely due to manual management practices that depend on subjective visual assessments. This paper proposes a lightweight deep learning architecture, termed SE-DBIRNet, for real-time classification of mangoes into five shelf-life stages: unripe, semi-ripe, fully ripe, overripe, and perished. The model incorporates three key design strategies: (i) depthwise separable convolutions, achieving an 88.5% reduction in parameters when integrated into the ResNet50 backbone; (ii) a double-branch inverted residual (DBIR) module designed to enhance feature diversity and richness; and (iii) a squeeze-and-excitation (SE) attention mechanism for adaptive channel-wise recalibration. Using a public benchmark dataset of 4428 RGB images (Mendeley Data) under 10-fold cross-validation, SE-DBIRNet achieved 98.24% accuracy. Among lightweight CNN architectures (EfficientNetB0, MobileNetV2, ResNet50), SE-DBIRNet outperformed the best lightweight baseline (EfficientNetB0: 96.57%) by 1.67 percentage points. While dedicated attention-based DenseNet variants (e.g., DSA-DenseNet: 99.20%) achieved higher accuracy, SE-DBIRNet offers a superior trade-off among accuracy, inference speed (56.9 ± 1.8 FPS), and memory efficiency (8871 ± 45 MB CPU memory). EigenCAM activation visualizations revealed that the model focuses on biologically relevant and stage-discriminative features, including surface color gradients, texture uniformity, lenticel patterns, and decay boundaries. Overall, SE-DBIRNet achieves a Pareto-optimal balance among accuracy, speed, and memory efficiency, making it a strong candidate for real-time, edge-deployable post-harvest mango quality-monitoring systems, particularly when computational resources are limited.