A Vision-Based Sensing Framework for PPE Detection and Safety Harness Compliance Recognition in High-Formwork Construction Environments Using YOLO-ILB
Gang Yao, Lang Liu, Yang Yang, Xiaodong CaiAutomated vision-based sensing for personal protective equipment (PPE) compliance in high-formwork support system (HFSS) construction environments faces three compounding challenges: extreme within-image scale variation, dense scaffold occlusions, and small safety hook targets prone to missed detection. Existing sensing systems address only binary presence detection and cannot assess whether safety harnesses are anchored in compliance with regulatory requirements. This paper proposes YOLO-ILB, a lightweight task-specific object detector built on YOLO11n with three targeted improvements. The C3k2_IDWC module replaces standard convolutions with multi-branch Inception depthwise convolutions, improving multi-scale feature discrimination at reduced computational cost. The SPPF_LSKA module embeds large separable kernel attention into the SPPF aggregation path, strengthening global context awareness to suppress scaffold background interference. A BiFPN neck replaces the original PAN structure, enabling bidirectional cross-scale weighted feature fusion to balance detection of small hooks and large harnesses within a sin gle image. A UAV-based sensing dataset was constructed using a DJI Mini 3 Pro (4032 × 3024 px) across 17 real construction sites under varied altitudes, viewing angles, and illumination conditions, yielding 2700 annotated images across five object categories. YOLO-ILB achieves mAP50 = 0.939 with only 1.923 M parameters and 5.7 G FLOPs at 262.3 FPS, outperforming eight mainstream YOLO baselines while remaining deployable on resource-constrained edge computing nodes. A geometry-based compliance algorithm further classifies three harness anchoring states—correct high anchoring, incorrect low anchoring, and unclipped or excessively distant hook—without additional sensors or annotations, achieving 90.82% overall accuracy on 305 field instances and extending the sensing system from presence detection to regulatory compliance assessment.