PPLCNet-YOLOv11: Exploring a Lightweight College Student Pose-Detection Method for Sports Training Under the Concept of General Education
Jie Chen, Zhi Wang, Wenquan HuangHuman pose detection is fundamental to quantitative sports training analysis in college general education courses, enabling an objective assessment of college students’ movement quality and the early identification of sports injury risks among non-professional athletes. At present, those detectors based on YOLO have encountered difficulties in capturing the continuous movement patterns of college athletes in routine training, maintaining the regression accuracy of different size posture targets, and maintaining the real-time calculation speed in the campus sports environment. Furthermore, most existing pose-estimation frameworks are optimized for general scenes and fail to address the unique challenges of college physical education settings, including non-standard student movements, diverse skill levels, and strict cost constraints for large-scale deployment. In order to solve these problems, we put forward PPLCNet-YOLOv11, which is a simplified human posture-estimation framework designed for college physical education. This model is optimized by three key improvements: (1) replacing the original backbone network with PPLCNet to enhance feature extraction, while strictly observing the strict FLOPs and parameter restrictions; (2) an enhanced Multi-Scale Attention Mechanism (MSAM) that combines adaptive scale perception, hierarchical channel attention, and pose-sensitive spatial attention to better represent elongated anatomical structures and multi-scale pose cues; and (3) an improved enhanced IoU loss function that incorporates scale-aware and aspect-ratio-aware penalty terms to refine the bounding box adjustment for atypical and sports-specific gestures. Experiments on both a dedicated college student sports pose dataset and two public benchmark datasets (COCO Keypoints 2017 and MPII Human Pose) demonstrate that PPLCNet-YOLOv11 achieves 77.8% mAP@0.5 and 37.09% mAP@0.95 based on the campus dataset, with 82.34% precision and 75.00% recall, while requiring only 2.62 M parameters and 6.38 GFLOPs. Extensive inference speed tests show that the model achieves 127 FPS on an NVIDIA RTX 4090 GPU, 38 FPS on an Intel i7-12700 CPU, and 16 FPS on a Jetson Nano edge device, meeting the real-time requirements of campus sports monitoring. Compared with mainstream lightweight YOLO variants and state-of-the-art specialized pose-estimation models, our proposed method improves mAP@0.5 by 4.93–12.6 percentage points based on the campus dataset. All experiments were repeated five times with different random seeds, and we report mean values with standard deviations and statistical significance tests to ensure result reliability. These results indicate that PPLCNet-YOLOv11 provides an accurate and resource-efficient solution for real-time pose evaluation in college physical training.