DOI: 10.3390/s26134003 ISSN: 1424-8220

A Multimodal Closed-Loop Framework for Vital Sign Monitoring and Intelligent Diagnosis of Amusement Ride Passengers Under High-Dynamic Motion

Yikun Wu, Yulong Song, Hao Yang, Ming Zhang

High-dynamic amusement ride conditions involving impacts, rapid rotations, and abrupt posture changes introduce severe motion artifacts that degrade vital sign quality and destabilize physiological state recognition. This study aims to develop an engineering-ready closed-loop framework for robust passenger monitoring and intelligent diagnosis. A multimodal sensing and modeling pipeline was designed to jointly leverage physiological signals such as heart rate and SpO2 and kinematic measurements, including acceleration, angular rate, velocity, and attitude. Inertial and PPG signals were preprocessed into supervised samples through wavelet multiresolution denoising and coordinate frame unification, while a strapdown inertial navigation system was used to propagate a 12-channel physical quantity sequence. To ensure interpretability and standards compliance, constraints from GB 8408-2018 were translated into executable threshold rules, enabling standards-driven auto-labeling and rule-based early warning. Building on this foundation, three learning modules were developed: a fusion model for high-dynamic heart rate estimation, a CNN–LSTM dynamic-threshold-enhanced network TAPNet for rapid kinematic anomaly screening, and an attention-augmented hybrid model HS-BANet integrating one-dimensional residual blocks, bidirectional LSTM, and multi-head attention for fine-grained arrhythmia classification. Experimental results demonstrated accurate and consistent heart rate estimation with RMSE of 1.18 bpm on HSSH-I and 1.24 bpm on the independent HSSH-II set, strong agreement with training and testing correlations of 0.9928 and 0.9865, and near-zero bias in Bland–Altman analysis. TAPNet achieved 96.9% validation accuracy and 98.2% test accuracy for kinematic anomaly recognition, maintaining robust generalization under class imbalance. HS-BANet enabled multi-class identification of PVC, PAC, VT, SVT, and AF, achieving an accuracy of 92.37%, an F1-score of 86.87%, a precision of 88.45%, a sensitivity of 88.14%, and a specificity of 89.42%. Overall, the proposed two-stage multimodal closed-loop—fast, interpretable early warning based on physical quantity thresholds followed by fine-grained diagnosis from physiological signals—supports stable feature extraction and reliable decision-making under strong motion artifacts and non-stationary dynamics, balancing responsiveness and diagnostic credibility, while showing potential for practical safety early warning and future deployment-oriented operational support in amusement ride scenarios.

More from our Archive