DOI: 10.3390/en19133153 ISSN: 1996-1073

XGBoost-Guided Spectrogram Pruning with SE-Augmented Residual CNN for Wind Turbine Gearbox Fault Diagnosis Under Unsteady Conditions

Chiheng Huang, Attia Bibi, Wenxian Yang, Fang Duan, Haiyan Miao, Rakesh Mishra

Reliable condition monitoring of wind turbine gearboxes is critical to reducing unplanned downtime and maintenance costs in wind farms. However, this task presents significant challenges due to the non-stationary nature of vibration signals, in which fault-relevant features are sparsely and unevenly distributed across the time–frequency map. Although time–frequency analysis has been widely adopted to represent nonlinear and non-stationary vibration signals, existing deep learning methods typically process the full spectrogram directly, without distinguishing redundant or uninformative regions. This leads to high input dimensionality and exposes the model to substantial spectral noise. Consequently, it increases computational burden and potentially reduces the diagnostic reliability. To address this issue, this paper proposes a two-stage hybrid framework based on complementary selection mechanisms operating on two distinct feature spaces. In the first stage, eXtreme Gradient Boosting (XGBoost) importance scores are used to identify and permanently prune uninformative time–frequency features from the input spectrogram, reducing the input map size by 25%. In the second stage, a Squeeze-and-Excitation (SE) block, inserted after the deepest residual layer, performs soft channel-wise recalibration of the abstract feature maps produced by the residual convolutional neural network (ResCNN), thereby amplifying discriminative representations prior to classification. The proposed method was evaluated in an eight-class variable-speed fault classification task using the MCC5-THU benchmark, where data were collected from a 2.2 kW motor-driven gearbox test rig. The proposed method achieves a mean accuracy of 97.81% ± 0.33% under 5-fold stratified cross-validation (CV), while reducing classifier training time by approximately 23% compared to a baseline model trained on the full spectrogram. These results demonstrate that explicit input-level spectrogram pruning, combined with model-level channel attention, yields a robust and computationally efficient diagnostic framework for wind turbine gearbox condition monitoring.

More from our Archive