DOI: 10.3390/info17070626 ISSN: 2078-2489

CNN–Transformer–KAN: A Hybrid Deep-Learning Framework with an Inspectable KAN Classification Head for Industrial Process Fault Diagnosis

Yujie Wu, Maoyu Zhang, Aoxuan Ding, Yu Hua, Zhehao Jin, Yiyang Dai

Detecting and identifying faults in industrial chemical plants is essential for safe and stable operation, and modern monitoring systems increasingly rely on deep learning to classify faults from multivariate sensor data. A practical obstacle to adoption is trust: most deep-learning diagnosers reach their decisions through a classification layer that operators cannot inspect, making it hard to see how the model maps process signals to a particular fault. This study targets fault diagnosis on the Tennessee Eastman (TE) process, a standard benchmark of simulated chemical-plant sensor data, and asks whether this final decision stage can be made directly inspectable without sacrificing accuracy. We propose CNN–Transformer–KAN (CTKAN), a hybrid model that learns local temporal patterns with a one-dimensional convolutional encoder, captures global inter-time-step dependencies with a Transformer encoder, and classifies faults with a Kolmogorov–Arnold Network (KAN) head whose learnable B-spline activations can be plotted and examined individually, in place of a conventional multi-layer perceptron (MLP). On the TE benchmark, CTKAN attains a Macro-F1 of 91.38 ± 0.26% over ten independent runs, comparable to a CNN + Transformer + MLP ablation (91.21 ± 0.32%) and a capacity-matched MLP-head variant (91.43 ± 0.37%) within seed-to-seed variability. The main finding is therefore not a higher score: at matched capacity the KAN and MLP heads are statistically indistinguishable in accuracy, so the KAN head’s value is to add a directly inspectable view of the classification stage at no measurable accuracy cost, helping process engineers sanity-check how the diagnoser separates faults in safety-critical settings.

More from our Archive