DOI: 10.3390/jmse14131228 ISSN: 2077-1312

Robust Features, Adaptive Thresholds: LightGBM for Fishing Vessel Type Identification from Sparse AIS Data

Shibo Li, Jianghua Sui

Under 10 min sparse Automatic Identification System (AIS) sampling, the reliability of point-wise motion statistics degrades substantially, and conventional classification methods rely on trajectory interpolation, which may introduce spurious motion patterns. This study proposes a feature-driven framework for fishing vessel type identification that eliminates the need for interpolation preprocessing. A 39-dimensional feature set is constructed using robust statistics, including the median and interquartile range, to characterize trajectory-level behavioral patterns. Adaptive speed interval thresholds are derived through a data-driven approach grounded in Bayesian decision boundaries, thereby removing the dependence on manually defined cut-off values. A backward ablation procedure guided by feature importance ranking identifies a lightweight 12-dimensional feature subset that retains 98.7% of the classification accuracy at a compression rate of 69%. Evaluated on 18,320 fishing vessel trajectories in the East China Sea, the full 39-dimensional feature set achieves a 5-fold cross-validation accuracy of 91.92% (Macro-F1 = 0.919, Kappa = 0.879), with inter-fold standard deviations ranging from 0.002 to 0.004. Comparative experiments demonstrate that three tree-based classifiers all exceed 90% accuracy on the same feature set, confirming that feature robustness, rather than model selection, constitutes the dominant performance factor. LightGBM achieves the optimal trade-off between accuracy and training efficiency, whereas the cross-validation standard deviation of LSTM is approximately 7.5 times greater, indicating that hand-crafted robust features provide superior stability under sparse sampling conditions. The proposed framework requires no fishery-specific prior knowledge and offers a transferable paradigm for sparse AIS trajectory analysis.

More from our Archive