A Stability-Driven Framework for Automated Operational Crop Mapping Using Optical and Radar Satellite Image Time Series
Maryam Choukri, Yacine Bouroubi, Jamal-Eddine Ouzemou, Abdelghani Chehbouni, Ahmed LaamraniOperational crop mapping requires classifiers capable of robust generalization across years. While feature importance is routinely used for model optimization, its temporal stability has rarely been systematically investigated, creating a critical gap in deploying reliable monitoring systems. This study moves beyond identifying “most important” features to systematically evaluate and quantify their inter-annual stability for enabling automated classification. Using six agricultural years (2018, 2019, 2020, 2023, 2024 and 2025) of Sentinel-1 and Sentinel-2 data over Morocco, we extracted 156 multi-sensor features across 12 monthly composites and analyzed their importance stability through statistical metrics, clustering, and novel composite indices: the Reliability Index (RI) and Automatic Selection Score (AuSS). This framework automates feature selection by ranking features with RI and AuSS and then applying Pareto optimization to identify a minimal stable feature set—without requiring annual retraining or expert intervention. Our analysis confirms a fundamental tension: the most discriminative features (e.g., NDVI, VH, VV) are also the most volatile, while stable features (e.g., NDRE, MSI, NDMI) offer modest predictive power. Hierarchical clustering revealed four behavioral typologies (Dominant Stable, Performant Volatile, Stable Minor, and Noise), guiding strategic feature management. Crucially, a Pareto analysis demonstrated that a refined portfolio of 6 indices (VH, VV, NDVI, NDRE, GCVI, RVI) captures 57.2% of cumulative predictive importance, filtering out inter-annual noise while preserving discriminative signal. The Voting Ensemble leveraging this Stable Portfolio maintained consistent high accuracy (87.4% accuracy, 87.2% F1-score) with minimal performance degradation during temporal transfer, while models based on volatile top features exhibited significant drops. Entropy analysis confirmed that all features in the Stable Portfolio provide consistent informational certainty, indicating that stability-driven selection does not increase model uncertainty. We conclude that feature stability is not merely a diagnostic metric but a foundational criterion for operational design. We propose a practical, metrics-driven framework for constructing automated crop classification systems that are more resilient to inter-annual climate variability.