A Fairness-Aware and Interpretable Model for Recidivism Prediction

doi:10.3390/a19070509

DOI: 10.3390/a19070509 ISSN: 1999-4893

A Fairness-Aware and Interpretable Model for Recidivism Prediction

Stamatis Chatzistamatis, George E. Tsekouras, Anastasios Rigos, Alvaro Garcia-Recuero, Eleni Valari, Andreas Siafakas, Konstantinos Kotis

Show PDF Cite

Recidivism prediction is increasingly embedded in criminal justice decision-making, yet most deployed systems remain opaque and have been shown to exhibit discriminatory behavior against certain demographic groups. This paper presents a fairness-aware interpretable framework for recidivism prediction applied to three real-world datasets from Bulgaria, Greece, and Portugal. The classification core relies on a 1-Dimensional Convolutional Neural Network (1D-CNN), trained by a custom objective function that embeds the Equalized Odds fairness criterion as an L1-regularized penalty reflecting on gender-based disparities in false positive and false negative rates. Model-level interpretability is provided through Kernel SHAP, which decomposes individual predictions into additive feature attributions grounded in cooperative game theory. Experiments across prediction tasks, each evaluated over randomized runs, demonstrate that the baseline model exhibits statistically significant bias against the female group in all datasets. The fairness-constrained model substantially reduces these disparities across all tasks at a moderate and expected cost to classification accuracy. Kernel SHAP analysis reveals the relative contribution of static and dynamic offenders’ attributes to individual risk scores, supporting auditability and contestability. The proposed framework advances the integration of predictive performance, algorithmic fairness, and structural interpretability in criminal justice analytics.

Outline

A Fairness-Aware and Interpretable Model for Recidivism Prediction

More from our Archive