DOI: 10.3390/geosciences16060240 ISSN: 2076-3263

Machine Learning-Based Delineation of Anomalous Gold Zones from Drillhole Geochemistry in a Sulphide-Hosted Orogenic Gold System

Gilbert Yaw Bimpong, Justina Senam Lotsu, Kwaku Boakye

Early stage mineral exploration requires the reliable identification of anomalous gold zones from drillhole geochemistry in data-limited environments. This study applies a machine learning (ML) classification framework to detect anomalous gold zones (Au ≥ 0.68 ppm; 90th percentile) from bulk XRF multielement drillhole geochemistry in a Paleoproterozoic Birimian greenstone belt sulphide-hosted orogenic gold system, West African Craton. A total of 53,126 one-metre diamond core samples from 301 drillholes were preprocessed within a compositional data analysis (CoDA) framework, with Au being explicitly excluded from the centred log-ratio (CLR) transformation to eliminate target–predictor circularity. After Minimum Covariance Determinant (MCD) outlier filtering, 40,385 samples were retained to construct a 19-feature matrix of 10 CLR-transformed elements, 1 rock-type feature, and 8 sulphide–lithology interaction features. Drillhole-based block cross-validation (DH-block CV), validated by an experimental along-hole variogram (practical autocorrelation range ≈ 20 m), ensured spatially honest performance estimates. Four nonlinear classifiers—Random Forest (RF), XGBoost, LightGBM, and Multi-Layer Perceptron (MLP)—were benchmarked against a Logistic Regression (LR) linear baseline. All nonlinear classifiers achieved validation AUC of 0.936–0.938, outperforming LR (AUC = 0.931) with F1-score improvements of +0.09 to +0.11 and precision gains of up to +35 percentage points—directly reducing wasted drill holes in applied exploration. MLP recorded the highest F1-score (0.666) and precision (0.765), and XGBoost the highest recall (0.787). Permutation importance identified S-Ti (ΔAUC = 0.028), S-Fe (0.021), and S-Al (0.013) as the top-ranked features, confirming that sulphide enrichment relative to lithological background is the primary discriminating signal. Partial dependence analysis revealed a threshold-driven non-monotonic Fe dependence at CLR(Fe) ≈ 3, marking the transition from lithological dilutant to sulphide co-indicator—a nonlinear pattern inaccessible to linear classifiers.

More from our Archive