Predicting Mineralogy with Hyperspectral Data: A Benchmark Dataset and Machine Learning Framework to Enable Hyperspectral Geometallurgy
Samuel T. Thiele, Moritz Kirsch, Max Frenzel, Raimon Tolosana-Delgado, Akshay V. Kamath, Bradley M. Guy, Yonghwi Kim, Laura Tuşa, Tom Járóka, Richard GloaguenMineralogical data acquired from drillcores provide important constraints for resource estimation, geometallurgical modelling, mineral exploration, and geological interpretation. While hyperspectral imaging is rapidly gaining traction for these applications, it lacks the ability to accurately quantify mineral abundances without extensive calibration data. Here, we build on previous work to demonstrate and benchmark workflows that combine scanning electron microscope (SEM) mineral maps with large-extent multimodal hyperspectral imaging data. The goal is to relate hyperspectral features and mineral abundances using supervised machine learning models, and then apply these models to infer mineralogy across entire drillcores. We adapt the learning process to the non-uniform (unbalanced) composition of most rocks, and achieve reasonable accuracy for most rock-forming minerals. However, we also find that prediction accuracy depends strongly on the representativity of training data—so models often fail to produce accurate maps of rare and accessory minerals. Robust, adaptive and ideally semi-automated sampling approaches might address this shortcoming by identifying locations which ensure optimal coverage of hyperspectral variance. We also emphasise that upscaling from SEM to drillcore scale inevitably involves extrapolation, meaning predictions should always be validated. However, once validated, upscaled mineralogy predictions could provide crucial quantitative data that bridge the scale gap between petrographic observations, metallurgical tests, and geometallurgical models.