Comparison of data handling techniques for modeling bat acoustic activity
Zackary W. Isenhour, Elizabeth A. Hunter, Jesse L. De La Cruz, W. Mark FordAbstract
With the proliferation of acoustic sampling to investigate bat distribution and ecology, researchers have implemented a myriad of statistical modeling approaches to interpret findings. Bats are taxa of high conservation concern; therefore, ensuring the accuracy of species‐level habitat association models is critical for informing management. We sought to determine prediction differences among statistical approaches to modeling counts of acoustic detections, using generalized linear mixed models with 8 acoustic data‐handling techniques. We applied each approach or combination of approaches to a rare species, the northern long‐eared bat ( Myotis septentrionalis ), and a common species, the eastern red bat ( Lasiurus borealis ), from summer survey results on a landscape in south‐central Pennsylvania, USA, 2024. We evaluated the accuracy of habitat association models of bat acoustic activity at the species level using cross‐validation and compared resulting predictions of models using spatial correlations. We determined that filtering data by automated identification software (Kaleidoscope Pro), the maximum likelihood estimate (MLE) P ‐value thresholds reduced relative mean absolute error (rMAE) in cross‐validation of northern long‐eared bat models. Using the MLE‐retained data produced the most accurate predictions over using raw data or the overly conservative match ratio data. However, for the eastern red bat, the results from the most conservative approach of only retaining data with at least a 90% match ratio from software development training sets had the lowest rMAE. We have provided evidence that the current standard of filtering data by nightly MLE can result in more accurate and informative habitat‐use acoustic activity models for rare bat species.