A Sensor-Based TinyML Acoustic Monitoring System for Edge-Side Animal Sound Recognition on Resource-Constrained Microcontrollers
Zhiqing Wang, Guicai YuEdge-side acoustic monitoring enables animal sound recognition in remote environments, but microcontroller deployment remains constrained by feature extraction, numerical consistency, memory, latency, and energy consumption. This study presents a sensor-based tiny machine learning (TinyML) acoustic monitoring system on an Arduino Nano 33 BLE Sense Rev2 platform, integrating onboard pulse-density modulation (PDM) microphone acquisition, Mel-frequency cepstral coefficient (MFCC) feature extraction, deployment-side standardization, 8-bit integer (INT8) neural-network inference, and edge-side decision output. To reduce training-to-deployment feature drift, consistent frame parameters, mirrored C++ feature operators, and exported standardization parameters are used to align personal-computer-side and microcontroller-side feature representations. A source-isolated seven-class protocol was constructed for six target animal classes and one compound background-noise class. In the single-run baseline comparison, the proposed multilayer perceptron achieved 98.28% test accuracy and 97.21% test macro-F1, while the ten-seed stability analysis yielded 98.64% ± 0.26% test accuracy and 97.87% ± 0.38% test macro-F1. The deployed INT8 model occupied approximately 26.9 KB, with a post-window latency of about 303 ms. System-level input power was 0.783–0.825 W, corresponding to an estimated autonomy of 7.63–8.03 h under the reference battery setting.