RSSI Data Augmentation Algorithm Based on Polynomial Regression and Stochastic Signal Fade Modeling
Mateusz Sumorek, Adam Idźkowski, Krzysztof KonopkoThis article presents a simple, original data augmentation algorithm for Received Signal Strength Indicator (RSSI), dedicated to indoor localization systems. The aim of the research was to develop a synthetic data generation method to serve as a regularization technique, making models more robust against measurement noise. The proposed approach combines propagation modeling using polynomial regression with the individual statistical characteristics of each Access Point (AP), accounting for signal fluctuations and a probabilistic signal outage mechanism. The effectiveness of the proposed solution was experimentally verified by evaluating K-NN and MLP neural network models in both classification and regression variants. The study was conducted on datasets with different measurement grid granularities, demonstrating the algorithm’s ability to improve the generalization properties of estimators, even with a limited number of samples in the training set. The results showed that the use of augmentation reduced the Mean Absolute Error (MAE) by an average of approximately 20% for the dense training set and about 17% for the sparse set. Within the evaluated test environment, models trained on the augmented sparse measurement grid, which contained 67% fewer physical calibration points (30 points compared to the dense grid’s 92), reached a precision comparable to models trained on the dense real-world dataset. Analysis of histograms and Cumulative Distribution Functions (CDF) of the error confirmed the preservation of the signal’s statistical integrity and the effective mitigation of gross errors. The proposed solution constitutes an efficient and easy-to-implement alternative to complex generative models (e.g., GANs). These findings serve as a successful proof-of-concept and pilot study, laying the foundation for further development and validation in larger, more complex spatial environments.