DOI: 10.3390/en19122908 ISSN: 1996-1073

Integrated Transfer Learning and Reinforcement Learning for Reactive Current Injection During Voltage Sags

Mohana Fathollahi, Antonio Camacho Santiago, Cecilio Angulo

Modern power grids with high renewable energy penetration are vulnerable to fast voltage disturbances caused by grid faults. Among these, voltage sags are critical because they develop within milliseconds and require rapid reactive current support to maintain grid stability and power reliability. Reinforcement learning has previously shown potential for reactive current injection control during voltage sag events due to its fast response and adaptability to changing system conditions. However, existing approaches rely on separate policies for specific subsets of the operating space, which limits their ability to provide optimal actions when the system operates across broader or combined state regions. To address this limitation, this paper proposes a unified Soft Actor–Critic (SAC) target policy trained over the full state and action space by integrating multi-source transfer learning with potential-based reward shaping approach. Results show that the proposed multi-source transfer approach enables the target agent to converge faster and reach a higher reward solution than the baseline SAC and single-source transfer approach. The trained policy also improved prediction accuracy, achieving reactive-current errors below 0.2 A with respect to the ground-truth reference generated through extensive simulations over the full observation and action space. The reference follows the grid-code requirement for minimum reactive current injection during faults and provides a benchmark for evaluating prediction accuracy. This can help distributed generation sources respond more effectively during severe perturbations such as voltage sags, support voltage recovery, and reduce the risk of cascaded disconnections that could lead to unwanted blackouts. Additionally, the inference execution time is also sufficiently fast to satisfy the response-time requirement of voltage sag events, confirming the real-time feasibility of the proposed controller.

More from our Archive