Fault-Tolerant Attitude Control of Flexible Spacecraft via Reinforcement Learning
Zhuoyue Peng, Qiang ShenThis paper proposes an integrated attitude control framework for flexible spacecraft subject to external disturbances, rigid–flexible dynamic coupling, and actuator faults. The control framework combines the Twin Delayed Deep Deterministic Policy Gradient (TD3) reinforcement learning algorithm with an adaptive fault-tolerant (AFT) compensator. First, a rigid–flexible coupling dynamic model is formulated using Modified Rodrigues Parameters. Second, an observer-based TD3 attitude controller is designed, where a hierarchical reward function incorporating the observer-estimated flexible modal displacement η^ is constructed to train the agent for simultaneous attitude convergence and vibration suppression. Third, a composite fault-tolerant control structure is developed by integrating the trained TD3 policy with an adaptive sliding mode compensator that handles both partial loss-of-effectiveness faults and time-varying additive faults. The proposed framework is evaluated under a progressive five-scenario uncertainty evaluation framework encompassing measurement noise, parameter mismatch, external disturbances, and actuator faults. Simulation results demonstrate that (i) the η^-augmented reward enables substantial improvements in vibration suppression over the baseline reward, achieving a better balance between pointing accuracy and vibration attenuation; (ii) under the most demanding fault scenario, the AFT compensator proves essential for precise convergence, and the composite TD3+AFT architecture achieves the best overall performance among the four compared control schemes.