Task-Relevant Representation Decoupling for Visual Reinforcement Learning Generalization
Jinwen Wang, Youfang Lin, Xiaobo Hu, Qian Xu, Shuo Wang, Zhuo Chen, Kai Lv
Visual Reinforcement Learning (VRL) has achieved considerable success in solving control tasks. However, generalizing learned policies to new environments remains a major challenge, as agents often overfit to task-irrelevant features in the training environment. To solve this problem, we introduce the concept of decoupling observations into task-relevant and task-irrelevant representations. Building on this idea, we propose a self-supervised
T
ask-
R
elevant
R
epresentation
D
ecoupling (T2RD) algorithm for VRL. This algorithm consists of three components: