Efficient decorrelation of features using Gramian in Reinforcement Learning. (arXiv:1911.08610v1 [cs.LG])
Learning good representations is a long standing problem in reinforcement learning (RL). One of the conventional ways to achieve this goal in the supervised setting is through regularization of the parameters. Extending some of these ideas to the RL setting has not yielded similar improvements in learning. In this paper, we develop an online regularization…