ltlhuuu / A2PR

Implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage-guided policy regularization method, in Pytorch
23Updated 5 months ago

Related projects

Alternatives and complementary repositories for A2PR