acyclics / MPO

Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments
27Updated 4 years ago

Alternatives and similar repositories for MPO:

Users that are interested in MPO are comparing it to the libraries listed below