wisnunugroho21 / reinforcement_learning_v_mpo

Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)
16Updated 3 years ago

Related projects

Alternatives and complementary repositories for reinforcement_learning_v_mpo