ronsailer / A2OC_A2C

PyTorch implementation of Advantage Actor-Critic (A2C), Asynchronous Advantage Option-Critic (A2OC), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).
8Updated 6 years ago

Related projects

Alternatives and complementary repositories for A2OC_A2C