hmomin / TD3-Bipedal-Walker

Trains an agent with Twin Delayed Deep Deterministic Policy Gradient (TD3) to solve the Bipedal Walker challenge from OpenAI
12Updated last year

Related projects

Alternatives and complementary repositories for TD3-Bipedal-Walker