openai / gym-soccer
☆305Updated last year
Related projects ⓘ
Alternatives and complementary repositories for gym-soccer
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆360Updated 4 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆416Updated 11 months ago
- Implementation of TRPO and related algorithms☆620Updated 6 years ago
- Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym envir…☆275Updated 6 years ago
- Repo containing code for multi-agent deep reinforcement learning (MADRL).☆669Updated last year
- Half Field Offense in Robocup 2D Soccer☆228Updated 2 years ago
- Code for the paper "Generative Adversarial Imitation Learning"☆690Updated 5 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆479Updated last year
- RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code☆654Updated 6 months ago
- Constrained Policy Optimization☆307Updated 7 years ago
- Tools for accelerating safe exploration research.☆506Updated last year
- Actor-critic with experience replay☆252Updated 2 years ago
- Mean Field Multi-Agent Reinforcement Learning☆377Updated 4 years ago
- Multi Agent Reinforcement Learning using MalmÖ☆246Updated 4 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆307Updated 3 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆295Updated 2 years ago
- Proximal Policy Optimization implementation with TensorFlow☆104Updated 6 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆101Updated 5 years ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆258Updated last month
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆131Updated 6 years ago
- Collection of Deep Reinforcement Learning algorithms☆297Updated 5 years ago
- A basic 2D maze environment where an agent start from the top left corner and try to find its way to the bottom left corner.☆354Updated last year
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆180Updated 6 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 5 years ago
- ☆337Updated 6 years ago
- Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.☆396Updated last year
- Prioritized Experience Replay (PER) implementation in PyTorch☆305Updated 4 years ago
- Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.☆655Updated 4 years ago
- implement of prioritized experience replay☆156Updated 6 years ago
- A3C LSTM Atari with Pytorch plus A3G design☆562Updated last year