openai / gym-soccer
☆303Updated 2 years ago
Alternatives and similar repositories for gym-soccer:
Users that are interested in gym-soccer are comparing it to the libraries listed below
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆361Updated 4 years ago
- Half Field Offense in Robocup 2D Soccer☆232Updated 2 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆422Updated last year
- Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym envir…☆274Updated 7 years ago
- Actor-critic with experience replay☆252Updated 2 years ago
- Code for the paper "Generative Adversarial Imitation Learning"☆710Updated 6 years ago
- Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.☆656Updated 5 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆180Updated 6 years ago
- Implementation of TRPO and related algorithms☆629Updated 6 years ago
- RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code☆684Updated 11 months ago
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆304Updated 2 years ago
- Repo containing code for multi-agent deep reinforcement learning (MADRL).☆693Updated 2 years ago
- Lua/Torch implementation of DQN (Nature, 2015)☆600Updated 8 years ago
- Implementation of Inverse Reinforcement Learning (IRL) algorithms in Python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL☆627Updated 11 months ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆489Updated 2 years ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆259Updated 6 months ago
- A basic 2D maze environment where an agent start from the top left corner and try to find its way to the bottom left corner.☆366Updated last year
- Accompanying repository for Let's make a DQN / A3C series.☆394Updated 6 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆189Updated 6 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆125Updated 5 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆309Updated 4 years ago
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆132Updated 7 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆214Updated 7 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆318Updated 3 years ago
- Constrained Policy Optimization☆320Updated 7 years ago
- ICML 2018 Self-Imitation Learning☆277Updated 5 years ago
- Proximal Policy Optimization implementation with TensorFlow☆107Updated 6 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆259Updated 4 years ago
- Implementation of Inverse Reinforcement Learning Algorithm on a toy car in a 2D world problem, (Apprenticeship Learning via Inverse Reinf…☆177Updated 3 years ago
- A customizable framework to create maze and gridworld environments☆265Updated 6 years ago