spring01 / drlbox
Interfacing RL agents with user-definable neural networks and OpenAI-gym environments.
☆12Updated 5 years ago
Related projects: ⓘ
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆125Updated 4 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 4 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆101Updated 4 years ago
- Proximal Policy Optimization implementation with TensorFlow☆101Updated 5 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆82Updated 6 years ago
- RainBow, Tensorflow☆49Updated 6 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 5 years ago
- implement of prioritized experience replay☆156Updated 6 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆173Updated last year
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 4 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆129Updated last year
- Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action☆111Updated 5 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆60Updated 3 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- ☆91Updated 3 years ago
- General purpose environment wrappers for openai gym☆23Updated 5 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆28Updated 5 years ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Updated 4 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆101Updated 5 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆47Updated 6 years ago
- C51-DDQN in Keras☆125Updated 6 years ago
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆26Updated 4 years ago
- Tensorflow implementation of Deep Deterministic Policy Gradients☆19Updated 7 years ago
- Basic reinforcement learning implementation with tensorflow version 2.0☆52Updated 4 years ago
- TensorFlow 2.0 for Deep Reinforcement Learning.☆82Updated last year
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆128Updated 6 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆128Updated 5 years ago
- ☆69Updated 5 years ago
- DQN, DDDQN, A3C, PPO, Curiosity applied to the game DOOM☆82Updated 3 years ago
- An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.☆29Updated 6 years ago