AdrienLE / intuitive_policy_gradient
☆20Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for intuitive_policy_gradient
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Updated 5 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- Data as Demonstrator (DaD) is a meta learning algorithm to improve the multi-step predictive capabilities of a learned time series (e.g. …☆33Updated 8 years ago
- ☆32Updated 6 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- Minimal and Clean Reinforcement Learning Examples in PyTorch☆41Updated 5 years ago
- Bandits Environments for the OpenAI Gym☆89Updated 4 years ago
- Highly Modular and Scalable Reinforcement Learning☆114Updated 4 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Updated 4 years ago
- Open source implementation of SeaRNN (ICLR 2018, https://openreview.net/forum?id=HkUR_y-RZ)☆49Updated 6 years ago
- SeqGAN but with more bells and whistles☆24Updated 6 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆31Updated 6 years ago
- Codebase for Efficient yet simple Reinforcement Learning Research Framework☆28Updated last year
- Code for human intervention reinforcement learning☆33Updated 6 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- Reinforcement Learning and Deep Learning Resources☆16Updated 6 years ago
- Differentiable Neural Computer in TensorFlow☆27Updated 7 years ago
- Variational Recurrent Auto-Encoder using LSTM encoder/decoder networks☆54Updated 8 years ago
- A simple Gridworld environment for Open AI gym☆24Updated 6 years ago
- ☆38Updated 2 months ago
- PyTorch implementation of Proximal Policy Optimization☆50Updated 6 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆56Updated 6 years ago
- Reinforcement learning algorithms in RLlib☆56Updated 6 months ago
- Implementation of Neural Episodic Control in Tensorflow☆26Updated 5 years ago
- ☆14Updated 5 years ago
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 4 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆51Updated 4 years ago
- Awesome RL: Papers, Books, Codes, Benchmarks☆115Updated last year
- ☆44Updated 5 years ago
- [2019] (Neurips workshop paper) Blending behavioral cloning and RL☆9Updated last year