Feryal / automated-curriculum-rl
☆32Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for automated-curriculum-rl
- Continual Reinforcement Learning in 3D Non-stationary Environments☆35Updated 5 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- An implementation of the Escape Room domain for Hierarchical Reinforcement Learning.☆24Updated 5 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆23Updated 5 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆48Updated 2 years ago
- krazy grid world☆25Updated 4 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆102Updated last year
- Generalised UDRL☆37Updated 2 years ago
- Reward Learning by Simulating the Past☆43Updated 5 years ago
- A repository for code of reinforcement learning algorithms with PyTorch☆29Updated 3 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Updated 4 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆36Updated 4 years ago
- Disagreement-Regularized Imitation Learning☆30Updated 3 years ago
- Efficient Exploration via State Marginal Matching (2019)☆66Updated 5 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆52Updated 5 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆17Updated 5 years ago
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 4 years ago
- ☆81Updated 3 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 3 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆86Updated 5 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Updated 5 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 6 years ago
- Invariant Causal Prediction for Block MDPs☆43Updated 4 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.☆37Updated last year
- Revisiting Rainbow☆73Updated 3 years ago