HumanCompatibleAI / deep-rlsp
Code accompanying "Learning What To Do by Simulating the Past", ICLR 2021.
☆26Updated 3 years ago
Related projects: ⓘ
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 3 years ago
- Gym environments for Robots that learn to interact with the environment autonomously☆34Updated last year
- Generalised UDRL☆37Updated 2 years ago
- ☆42Updated 2 years ago
- ☆53Updated 2 years ago
- Code to reproduce Neural Game Engine experiments and pre-trained models☆40Updated 2 years ago
- Official code for "Task-Embedded Control Networks for Few-Shot Imitation Learning".☆44Updated 4 years ago
- ☆27Updated 3 years ago
- Reward Learning by Simulating the Past☆43Updated 5 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20Updated 2 years ago
- Baselines and memory-based scenarios for the ViZDoom simulator☆33Updated last year
- Variational Reinforcement Learning☆16Updated last month
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆39Updated last year
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆21Updated 2 years ago
- Reinforcement Learning with Latent Flow☆42Updated 3 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆18Updated 5 years ago
- Repository hosting the code associated with "Unsupervised Behaviour Discovery with Quality-Diversity Optimisation"☆11Updated 3 years ago
- ☆35Updated this week
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆44Updated last year
- Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration☆25Updated 5 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆26Updated 4 years ago
- ☆113Updated last year
- Public Release of Plan2vec Implementation in pyTorch☆56Updated last year
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆17Updated 4 years ago
- A PyTorch implementation of visual interaction networks☆12Updated 5 years ago
- Reinforcement Learning with Videos: Combining Offline Observations with Interaction☆28Updated 3 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Clockwork VAEs in JAX/Flax☆31Updated 3 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆17Updated last year
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 3 years ago