erwanbou / sf-deep-rl
Project on Successor Features in Deep Reinforcement Learning and Transfer Learning
☆24Updated 7 years ago
Alternatives and similar repositories for sf-deep-rl:
Users that are interested in sf-deep-rl are comparing it to the libraries listed below
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 7 years ago
- Implementation of USFAs: https://arxiv.org/pdf/1812.07626.pdf☆9Updated 6 years ago
- ☆31Updated 5 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated last month
- ☆53Updated last year
- A reusable framework for successor features for transfer in deep reinforcement learning using keras.☆43Updated 3 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆57Updated 3 years ago
- Implementation of the Option-Critic Architecture☆39Updated 6 years ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆54Updated 4 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆94Updated 2 years ago
- ☆61Updated 6 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆22Updated 5 years ago
- ☆83Updated 6 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆23Updated 6 years ago
- Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"☆37Updated 5 years ago
- ☆26Updated 2 years ago
- Gridworld for MARL experiments☆139Updated 4 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆40Updated 4 years ago
- NeurIPS Reproducibility Challenge 2019☆20Updated 5 years ago
- ☆83Updated 4 years ago
- Gym environments modified with adversarial agents☆36Updated 8 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Updated 6 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆45Updated 4 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Updated 5 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆16Updated 5 years ago
- A simple RNN meta-learner☆10Updated 6 years ago
- ☆31Updated 4 years ago
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)☆18Updated 2 years ago