chanb / metalearning_RL
☆18Updated last year
Related projects: ⓘ
- ☆44Updated 3 years ago
- A simple RNN meta-learner☆10Updated 5 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆34Updated 4 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Meta RL codebase for Unstable Baselines☆20Updated last year
- ☆19Updated 3 years ago
- Assignments for CS294-112.☆30Updated 5 years ago
- Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms includ…☆26Updated last year
- ☆40Updated 3 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated last year
- Value-Decomposition Multi-Agent Actor-Critics☆39Updated last year
- Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'☆54Updated 2 years ago
- ☆51Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆87Updated 3 years ago
- ☆13Updated 4 years ago
- [NeurIPS 2022] Code for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments☆12Updated last year
- Code for a model-based version of Constrained Policy Optimization☆10Updated 3 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆61Updated 3 years ago
- ☆39Updated 2 years ago
- PyTorch IMPALA implementation☆24Updated 5 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆91Updated 2 years ago
- ☆28Updated 2 years ago
- ☆28Updated 3 years ago
- Source code for paper: Efficient deep reinforcement learning via adaptive policy transfer☆14Updated 2 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆57Updated 2 years ago
- ☆53Updated 6 months ago
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆13Updated 3 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆29Updated last year
- ☆30Updated 2 years ago
- Code for FOCAL Paper Published at ICLR 2021☆49Updated 9 months ago