chanb / metalearning_RLLinks
☆20Updated 2 years ago
Alternatives and similar repositories for metalearning_RL
Users that are interested in metalearning_RL are comparing it to the libraries listed below
Sorting:
- Value-Decomposition Multi-Agent Actor-Critics☆39Updated 2 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆104Updated 3 years ago
- ☆27Updated 5 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆40Updated 5 years ago
- ☆49Updated 4 years ago
- ☆43Updated 4 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- A simple RNN meta-learner☆10Updated 6 years ago
- ☆20Updated 4 years ago
- There will be updates later☆85Updated 6 years ago
- ☆40Updated 3 years ago
- A collection of Meta-Reinforcement Learning algorithms in PyTorch☆50Updated last year
- PyTorch implementation of discrete version of Soft Actor-Critic.☆36Updated 4 years ago
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆14Updated 4 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated 2 years ago
- Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'☆69Updated 3 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆25Updated 10 months ago
- discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!☆56Updated 2 years ago
- PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"☆10Updated 5 years ago
- Assignments for CS294-112.☆30Updated 6 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆65Updated 4 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆172Updated 10 months ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆97Updated 5 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆82Updated 2 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆52Updated 5 months ago
- [NeurIPS 2022] Code for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments☆12Updated 3 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆100Updated 6 years ago
- ☆32Updated 2 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆58Updated 3 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆25Updated 2 years ago