RobvanGastel / meta-rl-algorithmsLinks
A collection of Meta-Reinforcement Learning algorithms in PyTorch
☆51Updated last year
Alternatives and similar repositories for meta-rl-algorithms
Users that are interested in meta-rl-algorithms are comparing it to the libraries listed below
Sorting:
- Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'☆72Updated 4 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆111Updated 3 years ago
- ☆20Updated 2 years ago
- ☆27Updated 5 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆171Updated last year
- PyTorch implementation of discrete version of Soft Actor-Critic.☆36Updated 4 years ago
- Collection of OpenAI parametrized action-space environments.☆69Updated 10 months ago
- Value-Decomposition Multi-Agent Actor-Critics☆42Updated 3 years ago
- A collection of offline reinforcement learning algorithms.☆208Updated last year
- ☆48Updated 3 years ago
- Code snippets of Meta Reinforcement Learning algorithms☆39Updated 2 years ago
- ☆40Updated 4 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆27Updated last year
- There will be updates later☆88Updated 6 years ago
- ☆49Updated 4 years ago
- Code for Weighted QMIX☆145Updated 5 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆167Updated 2 years ago
- ☆42Updated 4 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆86Updated 3 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆219Updated last year
- PyTorch implementation of Constrained Policy Optimization☆56Updated 4 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆67Updated 4 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆55Updated last year
- Code for Dynamic Weights in Multi-Objective Deep Reinforcement Learning☆104Updated 2 years ago
- Solving POMDP using Recurrent networks☆92Updated 5 years ago
- Single-file pytorch implementation of hybrid-SAC☆63Updated 4 years ago
- Prioritized Experience Replay implementation with proportional prioritization☆86Updated 2 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- [NeurIPS 2022] Code for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments☆12Updated 3 years ago