TroddenSpade / Meta-Reinforcement-LearningLinks
Code snippets of Meta Reinforcement Learning algorithms
☆38Updated last year
Alternatives and similar repositories for Meta-Reinforcement-Learning
Users that are interested in Meta-Reinforcement-Learning are comparing it to the libraries listed below
Sorting:
- Code for Dynamic Weights in Multi-Objective Deep Reinforcement Learning☆98Updated 2 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆60Updated 4 years ago
- BranchingDQN☆50Updated 6 years ago
- A clean and robust Pytorch implementation of SAC on discrete action space☆42Updated 9 months ago
- qmix☆23Updated 5 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆53Updated 3 years ago
- Code for Weighted QMIX☆138Updated 4 years ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆157Updated last year
- A collection of Meta-Reinforcement Learning algorithms in PyTorch☆48Updated last year
- ☆40Updated 3 years ago
- ☆42Updated 3 years ago
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆32Updated last year
- ☆89Updated 3 years ago
- ☆101Updated 3 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆23Updated 8 months ago
- PyTorch implementation of MATD3☆13Updated 5 years ago
- The pytorch implementation of DGN on grid world and Starcraft☆144Updated 3 years ago
- Official repository of the paper TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Proble…☆49Updated last year
- The code for maddpg using pytorch☆170Updated 4 years ago
- ☆27Updated 4 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆91Updated 4 years ago
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆219Updated 6 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆181Updated last year
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆72Updated 3 years ago
- ☆62Updated last year
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆159Updated 2 years ago
- pytorch实现的一些MARL算法☆67Updated 4 years ago
- Nash Q Learning☆31Updated 4 years ago
- ☆214Updated 2 years ago
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆53Updated 3 years ago