TroddenSpade / Meta-Reinforcement-Learning
Code snippets of Meta Reinforcement Learning algorithms
☆34Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Meta-Reinforcement-Learning
- Implementation of Decision Transformer, Conservative Q-Learning, and Behavior Cloning in Offline Reinforcement Learning setting☆24Updated 2 years ago
- Study of paper "Meta reinforcement learning for sim-to-real domain adaptation"☆16Updated 2 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆50Updated 3 years ago
- A clean and robust Pytorch implementation of SAC on discrete action space☆32Updated last month
- ☆39Updated 3 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 3 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆43Updated 3 years ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆133Updated last year
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆25Updated last year
- Jax and Torch Multi-Agent SAC on PettingZoo API☆61Updated last year
- Parallel training on multiple Deep RL agents with Federated Learning approach to gain higher rewards☆30Updated 2 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆110Updated 7 months ago
- BranchingDQN☆48Updated 5 years ago
- ☆90Updated 3 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆121Updated 5 months ago
- Code for Weighted QMIX☆123Updated 4 years ago
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆37Updated 2 years ago
- ☆26Updated 4 years ago
- ☆186Updated last year
- pytorch实现的一些MARL算法☆64Updated 3 years ago
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆192Updated 5 years ago
- This is the official implementation of Multi-Agent PPO.☆93Updated last year
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆139Updated last year
- QMIX implemented in TensorFlow 2☆16Updated 3 years ago
- MiniMax Multi-Agent Deep Deterministic Policy Gradient (M3DDPG) pytorch implementation☆10Updated 3 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆75Updated 3 years ago
- Implement reinforcement learning algorithms in Pytorch☆31Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆45Updated 4 years ago
- The pytorch implementation of DGN on grid world and Starcraft☆134Updated 2 years ago
- Nash Q Learning☆30Updated 3 years ago