henrycharlesworth / multi_action_head_PPO
PPO with multi-head/autoregressive action outputs
☆39Updated 3 years ago
Alternatives and similar repositories for multi_action_head_PPO:
Users that are interested in multi_action_head_PPO are comparing it to the libraries listed below
- Collection of OpenAI parametrized action-space environments.☆62Updated 2 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆134Updated 6 months ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆116Updated 3 months ago
- Delayed RL agent for non-Atari tasks, from "Acting in Delayed Environments with Non-Stationary Markov Policies", ICLR 2021.☆14Updated last year
- ☆47Updated 3 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆57Updated 4 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆44Updated 2 years ago
- Revisiting Discrete Gradient Estimation in MADDPG☆24Updated last year
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆49Updated 3 years ago
- Implementation for mSAC methods in PyTorch☆40Updated 3 years ago
- There will be updates later☆84Updated 5 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆149Updated last year
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆99Updated 3 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆40Updated 2 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆42Updated 5 months ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆99Updated 2 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆98Updated 5 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆122Updated 6 months ago
- ☆42Updated 3 years ago
- Distributional Soft Actor Critic☆51Updated 4 years ago
- DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details☆45Updated 2 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆56Updated 2 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆64Updated 5 months ago
- Code for Weighted QMIX☆129Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆102Updated 4 years ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆53Updated 3 years ago