tiagoCuervo / CommonsGameLinks
An OpenAI gym multi-agent environment implementing the Commons Game proposed in "A multi-agent reinforcement learning model of common-pool resource appropriation"
☆22Updated 5 years ago
Alternatives and similar repositories for CommonsGame
Users that are interested in CommonsGame are comparing it to the libraries listed below
Sorting:
- An engine to create high performance multi-agent grid world environments with hundreds or thousands of agents, along with a set of refere…☆194Updated 3 years ago
- Gridworld for MARL experiments☆144Updated 5 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆202Updated last year
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆120Updated last year
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆168Updated 3 years ago
- ☆78Updated last year
- Value-Decomposition Multi-Agent Actor-Critics☆42Updated 3 years ago
- Repo for reproduction of sequential social dilemmas☆412Updated 11 months ago
- Lightweight multi-agent gridworld Gym environment☆213Updated 2 years ago
- A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartP…☆121Updated last year
- Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.☆215Updated 2 years ago
- Mean Field Multi-Agent Reinforcement Learning☆415Updated 5 years ago
- Codebase for Evolutionary Reinforcement Learning (ERL) from the paper "Evolution-Guided Policy Gradients in Reinforcement Learning" publi…☆249Updated 5 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- OpenAI gym-based algorithm for the grid world problem☆28Updated 5 years ago
- Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.