lns / memoireLinks
☆18Updated 6 years ago
Alternatives and similar repositories for memoire
Users that are interested in memoire are comparing it to the libraries listed below
Sorting:
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 6 years ago
- ☆108Updated 5 years ago
- A Multi-agent Learning Framework☆62Updated 4 years ago
- ☆33Updated 3 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 5 years ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆24Updated 5 years ago
- FEN Code☆40Updated 6 years ago
- RLA is a tool for managing your RL experiments automatically☆72Updated 3 years ago
- ☆135Updated last year
- ☆30Updated 3 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 11 months ago
- ☆27Updated 6 years ago
- ☆62Updated 7 years ago
- Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets☆133Updated last year
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 6 years ago
- ☆18Updated 3 years ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37Updated 6 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆132Updated 2 years ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆110Updated 2 years ago
- ☆54Updated last year
- Implementation of the Off Belief Learning algorithm.☆49Updated 3 years ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆35Updated 6 years ago
- Code for Sibling Rivalry and experiments presented in associated paper☆17Updated 9 months ago
- Learning to Incentivize Other Learning Agents☆35Updated 3 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Updated 6 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆163Updated 5 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 4 years ago
- ☆88Updated last year
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Updated 7 years ago