LantaoYu / 1M-agents-RL
A preliminary platform for up to 1 million reinforcement learning agents
☆11Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for 1M-agents-RL
- A Multi-agent Learning Framework☆62Updated 3 years ago
- FEN Code☆37Updated 5 years ago
- reproduce some RL or Multi-Agent models☆35Updated 5 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 2 months ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 4 years ago
- ☆18Updated 5 years ago
- ☆26Updated 4 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆66Updated 4 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆101Updated 3 years ago
- ☆71Updated 5 months ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆144Updated last year
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 3 years ago
- ☆11Updated 5 years ago
- ☆29Updated 2 years ago
- Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"☆44Updated last year
- Reinforcement Learning papers on exploration methods.☆20Updated 3 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 6 years ago
- on-policy optimization baselines for deep reinforcement learning☆28Updated 4 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆48Updated 5 years ago
- Trust Region Policy Optimization (TRPO) in pure TensorFlow☆18Updated 6 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆71Updated 8 years ago
- Reinforcement Learning via Latent State Decoding☆30Updated last year
- ☆97Updated last year
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆70Updated 7 years ago
- Environments with IC3Net paper☆12Updated 5 years ago
- Neurosymbolic transformers for multi-agent communication.☆20Updated 4 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆97Updated 2 years ago
- ☆118Updated 3 months ago