jidiai / Competition_3v3snakesLinks
☆38Updated 2 years ago
Alternatives and similar repositories for Competition_3v3snakes
Users that are interested in Competition_3v3snakes are comparing it to the libraries listed below
Sorting:
- ☆17Updated 3 years ago
- ☆124Updated 3 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 3 years ago
- My internship project in 𝖢𝖠𝖲𝖨𝖠. 🤗☆4Updated 6 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- ☆165Updated last year
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆59Updated 5 years ago
- Multi-Agent Determinantal Q-Learning☆43Updated 2 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆88Updated 4 years ago
- Pytorch implementation of Multi-Agent Generative Adversarial Imitation Learning☆41Updated 3 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- Paper list for constrained policy optimization in reinforcement learning.☆72Updated last year
- RLlib超参数详解(中文)☆18Updated 3 years ago
- ☆25Updated 3 years ago
- Assignments for CS294-112.☆30Updated 5 years ago
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".☆26Updated 5 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- ☆32Updated 2 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆86Updated 2 years ago
- ☆39Updated 2 years ago
- Represented Value Function Approach for Large Scale Multi Agent Reinforcement Learning☆15Updated 5 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆24Updated last year
- ☆44Updated 4 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning☆56Updated 3 years ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆29Updated 2 years ago
- ☆17Updated 5 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆29Updated 6 months ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆131Updated last year