staghuntrpg / agarLinks
This is the source code of Agar.io environment.
☆24Updated 3 years ago
Alternatives and similar repositories for agar
Users that are interested in agar are comparing it to the libraries listed below
Sorting:
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 3 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆74Updated 3 years ago
- ☆89Updated 3 years ago
- ☆25Updated 3 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆57Updated last year
- Benchmarked implementations of Offline RL Algorithms.☆75Updated 8 months ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆61Updated 2 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆68Updated 3 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆55Updated 4 years ago
- Benchmark data (i.e., DeepMind Control Suite and MuJoCo) for RL.☆32Updated 4 years ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆27Updated 2 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆49Updated last year
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆46Updated last year
- Official code repository for Prompt-DT.☆116Updated 3 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 2 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Updated 3 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆21Updated 3 years ago
- Implementation of the Off Belief Learning algorithm.☆49Updated 3 years ago
- Reinforcement Learning via Supervised Learning☆71Updated 3 years ago
- ☆54Updated last year
- Author's PyTorch implementation of LAP and PAL with TD3 and DDQN☆37Updated 3 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Updated 3 years ago
- ☆36Updated 2 years ago
- Image-based gridworld experiment for learning Markov state abstractions☆21Updated last year
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- [NeurIPS 2022] Official codebase for "Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learn…☆24Updated 8 months ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Updated 7 months ago
- Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.☆21Updated 3 years ago
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆23Updated last year