NeuralMMO / baselinesLinks
Baselines for Neural MMO -- new users should treat this repo as a starter project
☆51Updated last year
Alternatives and similar repositories for baselines
Users that are interested in baselines are comparing it to the libraries listed below
Sorting:
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆120Updated last year
- Learning to Incentivize Other Learning Agents☆35Updated 3 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆64Updated 2 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".☆61Updated 8 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆105Updated last year
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆75Updated 2 years ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆48Updated 2 years ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆56Updated 2 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆166Updated 2 years ago
- PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tu…☆157Updated 2 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 3 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆145Updated 2 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆121Updated 3 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 3 years ago
- ☆246Updated last year
- Object Centric Atari games☆96Updated 3 weeks ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Updated 5 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆52Updated 7 months ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆54Updated last year
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆110Updated 2 years ago
- ☆133Updated last year
- Benchmarked implementations of Offline RL Algorithms.☆76Updated 9 months ago
- Official code repository for Prompt-DT.☆119Updated 3 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆162Updated 4 years ago
- Synthetic Experience Replay☆107Updated last year
- Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"☆34Updated last year
- ☆116Updated 2 years ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆116Updated 2 years ago
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆46Updated 3 years ago