towzeur / gym-abalone
An environment of the board game Abalone using OpenAI's Gym API
☆25Updated last year
Alternatives and similar repositories for gym-abalone:
Users that are interested in gym-abalone are comparing it to the libraries listed below
- The source code for the gym-microrts paper.☆42Updated 2 years ago
- A2C is a special case of PPO!☆19Updated 2 years ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆43Updated 3 years ago
- Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.☆12Updated 2 years ago
- PAIRED in PyTorch 🔥☆58Updated last year
- Code and links for over 25,000 trained Atari agents☆95Updated 6 months ago
- ☆16Updated 3 years ago
- ☆28Updated 2 years ago
- Accompanying code for "Learning and Planning in Average-Reward Markov Decision Processes"☆14Updated 4 years ago
- Synchronized Curriculum Learning for RL Agents☆33Updated this week
- Multi-agent simulator in Jax for research and teaching in AI & ALife☆15Updated last week
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 6 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 9 months ago
- A tool for aggregating and plotting MARL experiment data.☆71Updated last month
- A minimal implementation of Go-Explore without domain knowledge☆13Updated 3 years ago
- Supervised and RL Models for No Press Diplomacy☆63Updated last year
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆48Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated 10 months ago
- Vectorization techniques for fast population-based training.☆55Updated 2 years ago
- ☆100Updated last year
- Baselines for gymnax 🤖☆63Updated last year
- Code for magnetic mirror descent.☆15Updated last year
- Repository for the PGA-MAP-Elites algorithm. PGA-MAP-Elites was developed to efficiently scale MAP-Elites to large genotypes and noisy d…☆36Updated 3 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆102Updated 2 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆17Updated 3 months ago
- Nethack Learning Environment Wrapper for Language Interface☆36Updated last year
- Neuro-evolution for OpenAI Gym environments☆56Updated 3 years ago
- A set of competitive environments for Reinforcement Learning research.☆29Updated 2 years ago
- Revisiting Rainbow☆74Updated 3 years ago