jondeaton / AgarLE
Agar.io OpenAI Gym Learning Environment
☆11Updated last year
Related projects: ⓘ
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆61Updated last year
- MiniZero: An AlphaZero and MuZero Training Framework☆63Updated last month
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆95Updated 10 months ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆189Updated 2 weeks ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆153Updated 3 years ago
- fast + parallel AlphaZero in JAX☆80Updated 5 months ago
- Gridworld domains in the gym interface☆24Updated 10 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆87Updated last month
- Classic MCTS example with mctx☆15Updated last year
- ♟️ Vectorized RL game environments in JAX☆391Updated this week
- ☆282Updated last year
- Benchmarking the Spectrum of Agent Capabilities☆373Updated 7 months ago
- AlphaZero in JAX☆68Updated 5 months ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆46Updated 5 months ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆37Updated last year
- ☆192Updated 7 months ago
- An environment of the board game Go using OpenAI's Gym API☆164Updated 2 years ago
- A simple implementation of MuZero algorithm for connect4 game☆93Updated 4 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆50Updated 10 months ago
- ☆46Updated last year
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆29Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆102Updated 3 weeks ago
- impact-driven-exploration☆125Updated 11 months ago
- An API conversion tool for popular external reinforcement learning environments☆131Updated 3 months ago
- An implementation of AlphaZero, trained to master Tic-Tac-Toe and Four in a row☆19Updated last year
- A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)☆227Updated 2 months ago
- ☆59Updated last month
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆82Updated last year
- Pytorch Implementation of MuZero☆335Updated last year
- The NetHack Learning Environment☆42Updated this week