Max-We / alphazero-tetris
An implementation of AlphaZero and MCTS with neural networks for Tetris
☆19Updated last month
Alternatives and similar repositories for alphazero-tetris
Users that are interested in alphazero-tetris are comparing it to the libraries listed below
Sorting:
- ☆10Updated this week
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆312Updated 2 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆161Updated last month
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆112Updated 8 months ago
- ☆77Updated last month
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆91Updated last month
- Baselines for gymnax 🤖☆66Updated 2 years ago
- ☆79Updated 6 months ago
- Efficient baselines for autocurricula in JAX.☆187Updated 8 months ago
- Accelerated minigrid environments with JAX☆135Updated this week
- General Modules for JAX☆65Updated last month
- An Open-Ended Agentic Simulator☆49Updated 9 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆97Updated 6 months ago
- Synchronized Curriculum Learning for RL Agents☆45Updated last month
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆51Updated 2 years ago
- JAX implementations of core Deep RL algorithms☆79Updated 3 years ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- fast + parallel AlphaZero in JAX☆96Updated 4 months ago
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆236Updated last month
- Contains JAX implementation of algorithms for inverse reinforcement learning☆72Updated 8 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆55Updated 2 years ago
- Simplest and Cleanest DreamerV3 implementation out there☆63Updated last month
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆22Updated 3 weeks ago
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆39Updated 2 years ago
- ☆41Updated 10 months ago
- JAX implementation of RL algorithms and vectorized environments☆43Updated last year
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆57Updated last year
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆50Updated last week
- A fast and robust algorithm for temporal difference learning☆15Updated 2 months ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆64Updated last year