Max-We / alphazero-tetris
An implementation of AlphaZero and MCTS with neural networks for Tetris
☆19Updated last month
Alternatives and similar repositories for alphazero-tetris:
Users that are interested in alphazero-tetris are comparing it to the libraries listed below
- Simple single-file baselines for Q-Learning in pure-GPU setting☆156Updated last month
- ☆75Updated 5 months ago
- ☆76Updated last month
- Baselines for gymnax 🤖☆66Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆111Updated 8 months ago
- Accelerated minigrid environments with JAX☆134Updated 8 months ago
- General Modules for JAX☆64Updated 2 weeks ago
- JAX implementation of RL algorithms and vectorized environments☆43Updated last year
- Efficient baselines for autocurricula in JAX.☆187Updated 8 months ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Simplest and Cleanest DreamerV3 implementation out there☆60Updated last month
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆308Updated 2 months ago
- An Open-Ended Agentic Simulator☆47Updated 8 months ago
- Various reinforcement learning algorithms written in Jax + Flax☆24Updated last year
- fast + parallel AlphaZero in JAX☆95Updated 4 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆89Updated 2 weeks ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆72Updated 8 months ago
- ☆41Updated 9 months ago
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago
- Cost aware hyperparameter tuning algorithm☆150Updated 9 months ago
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆82Updated last year
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆38Updated 2 years ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆48Updated this week
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆22Updated this week
- ☆43Updated 7 months ago
- ☆51Updated 2 years ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆45Updated 3 years ago
- Synchronized Curriculum Learning for RL Agents☆45Updated last month
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆233Updated 3 weeks ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆81Updated last month