Max-We / alphazero-tetris
An implementation of AlphaZero and MCTS with neural networks for Tetris
☆19Updated last week
Alternatives and similar repositories for alphazero-tetris:
Users that are interested in alphazero-tetris are comparing it to the libraries listed below
- Simple single-file baselines for Q-Learning in pure-GPU setting☆150Updated last week
- ☆74Updated 4 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 7 months ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆295Updated last month
- ☆74Updated last week
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆80Updated last month
- Accelerated minigrid environments with JAX☆132Updated 7 months ago
- Efficient baselines for autocurricula in JAX.☆186Updated 7 months ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆99Updated last year
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago
- Baselines for gymnax 🤖☆66Updated last year
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆228Updated 2 weeks ago
- General Modules for JAX☆64Updated last month
- An Open-Ended Agentic Simulator☆45Updated 7 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆71Updated 7 months ago
- fast + parallel AlphaZero in JAX☆94Updated 3 months ago
- Scaling scaling laws with board games.☆48Updated last year
- Simplest and Cleanest DreamerV3 implementation out there☆50Updated last week
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆94Updated 6 months ago
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆82Updated last year
- Interpreting how transformers simulate agents performing RL tasks☆78Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆128Updated last year
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆130Updated 7 months ago
- Cost aware hyperparameter tuning algorithm☆148Updated 9 months ago
- ☆50Updated last year
- 🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL☆304Updated this week
- Code for the paper "Harnessing Discrete Representations for Continual Reinforcement Learning"☆14Updated 9 months ago
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆57Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- Awesome Open-ended AI☆207Updated 6 months ago