patrik-ha / explainable-minichessLinks
Chess environment for smaller chess variants, AlphaZero-like MCTS-learning, and Concept Detection
☆17Updated 2 years ago
Alternatives and similar repositories for explainable-minichess
Users that are interested in explainable-minichess are comparing it to the libraries listed below
Sorting:
- Scaling scaling laws with board games.☆50Updated 2 years ago
- Code for the paper "Understanding RL Vision"☆48Updated 2 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- ☆44Updated 10 months ago
- ☆28Updated 2 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆73Updated last year
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 6 years ago
- ☆52Updated 2 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆30Updated last year
- INTeractive learning via REPresentatIon Discovery☆34Updated last year
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Updated 5 years ago
- PushWorld: A benchmark for manipulation planning with tools and movable obstacles☆83Updated last year
- Vectorization techniques for fast population-based training.☆56Updated 2 years ago
- Web application where humans can play Overcooked with AI agents.☆59Updated 2 years ago
- fast + parallel AlphaZero in JAX☆97Updated 7 months ago
- ☆15Updated last year
- Repo to reproduce the First-Explore paper results☆38Updated 7 months ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- ☆20Updated 2 years ago
- ☆54Updated 8 months ago
- A modular implementation of PPO, and soon hopefully other algorithms.☆26Updated last year
- Fully differentiable RL environments, written in Ivy.☆65Updated last year
- General Modules for JAX☆66Updated 3 months ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆129Updated last year
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆97Updated 2 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆11Updated 2 years ago
- ☆45Updated last year
- A tool for recording RL trajectories.☆103Updated 8 months ago
- AlphaZero in JAX☆78Updated last year
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 2 years ago