patrik-ha / explainable-minichess
Chess environment for smaller chess variants, AlphaZero-like MCTS-learning, and Concept Detection
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for explainable-minichess
- Scaling scaling laws with board games.☆43Updated last year
- ☆20Updated 7 months ago
- Causal Analysis of Agent Behavior for AI Safety☆17Updated last year
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- Interpreting how transformers simulate agents performing RL tasks☆73Updated last year
- Explainable Reinforcement Learning (XRL) Resources☆33Updated last month
- A web based platform for collecting human actions in reinforcement learning environments☆27Updated last year
- Code for the paper "Understanding RL Vision"☆43Updated last year
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 6 months ago
- Repo to reproduce the First-Explore paper results☆36Updated 2 weeks ago
- AlphaZero in JAX☆69Updated 7 months ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆31Updated 2 years ago
- Documentation for dynamic machine learning systems.☆27Updated 2 months ago
- Implementation of OpenAI's 'Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets' paper.☆34Updated last year
- ☆28Updated 2 years ago
- A neurosymbolic T5 agent for playing text games, from the EACL 2023 paper "Behavior Cloned Transformers are Neurosymbolic Reasoners"☆19Updated last year
- ☆29Updated 2 years ago
- Write simple games in Numpy!☆12Updated 2 years ago
- A PyTorch Implementation of Skipper☆20Updated last month
- ☆53Updated 2 weeks ago
- ☆48Updated last year
- ☆17Updated 5 months ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆91Updated last year
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆22Updated 4 months ago
- ☆42Updated 2 years ago
- fast + parallel AlphaZero in JAX☆84Updated 7 months ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆26Updated 2 years ago
- Web application where humans can play Overcooked with AI agents.☆57Updated last year
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- AlphaZero for continuous control tasks☆23Updated last year