patrik-ha / explainable-minichessLinks
Chess environment for smaller chess variants, AlphaZero-like MCTS-learning, and Concept Detection
☆17Updated 2 years ago
Alternatives and similar repositories for explainable-minichess
Users that are interested in explainable-minichess are comparing it to the libraries listed below
Sorting:
- ☆23Updated last year
- Scaling scaling laws with board games.☆53Updated 2 years ago
- Causal Analysis of Agent Behavior for AI Safety☆18Updated 2 years ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆14Updated last year
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- INTeractive learning via REPresentatIon Discovery☆34Updated last year
- A web based platform for collecting human actions in reinforcement learning environments☆31Updated 2 years ago
- ☆37Updated 2 years ago
- Documentation for dynamic machine learning systems.☆29Updated 11 months ago
- Explore and Control with Adversarial Surprise☆10Updated 4 years ago
- An environment for learning formal mathematical reasoning from scratch☆72Updated last year
- ☆31Updated 3 years ago
- Code for the paper "Understanding RL Vision"☆48Updated 2 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆76Updated last year
- ☆52Updated 2 years ago
- Generalized AI to perform a multitude of tasks written in python3☆21Updated last year
- Repo to reproduce the First-Explore paper results☆38Updated 8 months ago
- ☆89Updated 7 months ago
- AlphaZero in JAX☆78Updated last year
- ☆62Updated 9 months ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Updated 5 years ago
- ☆44Updated 11 months ago
- Logic Reinforcement Learning☆17Updated last year
- Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space…☆71Updated last year
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆105Updated 3 years ago
- ☆57Updated last year
- Interpreting how transformers simulate agents performing RL tasks☆87Updated last year
- Web application where humans can play Overcooked with AI agents.☆59Updated 2 years ago
- ☆55Updated 9 months ago
- Submissions for AI and Efficiency SOTA's☆56Updated 5 years ago