patrik-ha / explainable-minichessLinks
Chess environment for smaller chess variants, AlphaZero-like MCTS-learning, and Concept Detection
☆17Updated last year
Alternatives and similar repositories for explainable-minichess
Users that are interested in explainable-minichess are comparing it to the libraries listed below
Sorting:
- ☆23Updated last year
- Scaling scaling laws with board games.☆49Updated last year
- An environment for learning formal mathematical reasoning from scratch☆70Updated 10 months ago
- ☆56Updated last year
- Documentation for dynamic machine learning systems.☆29Updated 9 months ago
- A web based platform for collecting human actions in reinforcement learning environments☆30Updated last year
- A (fairly modular and easily expandable) novelty search implementation for gym-based environments☆12Updated 4 years ago
- Play chess against large language models.☆47Updated last year
- Adaptive Subgoal Search☆19Updated 2 years ago
- ☆51Updated 2 years ago
- Levin tree search guided by both a policy and a heuristic function☆19Updated last year
- ☆31Updated 2 years ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆78Updated 2 years ago
- Vectorization techniques for fast population-based training.☆56Updated 2 years ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆58Updated 4 months ago
- ☆10Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated last year
- Causal Analysis of Agent Behavior for AI Safety☆18Updated 2 years ago
- Repo to reproduce the First-Explore paper results☆37Updated 6 months ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- [ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"☆28Updated 3 months ago
- AlphaZero in JAX☆77Updated last year
- Repository of machine learning benchmarks☆36Updated 3 weeks ago
- ☆37Updated 9 months ago
- Interpreting how transformers simulate agents performing RL tasks☆85Updated last year
- Gym wrapper for pysc2☆10Updated 2 years ago
- fast + parallel AlphaZero in JAX☆97Updated 6 months ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆14Updated last year
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆32Updated 3 years ago
- PAIRED in PyTorch 🔥☆60Updated 2 years ago