A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.
☆87Dec 11, 2024Updated last year
Alternatives and similar repositories for alphazero-general
Users that are interested in alphazero-general are comparing it to the libraries listed below
Sorting:
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆46Dec 27, 2022Updated 3 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- A very fast implementation of AlphaZero, applied to games like Splendor, Santorini, The Little Prince, … Browser version available☆62Oct 3, 2025Updated 5 months ago
- An implementation of the AlphaZero algorithm for adversarial games to be used with the machine learning framework of your choice☆12Aug 30, 2020Updated 5 years ago
- Tensorflow implementation of MuZero algorithm☆11Aug 23, 2022Updated 3 years ago
- Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)☆15Jan 19, 2021Updated 5 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Mar 28, 2021Updated 4 years ago
- ☆16Feb 28, 2025Updated last year
- General Board Game Playing☆25Jun 16, 2025Updated 8 months ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆925Dec 20, 2023Updated 2 years ago
- Repo for "AlphaResearch: Accelerating New Algorithm Discovery with Language Models"☆54Nov 12, 2025Updated 3 months ago
- MuZero☆2,777Sep 3, 2024Updated last year
- Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…☆19Nov 19, 2024Updated last year
- ☆14Sep 1, 2021Updated 4 years ago
- Clean, tested, & modular AlphaZero implementation with multiplayer support.☆18Apr 22, 2019Updated 6 years ago
- ☆17Dec 4, 2019Updated 6 years ago
- Simple working implementation for google-deepmind FunSearch algorithm☆23Jan 25, 2024Updated 2 years ago
- This is the official code repository for the paper "Language Agents Meet Causality -- Bridging LLMs and Causal World Models"☆29May 6, 2025Updated 10 months ago
- We study toy models of skill learning.☆32Feb 3, 2026Updated last month
- qmix☆23May 28, 2020Updated 5 years ago
- Selfplay In MultiPlayer Environments☆329Jun 12, 2024Updated last year
- Contains the codebase for Quantum Natural Language Generation project☆24Nov 2, 2022Updated 3 years ago
- Pytorch Implementation of MuZero☆352Jul 23, 2023Updated 2 years ago
- Python syntax generator based on Object-Oriented Programing, type hints, and simplicity☆10Sep 26, 2021Updated 4 years ago
- mHC-lite: You Don’t Need 20 Sinkhorn-Knopp Iterations☆70Jan 12, 2026Updated last month
- ☆37Apr 27, 2023Updated 2 years ago
- Code for "Exploring Dynamic Selection of Branch Expansion Orders for Code Generation" (ACL 2021)☆31Apr 11, 2022Updated 3 years ago
- A structured implementation of MuZero☆206Jun 4, 2022Updated 3 years ago
- ☆12Mar 13, 2025Updated 11 months ago
- Libraries, guides, blueprints, and sample code, to enable rapidly building 0-1 applications on iOS, Android and web.☆11May 12, 2023Updated 2 years ago
- ☆10Apr 5, 2024Updated last year
- Deeply supervised density regression for automatic cell counting in microscopy images☆12Jan 31, 2022Updated 4 years ago
- This is a simple example of how to run the android ADK feature on a basic Arduino Uno with USB Host Shield.☆14May 24, 2011Updated 14 years ago
- TIme series DiscoverY BENCHmark (tidybench)☆38Feb 21, 2024Updated 2 years ago
- Automatically places bets on profitable each way horse races, can also lay arbitrage bets using Betfair☆11Dec 8, 2022Updated 3 years ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- LLM Skirmish☆44Feb 3, 2026Updated last month
- ☆14Feb 2, 2025Updated last year
- Official code for "Blind Image Deblurring Based on Dual Attention Network and 2D Blur Kernel Estimation" (ICIP 2021)☆13Nov 11, 2025Updated 3 months ago