A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
☆189Oct 26, 2024Updated last year
Alternatives and similar repositories for alpha_zero
Users that are interested in alpha_zero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Clean, tested, & modular AlphaZero implementation with multiplayer support.☆18Apr 22, 2019Updated 7 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆89Dec 11, 2024Updated last year
- A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more☆4,466Jan 1, 2025Updated last year
- A 9x9 Go (Weiqi/Baduk) Engine☆12Nov 5, 2021Updated 4 years ago
- PyTorch implementation of AlphaZero Chess from scratch☆184Aug 7, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A collection of tutorials for robotoc, efficient optimal control solvers for robotic systems.☆19Nov 1, 2022Updated 3 years ago
- Single player Alpha Zero implementation☆42Mar 7, 2022Updated 4 years ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆46Dec 27, 2022Updated 3 years ago
- ☆19Jan 16, 2025Updated last year
- Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.☆36Feb 18, 2020Updated 6 years ago
- ☆11Jun 22, 2023Updated 2 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆66Nov 14, 2024Updated last year
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- ☆20Jun 14, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An environment for learning formal mathematical reasoning from scratch☆71Aug 18, 2024Updated last year
- Trade using DRL algorithms on tensorflow2 and tf-agents☆11Oct 10, 2025Updated 8 months ago
- ELIGN: Expectation Alignment as a Multi-agent Intrinsic Reward☆20Dec 5, 2022Updated 3 years ago
- Code repository for the NAACL 2025 paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language…☆46Oct 13, 2024Updated last year
- [ICLR 2025] No Preference Left Behind: Group Distributional Preference Optimization☆16Apr 21, 2025Updated last year
- ☆11Mar 18, 2021Updated 5 years ago
- Implementing the supervised learning policy networks of AlphaGo☆12Jan 16, 2018Updated 8 years ago
- The repository is created for AR2L algorithm used for solving online 3D-BPP.☆16Sep 22, 2025Updated 8 months ago
- Learning Formal Mathematics from Intrinsic Motivation☆36Jul 10, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A simplified version of Go game in Python, with AI agents built-in and GUI to play.☆22May 3, 2019Updated 7 years ago
- Flow Contrastive Estimation (FCE) PyTorch Implementation on 2D data☆11May 20, 2022Updated 4 years ago
- An implementation of MuZero in JAX.☆58Nov 8, 2022Updated 3 years ago
- MISO: Learning Multiple Initial Solutions to Optimization Problems☆17Nov 8, 2024Updated last year
- ☆10Feb 3, 2016Updated 10 years ago
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆11Feb 28, 2023Updated 3 years ago
- Advanced Model Predictive Control in Python☆29Feb 2, 2026Updated 4 months ago
- Kino-dynamic optimization algorithm for multiped robots☆48Oct 25, 2021Updated 4 years ago
- List of awesome JAX resources☆13Dec 8, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Examples using MetaProgramming for writing tactics etc.☆19Nov 26, 2025Updated 6 months ago
- Problems and Results of IWLS 2023 Programming Contest☆17Apr 12, 2025Updated last year
- 🌳 Python implementation of single-player Monte-Carlo Tree Search.☆67Jun 25, 2021Updated 4 years ago
- A Lisp bytecode interpreter for ZX-Spectrum☆16Jul 3, 2018Updated 7 years ago
- Enemies for your LLM☆37Jan 20, 2026Updated 4 months ago
- Model predictive control in Python based on quadratic programming☆51May 26, 2026Updated 3 weeks ago
- MDP and RL interface for PDDL domains via PDDL.jl + POMDPs.jl.☆16Jun 14, 2024Updated 2 years ago