petosa/simple-alpha-zero

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/petosa/simple-alpha-zero)

petosa / simple-alpha-zero

Clean, tested, & modular AlphaZero implementation with multiplayer support.

☆18

Alternatives and similar repositories for simple-alpha-zero

Users that are interested in simple-alpha-zero are comparing it to the libraries listed below

Sorting:

jackdawkins11 / pytorch-alpha-zero
View on GitHub
☆10May 8, 2023Updated 2 years ago
dmkyr20 / spider
View on GitHub
The simple C/C++ library for hexapod (Robot spider with 6 legs) on Arduino.
☆13Dec 27, 2018Updated 7 years ago
int8 / gomcts
View on GitHub
Monte carlo tree search in Go language
☆30Apr 22, 2018Updated 7 years ago
vint-1 / dreamsmooth
View on GitHub
DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)
☆12May 6, 2024Updated last year
holken / polite
View on GitHub
code for polite
☆11Feb 28, 2024Updated 2 years ago
llmskirmish / skirmish
View on GitHub
LLM Skirmish
☆44Feb 3, 2026Updated last month
RDzRyan / src
View on GitHub
Hexapod Robot Control
☆10May 8, 2023Updated 2 years ago
DeepthiSudharsan / Stock-Prediction-using-Deep-Learning
View on GitHub
(Semester 4) Mathematics for Intelligent Systems - End Semester Project
☆12Apr 10, 2022Updated 3 years ago
Improbable-AI / orso
View on GitHub
☆16Feb 22, 2025Updated last year
LARK-AI-Lab / CodeScaler
View on GitHub
The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"
☆29Feb 23, 2026Updated last week
samstokes / deferrable_gratification
View on GitHub
Rich declarative API extensions for Ruby Deferrables.
☆56Oct 27, 2011Updated 14 years ago
wassname / rl_2d_walker.js
View on GitHub
Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)
☆10Sep 7, 2020Updated 5 years ago
nirgreshler / bayesian-online-planning
View on GitHub
The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.
☆13Jun 17, 2024Updated last year
hassox / rack-rescue
View on GitHub
Catches and handles exceptions in rack
☆26Aug 31, 2010Updated 15 years ago
ReidarRiveland / Instruct-RNN
View on GitHub
☆14Mar 21, 2024Updated last year
inboxedshoe / RP-DQN
View on GitHub
☆11Jan 11, 2022Updated 4 years ago
OuAzusaKou / imagination_mechanism
View on GitHub
About Code release for "Imagination Mechanism: Mesh Information Propagation for Enhancing Data Efficiency in Reinforcement Learning"
☆13Oct 7, 2023Updated 2 years ago
nerves-project-attic / nerves_network_interface
View on GitHub
Discover, setup, and get stats on network interfaces
☆11Nov 17, 2023Updated 2 years ago
meiji163 / bokego
View on GitHub
A 9x9 Go (Weiqi/Baduk) Engine
☆12Nov 5, 2021Updated 4 years ago
PatrickKorus / mcts-general
View on GitHub
General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.
☆42Oct 8, 2020Updated 5 years ago
JacquesCarette / QuantumPi
View on GitHub
Code repository for our work on Quantum Pi
☆10Jun 4, 2024Updated last year
TheMorpheus407 / Neural-Payload
View on GitHub
Neural Networks for penetration testing. Part of active research.
☆13Jun 21, 2022Updated 3 years ago
tmoer / MCTS-T
View on GitHub
Code for the paper 'Monte Carlo Tree Search for Asymmetric Trees'
☆12May 24, 2018Updated 7 years ago
sanjeevan / codelovely
View on GitHub
Full sourcecode for the website
☆11Nov 26, 2011Updated 14 years ago
hmishfaq / LSAC
View on GitHub
The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025
☆13May 28, 2025Updated 9 months ago
bigyak / wild-yak
View on GitHub
The Yak
☆16May 11, 2018Updated 7 years ago
kaiqi123 / SQAKD
View on GitHub
☆13May 3, 2024Updated last year
rwsh / AI-Aggregate
View on GitHub
Агрегированный проект методов искусственного интеллекта и машинного обучения
☆11Oct 16, 2017Updated 8 years ago
nslyubaykin / relax
View on GitHub
ReLAx - Reinforcement Learning Applications Library
☆15Feb 19, 2023Updated 3 years ago
jack-willturner / gymnastics
View on GitHub
A "gym" style toolkit for building lightweight NAS systems.
☆13Jun 13, 2022Updated 3 years ago
Xeeshanmalik / deep_ml_esn
View on GitHub
My Very Own Deep Multiple Layered Echo State Network
☆13Jan 2, 2021Updated 5 years ago
utra-robosoccer / Bez_IsaacGym
View on GitHub
Isaac Gym Reinforcement Learning Environments for humanoid robot Bez
☆10Jul 27, 2022Updated 3 years ago
seongun-kim / vcrl
View on GitHub
[ICML 2023] Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills
☆12Jul 15, 2023Updated 2 years ago
ankurhanda / dexpilot
View on GitHub
paper on dexpilot
☆15Oct 14, 2019Updated 6 years ago
cobookman / blockchainToAvro
View on GitHub
Bitcoin blockchain to avro file
☆12Feb 8, 2018Updated 8 years ago
ucd-dare / IN-RIL
View on GitHub
Submission Under Review
☆17May 15, 2025Updated 9 months ago
unifloc / unifloc_py
View on GitHub
unifloc on python
☆15Nov 14, 2020Updated 5 years ago
JdeRobot / WebSim2D
View on GitHub
Robot simulator using web technologies, just JavaScript
☆10Feb 13, 2020Updated 6 years ago
ayrat555 / rock
View on GitHub
Elixir implementation of ROCK: A Robust Clustering Algorithm for Categorical Attributes
☆12Jul 14, 2020Updated 5 years ago