Clean, tested, & modular AlphaZero implementation with multiplayer support.
☆18Apr 22, 2019Updated 6 years ago
Alternatives and similar repositories for simple-alpha-zero
Users that are interested in simple-alpha-zero are comparing it to the libraries listed below
Sorting:
- ☆10May 8, 2023Updated 2 years ago
- The simple C/C++ library for hexapod (Robot spider with 6 legs) on Arduino.☆13Dec 27, 2018Updated 7 years ago
- Monte carlo tree search in Go language☆30Apr 22, 2018Updated 7 years ago
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- code for polite☆11Feb 28, 2024Updated 2 years ago
- LLM Skirmish☆44Feb 3, 2026Updated last month
- Hexapod Robot Control☆10May 8, 2023Updated 2 years ago
- (Semester 4) Mathematics for Intelligent Systems - End Semester Project☆12Apr 10, 2022Updated 3 years ago
- ☆16Feb 22, 2025Updated last year
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆29Feb 23, 2026Updated last week
- Rich declarative API extensions for Ruby Deferrables.☆56Oct 27, 2011Updated 14 years ago
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- Catches and handles exceptions in rack☆26Aug 31, 2010Updated 15 years ago
- ☆14Mar 21, 2024Updated last year
- ☆11Jan 11, 2022Updated 4 years ago
- About Code release for "Imagination Mechanism: Mesh Information Propagation for Enhancing Data Efficiency in Reinforcement Learning"☆13Oct 7, 2023Updated 2 years ago
- Discover, setup, and get stats on network interfaces☆11Nov 17, 2023Updated 2 years ago
- A 9x9 Go (Weiqi/Baduk) Engine☆12Nov 5, 2021Updated 4 years ago
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆42Oct 8, 2020Updated 5 years ago
- Code repository for our work on Quantum Pi☆10Jun 4, 2024Updated last year
- Neural Networks for penetration testing. Part of active research.☆13Jun 21, 2022Updated 3 years ago
- Code for the paper 'Monte Carlo Tree Search for Asymmetric Trees'☆12May 24, 2018Updated 7 years ago
- Full sourcecode for the website☆11Nov 26, 2011Updated 14 years ago
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆13May 28, 2025Updated 9 months ago
- The Yak☆16May 11, 2018Updated 7 years ago
- ☆13May 3, 2024Updated last year
- Агрегированный проект методов искусственного интеллекта и машинного обучения☆11Oct 16, 2017Updated 8 years ago
- ReLAx - Reinforcement Learning Applications Library☆15Feb 19, 2023Updated 3 years ago
- A "gym" style toolkit for building lightweight NAS systems.☆13Jun 13, 2022Updated 3 years ago
- My Very Own Deep Multiple Layered Echo State Network☆13Jan 2, 2021Updated 5 years ago
- Isaac Gym Reinforcement Learning Environments for humanoid robot Bez☆10Jul 27, 2022Updated 3 years ago
- [ICML 2023] Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills☆12Jul 15, 2023Updated 2 years ago
- paper on dexpilot☆15Oct 14, 2019Updated 6 years ago
- Bitcoin blockchain to avro file☆12Feb 8, 2018Updated 8 years ago
- Submission Under Review☆17May 15, 2025Updated 9 months ago
- unifloc on python☆15Nov 14, 2020Updated 5 years ago
- Robot simulator using web technologies, just JavaScript☆10Feb 13, 2020Updated 6 years ago
- Elixir implementation of ROCK: A Robust Clustering Algorithm for Categorical Attributes☆12Jul 14, 2020Updated 5 years ago