A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general
☆46Dec 27, 2022Updated 3 years ago
Alternatives and similar repositories for fast-alphazero-general
Users that are interested in fast-alphazero-general are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A modified Alphazero implementation with C++ where performance matters.☆19May 22, 2026Updated last week
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆89Dec 11, 2024Updated last year
- Minimal AlphaZero in PyTorch, trained on Connect4 on a 6x6 board.☆21Aug 12, 2022Updated 3 years ago
- A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more☆4,453Jan 1, 2025Updated last year
- ☆13May 21, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Oct 9, 2023Updated 2 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆117Apr 27, 2023Updated 3 years ago
- Hex board game with MCTS implementation☆11Aug 15, 2023Updated 2 years ago
- mcts-simple is a Python3 library that implements Monte Carlo Tree Search and its variants to solve a host of problems, most commonly for …☆32Aug 8, 2025Updated 9 months ago
- Code for the paper "Continual Model-Based Reinforcement Learning with Hypernetworks"☆15Jul 28, 2021Updated 4 years ago
- A curated list of reinforcement learning in NLP. :-)☆21Oct 30, 2021Updated 4 years ago
- A Gobang(also known as "Five in a Row" and "Gomoku") game equipped with AlphaGo-liked AI.☆14May 1, 2020Updated 6 years ago
- An implementation of the AlphaZero algorithm for adversarial games to be used with the machine learning framework of your choice☆12Aug 30, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆129May 9, 2026Updated 2 weeks ago
- An implementation of the AlphaGo Zero and the AlphaZero algorithm for othello playing.☆22Jul 21, 2021Updated 4 years ago
- Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)☆15Jan 19, 2021Updated 5 years ago
- Various dotfiles I use on my machines.☆20Feb 9, 2026Updated 3 months ago
- Scalable Training of Propositional Logical Neural Networks.☆15Feb 4, 2022Updated 4 years ago
- Pallet loading problem solver with recursive partitioning approach for the packing of different rectangles in a rectangle.☆13Oct 1, 2012Updated 13 years ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Dec 26, 2019Updated 6 years ago
- attention으로 시계열 예측은 할 수 없을까☆10Apr 30, 2021Updated 5 years ago
- A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games☆187Oct 26, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A C++ implementation of the scalar-valued autograd engine micrograd☆23May 16, 2020Updated 6 years ago
- Guess What Moves: Unsupervised Video and Image Segmentation by Anticipating Motion☆25Mar 16, 2023Updated 3 years ago
- A platform for Applied Reinforcement Learning (Applied RL)☆14Jan 19, 2019Updated 7 years ago
- Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.☆29Feb 25, 2021Updated 5 years ago
- Android SDK for the Rover platform☆11Apr 29, 2026Updated last month
- Using Hotel Data to predict High Value And Potential VIP Guests☆11Dec 27, 2021Updated 4 years ago
- An easy to understand implementation of the paper "Model-Based Reinforcement Learning for Atari"☆18Sep 27, 2019Updated 6 years ago
- RV32I by cats☆15Sep 4, 2023Updated 2 years ago
- ☆17Apr 30, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆42Oct 8, 2020Updated 5 years ago
- (Keras) Use deep Q-learning to build two Gomoku (Five-in-a-Row) agents playing against each other.☆19Oct 8, 2016Updated 9 years ago
- MongoDb, ExpressJS, VueJS, NodeJS (MEVN) Boilerplate☆15May 28, 2017Updated 9 years ago
- Summarization with Pointer-Generator Networks☆15Sep 1, 2020Updated 5 years ago
- Project sources of homemade Arduino Glockenspiel☆10Feb 14, 2019Updated 7 years ago
- ☆12Oct 18, 2020Updated 5 years ago
- ☆27May 30, 2024Updated last year