A structured implementation of MuZero
☆206Jun 4, 2022Updated 3 years ago
Alternatives and similar repositories for MuZero
Users that are interested in MuZero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple implementation of MuZero algorithm for connect4 game☆96Aug 11, 2020Updated 5 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- ☆66Nov 3, 2021Updated 4 years ago
- Pytorch Implementation of MuZero☆353Jul 23, 2023Updated 2 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Mar 28, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- MuZero☆2,799Sep 3, 2024Updated last year
- Using a modified version of Werner Duvaud's MuZero implementation (https://github.com/werner-duvaud/muzero-general) this reinforcement ag…☆19Jun 30, 2021Updated 4 years ago
- An implementation of Deepmind's MuZero algorithm.☆16Aug 23, 2021Updated 4 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆928Dec 20, 2023Updated 2 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- An implementation of MuZero in JAX.☆58Nov 8, 2022Updated 3 years ago
- An environment of the board game Go using OpenAI's Gym API☆176May 3, 2022Updated 3 years ago
- GPU Monte Carlo Tree Search with MPI☆26Jan 9, 2019Updated 7 years ago
- Udacity Deep Reinforcement Learning Nanodegree Program☆11Jul 12, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.☆17Oct 15, 2024Updated last year
- A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more☆4,414Jan 1, 2025Updated last year
- A C++ pytorch implementation of MuZero☆40May 1, 2024Updated last year
- Reinforcement Learning papers on exploration methods.☆19Jun 27, 2021Updated 4 years ago
- This code illustrates the use of genetic programming to evolve financial trading strategies for a single equity stock. Individuals (strat…☆25Feb 24, 2019Updated 7 years ago
- A library of reinforcement learning components and agents☆3,968Apr 8, 2026Updated last week
- ☆18Aug 24, 2024Updated last year
- ☆13Jan 16, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Dec 22, 2020Updated 5 years ago
- Exploring the use of options in creating small worlds for faster learning in RL Domains☆16Jan 23, 2012Updated 14 years ago
- Revisiting Rainbow☆76Jun 9, 2021Updated 4 years ago
- A "build to learn" Alpha Zero implementation using Gradient Boosted Decision Trees (LightGBM)☆87Mar 14, 2025Updated last year
- Library of effective Go routines☆48Oct 11, 2010Updated 15 years ago
- A tensorflow implementation of the Forward-Forward Algorithm from NeurIPS '22.☆10May 10, 2023Updated 2 years ago
- OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.☆5,138Mar 26, 2026Updated 3 weeks ago
- Simple code for running and visualizing replicator dynamics☆11Jan 31, 2024Updated 2 years ago
- Classic MCTS example with mctx☆24May 25, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.☆1,021Mar 13, 2019Updated 7 years ago
- Double Q-learning reinforcement learning agent on NES Super Mario Bros☆42May 4, 2019Updated 6 years ago
- [Experimental] TensorFlow 2 version of stable-baselines, temporary repository☆45Jan 25, 2020Updated 6 years ago
- ☆313Feb 11, 2026Updated 2 months ago
- An extension for VS Code which provides support for the Nim language.☆13Sep 24, 2020Updated 5 years ago
- This project was moved to: https://github.com/coax-dev/coax☆161Nov 28, 2022Updated 3 years ago