A structured implementation of MuZero
☆206Jun 4, 2022Updated 3 years ago
Alternatives and similar repositories for MuZero
Users that are interested in MuZero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple implementation of MuZero algorithm for connect4 game☆96Aug 11, 2020Updated 5 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- ☆66Nov 3, 2021Updated 4 years ago
- Tensorflow implementation of MuZero algorithm☆11Aug 23, 2022Updated 3 years ago
- Pytorch Implementation of MuZero☆353Jul 23, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Mar 28, 2021Updated 5 years ago
- MuZero☆2,819Sep 3, 2024Updated last year
- Using a modified version of Werner Duvaud's MuZero implementation (https://github.com/werner-duvaud/muzero-general) this reinforcement ag…☆19Jun 30, 2021Updated 4 years ago
- An implementation of Deepmind's MuZero algorithm.☆16Aug 23, 2021Updated 4 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆933Dec 20, 2023Updated 2 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- An implementation of MuZero in JAX.☆58Nov 8, 2022Updated 3 years ago
- An environment of the board game Go using OpenAI's Gym API☆176May 3, 2022Updated 4 years ago
- GPU Monte Carlo Tree Search with MPI☆26Jan 9, 2019Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.☆17Oct 15, 2024Updated last year
- A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more☆4,453Jan 1, 2025Updated last year
- Reinforcement Learning papers on exploration methods.☆19Jun 27, 2021Updated 4 years ago
- This code illustrates the use of genetic programming to evolve financial trading strategies for a single equity stock. Individuals (strat…☆25Feb 24, 2019Updated 7 years ago
- A library of reinforcement learning components and agents☆3,989Apr 8, 2026Updated last month
- ☆18Aug 24, 2024Updated last year
- ☆13Jan 16, 2025Updated last year
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Dec 22, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Exploring the use of options in creating small worlds for faster learning in RL Domains☆16Jan 23, 2012Updated 14 years ago
- Revisiting Rainbow☆76Jun 9, 2021Updated 4 years ago
- A "build to learn" Alpha Zero implementation using Gradient Boosted Decision Trees (LightGBM)☆88Mar 14, 2025Updated last year
- Library of effective Go routines☆48Oct 11, 2010Updated 15 years ago
- A tensorflow implementation of the Forward-Forward Algorithm from NeurIPS '22.☆10May 10, 2023Updated 3 years ago
- OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.☆5,237Updated this week
- Simple code for running and visualizing replicator dynamics☆11Jan 31, 2024Updated 2 years ago
- Classic MCTS example with mctx☆25May 25, 2023Updated 3 years ago
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.☆1,026Mar 13, 2019Updated 7 years ago
- Tasking 2.0☆17Nov 1, 2021Updated 4 years ago
- Double Q-learning reinforcement learning agent on NES Super Mario Bros☆43May 4, 2019Updated 7 years ago
- [Experimental] TensorFlow 2 version of stable-baselines, temporary repository☆45Jan 25, 2020Updated 6 years ago
- ☆315Feb 11, 2026Updated 3 months ago
- An extension for VS Code which provides support for the Nim language.☆13Sep 24, 2020Updated 5 years ago
- Advanced Deep Learning and Reinforcement Learning 2018 Assignments☆18Nov 24, 2018Updated 7 years ago