tmoer / alphazero_singleplayer
Single player Alpha Zero implementation
☆42Updated 3 years ago
Alternatives and similar repositories for alphazero_singleplayer:
Users that are interested in alphazero_singleplayer are comparing it to the libraries listed below
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆158Updated 4 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆115Updated 8 months ago
- MultiTask Environments for Reinforcement Learning.☆74Updated 2 years ago
- Tabular methods for reinforcement learning☆36Updated 4 years ago
- A simple implementation of MuZero algorithm for connect4 game☆97Updated 4 years ago
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆39Updated 4 years ago
- Modular framework for Reinforcement Learning in python☆172Updated 2 years ago
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆34Updated 3 years ago
- RUDDER: Return Decomposition for Delayed Rewards☆47Updated 4 years ago
- A collection of RL algorithms written in JAX.☆97Updated 2 years ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆44Updated 2 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 2 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆58Updated 4 months ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Library for running a Monte Carlo tree search, either traditionally or with expert policies☆122Updated 11 months ago
- ☆100Updated last year
- Implicit Normalizing Flows + Reinforcement Learning☆60Updated 5 years ago
- Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"☆17Updated 2 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆93Updated 4 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆40Updated 2 years ago
- AlphaZero for continuous control tasks☆23Updated 2 years ago
- A structured implementation of MuZero☆207Updated 2 years ago
- ☆298Updated 3 months ago
- Clone of OpenAI's Spinning Up in PyTorch☆148Updated 2 years ago
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆30Updated 7 months ago
- [Experimental] TensorFlow 2 version of stable-baselines, temporary repository☆45Updated 5 years ago
- Reinforcement learning algorithms in RLlib☆57Updated 11 months ago
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆49Updated 2 years ago
- AGAC: Adversarially Guided Actor-Critic☆48Updated 3 years ago