michaelnny / muzero
A PyTorch implementation of DeepMind's MuZero agent
☆27Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for muzero
- ☆48Updated last year
- A project that provides help for using DeepMind's mctx on gym-style environments.☆50Updated last week
- fast + parallel AlphaZero in JAX☆84Updated 7 months ago
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆35Updated 4 years ago
- Emergence of complex strategies through multiagent competition☆42Updated last year
- Baselines for gymnax 🤖☆60Updated last year
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆47Updated last year
- Adaptable tools to make reinforcement learning and evolutionary computation algorithms.☆53Updated 2 years ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆22Updated 4 months ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆42Updated 4 years ago
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆20Updated last year
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆37Updated 3 years ago
- A2C is a special case of PPO!☆19Updated 2 years ago
- ☆28Updated 2 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆26Updated 3 years ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆91Updated last year
- OpenAi's gym environment wrapper to vectorize them with Ray☆22Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 3 months ago
- Deep Reinforcement Learning Framework done with PyTorch☆30Updated this week
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆81Updated last year
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- An implementation of Phasic Policy Gradient, a proposed improvement of Proximal Policy Gradients, in Pytorch☆46Updated this week
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago
- MiniZero: An AlphaZero and MuZero Training Framework☆72Updated last month
- Neuro-evolution for OpenAI Gym environments☆56Updated 3 years ago
- Accelerated minigrid environments with JAX☆119Updated 3 months ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆63Updated 3 months ago
- Reinforcement learning training framework for entity-gym environments.☆15Updated 8 months ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆70Updated 11 months ago