DHDev0 / Muzero
Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space.
β16Updated last year
Related projects β
Alternatives and complementary repositories for Muzero
- Neuroevolution Benchmark in JAX π¦β36Updated last year
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Desβ¦β23Updated 4 months ago
- β17Updated 4 months ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observβ¦β27Updated last year
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obserβ¦β56Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ105Updated 2 months ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Functionβ13Updated last year
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. aβ¦β20Updated 3 years ago
- The source code for the gym-microrts paper.β42Updated 2 years ago
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.β27Updated 3 months ago
- An implementation of MuZero in JAX.β53Updated 2 years ago
- Reinforcement learning algorithms in RLlibβ56Updated 6 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.β30Updated 11 months ago
- β48Updated last year
- Gym wrapper for pysc2β10Updated 2 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithmsβ46Updated 3 years ago
- Combining Evolutionary Algorithms and deep Reinforcement Learningβ14Updated 6 years ago
- Baselines for gymnax π€β59Updated last year
- A2C is a special case of PPO!β19Updated 2 years ago
- Scaling scaling laws with board games.β40Updated last year
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimizationβ24Updated 4 years ago
- Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"β17Updated 2 years ago
- π€ Reinforcement Learning paper summaries, notebooks, and articles.β26Updated 4 years ago
- TaskMet Task-driven Metric Learning for Model Learningβ18Updated 9 months ago
- An implementation of Phasic Policy Gradient, a proposed improvement of Proximal Policy Gradients, in Pytorchβ44Updated last week
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"β12Updated last week
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M β¦β43Updated 2 years ago
- MultiTask Environments for Reinforcement Learning.β74Updated 2 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)β26Updated 2 years ago
- β25Updated 2 weeks ago