enpasos / muzeroLinks
☆13Updated 5 months ago
Alternatives and similar repositories for muzero
Users that are interested in muzero are comparing it to the libraries listed below
Sorting:
- Implementation of Proximal Policy Optimization in Jax+Flax☆19Updated 2 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Gym wrapper for pysc2☆10Updated 2 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- A2C is a special case of PPO!☆22Updated 3 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆12Updated 2 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆21Updated 6 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- DreamerV3 implementation of Curious Replay, a method for prioritizing experience replay that is tailored to model-based reinforcement lea…☆36Updated last year
- Official repository of Action-Free Guide☆11Updated 2 years ago
- A C++ pytorch implementation of MuZero☆38Updated last year
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆11Updated 2 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- unofficial code reproducing Agent57☆36Updated last year
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆20Updated 2 years ago
- A collection of matrix games in JAX☆11Updated 7 months ago
- A web based platform for collecting human actions in reinforcement learning environments☆30Updated last year
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- ☆18Updated 2 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆36Updated 3 months ago
- Generalised UDRL☆37Updated 3 years ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354☆26Updated 3 years ago
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆39Updated 2 years ago
- flexible meta-learning in jax☆14Updated last year
- An unofficial implementation for online decision transformer☆40Updated 2 years ago