timoklein / alphazero-gym
AlphaZero for continuous control tasks
β23Updated last year
Related projects β
Alternatives and complementary repositories for alphazero-gym
- Baselines for gymnax π€β60Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β49Updated 2 years ago
- A collection of RL algorithms written in JAX.β95Updated 2 years ago
- Vectorization techniques for fast population-based training.β54Updated 2 years ago
- Standard interface for entity based reinforcement learning environments.β36Updated 8 months ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environmβ¦β39Updated 2 years ago
- General Modules for JAXβ58Updated 3 months ago
- Docker containers of baseline agents for the Crafter environmentβ28Updated 2 years ago
- Deep Reinforcement Learning Framework done with PyTorchβ30Updated this week
- Implicit Normalizing Flows + Reinforcement Learningβ60Updated 5 years ago
- β48Updated last year
- An implementation of MuZero in JAX.β53Updated 2 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.β50Updated 3 years ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorchβ46Updated last year
- Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354β25Updated 3 years ago
- JAX implementations of core Deep RL algorithmsβ79Updated 2 years ago
- Benchmarking RL generalization in an interpretable way.β132Updated 9 months ago
- Model-based reinforcement learning in TensorFlowβ54Updated 3 years ago
- Repository for the PGA-MAP-Elites algorithm. PGA-MAP-Elites was developed to efficiently scale MAP-Elites to large genotypes and noisy dβ¦β35Updated 3 years ago
- Fully differentiable RL environments, written in Ivy.β63Updated last year
- Deep Hierarchical Planning from Pixelsβ90Updated last year
- AGAC: Adversarially Guided Actor-Criticβ47Updated 3 years ago
- β29Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ105Updated 2 months ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according β¦β35Updated 6 months ago
- On the model-based stochastic value gradient for continuous reinforcement learningβ55Updated last year
- β63Updated 3 months ago
- GPT implementation in Flaxβ18Updated 2 years ago
- β65Updated 2 weeks ago
- JAX implementation of RL algorithms and vectorized environmentsβ34Updated 10 months ago