levilelis / h-levin
Levin tree search guided by both a policy and a heuristic function
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for h-levin
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆14Updated 10 months ago
- Standard interface for entity based reinforcement learning environments.☆36Updated 8 months ago
- An implementation of MuZero in JAX.☆53Updated 2 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆31Updated 2 years ago
- ☆48Updated last year
- General Modules for JAX☆58Updated 3 months ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆32Updated 4 years ago
- AlphaZero for continuous control tasks☆23Updated last year
- ☆20Updated 5 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- Map-Elites based on Evolution Strategies☆31Updated 2 years ago
- Train self-modifying neural networks with neuromodulated plasticity☆76Updated 5 years ago
- An Open-Ended Agentic Simulator☆28Updated 3 months ago
- Accelerated replay buffers in JAX☆40Updated 2 years ago
- Procgen2: A community maintained fork of procgen☆11Updated 2 years ago
- Baselines for gymnax 🤖☆60Updated last year
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆78Updated last year
- Vectorization techniques for fast population-based training.☆54Updated 2 years ago
- ☆65Updated 2 weeks ago
- ☆35Updated 6 years ago
- flexible meta-learning in jax☆12Updated last year
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆39Updated 2 years ago
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago
- ☆63Updated 3 months ago
- An implementation of Phasic Policy Gradient, a proposed improvement of Proximal Policy Gradients, in Pytorch☆46Updated this week
- Code for Learning to Synthesize Programs as Interpretable and Generalizable Policies in NeurIPS 2021☆33Updated 2 years ago
- krazy grid world☆25Updated 4 years ago
- ☆28Updated 2 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆76Updated 4 years ago