levilelis / h-levin
Levin tree search guided by both a policy and a heuristic function
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for h-levin
- An implementation of MuZero in JAX.☆53Updated 2 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.☆50Updated 6 months ago
- Procgen2: A community maintained fork of procgen☆11Updated 2 years ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆31Updated 4 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- krazy grid world☆25Updated 4 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆55Updated last year
- Deep Reinforcement Learning Framework done with PyTorch☆30Updated 2 weeks ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆40Updated last week
- Train self-modifying neural networks with neuromodulated plasticity☆76Updated 5 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆66Updated 4 years ago
- Logarithmic Reinforcement Learning☆26Updated last year
- Neuronal Circuit Policies☆39Updated 2 years ago
- fast + parallel AlphaZero in JAX☆84Updated 7 months ago
- Baselines for gymnax 🤖☆58Updated last year
- Map-Elites based on Evolution Strategies☆31Updated 2 years ago
- ☆48Updated last year
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆56Updated last year
- **Sferes2 module** A unifying modular framework for Quality-Diversity algorithms☆22Updated 4 years ago
- Vectorization techniques for fast population-based training.☆54Updated 2 years ago
- ☆20Updated 5 years ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆14Updated 9 months ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆16Updated last year
- A collection of RL algorithms written in JAX.☆94Updated 2 years ago
- AlphaZero for continuous control tasks☆23Updated last year
- ☆24Updated 2 years ago
- General Modules for JAX☆58Updated 3 months ago
- Accelerated replay buffers in JAX☆39Updated 2 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆144Updated last year
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago