levilelis / h-levinLinks
Levin tree search guided by both a policy and a heuristic function
☆19Updated 2 years ago
Alternatives and similar repositories for h-levin
Users that are interested in h-levin are comparing it to the libraries listed below
Sorting:
- An implementation of MuZero in JAX.☆57Updated 3 years ago
- Scaling scaling laws with board games.☆54Updated 2 years ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆25Updated 7 months ago
- ☆53Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Updated 3 years ago
- fast + parallel AlphaZero in JAX☆107Updated 11 months ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 3 years ago
- Vectorization techniques for fast population-based training.☆56Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆118Updated last year
- Accelerated replay buffers in JAX☆45Updated 3 years ago
- A collection of papers on divergence and quality diversity☆78Updated 3 years ago
- Standard interface for entity based reinforcement learning environments.☆38Updated last year
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆123Updated 4 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆40Updated 9 months ago
- Repository for the PGA-MAP-Elites algorithm. PGA-MAP-Elites was developed to efficiently scale MAP-Elites to large genotypes and noisy d…☆58Updated 4 years ago
- ☆88Updated last year
- Implicit Normalizing Flows + Reinforcement Learning☆61Updated 6 years ago
- An Open-Ended Agentic Simulator☆56Updated last year
- Baselines for gymnax 🤖☆73Updated 2 years ago
- General Modules for JAX☆71Updated 3 months ago
- Neuroevolution Benchmark in JAX 🦕☆41Updated 2 years ago
- krazy grid world☆25Updated 5 years ago
- A C++ pytorch implementation of MuZero☆41Updated last year
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆71Updated last year
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆75Updated 2 years ago
- The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge☆60Updated 2 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 3 years ago
- Building blocks for productive research☆64Updated 4 months ago
- Fully differentiable RL environments, written in Ivy.☆66Updated 2 years ago
- ☆89Updated 3 months ago