levilelis / h-levin
Levin tree search guided by both a policy and a heuristic function
☆18Updated last year
Alternatives and similar repositories for h-levin:
Users that are interested in h-levin are comparing it to the libraries listed below
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆17Updated last year
- Learning diverse options through the Laplacian representation.☆23Updated last year
- General Modules for JAX☆64Updated last month
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- ☆74Updated last week
- Scaling scaling laws with board games.☆48Updated last year
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- ☆53Updated 4 months ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆25Updated 9 months ago
- Neurosymbolic transformers for multi-agent communication.☆22Updated 4 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆40Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 7 months ago
- Official repository for the paper "Neural Differential Equations for Learning to Program Neural Nets Through Continuous Learning Rules" (…☆21Updated 2 years ago
- flexible meta-learning in jax☆12Updated last year
- Baselines for gymnax 🤖☆66Updated last year
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆31Updated 3 years ago
- Neuroevolution Benchmark in JAX 🦕☆38Updated last year
- Scalable Opponent Shaping Experiments in JAX☆24Updated 11 months ago
- ☆20Updated 9 months ago
- Few-shot Bayesian Imitation Learning with Policies as Logic over Programs☆19Updated last month
- ☆20Updated 2 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆71Updated 7 months ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆44Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆99Updated last year
- ☆50Updated last year
- Generalised UDRL☆37Updated 2 years ago