levilelis / h-levinLinks
Levin tree search guided by both a policy and a heuristic function
☆19Updated last year
Alternatives and similar repositories for h-levin
Users that are interested in h-levin are comparing it to the libraries listed below
Sorting:
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆20Updated last month
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Scaling scaling laws with board games.☆49Updated last year
- A simple hypernetwork implementation in jax using haiku.☆23Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆56Updated 2 years ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- General Modules for JAX☆65Updated 2 months ago
- flexible meta-learning in jax☆14Updated last year
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Updated 10 months ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 2 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- Standard interface for entity based reinforcement learning environments.☆38Updated last year
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆14Updated last year
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆17Updated 2 years ago
- ☆80Updated 7 months ago
- ☆82Updated 3 months ago
- ☆51Updated 2 years ago
- fast + parallel AlphaZero in JAX☆97Updated 6 months ago
- A minimal implementation of Go-Explore without domain knowledge☆15Updated 4 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆32Updated 3 years ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆28Updated 11 months ago
- ☆23Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆113Updated 10 months ago
- ☆54Updated 7 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆18Updated 7 months ago
- Neuroevolution Benchmark in JAX 🦕☆39Updated last year
- Vectorization techniques for fast population-based training.☆56Updated 2 years ago
- Baselines for gymnax 🤖☆67Updated 2 years ago