ShangtongZhang / rl-theory-in-leanLinks
Towards Formalizing RL Theory
☆40Updated 2 months ago
Alternatives and similar repositories for rl-theory-in-lean
Users that are interested in rl-theory-in-lean are comparing it to the libraries listed below
Sorting:
- Learn online intrinsic rewards from LLM feedback☆45Updated last year
- ☆91Updated 4 months ago
- The official code release for Q#: Provably Optimal Distributional RL for LLM Post-Training☆17Updated 10 months ago
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆38Updated 5 months ago
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025☆81Updated 10 months ago
- Learning diverse options through the Laplacian representation.☆23Updated 2 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆17Updated last month
- An Open-Ended Agentic Simulator☆58Updated last year
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆120Updated last year
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆30Updated 3 weeks ago
- ☆89Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Updated 2 years ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆130Updated 6 months ago
- ☆16Updated last year
- A collection of matrix games in JAX☆13Updated last year
- Challenging Memory-based Deep Reinforcement Learning Agents☆108Updated last year
- Code for the papers Hypernetworks in Meta-Reinforcement Learning (Beck et al., 2022) and Recurrent Hypernetworks are Surprisingly Strong …☆15Updated last year
- Efficient baselines for autocurricula in JAX.☆206Updated last year
- Simple single-file baselines for Q-Learning in pure-GPU setting☆231Updated last month
- Unified Implementations of Offline Reinforcement Learning Algorithms☆189Updated 3 weeks ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆22Updated last year
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆17Updated 3 years ago
- Clean single-file implementation of offline RL algorithms in JAX☆165Updated last month
- JAX implementation of RL algorithms and vectorized environments☆51Updated 2 years ago
- ☆21Updated last month
- Various reinforcement learning algorithms written in Jax + Flax☆26Updated 2 years ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆62Updated last week
- Distributional Successor Features Enable Zero-Shot Policy Optimization☆13Updated 9 months ago