ShangtongZhang / rl-theory-in-leanLinks
Towards Formalizing RL Theory
☆29Updated this week
Alternatives and similar repositories for rl-theory-in-lean
Users that are interested in rl-theory-in-lean are comparing it to the libraries listed below
Sorting:
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆28Updated 6 months ago
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆17Updated 2 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆21Updated last year
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆117Updated 4 months ago
- Code for the papers Hypernetworks in Meta-Reinforcement Learning (Beck et al., 2022) and Recurrent Hypernetworks are Surprisingly Strong …☆15Updated last year
- ☆86Updated last year
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆60Updated 2 years ago
- ☆18Updated 5 months ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆50Updated last year
- Agar.io for Continual Reinforcement Learning☆23Updated 3 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆104Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆117Updated last year
- A collection of matrix games in JAX☆12Updated 11 months ago
- ☆87Updated 2 months ago
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆36Updated 3 months ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆21Updated last year
- Unified Implementations of Offline Reinforcement Learning Algorithms☆117Updated 3 weeks ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆188Updated 7 months ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆42Updated 2 years ago
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆74Updated this week
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆18Updated last year
- Clean single-file implementation of offline RL algorithms in JAX☆159Updated 10 months ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆53Updated 2 years ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆61Updated last month
- JAX implementation of RL algorithms and vectorized environments☆49Updated last year
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025☆80Updated 8 months ago
- Simple JAX Graphics Library.☆36Updated last year
- Accelerated replay buffers in JAX☆43Updated 3 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆110Updated last year
- Synchronized Curriculum Learning for RL Agents☆114Updated 2 months ago