ShangtongZhang/rl-theory-in-lean

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ShangtongZhang/rl-theory-in-lean)

ShangtongZhang / rl-theory-in-lean

Towards Formalizing RL Theory

☆55

Alternatives and similar repositories for rl-theory-in-lean

Users that are interested in rl-theory-in-lean are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

esraaelelimy / gtd_algos
View on GitHub
Algorithms for Gradient TD updates
☆19Feb 21, 2026Updated 5 months ago
bit1029public / HRSSM
View on GitHub
Pytorch Implementation of Learning Latent Dynamic Robust Representations for World Models
☆25May 11, 2024Updated 2 years ago
ThibautTheate / Risk-Sensitive-Policy-with-Distributional-Reinforcement-Learning
View on GitHub
Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…
☆16Dec 19, 2022Updated 3 years ago
google-deepmind / egg
View on GitHub
☆19Apr 15, 2026Updated 3 months ago
rystrauss / dopamax
View on GitHub
Reinforcement learning in pure JAX.
☆13Jun 24, 2026Updated 3 weeks ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
itstyren / InteractionMARL-Coop
View on GitHub
Code for "Enhancing Cooperation through Selective Interaction and Long-term Experiences in Multi-Agent Reinforcement Learning", IJCAI24.
☆14Feb 9, 2025Updated last year
zaiyan-x / RFQI
View on GitHub
Implementation of Robust Reinforcement Learning using Offline Data [NeurIPS'22]
☆25Nov 9, 2024Updated last year
haoyuzhao123 / LeanIneqComp
View on GitHub
An inequality benchmark for theorem proving
☆22Feb 1, 2026Updated 5 months ago
ccr-cheng / riemannian-consistency-model
View on GitHub
Official implementation of the NeurIPS 25 paper of Riemannian Consistency Model (RCM) for few-step generation on Riemannian manifolds.
☆16Nov 2, 2025Updated 8 months ago
JasonMa2016 / CODAC
View on GitHub
Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)
☆22Aug 1, 2021Updated 4 years ago
danieldritter / OAPL
View on GitHub
☆30Feb 24, 2026Updated 4 months ago
SafeRL-Lab / Robust-Gymnasium
View on GitHub
[ICLR 2025] Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.
☆100Mar 20, 2026Updated 4 months ago
marekpetrik / RAAM
View on GitHub
Robust and Approximate Markov Decision Processes
☆11Jul 21, 2017Updated 9 years ago
danijar / crafter-baselines
View on GitHub
Docker containers of baseline agents for the Crafter environment
☆30Dec 14, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
dunnolab / awesome-in-context-rl
View on GitHub
Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —
☆305Sep 8, 2025Updated 10 months ago
danijar / diamond_env
View on GitHub
Standardized Minecraft Diamond Environment for Reinforcement Learning
☆40May 19, 2023Updated 3 years ago
seohongpark / horizon-reduction
View on GitHub
The official implementation of "Horizon Reduction Makes RL Scalable"
☆200Aug 2, 2025Updated 11 months ago
JanTempus / tokenisation_lp
View on GitHub
☆15May 20, 2026Updated 2 months ago
argumentcomputer / FFaCiL.lean
View on GitHub
Finite Fields and Curves in Lean
☆14Apr 6, 2023Updated 3 years ago
ml-jku / LRAM
View on GitHub
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
☆36Oct 31, 2024Updated last year
hercky / ACER_tf
View on GitHub
Implementation for ACER in tensorflow and sonnet by deepmind
☆11Aug 28, 2017Updated 8 years ago
StanfordASL / RSIRL
View on GitHub
Risk-sensitive Inverse Reinforcement Learning
☆11Sep 11, 2019Updated 6 years ago
Carbon225 / mctx-classic
View on GitHub
Classic MCTS example with mctx
☆25May 25, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
SafeRL-Lab / Robust-RL-Baselines
View on GitHub
Robust Reinforcement Learning Benchmark
☆13Sep 22, 2024Updated last year
gauthamvasan / avg
View on GitHub
Action Value Gradient Algorithm
☆28May 18, 2025Updated last year
LAVA-LAB / safe-slac
View on GitHub
Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.
☆11Mar 1, 2023Updated 3 years ago
vwxyzjn / cleanba
View on GitHub
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆125Aug 22, 2024Updated last year
andnp / PyExpUtils
View on GitHub
Experiment utility code, specifically designed for use with Compute Canada.
☆11Jan 27, 2025Updated last year
tiwari-research-group / Koopman-with-KANs
View on GitHub
☆11Jun 4, 2024Updated 2 years ago
Beneficial-AI-Foundation / vericoding-benchmark
View on GitHub
☆40Jun 5, 2026Updated last month
BY571 / IQN-and-Extensions
View on GitHub
PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…
☆94Mar 4, 2023Updated 3 years ago
exoshuffle / raysort
View on GitHub
☆16Sep 4, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
end3r / Gamepad-API-Content-Kit
View on GitHub
Gamepad API Content Kit
☆14Jun 1, 2016Updated 10 years ago
bramgrooten / automatic-noise-filtering
View on GitHub
[AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"
☆12Feb 22, 2024Updated 2 years ago
AntheaLi / open-research-seeds
View on GitHub
☆26Jun 10, 2026Updated last month
liuzuxin / OSRL
View on GitHub
🤖 Elegant implementations of offline safe RL algorithms in PyTorch
☆246Sep 13, 2024Updated last year
trishullab / clever
View on GitHub
CLEVER: Code Lean Evaluation for Verified End-to-end Reasoning
☆46Apr 3, 2026Updated 3 months ago
SherylHYX / MSGNN
View on GitHub
Official code for the LoG2022 paper -- MSGNN: A Spectral Graph Neural Network Based on a Novel Magnetic Signed Laplacian.
☆14Feb 8, 2025Updated last year
CLAIRE-Labo / StructuredFFN
View on GitHub
The official code of "Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers"
☆20Jul 24, 2024Updated last year