ran-weii/cleanil

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ran-weii/cleanil)

ran-weii / cleanil

High quality implementations of imitation and inverse reinforcement learning algorithms

☆24

Alternatives and similar repositories for cleanil

Users that are interested in cleanil are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MichaelTMatthews / purejaxgcrl
View on GitHub
GCRL in JAX. Official repository for LEO (ICML 2026).
☆28Jun 20, 2026Updated last month
kvfrans / jaxtransformer
View on GitHub
Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...
☆16May 28, 2025Updated last year
vmoens / mujoco-torch
View on GitHub
☆44Jul 17, 2026Updated last week
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
jsikyoon / OCRL
View on GitHub
Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…
☆12Feb 23, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
aliang8 / varibad_jax
View on GitHub
☆10Jun 27, 2024Updated 2 years ago
metadriverse / pvp
View on GitHub
Official release for the code used in paper: Learning from Active Human Involvement through Proxy Value Propagation (NeurIPS 2023 Spotlig…
☆34Jan 16, 2025Updated last year
kristianceder / DRL-Traj-Planner
View on GitHub
☆14Feb 10, 2026Updated 5 months ago
anh-tong / nanoGPT-equinox
View on GitHub
nanoGPT using Equinox
☆15Mar 3, 2023Updated 3 years ago
google-deepmind / dmc_vision_benchmark
View on GitHub
☆34Jun 21, 2024Updated 2 years ago
machado-research / AgarCL
View on GitHub
Agar.io for Continual Reinforcement Learning
☆24Jul 24, 2025Updated last year
mttga / purejaxql
View on GitHub
Simple single-file baselines for Q-Learning in pure-GPU setting
☆242Nov 24, 2025Updated 8 months ago
ZishunYu / Actor-Critic-Alignment
View on GitHub
Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''
☆13Oct 12, 2023Updated 2 years ago
dunnolab / harmony
View on GitHub
[ICML 2026 GenBio Workshop] Official Implementation for "Harmonic Torsional Diffusion for Protein-Ligand Flexible Docking"
☆15Jun 30, 2026Updated 3 weeks ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
robintyh1 / icml2021-pengqlambda
View on GitHub
Revisiting Peng's Q(lambda) for Modern Reinforcement Learning
☆15Jul 23, 2021Updated 5 years ago
StoneT2000 / rl-robotics-speedrun
View on GitHub
speed-running solving robot manipulation tasks
☆24Oct 31, 2024Updated last year
osudrl / masked-humanoid-controller
View on GitHub
☆16Mar 7, 2025Updated last year
prasoongoyal / PixL2R
View on GitHub
☆17Dec 21, 2020Updated 5 years ago
epignatelli / navix
View on GitHub
Accelerated minigrid environments with JAX
☆175Oct 20, 2025Updated 9 months ago
Miffyli / minecraft-bc
View on GitHub
Submission code of UEFDRL team to NeurIPS 2019 MineRL challenge (5th place)
☆13Nov 13, 2020Updated 5 years ago
facebookresearch / gen_dgrl
View on GitHub
Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024
☆29Apr 8, 2026Updated 3 months ago
frt03 / mxt_bench
View on GitHub
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)
☆14Feb 3, 2023Updated 3 years ago
RyanNavillus / reward-surfaces
View on GitHub
☆19Apr 22, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
peterdavidfagan / mujoco_robot_environments
View on GitHub
Prototyping mujoco simulation environments.
☆11Feb 20, 2025Updated last year
CEC-Agent / CEC
View on GitHub
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
☆32Oct 12, 2023Updated 2 years ago
mohmdelsayed / weight-clipping
View on GitHub
☆17Aug 20, 2025Updated 11 months ago
robfiras / s2pg
View on GitHub
Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"
☆25May 5, 2024Updated 2 years ago
nmonette / NCC-UED
View on GitHub
Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025
☆17Nov 24, 2025Updated 8 months ago
Cloud0723 / Offline-MLIRL
View on GitHub
☆22Dec 18, 2023Updated 2 years ago
jax-state-spaces / mamba2-jax
View on GitHub
mamba2-jax: A pure JAX/Flax implementation of Mamba-2 for language modeling and time series forecasting.
☆16Jun 23, 2026Updated last month
sdan / nanoEBM
View on GitHub
minimal Energy-based transformer
☆44Dec 11, 2025Updated 7 months ago
MichalOp / MineRL2020
View on GitHub
☆16Aug 7, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
roger-creus / stable-deep-rl-at-scale
View on GitHub
Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments…
☆39Oct 24, 2025Updated 9 months ago
drbh / yamoe
View on GitHub
🔀 yet another mixture of experts
☆23Jun 5, 2026Updated last month
dunnolab / vintix-II
View on GitHub
Vintix II: Decision Pre-Trained Transformer is a Scalable In-Context Reinforcement Learner - - = ICLR 2026
☆16Apr 8, 2026Updated 3 months ago
bryanoliveira / sliding-puzzles-gym
View on GitHub
A scalable benchmark for state representation learning in visual reinforcement learning.
☆17Jun 23, 2025Updated last year
adaptive-intelligent-robotics / Kheperax
View on GitHub
High-performance JAX-powered simulator for robotic navigation in 2D mazes, optimized for Quality-Diversity algorithm research and benchma…
☆21Jun 19, 2025Updated last year
kvfrans / rlbase_stable
View on GitHub
☆46Jul 12, 2024Updated 2 years ago
KTS-Innovation-Labs / eurekasim
View on GitHub
EurekaSim | Scientific and Engineering Simulation Application
☆11May 27, 2026Updated last month