High quality implementations of imitation and inverse reinforcement learning algorithms
☆24Aug 19, 2025Updated 10 months ago
Alternatives and similar repositories for cleanil
Users that are interested in cleanil are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆16May 28, 2025Updated last year
- Code base for NeurIPS 2022 paper Curriculum Reinforcement Learning using Optimal Transport via Gradual Domain Adaptation.☆11Aug 21, 2023Updated 2 years ago
- nanoGPT using Equinox☆15Mar 3, 2023Updated 3 years ago
- Official repo for paper "HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies"☆33Dec 12, 2025Updated 6 months ago
- Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''☆13Oct 12, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- speed-running solving robot manipulation tasks☆24Oct 31, 2024Updated last year
- Prototyping mujoco simulation environments.☆11Feb 20, 2025Updated last year
- Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments…☆39Oct 24, 2025Updated 8 months ago
- Xtructure is datastructure for using in JAX☆23Jun 27, 2026Updated last week
- ☆36Aug 26, 2025Updated 10 months ago
- Franka simulator in Drake compatible with existing libfranka programs☆24Aug 29, 2025Updated 10 months ago
- Fast reinforcement learning 💨☆29Jul 15, 2025Updated 11 months ago
- ☆20Jun 9, 2025Updated last year
- EurekaSim | Scientific and Engineering Simulation Application☆11May 27, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code accompanying the latent-action-priors paper.☆12Mar 5, 2025Updated last year
- This repository provides the code for training the position constrained generative grasp sampler from the paper Constrained Generative Sa…☆22Dec 4, 2024Updated last year
- Official release for the code used in paper: Learning from Active Human Involvement through Proxy Value Propagation (NeurIPS 2023 Spotlig…☆34Jan 16, 2025Updated last year
- Accelerated minigrid environments with JAX☆171Oct 20, 2025Updated 8 months ago
- minimal Energy-based transformer☆44Dec 11, 2025Updated 6 months ago
- Flax (Jax) implementation of DeepSeek-R1-Distill-Qwen-1.5B with weights ported from Hugging Face.☆26Feb 20, 2025Updated last year
- Vision package for robot manipulation and learning research☆26Apr 21, 2024Updated 2 years ago
- Imitation and relaxation reinforcement learning☆30Sep 26, 2022Updated 3 years ago
- Code release for ICLR 2023 paper "NTFields: Neural Time Fields for Physics-Informed Robot Motion Planning"☆55Sep 25, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Robotics: Science and Systems (RSS) 2025 | Action Flow Matching for Continual Robot Learning | Online and Non-episodic Robot Dynamics Mod…☆35Dec 18, 2025Updated 6 months ago
- IsaacGymGrasp runs a robot grasping physics simulator that can visualize, execute, and evaluate numerous robot grasps in simultaneous env…☆18Mar 14, 2023Updated 3 years ago
- ☆11Feb 6, 2018Updated 8 years ago
- H-Net Dynamic Hierarchical Architecture☆81Sep 11, 2025Updated 9 months ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆16Apr 30, 2025Updated last year
- Clean single-file implementation of offline RL algorithms in JAX☆182Jun 5, 2026Updated 3 weeks ago
- JAxtar is a project with a JAX-native implementation of parallelizeable A* & Q* solver for neural heuristic search research.☆50Jun 13, 2026Updated 3 weeks ago
- Code for the paper "Kinematic Motion Retargeting via Neural Latent Optimization for Learning Sign Language", RAL with ICRA 2022☆45Jun 13, 2022Updated 4 years ago
- NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.☆25May 20, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆35Jun 21, 2024Updated 2 years ago
- Semantic Synthesis of Pedestrian Locomotion☆13Sep 13, 2023Updated 2 years ago
- Submission Under Review☆17May 15, 2025Updated last year
- A digital data-generation pipeline that synthesizes humanoid loco-manipulation data from 3D assets and video priors.☆349Jun 9, 2026Updated 3 weeks ago
- Repository for our paper: Versatile Skill Control via Self-supervised Adversarial Imitation of Unlabeled Mixed Motions. (ICRA 2023)☆54Apr 6, 2024Updated 2 years ago
- [TMLR 2025] A collection of research papers on constraint inference within the field of RL☆11May 9, 2025Updated last year
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆20Mar 9, 2025Updated last year