Lei-Kun / Uni-O4Links

Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"

☆51

Alternatives and similar repositories for Uni-O4

Users that are interested in Uni-O4 are comparing it to the libraries listed below

Sorting:

Improbable-AI / pql
Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation
☆72Updated 2 years ago
jayeshs999 / sapg
Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)
☆54Updated 10 months ago
realquantumcookie / APRL
Efficient Real-World RL for Legged Locomotion via Adaptive Policy Regularization
☆76Updated last year
hengyuan-hu / ibrl
☆58Updated 10 months ago
StoneT2000 / rfcl
(ICLR 2024) Reverse Forward Curriculum Learning
☆48Updated 8 months ago
zhouzypaul / wsrl
JAX implementation of WSRL and RL baselines | ICLR 2025
☆101Updated 3 weeks ago
google-deepmind / language_to_reward_2023
☆146Updated 11 months ago
MaxSobolMark / PolicyAgnosticRL
☆71Updated last month
imgeorgiev / PWM
PWM: Policy Learning with Large World Models
☆55Updated 5 months ago
ankile / robust-rearrangement
From Imitation to Refinement -- Residual RL for Precise Visual Assembly
☆143Updated 8 months ago
robobase-org / robobase
☆43Updated 7 months ago
penn-pal-lab / scaffolder
Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…
☆29Updated last year
siddhanthaldar / ROT
Code for Watch and Match: Supercharging Imitation with Regularized Optimal Transport
☆80Updated 2 years ago
kevinzakka / ibc
A PyTorch implementation of Implicit Behavioral Cloning
☆108Updated 3 years ago
wertyuilife2 / bmpc
[ICLR 2025] Bootstrapped Model Predictive Control
☆20Updated last week
wang-kevin3290 / scaling-crl
☆47Updated 4 months ago
adrialopezescoriza / demo3
Official implementation of DEMO3
☆54Updated 2 months ago
XuGW-Kevin / DrM
DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …
☆76Updated last year
mihdalal / planseqlearn
[ICLR 2024] PyTorch Code for Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks
☆111Updated 11 months ago
notmahi / bet
Code and website for Behavior Transformers: Cloning k modes with one stone.
☆128Updated 2 years ago
jianlanluo / SAQ
☆33Updated last month
younggyoseo / CQN
Coarse-to-fine Q-Network
☆48Updated 11 months ago
youngwoon / robot-learning
☆53Updated 2 years ago
jonzamora / awesome-robot-learning-envs
A list of awesome and popular robot learning environments
☆111Updated 11 months ago
seohongpark / horizon-reduction
The official implementation of "Horizon Reduction Makes RL Scalable"
☆121Updated last month
pd-perry / RLIF
☆27Updated last year
heatz123 / tldr
Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations
☆33Updated 9 months ago
sukhijab / maxinforl_torch
☆44Updated 7 months ago
gemcollector / RL-ViGen
This is the repo of "RL-ViGen: A Reinforcement Learning Benchmark for Visual Generalization"
☆107Updated 5 months ago
agiachris / STAP
Official repository for "STAP: Sequencing Task-Agnostic Policies," presented at ICRA 2023.
☆47Updated 6 months ago