Lei-Kun / Uni-O4Links
Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"
☆49Updated 4 months ago
Alternatives and similar repositories for Uni-O4
Users that are interested in Uni-O4 are comparing it to the libraries listed below
Sorting:
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆50Updated 8 months ago
- Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation☆70Updated last year
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆28Updated last year
- (ICLR 2024) Reverse Forward Curriculum Learning☆47Updated 6 months ago
- ☆57Updated 8 months ago
- PWM: Policy Learning with Large World Models☆49Updated 3 months ago
- ☆43Updated 5 months ago
- ☆44Updated 5 months ago
- ☆32Updated this week
- ☆17Updated 4 months ago
- Efficient Real-World RL for Legged Locomotion via Adaptive Policy Regularization☆71Updated last year
- A PyTorch implementation of Implicit Behavioral Cloning☆106Updated 2 years ago
- Official repo for paper "TD-M(PC)^2: Improving Temporal Difference MPC Through Policy Constraint"☆56Updated 3 months ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆31Updated 7 months ago
- ☆55Updated 4 months ago
- [IJCAI'24] An index of algorithms, approaches, and systems on cross-domain policy transfer for embodied agents☆55Updated 3 months ago
- Implementation of the transformer from the paper: "Real-World Humanoid Locomotion with Reinforcement Learning"☆40Updated last month
- Code and website for Behavior Transformers: Cloning k modes with one stone.☆123Updated 2 years ago
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆74Updated last year
- [ICML'2023] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆59Updated last year
- ☆117Updated this week
- Code for Watch and Match: Supercharging Imitation with Regularized Optimal Transport☆80Updated 2 years ago
- ☆19Updated last year
- [ICLR 2025] Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.☆48Updated this week
- Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics Models☆31Updated 7 months ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆78Updated last year
- Safe Multi-Agent Isaac Gym benchmark for safe multi-agent reinforcement learning research.☆60Updated 4 months ago
- [ICLR 2025] Bootstrapped Model Predictive Control☆14Updated last month
- Finetuning Offline World Models in the Real World☆58Updated last year
- ☆27Updated 4 months ago