Lei-Kun / Uni-O4
Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"
☆47Updated 3 months ago
Alternatives and similar repositories for Uni-O4:
Users that are interested in Uni-O4 are comparing it to the libraries listed below
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆47Updated 7 months ago
- Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation☆69Updated last year
- ☆17Updated 2 months ago
- ☆57Updated 6 months ago
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆27Updated 11 months ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆30Updated 6 months ago
- Efficient Real-World RL for Legged Locomotion via Adaptive Policy Regularization☆70Updated last year
- ☆48Updated 2 months ago
- [IJCAI'24] An index of algorithms, approaches, and systems on cross-domain policy transfer for embodied agents☆48Updated 2 months ago
- ☆42Updated 3 months ago
- Code for the Behavior Retrieval Paper☆34Updated last year
- PWM: Policy Learning with Large World Models☆42Updated last month
- Code for Watch and Match: Supercharging Imitation with Regularized Optimal Transport☆78Updated 2 years ago
- ☆42Updated 4 months ago
- (ICLR 2024) Reverse Forward Curriculum Learning☆44Updated 4 months ago
- Official repository for "STAP: Sequencing Task-Agnostic Policies," presented at ICRA 2023.☆42Updated 2 months ago
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆72Updated 10 months ago
- Official repo for paper "TD-M(PC)^2: Improving Temporal Difference MPC Through Policy Constraint"☆52Updated 2 months ago
- Official Repository for "Eurekaverse: Environment Curriculum Generation via Large Language Models" (CoRL 2024)☆69Updated 2 months ago
- [ICLR 2024] PyTorch Code for Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks☆96Updated 7 months ago
- ☆18Updated last year
- ☆31Updated last year
- ☆42Updated 8 months ago
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆53Updated 6 months ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆25Updated last year
- From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data☆53Updated 2 years ago
- Implementation of the transformer from the paper: "Real-World Humanoid Locomotion with Reinforcement Learning"☆40Updated 2 weeks ago
- ☆25Updated last year
- Code and website for Behavior Transformers: Cloning k modes with one stone.☆120Updated last year
- Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics Models☆29Updated 6 months ago