Lei-Kun / Uni-O4
Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"
☆48Updated 3 months ago
Alternatives and similar repositories for Uni-O4:
Users that are interested in Uni-O4 are comparing it to the libraries listed below
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆48Updated 7 months ago
- Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation☆69Updated last year
- ☆57Updated 7 months ago
- PWM: Policy Learning with Large World Models☆46Updated 2 months ago
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆27Updated last year
- ☆17Updated 3 months ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆31Updated 6 months ago
- ☆51Updated 3 months ago
- (ICLR 2024) Reverse Forward Curriculum Learning☆46Updated 5 months ago
- [IJCAI'24] An index of algorithms, approaches, and systems on cross-domain policy transfer for embodied agents☆50Updated 2 months ago
- ☆32Updated last year
- ☆19Updated last year
- ☆38Updated 9 months ago
- Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics Models☆30Updated 6 months ago
- ☆43Updated 4 months ago
- From Imitation to Refinement -- Residual RL for Precise Visual Assembly☆116Updated 5 months ago
- ReDMan is an open-source simulation platform that provides a standardized implementation of safe RL algorithms for Reliable Dexterous Man…☆17Updated 2 years ago
- Implementation of the transformer from the paper: "Real-World Humanoid Locomotion with Reinforcement Learning"☆40Updated last month
- ☆44Updated last month
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆55Updated 7 months ago
- Coarse-to-fine Q-Network☆47Updated 9 months ago
- ☆43Updated 4 months ago
- Safe Multi-Agent Isaac Gym benchmark for safe multi-agent reinforcement learning research.☆61Updated 4 months ago
- Skill-based Model-based Reinforcement Learning (CoRL 2022)☆60Updated 2 years ago
- Official repo for paper "TD-M(PC)^2: Improving Temporal Difference MPC Through Policy Constraint"☆54Updated 3 months ago
- Official implementation of DEMO3☆46Updated last month
- Code for Watch and Match: Supercharging Imitation with Regularized Optimal Transport☆79Updated 2 years ago
- Official Code Repo for GENIMA☆71Updated 7 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆82Updated 5 months ago
- Efficient Real-World RL for Legged Locomotion via Adaptive Policy Regularization☆70Updated last year