Lei-Kun / Uni-O4
Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"
☆37Updated last week
Related projects ⓘ
Alternatives and complementary repositories for Uni-O4
- PWM: Policy Learning with Large World Models☆37Updated 3 months ago
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆42Updated 2 months ago
- Official code for "QueST: Self-Supervised Skill Abstractions for Continuous Control" [NeurIPS 2024]☆38Updated last month
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆24Updated last month
- Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation☆62Updated last year
- From Imitation to Refinement -- Residual RL for Precise Visual Assembly☆53Updated last week
- OGBench: Benchmarking Offline Goal-Conditioned RL☆79Updated 3 weeks ago
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆59Updated 7 months ago
- Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics Models☆17Updated last month
- (ICLR 2024) Reverse Forward Curriculum Learning☆38Updated 2 months ago
- ☆38Updated last month
- [ICLR 2024] PyTorch Code for Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks☆74Updated 3 months ago
- Coarse-to-fine Q-Network☆32Updated 3 months ago
- Code and website for Behavior Transformers: Cloning k modes with one stone.☆109Updated last year
- Code for Watch and Match: Supercharging Imitation with Regularized Optimal Transport☆73Updated last year
- ☆36Updated 3 months ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆73Updated 6 months ago
- Official release of CompoSuite, a compositional RL benchmark☆46Updated 9 months ago
- [IJCAI'24] An index of algorithms, approaches, and systems on cross-domain policy transfer for embodied agents☆32Updated 3 weeks ago
- Code for the paper "Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation"☆24Updated 2 months ago
- [NeurIPS 2024] GenRL: Multimodal foundation world models allow grounding language and video prompts into embodied domains, by turning the…☆59Updated 3 months ago
- ☆18Updated last month
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆35Updated last month
- Official Repository for "Eurekaverse: Environment Curriculum Generation via Large Language Models" (CoRL 2024)☆39Updated 2 weeks ago
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆62Updated 5 months ago
- Efficient Real-World RL for Legged Locomotion via Adaptive Policy Regularization☆64Updated last year
- ☆52Updated last year
- Official Code Repo for GENIMA☆54Updated last month
- ☆28Updated last year
- Code for Compositional Diffusion-Based Continuous Constraint Solvers (CoRL 23)☆47Updated 9 months ago