Lei-Kun / Uni-O4
Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"
☆29Updated 4 months ago
Related projects: ⓘ
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆53Updated 5 months ago
- PWM: Policy Learning with Large World Models☆32Updated last month
- Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation☆55Updated last year
- (ICLR 2024) Reverse Forward Curriculum Learning☆36Updated 2 weeks ago
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆37Updated this week
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆71Updated 4 months ago
- [GenRL] Multimodal foundation world models allow grounding language and video prompts into embodied domains, by turning them into sequenc…☆41Updated last month
- [IJCAI'24] An index of algorithms, approaches, and systems on cross-domain policy transfer for embodied agents☆26Updated this week
- [ICML'2023] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆45Updated 10 months ago
- ☆28Updated 11 months ago
- Official release of CompoSuite, a compositional RL benchmark☆44Updated 7 months ago
- Code for Watch and Match: Supercharging Imitation with Regularized Optimal Transport☆70Updated last year
- Official implementation of Diffusion Policy Policy Optimization, arxiv 2024☆127Updated this week
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆52Updated 3 months ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆24Updated last year
- Coarse-to-fine Q-Network☆26Updated last month
- From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data☆48Updated last year
- Chain-of-Thought Predictive Control☆54Updated last year
- ☆51Updated last year
- [ICLR 2023] Choreographer: a model-based agent that discovers and learns unsupervised skills in latent imagination, and it's able to effi…☆34Updated 3 months ago
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆14Updated 4 months ago
- ☆37Updated 2 months ago
- Efficient Real-World RL for Legged Locomotion via Adaptive Policy Regularization☆62Updated 10 months ago
- ReDMan is an open-source simulation platform that provides a standardized implementation of safe RL algorithms for Reliable Dexterous Man…☆15Updated last year
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆36Updated 7 months ago
- ☆21Updated last month
- Code for "DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks"☆14Updated 4 months ago
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆81Updated 10 months ago
- Finetuning Offline World Models in the Real World☆42Updated 10 months ago
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆29Updated last year