EmptyJackson / policy-guided-diffusion
Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"
☆115Updated 2 months ago
Related projects: ⓘ
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆29Updated 4 months ago
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆81Updated 10 months ago
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆53Updated 5 months ago
- ☆43Updated 3 months ago
- Official implementation of Diffusion Policy Policy Optimization, arxiv 2024☆127Updated this week
- Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"☆140Updated 2 months ago
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆20Updated 3 weeks ago
- Body Transformer: Leveraging Robot Embodiment for Policy Learning☆87Updated last month
- a simple and scalable agent for training adaptive policies with sequence-based RL☆79Updated this week
- Implementation of Diffusion Policy, Toyota Research's supposed breakthrough in leveraging DDPMs for learning policies for real-world Robo…☆87Updated 2 months ago
- [ICML'2023] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆45Updated 10 months ago
- [NeurIPS 2023] Efficient Diffusion Policy☆74Updated 10 months ago
- PWM: Policy Learning with Large World Models☆32Updated last month
- A minimal and stable PPO.☆96Updated 7 months ago
- A list of awesome and popular robot learning environments☆87Updated last month
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆145Updated last year
- ☆71Updated last year
- ☆47Updated 6 months ago
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆52Updated 3 months ago
- Using advances in generative modeling to learn reward functions from unlabeled videos.☆106Updated 7 months ago
- [GenRL] Multimodal foundation world models allow grounding language and video prompts into embodied domains, by turning them into sequenc…☆41Updated last month
- Codebase for Extracting Reward Functions from Diffusion Models☆11Updated 9 months ago
- (ICLR 2024) Reverse Forward Curriculum Learning☆36Updated 2 weeks ago
- Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023☆42Updated 4 months ago
- Official code for "Behavior Generation with Latent Actions" (ICML 2024 Spotlight)☆90Updated 6 months ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆36Updated 7 months ago
- ☆24Updated last year
- Foundation Policies with Hilbert Representations (ICML 2024)☆65Updated 5 months ago
- ☆48Updated 6 months ago
- (NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value☆27Updated 5 months ago