EmptyJackson / policy-guided-diffusionLinks
Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"
☆152Updated last year
Alternatives and similar repositories for policy-guided-diffusion
Users that are interested in policy-guided-diffusion are comparing it to the libraries listed below
Sorting:
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆73Updated last year
- PWM: Policy Learning with Large World Models☆65Updated 6 months ago
- The official implementation of flow Q-learning (FQL)☆272Updated 6 months ago
- The official implementation of "Horizon Reduction Makes RL Scalable"☆180Updated 6 months ago
- Q-learning with Adjoint Matching☆40Updated last week
- [NeurIPS 2025 Spotlight] Generative Trajectory Stitching through Diffusion Composition☆67Updated 5 months ago
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆76Updated last year
- ☆263Updated 2 months ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆36Updated 2 weeks ago
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆33Updated 4 months ago
- ☆50Updated 4 months ago
- Official implementation of "Flow Based Policy for Online Reinforcement Learning"☆65Updated 3 months ago
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Updated 2 years ago
- Code for "MetaMorph: Learning Universal Controllers with Transformers", Gupta et al, ICLR 2022☆127Updated 3 years ago
- off-policy RL on long sequences☆158Updated this week
- Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344☆49Updated last year
- [NeurIPS 2023] Efficient Diffusion Policy☆114Updated 2 years ago
- [NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…☆86Updated 10 months ago
- Code for the paper "Learning a Diffusion Model Policy from Rewards via Q-Score Matching"☆32Updated 9 months ago
- Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)☆134Updated last year
- ☆83Updated 3 weeks ago
- Repo for Implicit Diffusion Q-Learning☆123Updated 2 years ago
- JAX implementation of WSRL and RL baselines | ICLR 2025☆130Updated 6 months ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆82Updated last year
- ☆81Updated 8 months ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆105Updated 4 months ago
- ☆48Updated last year
- Decoupled Q-Chunking☆52Updated 3 weeks ago
- official implementation of QVPO☆60Updated 2 weeks ago
- (NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value☆35Updated last year