EmptyJackson / policy-guided-diffusionLinks
Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"
☆137Updated 11 months ago
Alternatives and similar repositories for policy-guided-diffusion
Users that are interested in policy-guided-diffusion are comparing it to the libraries listed below
Sorting:
- The official implementation of flow Q-learning (FQL)☆160Updated 3 months ago
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆65Updated last year
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Updated last year
- PWM: Policy Learning with Large World Models☆52Updated 4 months ago
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆50Updated 5 months ago
- ☆45Updated 11 months ago
- A benchmark for offline goal-conditioned RL and offline RL☆189Updated last week
- ☆46Updated 3 months ago
- ☆44Updated 6 months ago
- Generative Trajectory Stitching through Diffusion Composition☆23Updated last month
- [ICML'2023 Oral] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆62Updated last year
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆25Updated 9 months ago
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆28Updated last year
- ☆60Updated last year
- a simple and scalable agent for training adaptive policies with sequence-based RL☆128Updated 3 weeks ago
- Repo for Implicit Diffusion Q-Learning☆109Updated last year
- [NeurIPS 2023] Efficient Diffusion Policy☆104Updated last year
- ☆92Updated last year
- Foundation Policies with Hilbert Representations (ICML 2024)☆88Updated last year
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆44Updated last year
- Transformer-based World Models☆82Updated 2 years ago
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆51Updated 9 months ago
- A minimal and stable PPO.☆138Updated last year
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆75Updated last year
- official implementation of QVPO☆36Updated 8 months ago
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆62Updated last month
- ☆102Updated 2 years ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆32Updated 8 months ago
- ☆58Updated last year
- Using advances in generative modeling to learn reward functions from unlabeled videos.☆131Updated last year