Alescontrela / score_matching_rlLinks

Code for the paper "Learning a Diffusion Model Policy from Rewards via Q-Score Matching"

☆25

Alternatives and similar repositories for score_matching_rl

Users that are interested in score_matching_rl are comparing it to the libraries listed below

Sorting:

ZibinDong / AlignDiff-ICLR2024
☆31Updated last year
devinluo27 / comp_diffuser_release
Generative Trajectory Stitching through Diffusion Composition
☆28Updated last week
imgeorgiev / PWM
PWM: Policy Learning with Large World Models
☆55Updated this week
diffuserlite / diffuserlite.github.io
☆43Updated last year
Fang-Lin93 / DAC
DAC: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning.
☆20Updated last year
marc-rigter / polygrad-world-models
Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024
☆66Updated last year
haosulab / RPG
Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization
☆27Updated 2 years ago
seohongpark / fql
The official implementation of flow Q-learning (FQL)
☆202Updated 2 weeks ago
thuml / HarmonyDream
Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344
☆41Updated last year
quantumiracle / Consistency_Model_For_Reinforcement_Learning
Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24
☆26Updated 11 months ago
thu-ml / SRPO
Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).
☆46Updated last year
penn-pal-lab / scaffolder
Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…
☆29Updated last year
changchencc / Simple-Hierarchical-Planning-with-Diffusion
☆29Updated last year
kvfrans / cfgrl
☆43Updated 2 months ago
TEA-Lab / diffusion_reward
[ECCV 2024] 💐Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"
☆108Updated last year
EmptyJackson / policy-guided-diffusion
Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"
☆137Updated last year
Liang-ZX / AdaptDiffuser
[ICML'2023 Oral] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"
☆64Updated last year
sail-sg / edp
[NeurIPS 2023] Efficient Diffusion Policy
☆106Updated last year
nakamotoo / dsrl_pi0
Official implementation for pi0 steering via DSRL, Steering Your Diffusion Policy with Latent Space Reinforcement Learning (CoRL 2025)
☆17Updated this week
mazpie / genrl
[NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…
☆78Updated 4 months ago
StoneT2000 / rfcl
(ICLR 2024) Reverse Forward Curriculum Learning
☆48Updated 8 months ago
t6-thu / awesome-cross-domain-policy-transfer-for-embodied-agents
[IJCAI'24] An index of algorithms, approaches, and systems on cross-domain policy transfer for embodied agents
☆55Updated 5 months ago
tinnerhrhe / MTDiff
☆62Updated 8 months ago
elicassion / sugarl
Code for NeurIPS 2023 paper "Active Vision Reinforcement Learning with Limited Visual Observability"
☆52Updated 9 months ago
schmidtdominik / LAPO
Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)
☆115Updated last year
Streaming-Diffusion-Policy / streaming_diffusion_policy
Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models
☆65Updated 2 months ago
XuGW-Kevin / DrM
DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …
☆76Updated last year
wadx2019 / qvpo
official implementation of QVPO
☆46Updated 9 months ago
sukhijab / maxinforl_torch
☆44Updated 7 months ago
MaxSobolMark / PolicyAgnosticRL
☆71Updated this week