escontra/score_matching_rl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/escontra/score_matching_rl)

escontra / score_matching_rl

Code for the paper "Learning a Diffusion Model Policy from Rewards via Q-Score Matching"

☆34

Alternatives and similar repositories for score_matching_rl

Users that are interested in score_matching_rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wadx2019 / qvpo
View on GitHub
official implementation of QVPO
☆66Jan 23, 2026Updated 6 months ago
happy-yan / DACER-Diffusion-with-Online-RL
View on GitHub
NeurIPS 2024 DACER
☆182Feb 28, 2026Updated 5 months ago
BellmanTimeHut / DIPO
View on GitHub
☆130May 30, 2023Updated 3 years ago
ALRhub / DIME
View on GitHub
☆37Aug 26, 2025Updated 11 months ago
Fang-Lin93 / DAC
View on GitHub
DAC: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning.
☆30Jun 3, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ami-iit / paper_romualdi_viceconte_2024_humanoids_dnn-mpc-walking
View on GitHub
[Humanoids 2024 award finalist] Online DNN-Driven Nonlinear MPC for Stylistic Humanoid Robot Walking with Step Adjustment
☆20Feb 5, 2025Updated last year
sloganking / BvhToMimic
View on GitHub
Converting .bvh files to DeepMimic animations
☆14Jan 23, 2021Updated 5 years ago
qiayuanl / legged_control2_doc
View on GitHub
☆20Mar 6, 2026Updated 4 months ago
CarolinaBianchi / UFastSLAM
View on GitHub
Implementation of Unscented Fast SLAM algorithm for Applied Estimation (EL2320) - KTH
☆10Jan 28, 2019Updated 7 years ago
kevinzakka / dm_env_wrappers
View on GitHub
Standalone library of frequently-used wrappers for dm_env environments.
☆19Jul 9, 2024Updated 2 years ago
diffusionyes / MaxEntDP
View on GitHub
☆20Jan 30, 2025Updated last year
xizaoqu / blender_for_UniHSI
View on GitHub
☆14Mar 5, 2024Updated 2 years ago
irom-princeton / dppo
View on GitHub
Official implementation of Diffusion Policy Policy Optimization, arxiv 2024
☆842Feb 4, 2025Updated last year
scottemmons / rvs
View on GitHub
Reinforcement Learning via Supervised Learning
☆72May 16, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
minruixu / MAFRL
View on GitHub
code for
☆11Apr 10, 2021Updated 5 years ago
wadx2019 / genpo
View on GitHub
official implementation of GenPO
☆46Jan 7, 2026Updated 6 months ago
ami-iit / paper_elobaid_2024_stable-centroidal-mpc
View on GitHub
☆32May 30, 2025Updated last year
ZibinDong / AlignDiff-ICLR2024
View on GitHub
☆33Mar 10, 2024Updated 2 years ago
hongbiaozhu / Distributed_DRL_MIMO-NOMA_VEC
View on GitHub
☆14Apr 12, 2022Updated 4 years ago
408794550 / Mechine-Learning-In-Action
View on GitHub
Python语言编写，记录电子书Mechine Learning In Action中的源码，并附有每行代码的详细注释，方便初学者阅读。
☆13Sep 24, 2017Updated 8 years ago
akanazawa / fpo
View on GitHub
Implementation of Flow Policy Optimization (FPO)
☆454Jan 13, 2026Updated 6 months ago
sukhijab / maxinforl_torch
View on GitHub
☆51Sep 18, 2025Updated 10 months ago
zyrived / UAV_MEC_SYSTEM
View on GitHub
A UAVs empowered MEC SYSTEM
☆10Sep 22, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gobinda22 / UAV-assisted-MEC
View on GitHub
☆10Jul 31, 2021Updated 4 years ago
kondela / sngfaces-dataset
View on GitHub
Dataset containing high quality images of oil portrait paintings made on canvas.
☆16Oct 25, 2020Updated 5 years ago
Aaktzy / MEC_DAG_PPO
View on GitHub
Simulate one server for one user, use PPO.
☆15Nov 21, 2021Updated 4 years ago
montrealrobotics / one4all
View on GitHub
An end-to-end fully parametric method for image-goal navigation that leverages self-supervised and manifold learning to replace the topol…
☆12Jun 18, 2024Updated 2 years ago
AILWQ / DySymNet
View on GitHub
[ICML 2024] Official Pytorch implementation of the paper "A Neural-Guided Dynamic Symbolic Network for Exploring Mathematical Expressions…
☆22Nov 15, 2025Updated 8 months ago
haraldger / DRL-DecisionTransformer
View on GitHub
Research project for Deep Reinforcement Learning using Decision Transformer
☆16May 12, 2023Updated 3 years ago
nubot-nudt / RFSG
View on GitHub
☆12Mar 17, 2025Updated last year
Dribble-HRL / Dribble_HRL
View on GitHub
☆12Apr 9, 2026Updated 3 months ago
apourchot / ERL-pytorch
View on GitHub
Combining Evolutionary Algorithms and deep Reinforcement Learning
☆19Jul 17, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zhyang2226 / DMBP
View on GitHub
[ICLR 2024] DMBP: Diffusion Model-Based Predictor for Robust Offline Reinforcement Learning against State Observations Perturbations.
☆17May 24, 2024Updated 2 years ago
danijar / teleport
View on GitHub
Efficiently send large arrays across machines
☆15Jul 24, 2024Updated 2 years ago
PierreMarza / dynamic_implicit_representations
View on GitHub
Code for ICCV 2023 paper "Multi-Object Navigation with dynamically learned neural implicit representations"
☆14Mar 20, 2024Updated 2 years ago
HongyangDu / AttentionQoE
View on GitHub
☆15May 15, 2024Updated 2 years ago
Xuanphu108 / Deep-Reinforcement-Learning-for-MEC
View on GitHub
☆16Jun 3, 2019Updated 7 years ago
end3r / Gamepad-API-Content-Kit
View on GitHub
Gamepad API Content Kit
☆14Jun 1, 2016Updated 10 years ago
Alestaubin / stable-imitation-policy-with-waypoints
View on GitHub
Learning globally stable dynamical systems policies through imitation. A modification of the original work, focussing on waypoint-based i…
☆14Oct 12, 2024Updated last year