thu-ml/SRPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thu-ml/SRPO)

thu-ml / SRPO

Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).

☆48

Alternatives and similar repositories for SRPO

Users that are interested in SRPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ChenDRAG / CEP-energy-guided-diffusion
View on GitHub
Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction （ICML 2023）
☆55Aug 26, 2023Updated 2 years ago
philippe-eecs / IDQL
View on GitHub
Repo for Implicit Diffusion Q-Learning
☆126Dec 5, 2023Updated 2 years ago
thu-ml / Efficient-Diffusion-Alignment
View on GitHub
Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)
☆15Oct 29, 2024Updated last year
LAMDA-RL / PRDC
View on GitHub
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Nov 8, 2024Updated last year
twitter / diffusion-rl
View on GitHub
☆80Dec 9, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
filteredcophy / FilteredCoPhy
View on GitHub
☆10Nov 17, 2022Updated 3 years ago
Zhendong-Wang / Diffusion-Policies-for-Offline-RL
View on GitHub
☆430Apr 29, 2024Updated 2 years ago
intuitive-robots / vdd
View on GitHub
[NeurIPS 2024] Official code for "Variational Distillation of Diffusion Policies into Mixture of Experts"
☆17Dec 7, 2024Updated last year
ryanxhr / IVR
View on GitHub
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…
☆46Jul 27, 2023Updated 2 years ago
Improbable-AI / harness-offline-rl
View on GitHub
Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting
☆16Feb 14, 2024Updated 2 years ago
intuitive-robots / beso
View on GitHub
[RSS 2023] Official code for "Goal Conditioned Imitation Learning using Score-based Diffusion Policies"
☆91Dec 1, 2023Updated 2 years ago
Lifelong-ML / offline-compositional-rl-datasets
View on GitHub
☆21Mar 19, 2024Updated 2 years ago
LAMDA-RL / ACT
View on GitHub
Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)
☆17Feb 10, 2024Updated 2 years ago
quantumiracle / Consistency_Model_For_Reinforcement_Learning
View on GitHub
Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24
☆27Aug 28, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Roythuly / OBAC
View on GitHub
☆22May 27, 2024Updated 2 years ago
NUStreaming / BoB
View on GitHub
☆12Oct 18, 2022Updated 3 years ago
FanmingL / SmartLogger
View on GitHub
☆12May 14, 2024Updated 2 years ago
ChenDRAG / SfBC
View on GitHub
Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.or…
☆42Oct 11, 2023Updated 2 years ago
chenran-li / RQL-release
View on GitHub
(NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value
☆35Mar 29, 2024Updated 2 years ago
thu-ml / CEP-energy-guided-diffusion
View on GitHub
Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction
☆35Nov 3, 2023Updated 2 years ago
tinkoff-ai / cnf
View on GitHub
Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…
☆12Jan 31, 2023Updated 3 years ago
WentDong / Adapt
View on GitHub
Actuator Degeneration Adaptation Transformer
☆14Sep 19, 2023Updated 2 years ago
mazpie / redundancy-action-spaces
View on GitHub
[RA-L 2024] Novel action spaces leveraging redundancy in 7 DoF arms enable efficient & precise learning in robotic manipulation
☆23Jun 6, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
polixir / d3pe
View on GitHub
D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.
☆10Jun 2, 2022Updated 4 years ago
sfujim / TD7
View on GitHub
Author's PyTorch implementation of TD7 for online and offline RL
☆169Sep 12, 2023Updated 2 years ago
SafeRL-Lab / Safety-MuJoCo
View on GitHub
[AAAI 2024 (Oral)] Safety-MuJoCo Environments.
☆12Jun 4, 2024Updated 2 years ago
ML-Group-SDU / DiffAIL
View on GitHub
☆24Sep 27, 2024Updated last year
max7born / decision-lstm
View on GitHub
Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…
☆28Mar 24, 2023Updated 3 years ago
luke-ck / vgpmp
View on GitHub
Variational Gaussian Process Motion Planning
☆21Jul 30, 2024Updated last year
RoozbehRazavi / BIMRL
View on GitHub
Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)
☆10Dec 1, 2022Updated 3 years ago
KAIST-AILab / imitation-dice
View on GitHub
☆17Dec 30, 2024Updated last year
mansicer / Q-Adapter
View on GitHub
Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"
☆18Oct 5, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
liziniu / HyperDQN
View on GitHub
Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)
☆12Nov 28, 2023Updated 2 years ago
gouxiangchen / ac-ppo
View on GitHub
Actor-Critic and openAI clipped PPO in gym cartpole-v0 and pendulum-v0 environment
☆27Aug 2, 2020Updated 5 years ago
n13eho / Schaferct
View on GitHub
The source code of team 🥇Schaferct in 2nd Bandwidth Prediction of MMSys'24.
☆17May 13, 2024Updated 2 years ago
heatz123 / tldr
View on GitHub
Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations
☆36Jan 24, 2026Updated 5 months ago
zwfightzw / Meta-Critic
View on GitHub
☆11Oct 19, 2020Updated 5 years ago
intuitive-robots / flower_vla_pret
View on GitHub
[CoRL 2025] Pretraining code for FLOWER VLA on OXE
☆41Sep 22, 2025Updated 10 months ago
ALRhub / X_IL
View on GitHub
X-IL: Exploring the Design Space of Imitation Learning Policies
☆62Mar 7, 2025Updated last year