TianyuCodings / Diffusion_Trusted_Q_LearningLinks

[NeuIPS2024 DTQL] Diffusion Trusted Q-Learning for Offline RL — Official PyTorch Implementation

☆16

Alternatives and similar repositories for Diffusion_Trusted_Q_Learning

Users that are interested in Diffusion_Trusted_Q_Learning are comparing it to the libraries listed below

Sorting:

sail-sg / edp
[NeurIPS 2023] Efficient Diffusion Policy
☆105Updated last year
tinnerhrhe / MTDiff
☆62Updated 8 months ago
Liang-ZX / AdaptDiffuser
[ICML'2023 Oral] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"
☆63Updated last year
wadx2019 / qvpo
official implementation of QVPO
☆41Updated 8 months ago
cccedric / cpql
This is the official PyTorch implementation of the paper "Boosting Continuous Control with Consistency Policy".
☆42Updated last year
BellmanTimeHut / DIPO
☆104Updated 2 years ago
ZibinDong / AlignDiff-ICLR2024
☆31Updated last year
zbzhu99 / madiff
Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"
☆79Updated 3 weeks ago
thu-ml / SRPO
Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).
☆44Updated last year
zoharri / mamba
Meta-RL Model-Based Algorithm
☆38Updated 2 months ago
devinluo27 / comp_diffuser_release
Generative Trajectory Stitching through Diffusion Composition
☆24Updated 2 months ago
Fang-Lin93 / DAC
DAC: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning.
☆19Updated last year
ZhengYinan-AIR / FISOR
[ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"
☆107Updated 5 months ago
Alescontrela / score_matching_rl
Code for the paper "Learning a Diffusion Model Policy from Rewards via Q-Score Matching"
☆23Updated 3 months ago
marc-rigter / polygrad-world-models
Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024
☆66Updated last year
diffuserlite / diffuserlite.github.io
☆43Updated 11 months ago
weipu-zhang / STORM
☆93Updated last year
bit1029public / HRSSM
Pytorch Implementation of Learning Latent Dynamic Robust Representations for World Models
☆19Updated last year
jrobine / twm
Transformer-based World Models
☆83Updated 2 years ago
quantumiracle / Consistency_Model_For_Reinforcement_Learning
Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24
☆26Updated 10 months ago
thuml / HarmonyDream
Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344
☆41Updated last year
EmptyJackson / policy-guided-diffusion
Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"
☆137Updated 11 months ago
imgeorgiev / PWM
PWM: Policy Learning with Large World Models
☆53Updated 4 months ago
zhyang2226 / DMBP
[ICLR 2024] DMBP: Diffusion Model-Based Predictor for Robust Offline Reinforcement Learning against State Observations Perturbations.
☆14Updated last year
conglu1997 / SynthER
Synthetic Experience Replay
☆94Updated last year
zhaohengyin / EfficientImitate
Codebase of NeurIPS 2022 paper ''Planning for Sample Efficient Imitation Learning''
☆40Updated 2 years ago
NVlabs / DRAIL
[NeurIPS'24] The Official PyTorch implementation of DRAIL
☆42Updated 7 months ago
elicassion / StARformer
[ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.
☆95Updated 2 years ago
ldcq / ldcq
☆35Updated 2 years ago
TakuyaHiraoka / Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning
Source files to replicate experiments in my ICLR 2022 paper.
☆70Updated last year