supersglzc / ddiffpgLinks

Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient

☆15

Alternatives and similar repositories for ddiffpg

Users that are interested in ddiffpg are comparing it to the libraries listed below

Sorting:

wadx2019 / qvpo
official implementation of QVPO
☆44Updated 9 months ago
FanmingL / Recurrent-Offpolicy-RL
Implementation of SAC and TD3 based on various RNN and Transformer.
☆22Updated 9 months ago
TianyuCodings / Diffusion_Trusted_Q_Learning
[NeuIPS2024 DTQL] Diffusion Trusted Q-Learning for Offline RL — Official PyTorch Implementation
☆16Updated last year
cheryyunl / Make-An-Agent
☆45Updated last year
NVlabs / DRAIL
[NeurIPS'24] The Official PyTorch implementation of DRAIL
☆42Updated 7 months ago
chrisyrniu / Recent-Advances-in-Multi-Agent-Reinforcement-Learning
A collection of recent MARL papers
☆94Updated 8 months ago
zoharri / mamba
Meta-RL Model-Based Algorithm
☆40Updated 2 months ago
conglu1997 / SynthER
Synthetic Experience Replay
☆94Updated last year
Guozheng-Ma / Adaptive-Replay-Ratio
[ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.
☆12Updated 9 months ago
YushuoLi / Gato-A-Generalist-Agent
Minimal code for A Generalist Agent
☆42Updated 2 years ago
facebookresearch / mtm
MTM Masked Trajectory Models for Prediction, Representation, and Control.
☆157Updated 2 years ago
NJU-RL / Meta-DT
[NeurIPS 2024] Official Implementation of Meta-DT
☆44Updated 9 months ago
martius-lab / HiTS
Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021
☆34Updated 3 years ago
heatz123 / tldr
Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations
☆32Updated 9 months ago
sail-sg / edp
[NeurIPS 2023] Efficient Diffusion Policy
☆105Updated last year
metadriverse / policydissect
[NeurIPS 2022] Official implementation of the paper: "Human-AI Shared Control via Policy Dissection"
☆49Updated 2 years ago
qiwang067 / CoWorld
[NeurIPS 2024] PyTorch code for the paper "Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning…
☆19Updated last month
marc-rigter / polygrad-world-models
Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024
☆66Updated last year
alirezakazemipour / DDPG-HER
Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.
☆100Updated 2 months ago
LunjunZhang / world-model-as-a-graph
Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)
☆66Updated 4 years ago
Alescontrela / score_matching_rl
Code for the paper "Learning a Diffusion Model Policy from Rewards via Q-Score Matching"
☆24Updated 3 months ago
twni2016 / Memory-RL
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)
☆63Updated last year
sukhijab / maxinforl_torch
☆44Updated 7 months ago
philippe-eecs / IDQL
Repo for Implicit Diffusion Q-Learning
☆112Updated last year
jhejna / few-shot-preference-rl
☆35Updated 2 years ago
max7born / decision-lstm
Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…
☆27Updated 2 years ago
thu-ml / SRPO
Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).
☆45Updated last year
wkh923 / m3pc
M^3PC: Test-Time Model Predictive Control for Pretrained Masked Trajectory Model, ICLR 2025
☆16Updated 4 months ago
zbzhu99 / madiff
Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"
☆79Updated 3 weeks ago
cccedric / cpql
This is the official PyTorch implementation of the paper "Boosting Continuous Control with Consistency Policy".
☆42Updated last year