supersglzc / ddiffpgLinks
Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient
☆19Updated last year
Alternatives and similar repositories for ddiffpg
Users that are interested in ddiffpg are comparing it to the libraries listed below
Sorting:
- [NeurIPS'24] The Official PyTorch implementation of DRAIL☆52Updated last year
- official implementation of QVPO☆55Updated last week
- Implementation of SAC and TD3 based on various RNN and Transformer.☆28Updated last year
- M^3PC: Test-Time Model Predictive Control for Pretrained Masked Trajectory Model, ICLR 2025☆19Updated 9 months ago
- ☆46Updated 3 months ago
- [RA-L/ICRA2025] Official implementation for paper "Diverse Controllable Diffusion Policy with Signal Temporal Logic."☆32Updated last year
- Official code repository for CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models☆23Updated 2 months ago
- [NeuIPS2024 DTQL] Diffusion Trusted Q-Learning for Offline RL — Official PyTorch Implementation☆22Updated last year
- Exploiting Transformer in Reinforcement Learning for Interpretable Temporal Logic Motion Planning (RAL 2023)☆13Updated 2 years ago
- ☆35Updated last year
- ☆11Updated last month
- [NeurIPS 2023] Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible Plans☆21Updated last year
- [AAAI 2024 (Oral)] Safety-MuJoCo Environments.☆10Updated last year
- [ICML'2023 Oral] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆65Updated 2 years ago
- Official repo for Offline RL for Online RL☆18Updated 2 years ago
- [CoRL 2022] Official implementation of the publication Residual Skill Policies: Learning an Adaptable Skill-based Action Space for Reinfo…☆26Updated 2 years ago
- ☆48Updated last year
- [TNNLS] PGDQN: A generalized and efficient preference-guided epsilon-greedy policy equipped DQN for Atari and Autonomous Driving☆12Updated 2 years ago
- ☆45Updated last year
- ☆32Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆27Updated 2 years ago
- [IROS 22'] Model-free Neural Lyapunov Control☆27Updated 2 years ago
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆72Updated last year
- A multi-subtask reinforcement learning method where complex tasks can be decomposed into low-level subtasks.☆37Updated 3 years ago
- Model Predictive Actor-Critic Reinforcement Learning☆68Updated 4 years ago
- Code for the paper "Learning a Diffusion Model Policy from Rewards via Q-Score Matching"☆29Updated 8 months ago
- Official implementation for the LABOR (LAnguage-model-based Bimanual ORchestration) Agent.☆21Updated last year
- [ICML2025] Official implementation of Efficient Online Reinforcement Learning for Diffusion Policies appearing in ICML 2025.☆37Updated 4 months ago
- The code for paper 'STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning'☆15Updated last year
- ☆31Updated last year