supersglzc / ddiffpgLinks
Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient
☆15Updated 8 months ago
Alternatives and similar repositories for ddiffpg
Users that are interested in ddiffpg are comparing it to the libraries listed below
Sorting:
- official implementation of QVPO☆44Updated 9 months ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆22Updated 9 months ago
- [NeuIPS2024 DTQL] Diffusion Trusted Q-Learning for Offline RL — Official PyTorch Implementation☆16Updated last year
- ☆45Updated last year
- [NeurIPS'24] The Official PyTorch implementation of DRAIL☆42Updated 7 months ago
- A collection of recent MARL papers☆94Updated 8 months ago
- Meta-RL Model-Based Algorithm☆40Updated 2 months ago
- Synthetic Experience Replay☆94Updated last year
- [ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.☆12Updated 9 months ago
- Minimal code for A Generalist Agent☆42Updated 2 years ago
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆157Updated 2 years ago
- [NeurIPS 2024] Official Implementation of Meta-DT☆44Updated 9 months ago
- Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021☆34Updated 3 years ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆32Updated 9 months ago
- [NeurIPS 2023] Efficient Diffusion Policy☆105Updated last year
- [NeurIPS 2022] Official implementation of the paper: "Human-AI Shared Control via Policy Dissection"☆49Updated 2 years ago
- [NeurIPS 2024] PyTorch code for the paper "Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning…☆19Updated last month
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆66Updated last year
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.☆100Updated 2 months ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆66Updated 4 years ago
- Code for the paper "Learning a Diffusion Model Policy from Rewards via Q-Score Matching"☆24Updated 3 months ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆63Updated last year
- ☆44Updated 7 months ago
- Repo for Implicit Diffusion Q-Learning☆112Updated last year
- ☆35Updated 2 years ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆27Updated 2 years ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆45Updated last year
- M^3PC: Test-Time Model Predictive Control for Pretrained Masked Trajectory Model, ICLR 2025☆16Updated 4 months ago
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆79Updated 3 weeks ago
- This is the official PyTorch implementation of the paper "Boosting Continuous Control with Consistency Policy".☆42Updated last year