Fang-Lin93 / DACLinks
DAC: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning.
☆19Updated last year
Alternatives and similar repositories for DAC
Users that are interested in DAC are comparing it to the libraries listed below
Sorting:
- ☆28Updated last year
- ☆24Updated last year
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆95Updated 10 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆84Updated 6 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆74Updated last year
- ☆25Updated 9 months ago
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆25Updated 9 months ago
- ☆47Updated 6 months ago
- ☆61Updated 6 months ago
- Transformer-based World Models☆82Updated 2 years ago
- ☆102Updated 2 years ago
- Implementation of Jump-Start Reinforcement Learning (JSRL) with Stable Baselines3☆31Updated last year
- ☆23Updated last year
- official implementation of QVPO☆36Updated 7 months ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆22Updated 8 months ago
- ☆10Updated last year
- ☆26Updated 11 months ago
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆36Updated last year
- ☆33Updated 2 years ago
- [ICLR 2025] Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.☆48Updated last week
- [ICML'2023] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆59Updated last year
- [ICML 2024] The offical implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage…☆26Updated last year
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆28Updated last year
- ☆87Updated last year
- [ICLR 2025] Bootstrapped Model Predictive Control☆14Updated last month
- Synthetic Experience Replay☆92Updated last year
- Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344☆39Updated 11 months ago
- ☆29Updated last year
- ☆13Updated 6 months ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆25Updated last year