wadx2019 / qvpoLinks
official implementation of QVPO
☆45Updated 10 months ago
Alternatives and similar repositories for qvpo
Users that are interested in qvpo are comparing it to the libraries listed below
Sorting:
- NeurIPS 2024 DACER☆134Updated last week
- ☆111Updated 2 years ago
- [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"☆108Updated 6 months ago
- This is the official PyTorch implementation of the paper "Boosting Continuous Control with Consistency Policy".☆43Updated last year
- [NeurIPS 2023] Efficient Diffusion Policy☆107Updated last year
- [ICML'2023 Oral] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆64Updated last year
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆78Updated last month
- ☆43Updated last year
- ICLR 2024: SafeDreamer: Safe Reinforcement Learning with World Models☆75Updated last year
- ☆62Updated 9 months ago
- DAC: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning.☆19Updated last year
- [NeuIPS2024 DTQL] Diffusion Trusted Q-Learning for Offline RL — Official PyTorch Implementation☆18Updated last year
- ☆30Updated last year
- This repository provides a survey on the applications of deep generative models for offline reinforcement learning and imitation learning…☆52Updated 3 months ago
- ☆244Updated last month
- ☆95Updated last year
- ☆31Updated last year
- ☆83Updated last month
- Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344☆41Updated last year
- [NeurIPS'24] The Official PyTorch implementation of DRAIL☆42Updated 8 months ago
- Generative Trajectory Stitching through Diffusion Composition☆28Updated 3 weeks ago
- Meta-RL Model-Based Algorithm☆40Updated 3 months ago
- [ICLR 2025] Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.☆62Updated last week
- Code for the paper "Learning a Diffusion Model Policy from Rewards via Q-Score Matching"☆25Updated 4 months ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆22Updated 10 months ago
- Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"☆139Updated last year
- ☆52Updated 2 months ago
- [CoRL 2024] Official implementation for "MaIL: Improving Imitation Learning with Selective State Space Models"☆42Updated 6 months ago
- ☆380Updated last year
- Official code of the paper "Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts" at ICLR2024☆23Updated 9 months ago