wadx2019 / qvpo
official implementation of QVPO
☆20Updated 3 months ago
Alternatives and similar repositories for qvpo:
Users that are interested in qvpo are comparing it to the libraries listed below
- [ICML'2023] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆54Updated last year
- NeurIPS 2024 DACER☆66Updated last month
- [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"☆83Updated this week
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆51Updated last week
- [NeurIPS 2023] Efficient Diffusion Policy☆91Updated last year
- ☆87Updated last year
- This repository provides a survey on the applications of deep generative models for offline reinforcement learning and imitation learning…☆39Updated 5 months ago
- ICLR 2024: SafeDreamer: Safe Reinforcement Learning with World Models☆54Updated 9 months ago
- ☆57Updated 2 months ago
- This is a source repository for Multi-Agent Reinforcement Learning for Autonomous Driving research☆22Updated 4 months ago
- xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing☆15Updated 3 months ago
- ☆20Updated 9 months ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆42Updated 11 months ago
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆24Updated 5 months ago
- ☆25Updated 10 months ago
- ☆45Updated last week
- [NeurIPS'22 Spotlight] When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning☆53Updated last year
- ☆35Updated last month
- DAC: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning.☆12Updated 7 months ago
- ☆29Updated last year
- ☆18Updated 3 months ago
- [ICML 2024] Implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage-guided poli…☆23Updated 8 months ago
- Codebase for Extracting Reward Functions from Diffusion Models☆15Updated last year
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆31Updated 10 months ago
- [NeurIPS 2023] Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible Plans☆18Updated last year
- ☆19Updated 3 months ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆26Updated last year
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆17Updated last year
- ☆70Updated 7 months ago
- Codebase of NeurIPS 2022 paper ''Planning for Sample Efficient Imitation Learning''☆38Updated 2 years ago