zbzhu99 / madiff
Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"
☆68Updated 2 months ago
Alternatives and similar repositories for madiff:
Users that are interested in madiff are comparing it to the libraries listed below
- ☆95Updated last year
- [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"☆95Updated 2 months ago
- ☆61Updated 5 months ago
- Official code of the paper "Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts" at ICLR2024☆16Updated 5 months ago
- This repository provides a survey on the applications of deep generative models for offline reinforcement learning and imitation learning…☆48Updated 8 months ago
- [NeurIPS 2023] Efficient Diffusion Policy☆98Updated last year
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆34Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆61Updated 10 months ago
- This is the official PyTorch implementation of the paper "Boosting Continuous Control with Consistency Policy".☆35Updated 11 months ago
- ☆31Updated last year
- ICLR 2024: SafeDreamer: Safe Reinforcement Learning with World Models☆64Updated last year
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆124Updated last year
- ☆23Updated 6 months ago
- Model-based Offline Policy Optimization re-implement all by pytorch☆31Updated last year
- NeurIPS 2024 DACER☆101Updated last week
- ☆46Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆82Updated last year
- ☆21Updated 11 months ago
- ☆29Updated last year
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆52Updated 2 years ago
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆88Updated 7 months ago
- ☆44Updated 3 weeks ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆75Updated last year
- official implementation of QVPO☆30Updated 6 months ago
- Synthetic Experience Replay☆91Updated 10 months ago
- [ICML 2024] The offical implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage…☆25Updated 10 months ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆20Updated 6 months ago
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆25Updated 3 weeks ago
- DAC: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning.☆17Updated 10 months ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆29Updated 4 months ago