machine-teaching-group / neurips2022_exploration-guided-reward-shaping
☆11Updated last year
Related projects: ⓘ
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆123Updated 8 months ago
- This is the official implementation of Multi-Agent PPO.☆89Updated last year
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆21Updated 10 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆70Updated 9 months ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆43Updated last year
- [NeurIPS 2022] Code for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments☆12Updated last year
- ☆180Updated last year
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆91Updated 2 years ago
- Constrained Policy Optimization implementation on Safety Gym☆21Updated 2 years ago
- DSAC; Distributional Soft Actor-Critic☆108Updated 6 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆87Updated 3 years ago
- A collection of offline reinforcement learning algorithms.☆153Updated 3 months ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆139Updated 5 months ago
- Paper list for constrained policy optimization in reinforcement learning.☆64Updated 10 months ago
- ☆39Updated 2 years ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆150Updated last week
- ☆87Updated 2 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆102Updated 2 years ago
- ☆87Updated 3 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆83Updated last year
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆117Updated 4 months ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆102Updated 5 months ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆113Updated last month
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆38Updated this week
- [ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control☆99Updated 3 years ago
- There will be updates later☆79Updated 5 years ago
- ☆54Updated 3 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆108Updated last year
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆54Updated last year
- A clean and robust Pytorch implementation of PPO on Discrete action space☆56Updated 3 months ago