raincchio / P3O
☆11Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for P3O
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Updated last year
- Code for Efficient Continuous Control with Double Actors and Regularized Critics, AAAI 2022.☆19Updated 2 years ago
- [AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"☆12Updated 9 months ago
- Revisiting Discrete Gradient Estimation in MADDPG☆23Updated last year
- ☆17Updated 9 months ago
- ☆13Updated 3 years ago
- Code for NeurIPS paper "Self-Organized Group for Cooperative Multi-agentReinforcement Learning".☆18Updated last year
- Codes for the paper "Multi-task Hierarchical Adversarial Inverse Reinforcement Learning"☆15Updated last year
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆13Updated 3 years ago
- ☆28Updated 3 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆26Updated last year
- ☆34Updated 2 years ago
- Multi-task Multi-agent Soft Actor Critic for SMAC☆12Updated 2 years ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆19Updated 2 years ago
- ☆26Updated 4 years ago
- ☆18Updated 3 years ago
- Meta RL codebase for Unstable Baselines☆20Updated last year
- ☆11Updated 2 years ago
- ☆25Updated 7 months ago
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆12Updated last year
- Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.☆14Updated 5 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆40Updated 2 months ago
- Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239☆18Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆73Updated 11 months ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆48Updated last year
- Codes for the paper "HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism"☆18Updated 2 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆27Updated 3 years ago
- ☆24Updated 2 years ago