AIDefender / MyDiscor
Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"
☆13Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for MyDiscor
- Meta RL codebase for Unstable Baselines☆20Updated last year
- ☆45Updated 3 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆47Updated last year
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆17Updated last year
- ☆28Updated last year
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆22Updated last week
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆72Updated 10 months ago
- ☆28Updated 2 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆83Updated last year
- ☆18Updated last year
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆61Updated 3 years ago
- Code for Adapting Environment Sudden Changes by Learning Context Sensitive Policy☆20Updated 2 years ago
- ☆41Updated 3 years ago
- ☆13Updated 3 years ago
- The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆37Updated last week
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆64Updated last year
- ☆51Updated last year
- Submission for MAVEN: Multi-Agent Variational Exploration☆57Updated 2 years ago
- Model-based Offline Policy Optimization re-implement all by pytorch☆27Updated last year
- ☆28Updated 3 years ago
- There will be updates later☆81Updated 5 years ago
- Learning Individual Intrinsic Reward in MARL☆62Updated last year
- ☆88Updated 3 years ago
- The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.☆38Updated last year
- code for ROMANCE☆12Updated 3 weeks ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆92Updated 3 years ago
- Revisiting Discrete Gradient Estimation in MADDPG☆23Updated last year
- Distributional Soft Actor Critic☆49Updated 4 years ago
- ☆34Updated 2 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆40Updated last month