j3soon / dfac
[ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning
☆30Updated last year
Related projects ⓘ
Alternatives and complementary repositories for dfac
- Learning Individual Intrinsic Reward in MARL☆62Updated last year
- ☆71Updated 5 months ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆62Updated 3 years ago
- ☆54Updated 8 months ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated last year
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆26Updated 2 years ago
- ☆47Updated last year
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆36Updated 3 weeks ago
- Model-Based Offline Reinforcement Learning☆47Updated 3 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆26Updated 2 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆42Updated 2 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated last year
- ☆18Updated 2 years ago
- Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)☆21Updated 2 years ago
- Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"☆17Updated 2 years ago
- ☆52Updated last year
- ☆44Updated 3 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆25Updated last year
- Author's PyTorch implementation of LAP and PAL with TD3 and DDQN☆34Updated 2 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Submission for MAVEN: Multi-Agent Variational Exploration☆57Updated 2 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆52Updated 3 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆67Updated 3 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆48Updated last year
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆19Updated last year
- An open source benchmark for Multi Agent Reinforcement Learning☆29Updated last year
- ☆24Updated last year
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆60Updated last year
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago