proroklab / ControllingBehavioralDiversity
This repository contains the code for Diversity Control (DiCo), a novel method to constrain behavioral diversity in multi-agent reinforcement learning.
☆14Updated last month
Related projects ⓘ
Alternatives and complementary repositories for ControllingBehavioralDiversity
- ☆38Updated last year
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆35Updated 2 months ago
- Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.☆14Updated last week
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆17Updated last year
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆55Updated 10 months ago
- Official code for "A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning"☆14Updated last year
- ☆21Updated 7 months ago
- Contains implementation of the FILTER algorithm for exponentially faster inverse reinforcement learning.☆46Updated last year
- Fast and flexible multi-agent gridworld reinforcement learning environments.☆32Updated 3 weeks ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆32Updated last year
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆36Updated 3 weeks ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆34Updated 2 years ago
- DecentralizedLearning☆21Updated last year
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆26Updated 5 months ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- ☆20Updated 6 months ago
- ☆34Updated last year
- The Starcraft Multi-Agent challenge lite☆38Updated 2 months ago
- This repository has code for the paper "Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algor…☆25Updated last year
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆27Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆57Updated 5 months ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆26Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆80Updated last year
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆22Updated 7 months ago
- Toolkit of Causal Model-based Reinforcement Learning.☆32Updated last year
- Synthetic Experience Replay☆74Updated 5 months ago
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆30Updated 8 months ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆11Updated 6 months ago
- Codebase for BRDiv: Diverse teammate generation for ad hoc teamwork☆13Updated 6 months ago
- ☆28Updated last year