moratodpg / imp_marl
IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL
☆35Updated last week
Related projects: ⓘ
- Simple maze environments using mujoco-py☆52Updated 8 months ago
- The Starcraft Multi-Agent challenge lite☆32Updated last week
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆31Updated last year
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆33Updated 3 years ago
- ☆37Updated last year
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆31Updated last year
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆49Updated 3 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆45Updated 3 months ago
- ☆51Updated last year
- ☆20Updated 5 months ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆75Updated 9 months ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆61Updated last year
- Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"☆39Updated 2 years ago
- ☆16Updated last year
- Distributional Soft Actor Critic☆49Updated 4 years ago
- JAX and PZ RL envs + algorithms for swarms of CrazyFlies☆57Updated 3 weeks ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆70Updated 9 months ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆37Updated 2 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆53Updated 3 months ago
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆65Updated this week
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆63Updated last year
- ☆30Updated 3 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆108Updated last year
- Implementation of Tactical Optimistic and Pessimistic value estimation☆24Updated last year
- Toolkit of Causal Model-based Reinforcement Learning.☆32Updated last year
- DecentralizedLearning☆19Updated last year
- Benchmarking RL generalization in an interpretable way.☆128Updated 7 months ago
- ☆12Updated this week
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆16Updated 8 months ago