Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)
☆33Oct 6, 2022Updated 3 years ago
Alternatives and similar repositories for meta-mapg
Users that are interested in meta-mapg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Model-Free Opponent Shaping (ICML 2022)☆20Nov 18, 2022Updated 3 years ago
- ☆13Oct 11, 2022Updated 3 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆25Aug 4, 2022Updated 3 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3…☆14Apr 26, 2022Updated 3 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Aug 21, 2018Updated 7 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆72Aug 18, 2016Updated 9 years ago
- Daily Paper Reading☆23Jan 17, 2026Updated 2 months ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆152Apr 13, 2023Updated 2 years ago
- Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.☆23Nov 22, 2025Updated 4 months ago
- Source code for "Influencing Long-Term Behavior in Multiagent Reinforcement Learning" (NeurIPS 2022)☆19Jan 1, 2023Updated 3 years ago
- [AAAI-23] Improving Pareto Front Learning via Multi-Sample Hypernetworks☆10Aug 22, 2024Updated last year
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆49Mar 8, 2024Updated 2 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Aug 26, 2022Updated 3 years ago
- This repository contains all code and experiments for competitive policy gradient (CoPG) algorithm.☆24Aug 1, 2020Updated 5 years ago
- Sigma-Point Kalman Filters☆12Aug 21, 2018Updated 7 years ago
- Project explores collaboration capabilities of VDN and IQL agents on a custom MARL Food Collector environment☆11Apr 6, 2022Updated 3 years ago
- All the codes and data used in "Inverse design of soft materials via a deep-learning-based evolutionary strategy", by G. M. Coli, E. Boat…☆11Oct 26, 2021Updated 4 years ago
- Unsupervised learning of structure in systems of interacting particles.☆13Nov 13, 2023Updated 2 years ago
- We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting☆12Mar 9, 2018Updated 8 years ago
- Combination of Rapidly-Exporing Random Trees (RRT) and Safe Interval Path Planning (SIPP) for high-DOF planning in dynamic environments,…☆18Mar 13, 2026Updated last week
- Master Thesis Project in Computer Engineering at Aarhus University 2024 on "Simulating Multi-agent Path Planning in Complex environments …☆16Oct 12, 2025Updated 5 months ago
- Minimal A2C/A3C example of an LSTM-based meta-learner.☆13Feb 2, 2021Updated 5 years ago
- Tube-Based Zonotopic Data Driven Predictive Control☆12Nov 23, 2022Updated 3 years ago
- Adaptable Agent Populations via a Generative Model of Policies☆12Oct 14, 2021Updated 4 years ago
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆310Apr 13, 2023Updated 2 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) applied on Reinforcement Learning problems in TensorFlow 2.☆27May 11, 2021Updated 4 years ago
- MAML implementation (tensorflow)☆15Apr 25, 2019Updated 6 years ago
- Simple, extensible implementations of some meta-learning algorithms in Jax☆11Oct 6, 2020Updated 5 years ago
- Implementation of the G2RL approach in the POGEMA environment☆14Jun 5, 2024Updated last year
- FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation☆14Dec 13, 2024Updated last year
- Multi-Agent PathFinding (MAPF) for 2D Robots moving inventory on a grid - Practice building environment + robots + planning + inventory m…☆16Nov 20, 2023Updated 2 years ago
- Code for Slow Transition to Low-Dimensional Chaos in Heavy-Tailed Recurrent Neural Networks (NeurIPS 2025)☆20Mar 16, 2026Updated last week
- ☆10Oct 11, 2022Updated 3 years ago
- A Gymnasium environment for simulating multi-robot planning.☆30Sep 23, 2023Updated 2 years ago
- Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch☆877Dec 27, 2022Updated 3 years ago
- Continual Multi-agent Reinforcement Learning in Dynamic Environments☆11Jul 1, 2021Updated 4 years ago
- ☆25Jan 2, 2019Updated 7 years ago