Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)
☆34Oct 6, 2022Updated 3 years ago
Alternatives and similar repositories for meta-mapg
Users that are interested in meta-mapg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Oct 11, 2022Updated 3 years ago
- Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).☆21Jan 15, 2020Updated 6 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆25Aug 4, 2022Updated 3 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3…☆14Apr 26, 2022Updated 4 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆98Aug 21, 2018Updated 7 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆71Apr 15, 2026Updated last month
- Daily Paper Reading☆24Jan 17, 2026Updated 4 months ago
- Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.☆23Nov 22, 2025Updated 6 months ago
- Source code for "Influencing Long-Term Behavior in Multiagent Reinforcement Learning" (NeurIPS 2022)☆19Jan 1, 2023Updated 3 years ago
- [AAAI-23] Improving Pareto Front Learning via Multi-Sample Hypernetworks☆10Aug 22, 2024Updated last year
- Reproduce ICLR2018 submission "Emergent Communication through Negotiation"☆17Apr 19, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆49Mar 8, 2024Updated 2 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆21Aug 26, 2022Updated 3 years ago
- ☆16Oct 23, 2023Updated 2 years ago
- This repository contains all code and experiments for competitive policy gradient (CoPG) algorithm.☆24Aug 1, 2020Updated 5 years ago
- ☆26Sep 25, 2022Updated 3 years ago
- Sigma-Point Kalman Filters☆12Aug 21, 2018Updated 7 years ago
- Project explores collaboration capabilities of VDN and IQL agents on a custom MARL Food Collector environment☆11Apr 6, 2022Updated 4 years ago
- All the codes and data used in "Inverse design of soft materials via a deep-learning-based evolutionary strategy", by G. M. Coli, E. Boat…☆11Oct 26, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting☆12Mar 9, 2018Updated 8 years ago
- Deep reinforcement learning with a particle dynamics environment applied to emergency evacuation of a room with obstacles☆10Mar 6, 2026Updated 2 months ago
- Training materials for IGVC and RoboRacing☆17Nov 10, 2019Updated 6 years ago
- Master Thesis Project in Computer Engineering at Aarhus University 2024 on "Simulating Multi-agent Path Planning in Complex environments …☆17Oct 12, 2025Updated 7 months ago
- ☆30Nov 10, 2025Updated 6 months ago
- Tube-Based Zonotopic Data Driven Predictive Control☆12Nov 23, 2022Updated 3 years ago
- Adaptable Agent Populations via a Generative Model of Policies☆12Oct 14, 2021Updated 4 years ago
- Combination of Rapidly-Exporing Random Trees (RRT) and Safe Interval Path Planning (SIPP) for high-DOF planning in dynamic environments,…☆18May 17, 2026Updated last week
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆309Apr 13, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- MAML implementation (tensorflow)☆15Apr 25, 2019Updated 7 years ago
- Implementation of the G2RL approach in the POGEMA environment☆14Jun 5, 2024Updated last year
- FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation☆13Dec 13, 2024Updated last year
- Python implementation for Combining Latent Space and Structured Kernels for Bayesian Optimization over Combinatorial Spaces.☆13Nov 30, 2021Updated 4 years ago
- Simple, extensible implementations of some meta-learning algorithms in Jax☆11Oct 6, 2020Updated 5 years ago
- Multi-Agent PathFinding (MAPF) for 2D Robots moving inventory on a grid - Practice building environment + robots + planning + inventory m…☆16Nov 20, 2023Updated 2 years ago
- Code for Slow Transition to Low-Dimensional Chaos in Heavy-Tailed Recurrent Neural Networks (NeurIPS 2025)☆20Mar 16, 2026Updated 2 months ago