Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)
☆34Oct 6, 2022Updated 3 years ago
Alternatives and similar repositories for meta-mapg
Users that are interested in meta-mapg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Model-Free Opponent Shaping (ICML 2022)☆24Nov 18, 2022Updated 3 years ago
- ☆13Oct 11, 2022Updated 3 years ago
- Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).☆21Jan 15, 2020Updated 6 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆25Aug 4, 2022Updated 3 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3…☆14Apr 26, 2022Updated 4 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆98Aug 21, 2018Updated 7 years ago
- Daily Paper Reading☆23Jan 17, 2026Updated 5 months ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆152Apr 13, 2023Updated 3 years ago
- Source code for "Influencing Long-Term Behavior in Multiagent Reinforcement Learning" (NeurIPS 2022)☆19Jan 1, 2023Updated 3 years ago
- [AAAI-23] Improving Pareto Front Learning via Multi-Sample Hypernetworks☆10Aug 22, 2024Updated last year
- Reproduce ICLR2018 submission "Emergent Communication through Negotiation"☆17Apr 19, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆22Aug 26, 2022Updated 3 years ago
- This repository contains all code and experiments for competitive policy gradient (CoPG) algorithm.☆24Aug 1, 2020Updated 5 years ago
- Sigma-Point Kalman Filters☆12Aug 21, 2018Updated 7 years ago
- Project explores collaboration capabilities of VDN and IQL agents on a custom MARL Food Collector environment☆11Apr 6, 2022Updated 4 years ago
- All the codes and data used in "Inverse design of soft materials via a deep-learning-based evolutionary strategy", by G. M. Coli, E. Boat…☆12Oct 26, 2021Updated 4 years ago
- We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting☆12Mar 9, 2018Updated 8 years ago
- Deep reinforcement learning with a particle dynamics environment applied to emergency evacuation of a room with obstacles☆10Mar 6, 2026Updated 3 months ago
- Master Thesis Project in Computer Engineering at Aarhus University 2024 on "Simulating Multi-agent Path Planning in Complex environments …☆18Oct 12, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Minimal A2C/A3C example of an LSTM-based meta-learner.☆13Feb 2, 2021Updated 5 years ago
- ☆31Jun 23, 2026Updated last week
- Tube-Based Zonotopic Data Driven Predictive Control☆12Nov 23, 2022Updated 3 years ago
- Adaptable Agent Populations via a Generative Model of Policies☆12Oct 14, 2021Updated 4 years ago
- Combination of Rapidly-Exporing Random Trees (RRT) and Safe Interval Path Planning (SIPP) for high-DOF planning in dynamic environments,…☆18May 17, 2026Updated last month
- Implementation of "Exponential Natural Evolution Strategies" (xNES) https://arxiv.org/abs/1106.4487☆20Dec 11, 2019Updated 6 years ago
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆310Apr 13, 2023Updated 3 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) applied on Reinforcement Learning problems in TensorFlow 2.☆27May 11, 2021Updated 5 years ago
- Implementation of the G2RL approach in the POGEMA environment☆15Jun 5, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation☆13Dec 13, 2024Updated last year
- Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"☆43Oct 5, 2022Updated 3 years ago
- Python implementation for Combining Latent Space and Structured Kernels for Bayesian Optimization over Combinatorial Spaces.☆13Nov 30, 2021Updated 4 years ago
- A collection of scripts for pairing OVITO with freud and other Glotzer lab packages☆16Jun 22, 2026Updated last week
- Multi-Agent PathFinding (MAPF) for 2D Robots moving inventory on a grid - Practice building environment + robots + planning + inventory m…☆16Nov 20, 2023Updated 2 years ago
- Code for Slow Transition to Low-Dimensional Chaos in Heavy-Tailed Recurrent Neural Networks (NeurIPS 2025)☆20Mar 16, 2026Updated 3 months ago
- ☆10Oct 11, 2022Updated 3 years ago