Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.
☆21Nov 22, 2025Updated 3 months ago
Alternatives and similar repositories for AORPO
Users that are interested in AORPO are comparing it to the libraries listed below
Sorting:
- This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".☆64Apr 8, 2025Updated 10 months ago
- ☆13Oct 11, 2022Updated 3 years ago
- Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"☆41Oct 5, 2022Updated 3 years ago
- DecentralizedLearning☆25Dec 8, 2022Updated 3 years ago
- PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation☆12Dec 28, 2019Updated 6 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆32Nov 22, 2025Updated 3 months ago
- multi-agent reinforcement learning for competitive environments using pytorch☆14Dec 31, 2019Updated 6 years ago
- ☆17Mar 13, 2021Updated 4 years ago
- This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. P…☆54Nov 22, 2025Updated 3 months ago
- ☆17Sep 28, 2023Updated 2 years ago
- Latent Dynamics Mixture, NeurIPS 2021☆18Oct 25, 2022Updated 3 years ago
- Code repository for "N-agent Ad Hoc Teamwork" paper (Wang et al., Neurips 2024).☆24Oct 2, 2025Updated 5 months ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆25Aug 4, 2022Updated 3 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.☆20Jan 19, 2023Updated 3 years ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Mar 9, 2021Updated 4 years ago
- Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>☆23Mar 24, 2023Updated 2 years ago
- For managing 2P imaging datasets from preprocessing to activity trace extraction☆10Apr 12, 2019Updated 6 years ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆68Jul 17, 2021Updated 4 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆185Apr 12, 2022Updated 3 years ago
- Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).☆29Oct 25, 2022Updated 3 years ago
- Code for Scene Graph Generation with External Knowledge and Image Reconstruction☆25Dec 1, 2019Updated 6 years ago
- BabyAI++: Towards Grounded language Learning beyond Memorization, ICLR BeTR-RL 2020☆26Jul 28, 2020Updated 5 years ago
- The implementation of AAAI 2022 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".☆63Dec 12, 2023Updated 2 years ago
- [ICLR 2022 Oral] Official PyTorch Implementation of "Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design".☆73Dec 6, 2023Updated 2 years ago
- Latex Template for Undergraduate Thesis at School of EECS, Peking University☆29Jul 4, 2020Updated 5 years ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆87Jan 24, 2024Updated 2 years ago
- This repository contains the official code for our NeurIPS 2021 publication "Robust Deep Reinforcement Learning through Adversarial Loss…☆32Jan 21, 2022Updated 4 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Oct 6, 2022Updated 3 years ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆41Feb 27, 2024Updated 2 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- LEAP is a novel tool for discovering latent temporal causal relations.☆17Oct 18, 2021Updated 4 years ago
- Code to reproduce experiments from:☆10Dec 11, 2020Updated 5 years ago
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…☆11Dec 9, 2022Updated 3 years ago
- Solutions to assignments in course- "Bitcoin and Cryptocurrency Technologies", offered by coursera, Princeton University☆11Jun 28, 2018Updated 7 years ago
- ☆11Jun 1, 2017Updated 8 years ago
- This repo contains the pytorch implementation for Dynamic Concept Learner (accepted by ICLR 2021).☆37Jul 8, 2024Updated last year
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Aug 21, 2018Updated 7 years ago