apexrl/AORPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apexrl/AORPO)

apexrl / AORPO

Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.

☆23

Alternatives and similar repositories for AORPO

Users that are interested in AORPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

IanRDavies / LeMOL
View on GitHub
Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3…
☆14Apr 26, 2022Updated 4 years ago
uoe-agents / LIAM
View on GitHub
Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"
☆43Oct 5, 2022Updated 3 years ago
xihuai18 / A2PO-ICLR2023
View on GitHub
Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)
☆32Nov 22, 2025Updated 8 months ago
hanizaidi110 / Opponent-Modeling-and-Predicting-Opponent-moves-in-Poker
View on GitHub
Advanced_Data_Integration_Project
☆11Jul 31, 2018Updated 7 years ago
yjpark1 / competitiveMARL
View on GitHub
multi-agent reinforcement learning for competitive environments using pytorch
☆14Dec 31, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
YeTianJHU / GSCU
View on GitHub
Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)
☆25Aug 4, 2022Updated 3 years ago
sii-yingwen / rommeo
View on GitHub
IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)
☆23Dec 8, 2022Updated 3 years ago
sjtu-marl / ZSC-Eval
View on GitHub
This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. P…
☆56Nov 22, 2025Updated 8 months ago
julienroyd / coordination-marl
View on GitHub
Code to reproduce experiments from:
☆10Dec 11, 2020Updated 5 years ago
IrisLi17 / self-imitation-via-reduction
View on GitHub
☆17Mar 13, 2021Updated 5 years ago
alleboudy / pointnet
View on GitHub
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
☆12Dec 28, 2019Updated 6 years ago
vint-1 / dreamsmooth
View on GitHub
DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)
☆12May 6, 2024Updated 2 years ago
jsikyoon / bmaml_rl
View on GitHub
This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.
☆20Jan 19, 2023Updated 3 years ago
CatherineMeng / FGYM-user-demo
View on GitHub
Demonstrating the usage of FGYM: A Toolkit for benchmarking FPGA-accelerated Reinforcement Learning
☆14Aug 12, 2021Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
qu-tan-um / PKU_EECS_UGR_THSS
View on GitHub
Latex Template for Undergraduate Thesis at School of EECS, Peking University
☆29Jul 4, 2020Updated 6 years ago
YQ-XiaMLTech / SRv6-GNN
View on GitHub
The Incremental Deployment Method of Segment Routing over an IPv6 (SRv6) Network Based on Graph Neural Network (GNN) and Multi-Agent Rein…
☆16Jan 24, 2025Updated last year
UnrealTracking / ToM2C
View on GitHub
The offcial implementation of "ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind" (ICLR 2022) .
☆76Nov 4, 2024Updated last year
suyoung-lee / LDM
View on GitHub
Latent Dynamics Mixture, NeurIPS 2021
☆18Oct 25, 2022Updated 3 years ago
hysts / pytorch_resnet_preact
View on GitHub
A PyTorch implementation of ResNet-preact
☆12Aug 5, 2019Updated 6 years ago
carolinewang01 / naht
View on GitHub
Code repository for "N-agent Ad Hoc Teamwork" paper (Wang et al., Neurips 2024).
☆29Oct 2, 2025Updated 9 months ago
hychen-naza / LEAP
View on GitHub
☆17Sep 28, 2023Updated 2 years ago
liy1shu / FlowBotHD
View on GitHub
FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation
☆13Dec 13, 2024Updated last year
siahuat0727 / ubiquant-market-prediction
View on GitHub
A PyTorch Lightning template to try out a wide range of ideas on the Ubiquant Market Prediction competition without modifying any code!
☆12Mar 24, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
shi3z / local_mixture_of_agents
View on GitHub
☆12Jul 6, 2024Updated 2 years ago
LunjunZhang / world-model-as-a-graph
View on GitHub
Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)
☆71Jul 17, 2021Updated 5 years ago
wbbhcb / RL-Stock
View on GitHub
📈 如何用深度强化学习自动炒股
☆13Mar 31, 2020Updated 6 years ago
dkkim93 / meta-mapg
View on GitHub
Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)
☆34Oct 6, 2022Updated 3 years ago
Xingyu-Lin / mbpo_pytorch
View on GitHub
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆189Apr 12, 2022Updated 4 years ago
alexis-jacq / LOLA_DiCE
View on GitHub
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆98Aug 21, 2018Updated 7 years ago
apexrl / bmpo
View on GitHub
Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>
☆23Mar 24, 2023Updated 3 years ago
facebookresearch / cascade
View on GitHub
Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).
☆30Oct 25, 2022Updated 3 years ago
ygjin11 / task-hypernet
View on GitHub
The official implementation of the paper "Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork".
☆12Feb 27, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Stanford-ILIAD / ELLA
View on GitHub
Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.
☆21Mar 9, 2021Updated 5 years ago
ziyadsheeba / qfat
View on GitHub
[NeurIPS 2025, Spotlight] An official implementation of the paper Quantization-Free Autoregressive Action Transformer
☆11Mar 3, 2026Updated 4 months ago
zgbkdlm / tme
View on GitHub
Taylor moment expansion in Python (JaX and SymPy) and Matlab
☆11Nov 26, 2024Updated last year
arxrean / SGG_Ex_RC
View on GitHub
Code for Scene Graph Generation with External Knowledge and Image Reconstruction
☆25Dec 1, 2019Updated 6 years ago
ZhecanJamesWang / GLAT_SGG
View on GitHub
Code for GLAT (Global Local Transformer), ECCV 2020 "Learning Visual Commonsense for Robust Scene Graph Generation"
☆11Dec 16, 2020Updated 5 years ago
Lee-zix / MARLPaR
View on GitHub
Code and models for the paper Path Reasoning over Knowledge Graph: A Multi-Agent and Reinforcement Learning Based Method
☆18Nov 23, 2020Updated 5 years ago
facebookresearch / ego-env
View on GitHub
Human-centric environment representations from egocentric video
☆15Feb 5, 2026Updated 5 months ago