PKU-RL/MBOM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PKU-RL/MBOM)

PKU-RL / MBOM

☆13

Alternatives and similar repositories for MBOM

Users that are interested in MBOM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YeTianJHU / GSCU
View on GitHub
Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)
☆25Aug 4, 2022Updated 3 years ago
uoe-agents / LIAM
View on GitHub
Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"
☆43Oct 5, 2022Updated 3 years ago
hanizaidi110 / Opponent-Modeling-and-Predicting-Opponent-moves-in-Poker
View on GitHub
Advanced_Data_Integration_Project
☆11Jul 31, 2018Updated 7 years ago
Sandholm-Lab / ESCHER
View on GitHub
☆16Jul 13, 2022Updated 4 years ago
ying-wen / gr2
View on GitHub
Appendix and Code for Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning
☆14Dec 8, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
apexrl / AORPO
View on GitHub
Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.
☆23Nov 22, 2025Updated 8 months ago
jiechuanjiang / I2Q
View on GitHub
I2Q: A Fully Decentralized Q-Learning Algorithm
☆19Nov 10, 2022Updated 3 years ago
uoe-agents / PO-GPL
View on GitHub
Official code for "A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning"
☆15Mar 1, 2023Updated 3 years ago
dkkim93 / meta-mapg
View on GitHub
Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)
☆34Oct 6, 2022Updated 3 years ago
PKU-RL / CORRO
View on GitHub
[ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning
☆40Aug 17, 2022Updated 3 years ago
jbr-ai-labs / mamba
View on GitHub
This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".
☆67Apr 8, 2025Updated last year
chan64 / remote_sensing_image_captioning
View on GitHub
Remote sensing Image Captioning is a special case of Image Captioning which solves the difficulties in processing the remote sensing imag…
☆12Jun 16, 2021Updated 5 years ago
PKU-RL / I2C
View on GitHub
☆48Jun 29, 2021Updated 5 years ago
sii-yingwen / rommeo
View on GitHub
IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)
☆23Dec 8, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
facebookresearch / off-belief-learning
View on GitHub
Implementation of the Off Belief Learning algorithm.
☆49Aug 18, 2022Updated 3 years ago
menglinjian / Deep-FTRL-ORW
View on GitHub
Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…
☆11Dec 1, 2022Updated 3 years ago
ruizhaogit / maximum_entropy_population_based_training
View on GitHub
Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination
☆26Nov 29, 2022Updated 3 years ago
nigelyaoj / Quality-Similar-Diversity
View on GitHub
Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning
☆19Dec 26, 2025Updated 6 months ago
QPD-NeurIPS2019 / QPD
View on GitHub
This is the code for Q-value Path Decomposition for Deep Multiagent Reinforcement Learning (NeurIPS 2019).
☆12May 20, 2019Updated 7 years ago
JBLanier / pipeline-psro
View on GitHub
Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
☆57Aug 30, 2024Updated last year
risk-sensitive-reachability / ToolboxLS
View on GitHub
Clone of Ian M. Mitchell's ToolboxLS repository. (https://bitbucket.org/ian_mitchell/toolboxls)
☆14Jan 18, 2020Updated 6 years ago
indylab / nxdo
View on GitHub
Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games
☆40Aug 27, 2021Updated 4 years ago
IanRDavies / LeMOL
View on GitHub
Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3…
☆14Apr 26, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
lasithagt / admm
View on GitHub
☆19Aug 8, 2023Updated 2 years ago
deeptexas-ai / deep-holdem-ai
View on GitHub
Strongest Texas Hold'em AI for online poker / 最强德州扑克AI在线辅助 / 最強德州撲克AI線上輔助 - Real-time decision engine, GTO solver, equity calculator
☆24Updated this week
hiteshK03 / Remote-sensing-image-captioning-with-transformer-and-multilabel-classification
View on GitHub
☆18Nov 23, 2022Updated 3 years ago
CNDOTA / NeurIPS22-ATM
View on GitHub
☆15Oct 9, 2022Updated 3 years ago
jhartford / DeepCognitiveHierarchy
View on GitHub
Implementation of Deep Learning for Predicting Human Strategic Behavior
☆15Apr 6, 2017Updated 9 years ago
sjtu-marl / bd_rd_psro
View on GitHub
Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
☆24Feb 27, 2022Updated 4 years ago
rpSebastian / AutoCFR
View on GitHub
Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)
☆22Apr 22, 2024Updated 2 years ago
zhoubohan0 / STG-Transformer
View on GitHub
[NeurIPS 2023] Official implementation of "Learning from Visual Observation via Offline Pretrained State-to-Go Transformer"
☆17Oct 1, 2023Updated 2 years ago
quantumiracle / nash-dqn
View on GitHub
Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…
☆22Aug 26, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
dickreuter / tf_rl
View on GitHub
Refinforcement learning framework
☆15Mar 25, 2023Updated 3 years ago
liy1shu / FlowBotHD
View on GitHub
FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation
☆13Dec 13, 2024Updated last year
shi3z / local_mixture_of_agents
View on GitHub
☆12Jul 6, 2024Updated 2 years ago
PKU-RL / FOP-DMAC-MACPF
View on GitHub
☆14Mar 5, 2023Updated 3 years ago
phejohnwang / MRCScheduling
View on GitHub
Sccheduling Environment for Multi-Robot Coordination Problems
☆18May 9, 2022Updated 4 years ago
CILAB-MA / Machine_ToM
View on GitHub
The Implementation of "Machine Theory of Mind", ICML 2018
☆28Mar 14, 2022Updated 4 years ago
mxu34 / prompt-dt
View on GitHub
Official code repository for Prompt-DT.
☆123Aug 3, 2022Updated 3 years ago