Jackory/RPBT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Jackory/RPBT)

Jackory / RPBT

(AAAI24 oral) Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)

☆12

Alternatives and similar repositories for RPBT

Users that are interested in RPBT are comparing it to the libraries listed below

Sorting:

thu-ml / CEURL
View on GitHub
Official implementation for "PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning" (NeurIPS 2024)
☆18Oct 13, 2024Updated last year
thu-rllab / SOG
View on GitHub
Code for NeurIPS paper "Self-Organized Group for Cooperative Multi-agentReinforcement Learning".
☆21Feb 20, 2023Updated 3 years ago
yingchengyang / CPPO
View on GitHub
Official implementation for "Towards Safe Reinforcement Learning via Constraining Conditional Value at Risk" (IJCAI 2022)
☆24Aug 29, 2024Updated last year
Sadie-Zhao / Zero-Sum-Stochastic-Stackelberg-Games-NeurIPS
View on GitHub
This is the code repository for the paper "Zero-Sum Stochastic Stackelberg Games".
☆16Oct 12, 2022Updated 3 years ago
menglinjian / Deep-FTRL-ORW
View on GitHub
Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…
☆11Dec 1, 2022Updated 3 years ago
onp / gmcr-py
View on GitHub
A Decision Support System (DSS) based on the Graph Model for Conflict Resolution (GMCR).
☆15Apr 4, 2020Updated 5 years ago
npvoid / OnlineDoubleOracle
View on GitHub
☆11Apr 23, 2021Updated 4 years ago
AnanyaJain3 / Spacecraft-Trajectory-Optimization
View on GitHub
Project under CSF407 - AI
☆13Jun 24, 2024Updated last year
xihuai18 / A2PO-ICLR2023
View on GitHub
Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)
☆32Nov 22, 2025Updated 3 months ago
Amanda2024 / CARE-SMAC-MA_SAC
View on GitHub
Multi-task Multi-agent Soft Actor Critic for SMAC
☆15Jan 18, 2022Updated 4 years ago
CORE-Robotics-Lab / HetNet
View on GitHub
Public implementation of Heterogeneous Policy Networks (HetNet) from AAMAS'22 -- Paper Title: Learning Efficient Diverse Communication fo…
☆21Apr 23, 2024Updated last year
jinxinglim / Game-Theoretical-Approaches-in-Multi-Agent-Reinforcement-Learning-Policy-Space-Response-Oracles
View on GitHub
☆16Oct 6, 2019Updated 6 years ago
microsoft / strategically_efficient_rl
View on GitHub
More efficient exploration for reinforcement learning in two-player, zero-sum game
☆21Jul 30, 2024Updated last year
aicenter / openspiel_reproductions
View on GitHub
Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works
☆18Mar 2, 2021Updated 5 years ago
diversepsro / diverse_psro
View on GitHub
☆22May 20, 2021Updated 4 years ago
carolinewang01 / naht
View on GitHub
Code repository for "N-agent Ad Hoc Teamwork" paper (Wang et al., Neurips 2024).
☆24Oct 2, 2025Updated 5 months ago
ssokota / mmd
View on GitHub
Code for magnetic mirror descent.
☆17Oct 5, 2023Updated 2 years ago
samjia2000 / HSP
View on GitHub
This is a repository for Hidden-utility Self-Play.
☆26Jul 27, 2023Updated 2 years ago
tjuHaoXiaotian / pymarl3
View on GitHub
We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…
☆173Jan 7, 2024Updated 2 years ago
jatinarora2702 / gail-pytorch
View on GitHub
PyTorch implementation of GAIL and PPO reinforcement learning algorithms
☆26May 7, 2021Updated 4 years ago
zzq-bot / offline-marl-framework-offpymarl
View on GitHub
Benchmarked implementations of Offline Multi-Agent RL Algorithms based on PyMARL codebase.
☆35Oct 7, 2024Updated last year
sail-sg / rosmo
View on GitHub
Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
☆30Jul 18, 2023Updated 2 years ago
yang-xy20 / async_mappo
View on GitHub
☆34Apr 11, 2023Updated 2 years ago
liuruoze / HierNet-SC2
View on GitHub
(AAAI'2019) The codes, models, logs, and data for an extended paper of the original paper "On Reinforcement Learning for Full-length Game…
☆31Oct 5, 2022Updated 3 years ago
hijkzzz / noisy-mappo
View on GitHub
Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)
☆76Jun 9, 2023Updated 2 years ago
rowatc / Diplomacy-AI
View on GitHub
we're building an AI to play the board game Diplomacy!
☆35Mar 27, 2022Updated 3 years ago
Shawn9927 / Hybrid-Particle-Swarm-Optimization-Algorithm
View on GitHub
一种混合VNS（变邻域搜索算法）的PSO（粒子群优化算法）用以解决拦截对抗中的任务分配问题，新的算法能够有效地避免粒子群陷入局部收敛
☆13Apr 2, 2022Updated 3 years ago
leizhougetbetter / TemporalNetworks
View on GitHub
☆11Jun 20, 2022Updated 3 years ago
tianoak / RobocupRescueAgentSimulation
View on GitHub
A projet for simulating the rescue after a disaster
☆10Dec 4, 2020Updated 5 years ago
AbdAlazezAhmed / Oribital-Mechanics-Matlab
View on GitHub
Some Orbital Mechanics Matlab Codes. Heavily based on the "Orbital Mechanics for Engineers, Howard D. Curtis" book.
☆10Apr 17, 2023Updated 2 years ago
tamaskis / solve_riccati_ode-MATLAB
View on GitHub
Solves the Riccati differential equation for the finite-horizon linear quadratic regulator.
☆13Dec 8, 2022Updated 3 years ago
ionmadrazo / Vec2Read
View on GitHub
☆10Oct 3, 2023Updated 2 years ago
yeshenpy / RACE
View on GitHub
(ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolut…
☆42Oct 14, 2023Updated 2 years ago
liyang619 / COLE-Platform
View on GitHub
Overcooked human-AI experiment platform
☆39Dec 21, 2023Updated 2 years ago
automaticdai / dag-gen-rnd
View on GitHub
dag-gen-rnd: A randomized Multi-DAG task generator for scheduling and allocation research
☆40Feb 10, 2026Updated 3 weeks ago
manantomar / Mirror-Descent-Policy-Optimization
View on GitHub
Mirror Descent Policy Optimization
☆42Oct 31, 2020Updated 5 years ago
christopher-hsu / scalableMARL
View on GitHub
Multi-Agent Reinforcement Learning (MARL) method to learn scalable control polices for multi-agent target tracking.
☆42Aug 30, 2023Updated 2 years ago
indylab / nxdo
View on GitHub
Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games
☆40Aug 27, 2021Updated 4 years ago
kikojay / EMC
View on GitHub
The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.
☆41Feb 16, 2023Updated 3 years ago