ruizhaogit/maximum_entropy_population_based_training

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ruizhaogit/maximum_entropy_population_based_training)

ruizhaogit / maximum_entropy_population_based_training

Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination

☆26

Alternatives and similar repositories for maximum_entropy_population_based_training

Users that are interested in maximum_entropy_population_based_training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LxzGordon / PECAN
View on GitHub
☆12Jan 4, 2024Updated 2 years ago
samjia2000 / HSP
View on GitHub
This is a repository for Hidden-utility Self-Play.
☆27Jul 27, 2023Updated 2 years ago
51616 / marl-lipo
View on GitHub
Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)
☆19May 10, 2024Updated 2 years ago
sjtu-marl / ZSC-Eval
View on GitHub
This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. P…
☆56Nov 22, 2025Updated 8 months ago
yanxue7 / E3T-Overcooked
View on GitHub
☆15May 4, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
uoe-agents / LIAM
View on GitHub
Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"
☆43Oct 5, 2022Updated 3 years ago
HumanCompatibleAI / human_aware_rl
View on GitHub
Code for "On the Utility of Learning about Humans for Human-AI Coordination"
☆112Apr 17, 2023Updated 3 years ago
uoe-agents / BRDiv
View on GitHub
Codebase for BRDiv: Diverse teammate generation for ad hoc teamwork
☆13May 2, 2024Updated 2 years ago
RedTachyon / coltra-rl
View on GitHub
A modular implementation of PPO, and soon hopefully other algorithms.
☆27Jan 16, 2024Updated 2 years ago
PKU-RL / MBOM
View on GitHub
☆13Oct 11, 2022Updated 3 years ago
StephAO / HAHA
View on GitHub
Agents to play overcooked ai
☆15Jul 3, 2024Updated 2 years ago
garrett4wade / revisiting_marl
View on GitHub
Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)
☆23Jul 16, 2022Updated 4 years ago
liyheng / FOP
View on GitHub
☆14Jul 12, 2021Updated 5 years ago
lamda-bbo / madac
View on GitHub
Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”
☆26Mar 6, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HumanCompatibleAI / overcooked_ai
View on GitHub
A benchmark environment for fully cooperative human-AI performance.
☆988Mar 22, 2025Updated last year
agakshat / LOLA-pytorch
View on GitHub
Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)
☆19Jul 20, 2018Updated 8 years ago
aadharna / UntouchableThunder
View on GitHub
Co-evolution of agents and environments in GVG-AI
☆17Aug 12, 2021Updated 4 years ago
uoe-agents / TED
View on GitHub
Official repository for "Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning".
☆13Jan 25, 2023Updated 3 years ago
nmonette / NCC-UED
View on GitHub
Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025
☆17Nov 24, 2025Updated 8 months ago
bic4907 / Overcooked-AI
View on GitHub
Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method
☆48Sep 11, 2024Updated last year
max7born / decision-lstm
View on GitHub
Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…
☆28Mar 24, 2023Updated 3 years ago
ruizhaogit / mep
View on GitHub
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)
☆24May 30, 2019Updated 7 years ago
jparkerholder / DvD_ES
View on GitHub
Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…
☆45Oct 29, 2020Updated 5 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
uoe-agents / MATE
View on GitHub
Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
☆15Apr 25, 2024Updated 2 years ago
srnbckr / ebpf-network-emulation
View on GitHub
☆12Aug 12, 2022Updated 3 years ago
jbr-ai-labs / mamba
View on GitHub
This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".
☆67Apr 8, 2025Updated last year
HumanCompatibleAI / human_ai_robustness
View on GitHub
☆22Jul 15, 2020Updated 6 years ago
icaros-usc / overcooked_env_gen
View on GitHub
Official implementation of the paper "On the Importance of Environments in Human-Robot Coordination", published in RSS 2021.
☆16May 1, 2024Updated 2 years ago
amacrutherford / sampling-for-learnability
View on GitHub
Official codebase for "Sampling For Learnability", published at NeurIPS 2024
☆24Oct 21, 2025Updated 9 months ago
lych1233 / GAMMA-human-ai-collaboration
View on GitHub
☆11Jan 13, 2026Updated 6 months ago
kennyderek / adap
View on GitHub
Adaptable Agent Populations via a Generative Model of Policies
☆12Oct 14, 2021Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Lifelong-ML / Mendez2022ModularLifelongRL
View on GitHub
Source code to reproduce experiments from Mendez et al., ICLR '22
☆23Jul 29, 2022Updated 3 years ago
Baichenjia / Contrastive-UCB
View on GitHub
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
☆12Jun 16, 2022Updated 4 years ago
JBLanier / pipeline-psro
View on GitHub
Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
☆57Aug 30, 2024Updated last year
SiyuanQi-zz / intentMARL
View on GitHub
Code for ICRA2018 - Intent-aware Multi-agent Reinforcement Learning.
☆22Feb 22, 2018Updated 8 years ago
rojinakashefi / Pacman-AI
View on GitHub
Fundamental of AI course which focuses on search, multiagents, mdp and reinforcement learning algorithms.
☆13Oct 29, 2022Updated 3 years ago
schroederdewitt / multiagent_mujoco
View on GitHub
Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.
☆373Mar 16, 2023Updated 3 years ago
rstrivedi / Melting-Pot-Contest-2023
View on GitHub
☆47May 21, 2024Updated 2 years ago