rocanaan / hanabi-ad-hoc-learningLinks

☆6

Alternatives and similar repositories for hanabi-ad-hoc-learning

Users that are interested in hanabi-ad-hoc-learning are comparing it to the libraries listed below

Sorting:

TARTRL / TiZero
Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体
☆14Updated 2 years ago
facebookresearch / off-belief-learning
Implementation of the Off Belief Learning algorithm.
☆48Updated 2 years ago
uoe-agents / TED
Official repository for "Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning".
☆13Updated 2 years ago
uoe-agents / MATE
Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
☆13Updated last year
Stanford-ILIAD / Conventions-ModularPolicy
PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021
☆16Updated 4 years ago
valeriechen / ask-your-humans
Dataset collection and training code for "Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning"
☆11Updated 4 months ago
ryan-dorazio / mmd-dilated
An implementation of the QRE solver magnetic mirror descent with dilated entropy (MMD).
☆8Updated 3 years ago
cgrivera / ai-arena
The AI Arena: A framework for distributed multi-agent reinforcement learning
☆15Updated 3 years ago
princeton-nlp / SRL-NLC
Safe Reinforcement Learning with Natural Language Constraints
☆15Updated 3 years ago
uoe-agents / derl
The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)
☆27Updated 3 years ago
apexrl / COIL
Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"
☆18Updated 2 years ago
npvoid / OnlineDoubleOracle
☆11Updated 4 years ago
aicenter / openspiel_reproductions
Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works
☆16Updated 4 years ago
ruizhaogit / maximum_entropy_population_based_training
Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination
☆28Updated 2 years ago
martius-lab / cid-in-rl
Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., Schölkopf, B., Marti…
☆44Updated 3 years ago
frt03 / mxt_bench
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)
☆13Updated 2 years ago
diversepsro / diverse_psro
☆18Updated 4 years ago
philipjball / TD3_PyTorch
♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation
☆10Updated 4 years ago
tianjunz / NovelD
☆40Updated 3 years ago
waterhorse1 / NAC
(NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.
☆28Updated 3 years ago
Cranial-XIX / marl-copa
PyTorch Implementation of COPA for coordinating teams that can dynamically change.
☆21Updated 3 years ago
mila-iqia / SGI
Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)
☆54Updated 4 years ago
j3soon / dfac
[ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning
☆32Updated 2 years ago
ying-wen / gr2
Appendix and Code for Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning
☆15Updated 2 years ago
sjtu-marl / bd_rd_psro
Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
☆21Updated 3 years ago
uoe-agents / LIAM
Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"
☆36Updated 2 years ago
victorcampos7 / edl
Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"
☆36Updated 5 years ago
secury / optidice
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
☆15Updated 2 years ago
microsoft / strategically_efficient_rl
More efficient exploration for reinforcement learning in two-player, zero-sum game
☆21Updated last year
PKU-RL / MBOM
☆13Updated 2 years ago