rocanaan / hanabi-ad-hoc-learningLinks
☆6Updated 4 years ago
Alternatives and similar repositories for hanabi-ad-hoc-learning
Users that are interested in hanabi-ad-hoc-learning are comparing it to the libraries listed below
Sorting:
- Implementation of the Off Belief Learning algorithm.☆47Updated 2 years ago
- ☆14Updated 2 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆16Updated 4 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆18Updated 2 years ago
- Dataset collection and training code for "Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning"☆10Updated last month
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆12Updated last year
- ☆41Updated 3 years ago
- Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning☆17Updated 2 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆14Updated last year
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆28Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆13Updated 2 years ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆20Updated 3 years ago
- ☆18Updated 4 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆36Updated 3 months ago
- The Implementation of "Machine Theory of Mind", ICML 2018☆25Updated 3 years ago
- ☆12Updated 2 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Updated 2 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆24Updated 2 years ago
- Code for magnetic mirror descent.☆16Updated last year
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆55Updated 11 months ago
- ☆31Updated 5 years ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- PyTorch implementation for all models and environments in the paper "Learning to Ground Multi-Agent Communication with Autoencoders"☆46Updated 3 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆16Updated 4 years ago
- (AAAI24 oral) Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)☆12Updated 2 years ago
- ☆48Updated last year
- An open source benchmark for Multi Agent Reinforcement Learning☆30Updated last year
- ☆53Updated last year
- The AI Arena: A framework for distributed multi-agent reinforcement learning☆14Updated 2 years ago