facebookresearch/hanabi_SAD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/hanabi_SAD)

facebookresearch / hanabi_SAD

Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning

☆103

Alternatives and similar repositories for hanabi_SAD

Users that are interested in hanabi_SAD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / Hanabi_SPARTA
View on GitHub
Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it
☆129Jul 18, 2023Updated 3 years ago
facebookresearch / off-belief-learning
View on GitHub
Implementation of the Off Belief Learning algorithm.
☆49Aug 18, 2022Updated 3 years ago
aronsar / hoad
View on GitHub
☆14Jun 17, 2022Updated 4 years ago
facebookresearch / jps
View on GitHub
Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"
☆52Nov 14, 2023Updated 2 years ago
google-deepmind / hanabi-learning-environment
View on GitHub
hanabi_learning_environment is a research platform for Hanabi experiments.
☆670Feb 14, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
facebookresearch / rela
View on GitHub
Reinforcement Learning Assembly
☆94Sep 2, 2021Updated 4 years ago
Quuxplusone / Hanabi
View on GitHub
Framework for writing bots that play Hanabi.
☆37May 16, 2019Updated 7 years ago
Stanford-ILIAD / Conventions-ModularPolicy
View on GitHub
PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021
☆15Mar 9, 2021Updated 5 years ago
chandar-lab / Lifelong-Hanabi
View on GitHub
A Continual Multi-agent RL testbed based on Hanabi
☆31Aug 1, 2021Updated 4 years ago
microsoft / strategically_efficient_rl
View on GitHub
More efficient exploration for reinforcement learning in two-player, zero-sum game
☆21Jul 30, 2024Updated last year
info-structures / ais
View on GitHub
This repository contains the code for RL for POMDPs through learning an Approximate Information State.
☆23Nov 29, 2025Updated 7 months ago
mit-ll / hanabi_AnyPlay
View on GitHub
☆14Jun 28, 2022Updated 4 years ago
DavidJanz / successor_uncertainties_atari
View on GitHub
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Feb 24, 2023Updated 3 years ago
rosewang2008 / gym-cooking
View on GitHub
🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Compu…
☆224Apr 25, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
laurimi / multiagent-prediction-reward
View on GitHub
Multi-agent active perception with prediction rewards
☆12Nov 13, 2020Updated 5 years ago
1310183534 / DouDiZhu
View on GitHub
☆13Sep 14, 2021Updated 4 years ago
YeTianJHU / GSCU
View on GitHub
Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)
☆25Aug 4, 2022Updated 3 years ago
HumanCompatibleAI / human_aware_rl
View on GitHub
Code for "On the Utility of Learning about Humans for Human-AI Coordination"
☆112Apr 17, 2023Updated 3 years ago
instadeepai / EGTA-NMARL
View on GitHub
Experiments for performing empirical game-theoretic analysis of networked system control for common-pool resource management using multi-…
☆19Oct 11, 2020Updated 5 years ago
PKU-RL / Literature
View on GitHub
☆108Feb 10, 2021Updated 5 years ago
schroederdewitt / mackrl
View on GitHub
Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)
☆34Dec 1, 2019Updated 6 years ago
j96w / cogail
View on GitHub
Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration
☆52Nov 8, 2021Updated 4 years ago
allenai / cordial-sync
View on GitHub
cordial-sync is a software package than can be used to reproduce the results from the paper "A Cordial Sync: Going Beyond Marginal Polici…
☆41Jan 13, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
social-dilemma / multiagent
View on GitHub
Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas
☆54Dec 8, 2022Updated 3 years ago
PKU-RL / FOP-DMAC-MACPF
View on GitHub
☆14Mar 5, 2023Updated 3 years ago
philipjball / OffCon3
View on GitHub
📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)
☆25Jun 20, 2021Updated 5 years ago
yanxue7 / E3T-Overcooked
View on GitHub
☆15May 4, 2024Updated 2 years ago
ssokota / mmd
View on GitHub
Code for magnetic mirror descent.
☆20Oct 5, 2023Updated 2 years ago
uber-research / D3G
View on GitHub
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
facebookresearch / starcraft_defogger
View on GitHub
Forward Modeling for Partial Observation Strategy Games - A StarCraft Defogger
☆34Aug 30, 2021Updated 4 years ago
tencent-ailab / Arena
View on GitHub
☆11Mar 10, 2021Updated 5 years ago
IC3Net / IC3Net
View on GitHub
Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks
☆233Oct 3, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
hengyuan-hu / jax-vs-pytorch
View on GitHub
☆13Feb 25, 2025Updated last year
uber-research / Evolvability-ES
View on GitHub
☆14Jun 26, 2019Updated 7 years ago
simsimiSION / pymarl-algorithm-extension-via-starcraft
View on GitHub
☆13Aug 15, 2020Updated 5 years ago
oxwhirl / pymarl
View on GitHub
Python Multi-Agent Reinforcement Learning framework
☆2,206Dec 8, 2022Updated 3 years ago
facebookresearch / measuring-emergent-comm
View on GitHub
On the pitfalls of measuring emergent communication
☆34Mar 12, 2019Updated 7 years ago
henry-prior / multimodal-rl
View on GitHub
Solving reinforcement learning tasks which require language and vision
☆33Apr 4, 2023Updated 3 years ago
hengyuan-hu / instruct-rl
View on GitHub
☆16Feb 23, 2024Updated 2 years ago