facebookresearch / Hanabi_SPARTAView external linksLinks
Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it
☆129Jul 18, 2023Updated 2 years ago
Alternatives and similar repositories for Hanabi_SPARTA
Users that are interested in Hanabi_SPARTA are comparing it to the libraries listed below
Sorting:
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆103Jun 22, 2022Updated 3 years ago
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- Framework for writing bots that play Hanabi.☆37May 16, 2019Updated 6 years ago
- hanabi_learning_environment is a research platform for Hanabi experiments.☆665Feb 14, 2023Updated 2 years ago
- Reinforcement Learning Assembly☆92Sep 2, 2021Updated 4 years ago
- State of the art Hanabi bots + simulation framework in rust☆45Nov 17, 2023Updated 2 years ago
- ☆10Feb 28, 2019Updated 6 years ago
- Benchmark Python and Cython code☆13Jun 13, 2014Updated 11 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆119Jul 25, 2024Updated last year
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Collection of game-theoretic algorithms for Poker☆30Apr 6, 2019Updated 6 years ago
- An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.☆689Mar 20, 2024Updated last year
- Counterfactual regret minimization algorithm for Kuhn poker☆181Feb 13, 2019Updated 7 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 2 years ago
- Scalable Implementation of Deep CFR and Single Deep CFR☆314May 6, 2020Updated 5 years ago
- ☆17Dec 19, 2019Updated 6 years ago
- Code for magnetic mirror descent.☆17Oct 5, 2023Updated 2 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆22Oct 26, 2018Updated 7 years ago
- A gym-esque environment for Super Smash Bros. Melee.☆12Aug 13, 2021Updated 4 years ago
- Rails application that allows humans to play poker matches managed by the Annual Computer Poker Competition's Dealer program in a web GUI…☆58Dec 2, 2016Updated 9 years ago
- A standalone release of DeepMind Lab's maze generator with Python bindings.☆67Oct 3, 2023Updated 2 years ago
- An open implementation of Pure CFR applied to ACPC poker games.☆207Mar 3, 2017Updated 8 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆64Sep 6, 2023Updated 2 years ago
- Framework for Multi-Agent Deep Reinforcement Learning in Poker☆506Mar 31, 2023Updated 2 years ago
- `Black` for Jupyter notebooks.☆19Apr 23, 2020Updated 5 years ago
- Roll model for trading strategy to C++ or FPGA via Matlab tool☆10Sep 11, 2014Updated 11 years ago
- LaCAM: a quick and scalable multi-agent pathfinding algorithm☆21Dec 29, 2025Updated last month
- A C library for indexing poker hands that respects suit isomorphisms☆72Aug 1, 2014Updated 11 years ago
- OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.☆5,022Updated this week
- An implementation of CFR algorithm to solve Kuhn Poker.☆13Feb 6, 2020Updated 6 years ago
- Lightweight and Effective Preference Construction in PIBT for Large-Scale Multi-Agent Pathfinding (SoCS-25)☆15May 21, 2025Updated 8 months ago
- PSYCH 291: Causal Cognition (https://tobiasgerstenberg.github.io/causal_cognition/)☆12May 23, 2019Updated 6 years ago
- Mental state inference from observable behavior☆15Dec 3, 2021Updated 4 years ago
- Code to accompany "Human-Level Performance in No-Press Diplomacy via Equilibrium Search", published at ICLR 2021☆51Aug 27, 2022Updated 3 years ago
- Find best-response to a fixed policy in multi-agent RL☆288Apr 1, 2022Updated 3 years ago
- 7-card Poker Hand Evaluator in 577 bytes☆47Jul 1, 2020Updated 5 years ago
- An asynchronous RL platform for congestion control in QUIC transport protocol. https://arxiv.org/abs/1910.04054.☆157Feb 2, 2022Updated 4 years ago
- Reproducing the reinforcement learning models used in "Emergence of Linguistic Communication from Referential Games with Symbolic and Pix…☆12Jun 23, 2018Updated 7 years ago
- Open-source library for a reinforcement learning research.☆54Dec 8, 2022Updated 3 years ago