facebookresearch / Hanabi_SPARTA
Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it
☆128Updated last year
Alternatives and similar repositories for Hanabi_SPARTA
Users that are interested in Hanabi_SPARTA are comparing it to the libraries listed below
Sorting:
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆101Updated 2 years ago
- impact-driven-exploration☆131Updated last year
- Scalable Implementation of Neural Fictitous Self-Play☆78Updated 6 years ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆204Updated 4 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆117Updated 9 months ago
- The submission template for the MineRL Competition @ NeurIPS 2021. Clone this to make a new submission!☆92Updated 3 years ago
- Paired Open-Ended Trailblazer (POET) and Enhanced POET☆248Updated 3 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- A collection of baselines for the MineRL environment/datasets & the NeurIPS 2021 MineRL competitions☆148Updated 3 years ago
- ☆85Updated 4 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆148Updated 2 years ago
- A networking protocol for agent-environment communication☆102Updated 3 months ago
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆202Updated 4 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆87Updated 6 years ago
- Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"☆171Updated 2 years ago
- The RL Reliability Metrics library provides a set of metrics for measuring the reliability of reinforcement learning (RL) algorithms, as …☆166Updated last year
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆83Updated 5 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- ☆300Updated 4 months ago
- A customisable 2D platform for agent-based AI research☆428Updated last year
- Code for 'Dynamics-Aware Unsupervised Discovery of Skills' (DADS). Enables skill discovery without supervision, which can be combined wit…☆189Updated 3 years ago
- Augmented environments with RL☆104Updated 6 years ago
- ReconChess python implementation☆42Updated 3 years ago
- A simple implementation of MuZero algorithm for connect4 game☆97Updated 4 years ago
- Keeping track of RL experiments☆161Updated 2 years ago
- Vectorized interface for reinforcement learning environments☆140Updated 2 years ago
- Starter Kit for NeurIPS 2020 - Procgen Competition on AIcrowd☆91Updated 2 years ago
- OpenAI Gym wrapper for ViZDoom enviroments☆69Updated 3 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆189Updated 6 years ago
- Library to compare and evaluate reward functions☆67Updated last year