facebookresearch / Hanabi_SPARTA
Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it
☆128Updated last year
Alternatives and similar repositories for Hanabi_SPARTA:
Users that are interested in Hanabi_SPARTA are comparing it to the libraries listed below
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆98Updated 2 years ago
- impact-driven-exploration☆130Updated last year
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆113Updated 5 months ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆147Updated last year
- Scalable Implementation of Neural Fictitous Self-Play☆74Updated 5 years ago
- Starter Kit for NeurIPS 2020 - Procgen Competition on AIcrowd☆90Updated last year
- Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"☆169Updated last year
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆201Updated 4 years ago
- A collection of baselines for the MineRL environment/datasets & the NeurIPS 2021 MineRL competitions☆147Updated 3 years ago
- ☆85Updated 4 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆234Updated 2 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Keeping track of RL experiments☆158Updated 2 years ago
- ☆293Updated 3 weeks ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆369Updated 3 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆92Updated 6 years ago
- Paired Open-Ended Trailblazer (POET) and Enhanced POET☆245Updated 2 years ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆198Updated 4 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆68Updated last year
- A simple implementation of MuZero algorithm for connect4 game☆97Updated 4 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆83Updated 5 years ago
- Basic versions of agents from Spinning Up in Deep RL written in PyTorch☆198Updated 3 years ago
- PAIRED in PyTorch 🔥☆56Updated last year
- Modifiable OpenAI Gym environments for studying generalization in RL☆86Updated 5 years ago
- ☆106Updated 5 years ago
- A networking protocol for agent-environment communication☆94Updated 3 months ago
- 🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Compu…☆194Updated 3 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆197Updated 6 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- The submission template for the MineRL Competition @ NeurIPS 2021. Clone this to make a new submission!☆92Updated 3 years ago