Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning
☆102Jun 22, 2022Updated 3 years ago
Alternatives and similar repositories for hanabi_SAD
Users that are interested in hanabi_SAD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆129Jul 18, 2023Updated 2 years ago
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- ☆14Jun 17, 2022Updated 3 years ago
- Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"☆52Nov 14, 2023Updated 2 years ago
- hanabi_learning_environment is a research platform for Hanabi experiments.☆664Feb 14, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Reinforcement Learning Assembly☆92Sep 2, 2021Updated 4 years ago
- Framework for writing bots that play Hanabi.☆37May 16, 2019Updated 6 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆15Mar 9, 2021Updated 5 years ago
- A Continual Multi-agent RL testbed based on Hanabi☆32Aug 1, 2021Updated 4 years ago
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆21Jul 30, 2024Updated last year
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆22Nov 29, 2025Updated 4 months ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- 🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Compu…☆222Apr 25, 2021Updated 4 years ago
- ☆14Jun 28, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆13Sep 14, 2021Updated 4 years ago
- Multi-agent active perception with prediction rewards☆11Nov 13, 2020Updated 5 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆25Aug 4, 2022Updated 3 years ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆110Apr 17, 2023Updated 2 years ago
- Experiments for performing empirical game-theoretic analysis of networked system control for common-pool resource management using multi-…☆18Oct 11, 2020Updated 5 years ago
- ☆108Feb 10, 2021Updated 5 years ago
- Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration☆52Nov 8, 2021Updated 4 years ago
- cordial-sync is a software package than can be used to reproduce the results from the paper "A Cordial Sync: Going Beyond Marginal Polici…☆41Jan 13, 2021Updated 5 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Dec 1, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas☆54Dec 8, 2022Updated 3 years ago
- ☆14Mar 5, 2023Updated 3 years ago
- ☆14May 4, 2024Updated last year
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆25Jun 20, 2021Updated 4 years ago
- Forward Modeling for Partial Observation Strategy Games - A StarCraft Defogger☆33Aug 30, 2021Updated 4 years ago
- Code for magnetic mirror descent.☆18Oct 5, 2023Updated 2 years ago
- ☆12Aug 15, 2020Updated 5 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks☆228Oct 3, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆10Mar 10, 2021Updated 5 years ago
- ☆13Feb 25, 2025Updated last year
- Python Multi-Agent Reinforcement Learning framework☆2,168Dec 8, 2022Updated 3 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- Solving reinforcement learning tasks which require language and vision☆33Apr 4, 2023Updated 2 years ago
- On the pitfalls of measuring emergent communication☆34Mar 12, 2019Updated 7 years ago
- ☆16Feb 23, 2024Updated 2 years ago