PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021
☆15Mar 9, 2021Updated 5 years ago
Alternatives and similar repositories for Conventions-ModularPolicy
Users that are interested in Conventions-ModularPolicy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆111Apr 17, 2023Updated 2 years ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆17Dec 24, 2017Updated 8 years ago
- The code for experiments conducted to verify the correctness of mirror learning.☆11Jun 3, 2022Updated 3 years ago
- ☆12Jan 4, 2024Updated 2 years ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆24Feb 27, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆102Jun 22, 2022Updated 3 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆46Oct 29, 2020Updated 5 years ago
- PyTorch implementation for "Training and Inference on Any-Order Autoregressive Models the Right Way", NeurIPS 2022 Oral, TPM 2023 Best Pa…☆16May 31, 2023Updated 2 years ago
- Multi-task Multi-agent Soft Actor Critic for SMAC☆15Jan 18, 2022Updated 4 years ago
- ☆14May 31, 2022Updated 3 years ago
- Represented Value Function Approach for Large Scale Multi Agent Reinforcement Learning☆16Mar 11, 2020Updated 6 years ago
- Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning☆10Nov 14, 2021Updated 4 years ago
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- Overcooked-AI Experiment Psiturk Demo (for MTurk experiments)☆12May 10, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Adaptable Agent Populations via a Generative Model of Policies☆12Oct 14, 2021Updated 4 years ago
- Agents to play overcooked ai☆15Jul 3, 2024Updated last year
- PyTorch implementation for "Probabilistic Circuits for Variational Inference in Discrete Graphical Models", NeurIPS 2020☆17Oct 11, 2021Updated 4 years ago
- PyTorch implementation for "HyperSPNs: Compact and Expressive Probabilistic Circuits", NeurIPS 2021☆13Oct 26, 2021Updated 4 years ago
- Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)☆41Jan 13, 2024Updated 2 years ago
- A Continual Multi-agent RL testbed based on Hanabi☆32Aug 1, 2021Updated 4 years ago
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20May 31, 2023Updated 2 years ago
- ☆15Sep 7, 2022Updated 3 years ago
- archived prebuilts☆14Jul 30, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆28Nov 22, 2019Updated 6 years ago
- Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration☆52Nov 8, 2021Updated 4 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆15Apr 25, 2024Updated last year
- ☆27Dec 20, 2021Updated 4 years ago
- Web application where humans can play Overcooked with AI agents.☆60Dec 6, 2022Updated 3 years ago
- ☆89Apr 18, 2024Updated last year
- Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.☆20Apr 13, 2021Updated 5 years ago
- ☆25Nov 1, 2022Updated 3 years ago
- ☆14Jun 17, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆26Mar 6, 2023Updated 3 years ago
- Sandbox environment for generalizable agent research☆27Aug 19, 2022Updated 3 years ago
- ☆58Jun 6, 2023Updated 2 years ago
- A2C for GVG-AI☆22Nov 7, 2018Updated 7 years ago
- Tutorials on learning and using successor representations.☆54Oct 31, 2019Updated 6 years ago
- Auto^6ML is a jittor library allowing users to achieve machine learning automation.☆26Sep 28, 2024Updated last year
- JAX implementation of Graph Attention Networks☆13Jan 29, 2022Updated 4 years ago