☆47Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for mapr2
Users that are interested in mapr2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- A Multi-agent Learning Framework☆62May 10, 2021Updated 4 years ago
- Multi-Agent Determinantal Q-Learning☆43Nov 22, 2022Updated 3 years ago
- ☆32Jun 25, 2018Updated 7 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆152Apr 13, 2023Updated 2 years ago
- Social Attention for Autonomous Decision-Making in Dense Traffic☆23Oct 30, 2021Updated 4 years ago
- Multi-task Multi-agent Soft Actor Critic for SMAC☆15Jan 18, 2022Updated 4 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆37Dec 7, 2020Updated 5 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆43Jan 29, 2019Updated 7 years ago
- Stochastic Markov Games☆12Oct 5, 2017Updated 8 years ago
- ☆12Jan 30, 2021Updated 5 years ago
- ☆108Feb 10, 2021Updated 5 years ago
- Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019☆794May 29, 2022Updated 3 years ago
- Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).☆14Apr 4, 2025Updated 11 months ago
- Code for the paper: "MIDAS: Multi-agent Interaction-aware Decision-making with Adaptive Strategies for Urban Autonomous Navigation"☆17Sep 21, 2021Updated 4 years ago
- Repo for reproduction of sequential social dilemmas☆415Mar 6, 2025Updated last year
- ☆14Jun 17, 2022Updated 3 years ago
- PyTorch implementation of Count-Based Exploration with Neural Density Models☆10Mar 22, 2018Updated 8 years ago
- Neural relational inference for interacting systems - pytorch☆18Nov 19, 2019Updated 6 years ago
- Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)☆19Jul 20, 2018Updated 7 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆15Mar 9, 2021Updated 5 years ago
- ☆46Jun 29, 2021Updated 4 years ago
- Upper Confidence Tree Planner for ATARI games☆19Mar 9, 2016Updated 10 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆103Mar 24, 2023Updated 2 years ago
- This model provides a robust task allocation method by dynamically adjusting Voronoi boundaries to adapt to changes in tasks and environm…☆14Jan 22, 2025Updated last year
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lo…☆16Nov 27, 2024Updated last year
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆84Apr 4, 2021Updated 4 years ago
- Python Multi-Agent Reinforcement Learning framework☆2,168Dec 8, 2022Updated 3 years ago
- ☆25Jan 2, 2019Updated 7 years ago
- ☆13Oct 11, 2022Updated 3 years ago
- ☆19Jul 18, 2021Updated 4 years ago
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆169Dec 8, 2022Updated 3 years ago
- A safe and efficient autonomous driving algorithm. Winner of the 2019 DriveML Huawei Autonomous Vehicles Challenge. Built using RLLib and…☆18Jan 24, 2020Updated 6 years ago
- ☆19Aug 8, 2023Updated 2 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Dec 11, 2020Updated 5 years ago
- Monte Carlo Conterfactual Regret Minimization for imperfect information games☆13Mar 29, 2019Updated 6 years ago
- Multi-Agent Connected Autonomous Driving (MACAD) Gym environments for Deep RL. Code for the paper presented in the Machine Learning for A…☆376May 20, 2023Updated 2 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆46Jun 22, 2020Updated 5 years ago
- Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas☆54Dec 8, 2022Updated 3 years ago