ml3705454/mapr2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ml3705454/mapr2)

ml3705454 / mapr2

☆48

Alternatives and similar repositories for mapr2

Users that are interested in mapr2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sii-yingwen / rommeo
View on GitHub
IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)
☆23Dec 8, 2022Updated 3 years ago
ying-wen / malib_deprecated
View on GitHub
A Multi-agent Learning Framework
☆62May 10, 2021Updated 5 years ago
QDPP-GitHub / QDPP
View on GitHub
Multi-Agent Determinantal Q-Learning
☆43Nov 22, 2022Updated 3 years ago
ying-wen / gr2
View on GitHub
Appendix and Code for Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning
☆14Dec 8, 2022Updated 3 years ago
alshedivat / lola
View on GitHub
Code release for Learning with Opponent-Learning Awareness and variations.
☆152Apr 13, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
eleurent / social-attention
View on GitHub
Social Attention for Autonomous Decision-Making in Dense Traffic
☆23Oct 30, 2021Updated 4 years ago
Qeneb / SS-MARL
View on GitHub
The implementation of Scalable Safe Multi-Agent Reinforcement Learning for Multi-Agent System.
☆11Sep 8, 2025Updated 10 months ago
Coac / CommNet-BiCnet
View on GitHub
CommNet and BiCnet implementation in tensorflow
☆55Jul 27, 2018Updated 7 years ago
snu-mllab / EMI
View on GitHub
Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.
☆37Dec 7, 2020Updated 5 years ago
adithya-subramanian / Multi_Agent_Soft_Actor_Critic
View on GitHub
A Pytorch Implementation of Multi Agent Soft Actor Critic
☆44Jan 29, 2019Updated 7 years ago
CyberAgentAILab / regularized-bon
View on GitHub
Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).
☆14Apr 4, 2025Updated last year
aijunbai / markov-game
View on GitHub
Stochastic Markov Games
☆12Oct 5, 2017Updated 8 years ago
matrl-project / matrl
View on GitHub
☆12Jan 30, 2021Updated 5 years ago
PKU-RL / Literature
View on GitHub
☆108Feb 10, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
shariqiqbal2810 / MAAC
View on GitHub
Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019
☆807May 29, 2022Updated 4 years ago
JBLanier / pipeline-psro
View on GitHub
Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
☆57Aug 30, 2024Updated last year
CyberAgentAILab / filtered-dpo
View on GitHub
[EMNLP 2024] Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by …
☆16Nov 27, 2024Updated last year
longtermrisk / marltoolbox
View on GitHub
A toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).
☆32Sep 29, 2022Updated 3 years ago
sherrychen1120 / MIDAS
View on GitHub
Code for the paper: "MIDAS: Multi-agent Interaction-aware Decision-making with Adaptive Strategies for Urban Autonomous Navigation"
☆17Sep 21, 2021Updated 4 years ago
ChunyuanLI / RAS
View on GitHub
AISTATS 2019: Reference-based Adversarial Sampling & Its applications to Soft Q-learning
☆15Jan 21, 2019Updated 7 years ago
iQiyuan / MultiRobot-TaskAlloc-Voronoi
View on GitHub
This model provides a robust task allocation method by dynamically adjusting Voronoi boundaries to adapt to changes in tasks and environm…
☆14Jan 22, 2025Updated last year
eugenevinitsky / sequential_social_dilemma_games
View on GitHub
Repo for reproduction of sequential social dilemmas
☆418Mar 6, 2025Updated last year
aronsar / hoad
View on GitHub
☆14Jun 17, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kristychoi / pixel_exploration
View on GitHub
PyTorch implementation of Count-Based Exploration with Neural Density Models
☆10Mar 22, 2018Updated 8 years ago
agakshat / LOLA-pytorch
View on GitHub
Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)
☆19Jul 20, 2018Updated 8 years ago
Stanford-ILIAD / Conventions-ModularPolicy
View on GitHub
PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021
☆15Mar 9, 2021Updated 5 years ago
tkipf / nri
View on GitHub
Neural relational inference for interacting systems - pytorch
☆19Nov 19, 2019Updated 6 years ago
ApricityZ / TERL
View on GitHub
☆16Nov 6, 2025Updated 8 months ago
XiaoxiaoGuo / atari_uct
View on GitHub
Upper Confidence Tree Planner for ATARI games
☆19Mar 9, 2016Updated 10 years ago
rraileanu / auto-drac
View on GitHub
Automatic Data-Regularized Actor-Critic (Auto-DrAC)
☆104Mar 24, 2023Updated 3 years ago
PKU-RL / I2C
View on GitHub
☆48Jun 29, 2021Updated 5 years ago
gaflach / usizer
View on GitHub
discrete gate sizing
☆14Nov 23, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
YuhangSong / Arena-BuildingToolkit
View on GitHub
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
☆84Apr 4, 2021Updated 5 years ago
tianjunz / MADE
View on GitHub
☆19Jul 18, 2021Updated 5 years ago
oxwhirl / pymarl
View on GitHub
Python Multi-Agent Reinforcement Learning framework
☆2,208Dec 8, 2022Updated 3 years ago
PKU-RL / MBOM
View on GitHub
☆13Oct 11, 2022Updated 3 years ago
risk-sensitive-reachability / ToolboxLS
View on GitHub
Clone of Ian M. Mitchell's ToolboxLS repository. (https://bitbucket.org/ian_mitchell/toolboxls)
☆14Jan 18, 2020Updated 6 years ago
PedroCastro / DriveML
View on GitHub
A safe and efficient autonomous driving algorithm. Winner of the 2019 DriveML Huawei Autonomous Vehicles Challenge. Built using RLLib and…
☆18Jan 24, 2020Updated 6 years ago
shoq / cfr
View on GitHub
Monte Carlo Conterfactual Regret Minimization for imperfect information games
☆13Mar 29, 2019Updated 7 years ago