YiqinYang/ICQ

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YiqinYang/ICQ)

YiqinYang / ICQ

Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS 2021 Spotlight https://arxiv.org/abs/2106.03400)

☆76

Alternatives and similar repositories for ICQ

Users that are interested in ICQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ling-pan / OMAR
View on GitHub
☆55Jul 21, 2022Updated 4 years ago
ReinholdM / Offline-Pre-trained-Multi-Agent-Decision-Transformer
View on GitHub
☆120Apr 15, 2023Updated 3 years ago
thu-rllab / CFCQL
View on GitHub
Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.
☆42Feb 18, 2025Updated last year
ReinholdM / Papers-of-Offline-RL
View on GitHub
Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)
☆18Apr 21, 2022Updated 4 years ago
apexrl / CoDAIL
View on GitHub
Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>
☆19Jun 17, 2021Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
instadeepai / og-marl
View on GitHub
Datasets with baselines for Offline MARL.
☆224Nov 2, 2025Updated 8 months ago
ZhengYinan-AIR / OMIGA
View on GitHub
[NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…
☆44Mar 3, 2024Updated 2 years ago
google-deepmind / constrained_optidice
View on GitHub
☆10Sep 9, 2022Updated 3 years ago
LAMDA-RL / ODIS
View on GitHub
The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".
☆45Oct 31, 2024Updated last year
YiqinYang / VEM
View on GitHub
Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…
☆15Mar 9, 2022Updated 4 years ago
daniellawson9999 / online-decision-transformer
View on GitHub
An unofficial implementation for online decision transformer
☆41Sep 20, 2022Updated 3 years ago
zzq-bot / offline-marl-framework-offpymarl
View on GitHub
Benchmarked implementations of Offline Multi-Agent RL Algorithms based on PyMARL codebase.
☆35Oct 7, 2024Updated last year
lrhammond / almanac
View on GitHub
Implementation and evaluation of Almanac (Automaton/Logic Multi-Agent Natural Actor-Critic), an algorithm for multi-agent reinforcement l…
☆10May 5, 2022Updated 4 years ago
Jiwonjeon9603 / MASER
View on GitHub
This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…
☆23Jul 6, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TonghanWang / DOP
View on GitHub
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)
☆51Dec 8, 2022Updated 3 years ago
dmksjfl / SEABO
View on GitHub
Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning
☆12Jan 19, 2024Updated 2 years ago
kennyderek / adap
View on GitHub
Adaptable Agent Populations via a Generative Model of Policies
☆12Oct 14, 2021Updated 4 years ago
bic4907 / Overcooked-AI
View on GitHub
Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method
☆48Sep 11, 2024Updated last year
Theohhhu / UPDeT
View on GitHub
Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…
☆139Feb 3, 2021Updated 5 years ago
cyanrain7 / TRPO-in-MARL
View on GitHub
☆227Jun 4, 2023Updated 3 years ago
sisl / DecNashPlanning
View on GitHub
☆16Apr 6, 2022Updated 4 years ago
twitter / diffusion-rl
View on GitHub
☆80Dec 9, 2022Updated 3 years ago
zhxieml / PDT
View on GitHub
Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer
☆29Jul 25, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
IouJenLiu / CMAE
View on GitHub
☆50Jul 23, 2021Updated 5 years ago
tinnerhrhe / MTDiff
View on GitHub
☆64Nov 15, 2024Updated last year
oxwhirl / smacv2
View on GitHub
☆322Feb 15, 2024Updated 2 years ago
oxwhirl / facmac
View on GitHub
☆116Oct 25, 2021Updated 4 years ago
Farama-Foundation / D4RL
View on GitHub
A collection of reference environments for offline reinforcement learning
☆1,694Nov 18, 2024Updated last year
zbzhu99 / madiff
View on GitHub
Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"
☆111Jun 26, 2025Updated last year
decisionforce / CoPO
View on GitHub
[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".
☆136Jan 29, 2024Updated 2 years ago
QPD-NeurIPS2019 / QPD
View on GitHub
This is the code for Q-value Path Decomposition for Deep Multiagent Reinforcement Learning (NeurIPS 2019).
☆12May 20, 2019Updated 7 years ago
thu-ml / CEP-energy-guided-diffusion
View on GitHub
Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction
☆35Nov 3, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sfujim / BCQ
View on GitHub
Author's PyTorch implementation of BCQ for continuous and discrete actions
☆667Apr 6, 2021Updated 5 years ago
catezi / MAPT
View on GitHub
This is the official code repository for the paper "Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Ag…
☆12Apr 9, 2026Updated 3 months ago
oxwhirl / pymarl
View on GitHub
Python Multi-Agent Reinforcement Learning framework
☆2,210Dec 8, 2022Updated 3 years ago
sfujim / TD3_BC
View on GitHub
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
☆410Dec 18, 2021Updated 4 years ago
Will-Nie / AutoLinePlotter
View on GitHub
This repo support auto line plot for multi-seed event file from TensorBoard
☆12Jun 23, 2022Updated 4 years ago
yalidu / liir
View on GitHub
Learning Individual Intrinsic Reward in MARL
☆65Dec 8, 2022Updated 3 years ago
zhaoyizhou1123 / mbrcsl
View on GitHub
☆11Nov 18, 2023Updated 2 years ago