yeshenpy/PMIC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yeshenpy/PMIC)

yeshenpy / PMIC

Original PyTorch implementation of PMIC from PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration

☆21

Alternatives and similar repositories for PMIC

Users that are interested in PMIC are comparing it to the libraries listed below

Sorting:

lansinuote / Simple_RLHF_Llama3
View on GitHub
☆33Aug 7, 2024Updated last year
uob-TextAnalytics / text_labs_public
View on GitHub
Lab notebooks for Text Analytics
☆12Updated this week
lrhammond / almanac
View on GitHub
Implementation and evaluation of Almanac (Automaton/Logic Multi-Agent Natural Actor-Critic), an algorithm for multi-agent reinforcement l…
☆10May 5, 2022Updated 3 years ago
hsbyhub / libxco
View on GitHub
libxco是一个轻量级高性能协程网络库
☆12Jul 10, 2025Updated 7 months ago
ematm0067 / 2023_24
View on GitHub
☆11Jul 16, 2024Updated last year
sejmoonwei / SPGrasp
View on GitHub
Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.
☆19Jan 6, 2026Updated 2 months ago
flowersteam / EAGER
View on GitHub
☆10Oct 11, 2022Updated 3 years ago
urosolia / MOMDP
View on GitHub
solver for discrete Mixed Observable Markov Decision Processes
☆11Oct 30, 2020Updated 5 years ago
yeshenpy / RACE
View on GitHub
(ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolut…
☆42Oct 14, 2023Updated 2 years ago
rillhuEspProj / esp32-aliyun-demo
View on GitHub
esp32 aliyun access example
☆11Mar 3, 2022Updated 4 years ago
martius-lab / pink-noise-rl
View on GitHub
☆48Nov 20, 2025Updated 3 months ago
vickipedia6 / Tennis-Deep-Reinforcement-Learning
View on GitHub
Training Multiple agents in the same environment to collaborate and compete with each other
☆12Dec 1, 2019Updated 6 years ago
LlamaTouch / AgentEnv
View on GitHub
An environment for mobile angets to interact with realistic android device or android emulator
☆13Jul 19, 2024Updated last year
anishmadan23 / MAML_Pytorch_RL
View on GitHub
☆10Aug 8, 2021Updated 4 years ago
Tlntin / booking_simulator
View on GitHub
☆11Jan 6, 2024Updated 2 years ago
menggedu / EDL
View on GitHub
Code and data for paper named: Large language models for automatic equation discovery of nonlinear dynamics
☆12Mar 6, 2025Updated last year
dxzxy12138 / PhysReason
View on GitHub
PhysReason Becnhmark
☆19Jul 8, 2025Updated 7 months ago
FLAIROx / popjym
View on GitHub
POPGym Library in JAX
☆12Apr 15, 2024Updated last year
sddcg-Jam / tencent_algo
View on GitHub
tencent 2019 algo
☆10Jul 2, 2019Updated 6 years ago
kunz07 / Energy-Management-of-HVAC-Systems
View on GitHub
Final Year Project
☆10Jul 6, 2022Updated 3 years ago
clvrai / coordination
View on GitHub
Learning to Coordinate Manipulation Skills via Skill Behavior Diversification (ICLR 2020)
☆50Jun 22, 2022Updated 3 years ago
ninell-oldenburg / social-contracts
View on GitHub
☆12Mar 12, 2024Updated last year
google-deepmind / constrained_optidice
View on GitHub
☆10Sep 9, 2022Updated 3 years ago
ChengpengLi1003 / Q-learning
View on GitHub
针对最经典的表格型Q learning算法进行了复现，能够支持gym中大多数的离散动作和状态空间的环境，譬如CliffWalking-v0。
☆10Jan 2, 2021Updated 5 years ago
matrl-project / matrl
View on GitHub
☆12Jan 30, 2021Updated 5 years ago
Adithkumarba / Hand-Gesture-Recognition-with-Deep-Learning
View on GitHub
A dynamic hand gesture recognition system using a 3D CNN model
☆13Jul 19, 2020Updated 5 years ago
mgerstgrasser / super
View on GitHub
suPER is a collaborative multi-agent RL algorithm
☆14Jun 11, 2024Updated last year
Steven-Ho / VALOR
View on GitHub
Implementation of VALOR (Variational Option Discovery Algorithms)
☆10Jun 28, 2019Updated 6 years ago
grasp-lyrl / scalableMARL
View on GitHub
Multi-Agent Reinforcement Learning (MARL) method to learn scalable control polices for multi-agent target tracking (IROS22).
☆11Jul 22, 2022Updated 3 years ago
pickxiguapi / Embodied-FSD
View on GitHub
Official code for "From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation" (ICLR2026)
☆31Updated this week
Sultan91 / HVAC_RL
View on GitHub
Reinforcement learning implementation of HVAC controller
☆12Jun 22, 2018Updated 7 years ago
sygi / vic-tensorflow
View on GitHub
Implementation of Variational Intrinsic Control in tensorflow
☆11Apr 5, 2017Updated 8 years ago
npvoid / OnlineDoubleOracle
View on GitHub
☆11Apr 23, 2021Updated 4 years ago
RL-DLMU / PNC-HDQN
View on GitHub
codes for paper 《Neighborhood Cooperative Multiagent Reinforcement Learning for Adaptive Traffic Signal Control in Epidemic Regions》
☆14Apr 3, 2022Updated 3 years ago
SmartFlow-AI4CFD / SmartFlow
View on GitHub
CFD-solver-agnostic deep reinforcement learning framework for computational fluid dynamics on HPC platforms
☆20Aug 1, 2025Updated 7 months ago
Stanford-ILIAD / Diverse-Conventions
View on GitHub
Exploring techniques to generate diverse conventions in multi-agent settings
☆15Nov 14, 2023Updated 2 years ago
Robolabo / LSTM-HVAC
View on GitHub
LSTM to predict daily HVAC consumption in buildings
☆14Jul 25, 2024Updated last year
luokn / ms-gat
View on GitHub
Learning Multiaspect Traffic Couplings by Multirelational Graph Attention Networks for Traffic Prediction
☆13Oct 7, 2022Updated 3 years ago
ceriottm / ale-notebooks
View on GitHub
Jupyter notebook for an introduction to atomic-scale machine learning class
☆17Nov 14, 2023Updated 2 years ago