Original PyTorch implementation of PMIC from PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
☆22Mar 26, 2024Updated 2 years ago
Alternatives and similar repositories for PMIC
Users that are interested in PMIC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆46Oct 31, 2024Updated last year
- (NeurIPS 2025) LaRes: Evolutionary Reinforcement Learning with LLM-based Adaptive Reward Search☆22Mar 9, 2026Updated 2 weeks ago
- (ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolut…☆44Oct 14, 2023Updated 2 years ago
- ☆12Aug 15, 2020Updated 5 years ago
- Final Year Project☆10Jul 6, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation and evaluation of Almanac (Automaton/Logic Multi-Agent Natural Actor-Critic), an algorithm for multi-agent reinforcement l…☆10May 5, 2022Updated 3 years ago
- Learning to Coordinate Manipulation Skills via Skill Behavior Diversification (ICLR 2020)☆49Jun 22, 2022Updated 3 years ago
- POPGym Library in JAX☆12Apr 15, 2024Updated last year
- solver for discrete Mixed Observable Markov Decision Processes☆11Oct 30, 2020Updated 5 years ago
- (ICML 2024) The official code for Value-Evolutionary-Based Reinforcement Learning☆20Jul 2, 2024Updated last year
- Training Multiple agents in the same environment to collaborate and compete with each other☆12Dec 1, 2019Updated 6 years ago
- A dynamic hand gesture recognition system using a 3D CNN model☆13Jul 19, 2020Updated 5 years ago
- Official Repository for "Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing" (ICML2021)☆10Oct 26, 2021Updated 4 years ago
- Implementation of Variational Intrinsic Control in tensorflow☆11Apr 5, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for Paper "Effective Multi-agent Reinforcement Learning Control with Relative Entropy Regularization".☆13Sep 27, 2023Updated 2 years ago
- Official code for the paper: Continual Task Allocation in Meta-Policy Network via Sparse Prompting☆21Feb 10, 2025Updated last year
- ☆10Oct 11, 2022Updated 3 years ago
- an application to enhance video quality.☆12Jun 16, 2021Updated 4 years ago
- Least-squares (sparse) spectral estimation and (sparse) LPV spectral decomposition.☆13Dec 9, 2024Updated last year
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆10Jun 24, 2022Updated 3 years ago
- libxco是一个轻量级高性能协程网络库☆12Jul 10, 2025Updated 8 months ago
- optimize the HVAC pumps and chillers operation☆12Jan 14, 2019Updated 7 years ago
- #MIMO PID Controller for HVAC System of Buildings☆13Jan 17, 2019Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- suPER is a collaborative multi-agent RL algorithm☆14Jun 11, 2024Updated last year
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Jun 28, 2019Updated 6 years ago
- Radio Frequency based localization (@433MHz) framework for micro underwater robots in confined tanks using software defined radio (SDR)☆13Jan 14, 2020Updated 6 years ago
- LSTM to predict daily HVAC consumption in buildings☆14Jul 25, 2024Updated last year
- ☆13Mar 12, 2024Updated 2 years ago
- ☆11Jan 6, 2024Updated 2 years ago
- Reinforcement learning implementation of HVAC controller☆12Jun 22, 2018Updated 7 years ago
- Lab notebooks for Text Analytics☆14Mar 19, 2026Updated last week
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆39Dec 2, 2025Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An environment for mobile angets to interact with realistic android device or android emulator☆13Jul 19, 2024Updated last year
- 遗传算法优化卷积神经网络(人脸识别分类)☆13Jun 13, 2019Updated 6 years ago
- Actor-Critic Differentiable Model Predictive Control☆79Jan 13, 2026Updated 2 months ago
- ☆10Apr 23, 2021Updated 4 years ago
- Distributed Collaboration of Connected Autonomous Vehicles at Unsignalized Intersections using Parallel Monte Carlo Tree Search☆16May 2, 2022Updated 3 years ago
- Memory Replay with Data Compression (ICLR 2022)☆16Sep 26, 2023Updated 2 years ago
- [CVPR 2025] Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model☆31Jun 26, 2025Updated 9 months ago