This is a minimal example to demonstrate how multi-agent reinforcement learning with differentiable communication channels and centralized critics can be realized in RLLib. This example serves as a reference implementation and starting point for making RLLib more compatible with such architectures.
☆44Sep 24, 2023Updated 2 years ago
Alternatives and similar repositories for rllib_differentiable_comms
Users that are interested in rllib_differentiable_comms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Example Code for the Conditional Action Trees Paper☆12May 24, 2021Updated 5 years ago
- Official implementation of the UMDQN algorithm presented in the scientific research paper entitled "Distributional Reinforcement Learning…☆11Jun 3, 2022Updated 4 years ago
- Lux AI environment interface for RLlib multi-agents☆12Sep 23, 2021Updated 4 years ago
- Heterogeneous Multi-Robot Reinforcement Learning☆75Nov 10, 2025Updated 7 months ago
- ☆55Jul 21, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Mar 6, 2025Updated last year
- A working AlphaZero implementation that's simple enough to be able to understand what's going on at a quick glance, without sacrificing t…☆14Mar 23, 2023Updated 3 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆41Feb 18, 2025Updated last year
- Implementation of the MEPOL algorithm - A policy gradient method for task-agnostic exploration☆15Jul 6, 2023Updated 2 years ago
- ☆17Jul 27, 2023Updated 2 years ago
- BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL alg…☆633Feb 7, 2026Updated 4 months ago
- Deep memory and sequence models in JAX☆32Jun 8, 2026Updated 3 weeks ago
- [CoRL'20] Learning a Decision Module by Imitating Driver’s Control Behaviors☆31Aug 5, 2022Updated 3 years ago
- The AI Arena: A framework for distributed multi-agent reinforcement learning☆14Aug 5, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- On the pitfalls of measuring emergent communication☆34Mar 12, 2019Updated 7 years ago
- An extension of the PyMARL codebase that includes additional algorithms and environment support☆724Sep 24, 2024Updated last year
- Repo for reproduction of sequential social dilemmas☆417Mar 6, 2025Updated last year
- Code to reproduce experiments from:☆10Dec 11, 2020Updated 5 years ago
- EA-HAS-Bench: Energy-Aware Hyperparameter and Architecture Search Benchmark (ICLR Spotlight 2023)☆18Dec 8, 2024Updated last year
- ☆10May 24, 2021Updated 5 years ago
- Husky Simulation and Hardware In the Loop simulation on Isaac SIM with Isaac ROS☆18Dec 19, 2023Updated 2 years ago
- An environment based on JSBSIM aimed at one-to-one close air combat.☆14May 15, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆15Feb 4, 2020Updated 6 years ago
- Pytorch implementation of the paper 'Compositional language emerge in a neural iterated learning' (ICLR 2020).☆16Oct 14, 2021Updated 4 years ago
- Python library for solving reinforcement learning (RL) problems using generative models.☆11Feb 18, 2025Updated last year
- [NeurIPS'21] RoMA: Robust Model Adaptation for Offline Model-based Optimization☆15Oct 28, 2021Updated 4 years ago
- Benchmark Suite for Interpretable Rule Learning☆12Aug 23, 2020Updated 5 years ago
- Faithful Python implementation of the paper "Towards Deep Symbolic Reinforcement Learning" by Garnelo et al.☆13Mar 23, 2021Updated 5 years ago
- c++ implementation of alphagozero☆15May 29, 2018Updated 8 years ago
- ☆15Feb 23, 2026Updated 4 months ago
- ☆10Jul 23, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Jun 6, 2024Updated 2 years ago
- reveal-md is great project. Improve your presentation even more with custom user scripts. Here is the place to find them.☆15Dec 7, 2023Updated 2 years ago
- ☆19May 8, 2026Updated last month
- QGFN: Controllable Greediness with Action Values - Code☆11May 17, 2024Updated 2 years ago
- Code for "Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning"☆37May 22, 2021Updated 5 years ago
- Machine Learning course for MSU☆13Dec 13, 2023Updated 2 years ago
- The pytorch implementation of DGN on grid world and Starcraft☆152Dec 11, 2021Updated 4 years ago