This is a minimal example to demonstrate how multi-agent reinforcement learning with differentiable communication channels and centralized critics can be realized in RLLib. This example serves as a reference implementation and starting point for making RLLib more compatible with such architectures.
☆44Sep 24, 2023Updated 2 years ago
Alternatives and similar repositories for rllib_differentiable_comms
Users that are interested in rllib_differentiable_comms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆31Apr 25, 2021Updated 5 years ago
- VMAS is a vectorized differentiable simulator designed for efficient Multi-Agent Reinforcement Learning benchmarking. It is comprised of …☆562Feb 8, 2026Updated 2 months ago
- Official implementation of the UMDQN algorithm presented in the scientific research paper entitled "Distributional Reinforcement Learning…☆11Jun 3, 2022Updated 3 years ago
- Lux AI environment interface for RLlib multi-agents☆12Sep 23, 2021Updated 4 years ago
- ☆54Jul 21, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Mar 6, 2025Updated last year
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆41Feb 18, 2025Updated last year
- Implementation of the MEPOL algorithm - A policy gradient method for task-agnostic exploration☆15Jul 6, 2023Updated 2 years ago
- Deep memory and sequence models in JAX☆26Apr 23, 2026Updated last week
- BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL alg…☆612Feb 7, 2026Updated 2 months ago
- [CoRL'20] Learning a Decision Module by Imitating Driver’s Control Behaviors☆31Aug 5, 2022Updated 3 years ago
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- An extension of the PyMARL codebase that includes additional algorithms and environment support☆711Sep 24, 2024Updated last year
- Repo for reproduction of sequential social dilemmas☆416Mar 6, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- EA-HAS-Bench: Energy-Aware Hyperparameter and Architecture Search Benchmark (ICLR Spotlight 2023)☆18Dec 8, 2024Updated last year
- ☆10May 24, 2021Updated 4 years ago
- ☆32Nov 5, 2025Updated 5 months ago
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…☆137Feb 3, 2021Updated 5 years ago
- ☆11Oct 25, 2021Updated 4 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆15Feb 4, 2020Updated 6 years ago
- Learning to Ground Multi-Agent Communication with Autoencoders [NeurIPS 2021]☆49Oct 29, 2021Updated 4 years ago
- Pytorch implementation of the paper 'Compositional language emerge in a neural iterated learning' (ICLR 2020).☆16Oct 14, 2021Updated 4 years ago
- Python library for solving reinforcement learning (RL) problems using generative models.☆11Feb 18, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Running massive simulations using RNNs on CPUs for building bots and all kinds of things.☆13Jun 13, 2021Updated 4 years ago
- Benchmark Suite for Interpretable Rule Learning☆12Aug 23, 2020Updated 5 years ago
- Qualitative Numeric Planning☆10Dec 10, 2020Updated 5 years ago
- c++ implementation of alphagozero☆15May 29, 2018Updated 7 years ago
- reveal-md is great project. Improve your presentation even more with custom user scripts. Here is the place to find them.☆15Dec 7, 2023Updated 2 years ago
- QGFN: Controllable Greediness with Action Values - Code☆11May 17, 2024Updated last year
- Code for "Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning"☆36May 22, 2021Updated 4 years ago
- Machine Learning course for MSU☆13Dec 13, 2023Updated 2 years ago
- The pytorch implementation of DGN on grid world and Starcraft☆153Dec 11, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆15Apr 18, 2024Updated 2 years ago
- ☆31Jan 16, 2023Updated 3 years ago
- ☆14Nov 2, 2022Updated 3 years ago
- Official Code for Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation (CVPR 2025)☆14Apr 2, 2025Updated last year
- A tool for aggregating and plotting MARL experiment data.☆84Apr 13, 2026Updated 2 weeks ago
- Few-shot Bayesian Imitation Learning with Policies as Logic over Programs☆21Oct 19, 2025Updated 6 months ago
- Probabilistic logic language for inference, planning and learning in static and dynamic domains☆15Feb 27, 2017Updated 9 years ago