proroklab / rllib_differentiable_comms

This is a minimal example to demonstrate how multi-agent reinforcement learning with differentiable communication channels and centralized critics can be realized in RLLib. This example serves as a reference implementation and starting point for making RLLib more compatible with such architectures.
40Updated last year

Related projects

Alternatives and complementary repositories for rllib_differentiable_comms