iassael / learning-to-communicateLinks
Learning to Communicate with Deep Multi-Agent Reinforcement Learning
☆440Updated 6 years ago
Alternatives and similar repositories for learning-to-communicate
Users that are interested in learning-to-communicate are comparing it to the libraries listed below
Sorting:
- Neural network model, suitable for multi-agent learning. https://arxiv.org/abs/1605.07736☆216Updated 8 years ago
- Implementation of Meta-RL A3C algorithm☆405Updated 8 years ago
- Multiagent Cooperation and Competition with Deep Reinforcement Learning☆123Updated 9 years ago
- Code for the paper "Meta-Learning Shared Hierarchies"☆619Updated 2 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆350Updated 6 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆433Updated last year
- Implementation of TRPO and related algorithms☆640Updated 7 years ago
- Basic DQN implementation☆227Updated 7 years ago
- Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym envir…☆277Updated 7 years ago
- Code for the paper "Generative Adversarial Imitation Learning"☆725Updated 6 years ago
- An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161☆186Updated 8 years ago
- Implementations of deep RL papers and random experimentation☆178Updated 7 years ago
- Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)☆408Updated 8 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆215Updated 7 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 6 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆591Updated 7 years ago
- Constrained Policy Optimization☆333Updated 8 years ago
- Accompanying repository for Let's make a DQN / A3C series.☆395Updated 7 years ago
- ☆305Updated 2 years ago
- Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch☆358Updated 6 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆312Updated 4 years ago
- implement of prioritized experience replay☆159Updated 7 years ago
- Actor-critic with experience replay☆256Updated 3 years ago
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆193Updated 7 years ago
- Persistent advantage learning dueling double DQN for the Arcade Learning Environment☆263Updated 7 years ago
- RLPy Reinforcement Learning Framework☆254Updated 6 years ago
- Value Iteration Networks☆290Updated 8 years ago
- Advantage async actor-critic Algorithms (A3C) and Progressive Neural Network implemented by tensorflow.☆120Updated 9 years ago
- Evolution Strategies in PyTorch☆355Updated 8 years ago
- Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.☆658Updated 5 years ago