andrecianflone / dynaq
Exploring the Dyna-Q reinforcement learning algorithm
☆16Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for dynaq
- ☆44Updated 3 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆96Updated 2 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆114Updated 2 weeks ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆50Updated 3 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Deep learning implementations (Asynchronous Deep Q-Learning) of multiple Game Theory algorithms for adversarial learning (WoLF-PHC, GIGA-…☆15Updated 7 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆40Updated 2 months ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆32Updated 2 years ago
- There will be updates later☆82Updated 5 years ago
- ☆42Updated 3 years ago
- ☆71Updated 5 months ago
- Code for "SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning" AAAI 2020☆26Updated last year
- Code for the paper "AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training"☆18Updated 2 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆30Updated last year
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆52Updated 4 years ago
- The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.☆39Updated last year
- Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"☆26Updated 3 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆32Updated 5 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆95Updated 3 years ago
- Safe Reinforcement Learning in Constrained Markov Decision Processes☆55Updated 4 years ago
- pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"☆51Updated last year
- using recurrent networks(LSTM) to solve POMDPs☆35Updated 6 years ago
- Deep Implicit Coordination Graphs☆41Updated 5 months ago
- Implementation of the Option-Critic Architecture☆36Updated 5 years ago
- ☆47Updated 5 years ago
- Multi-Agent Determinantal Q-Learning☆42Updated 2 years ago
- Code for a model-based version of Constrained Policy Optimization☆10Updated 3 years ago
- Unified Model-Free Hierarchical Reinforcement Learning Framework☆37Updated 5 years ago
- ppo-lstm-parallel☆42Updated 5 years ago