mynkpl1998 / Recurrent-Deep-Q-Learning
Solving POMDP using Recurrent networks
☆85Updated 4 years ago
Alternatives and similar repositories for Recurrent-Deep-Q-Learning:
Users that are interested in Recurrent-Deep-Q-Learning are comparing it to the libraries listed below
- ☆41Updated 5 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆117Updated 4 months ago
- Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…☆82Updated 7 years ago
- ☆84Updated 6 years ago
- ☆75Updated 5 years ago
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆156Updated 2 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆99Updated 2 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆39Updated 4 years ago
- ☆120Updated 2 years ago
- There will be updates later☆84Updated 5 years ago
- Collection of OpenAI parametrized action-space environments.☆64Updated 2 weeks ago
- Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning☆56Updated 2 years ago
- pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"☆52Updated 2 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆94Updated 2 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆48Updated 2 years ago
- ☆48Updated 3 years ago
- Deep Implicit Coordination Graphs☆41Updated 10 months ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 7 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆55Updated 2 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆129Updated 2 years ago
- ☆47Updated 4 years ago
- Gym-like extensions for POMDP☆57Updated 4 years ago
- ☆74Updated 10 months ago
- Code for Dynamic Weights in Multi-Objective Deep Reinforcement Learning☆94Updated last year
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 8 months ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆137Updated 6 years ago
- ☆44Updated 2 years ago
- Learning Individual Intrinsic Reward in MARL☆62Updated 2 years ago