watakandai / hiro_pytorch
Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)
☆97Updated 3 years ago
Alternatives and similar repositories for hiro_pytorch:
Users that are interested in hiro_pytorch are comparing it to the libraries listed below
- ☆47Updated 3 years ago
- There will be updates later☆84Updated 5 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- ☆91Updated 4 years ago
- The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.☆40Updated 2 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆65Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆49Updated 4 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆51Updated last year
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆71Updated last month
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆64Updated 5 months ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆84Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆161Updated 2 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆86Updated 4 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆69Updated last year
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆99Updated 2 years ago
- ☆40Updated 3 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆157Updated 3 months ago
- ☆94Updated 3 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆56Updated 2 years ago
- ☆38Updated 2 years ago
- Constrained Policy Optimization implementation on Safety Gym☆23Updated 3 years ago
- Code for Weighted QMIX☆129Updated 4 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆42Updated 5 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆81Updated last year
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆159Updated 9 months ago
- This is the official implementation of Multi-Agent PPO.☆102Updated 2 years ago
- PyTorch implementation of Constrained Policy Optimization☆51Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆122Updated 6 months ago
- ☆41Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆123Updated this week