vtalpaert / pytorch-feudal-network
Pytorch implementation of [Feudal Net](https://arxiv.org/abs/1703.01161). ([Tensorflow version](https://github.com/dmakian/feudal_networks))
☆16Updated 5 years ago
Alternatives and similar repositories for pytorch-feudal-network:
Users that are interested in pytorch-feudal-network are comparing it to the libraries listed below
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆44Updated 3 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆60Updated 5 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆28Updated 5 years ago
- Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems☆31Updated 6 years ago
- Code for Sibling Rivalry and experiments presented in associated paper☆17Updated 3 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 2 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆32Updated 2 years ago
- ☆30Updated 5 years ago
- Deep RL agents with PyTorch☆35Updated 3 years ago
- ☆54Updated 11 months ago
- ☆71Updated 7 months ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C), Asynchronous Advantage Option-Critic (A2OC), Proximal Policy Optimization (PPO) a…☆8Updated 6 years ago
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆57Updated 4 years ago
- ☆47Updated 3 years ago
- PyTorch IMPALA implementation☆25Updated 5 years ago
- AGAC: Adversarially Guided Actor-Critic☆48Updated 3 years ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆31Updated 4 years ago
- ☆52Updated 4 years ago
- Model-Based Offline Reinforcement Learning☆48Updated 4 years ago
- [ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning☆31Updated last year
- Code for demonstration example-task in RUDDER blog☆22Updated 4 years ago
- Learning Individual Intrinsic Reward in MARL☆63Updated 2 years ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Updated 5 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆24Updated last year
- ☆18Updated 5 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago