lweitkamp / option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
☆117Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for option-critic-pytorch
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆122Updated 6 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆92Updated 3 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆94Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆154Updated 2 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆36Updated 4 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆150Updated 2 weeks ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆326Updated 2 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆159Updated last month
- There will be updates later☆81Updated 5 years ago
- A collection of offline reinforcement learning algorithms.☆158Updated 5 months ago
- ☆88Updated 3 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆294Updated 2 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆69Updated 11 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆114Updated last year
- PyTorch implementation of SAC-Discrete.☆284Updated 3 months ago
- Conservative Q Learning on top of SAC☆119Updated 2 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆82Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆72Updated 10 months ago
- This is the official implementation of Multi-Agent PPO.☆92Updated last year
- Baseline implementation of recurrent PPO using truncated BPTT☆123Updated 6 months ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆253Updated 4 years ago
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆280Updated last year
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆40Updated last month
- PyTorch implementation of GAIL and AIRL based on PPO.☆194Updated 3 years ago
- ☆188Updated last year
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆149Updated last year
- Implement many Sparse Reward algorithms in Gym Fetch environment☆81Updated 4 years ago
- Code for Weighted QMIX☆123Updated 3 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆145Updated 6 months ago
- Gridworld for MARL experiments☆137Updated 3 years ago