lweitkamp / option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
☆122Updated 6 months ago
Alternatives and similar repositories for option-critic-pytorch:
Users that are interested in option-critic-pytorch are comparing it to the libraries listed below
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆161Updated 2 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆131Updated 9 months ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆158Updated 3 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆127Updated last year
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆99Updated 2 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆71Updated 2 months ago
- Prioritized Experience Replay implementation with proportional prioritization☆76Updated last year
- There will be updates later☆84Updated 5 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆258Updated 4 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆173Updated 2 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆99Updated 3 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆84Updated last year
- A collection of offline reinforcement learning algorithms.☆168Updated 2 months ago
- ☆191Updated last year
- ☆91Updated 4 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆65Updated last year
- ☆107Updated last year
- ☆121Updated 6 months ago
- Baseline implementation of recurrent PPO using truncated BPTT☆134Updated 9 months ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆207Updated 4 years ago
- Conservative Q Learning on top of SAC☆123Updated 2 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆56Updated 2 years ago
- ☆73Updated 8 months ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆42Updated 5 months ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆341Updated 3 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆169Updated 6 months ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆86Updated 4 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago