toshikwa / soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
☆94Updated 4 years ago
Related projects: ⓘ
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆113Updated last month
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆87Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆108Updated 6 months ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆146Updated last year
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆92Updated 4 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆34Updated 4 years ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆182Updated 3 years ago
- ☆117Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆150Updated 2 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆118Updated 4 months ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆117Updated 4 months ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆158Updated last month
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆91Updated 2 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆119Updated 3 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated last month
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆128Updated 5 years ago
- ☆39Updated 2 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆60Updated 10 months ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆110Updated 2 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆92Updated 5 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆27Updated 3 years ago
- The implementation of LSTM-TD3.☆60Updated last year
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆139Updated 5 months ago
- There will be updates later☆79Updated 5 years ago
- Collection of OpenAI parametrized action-space environments.☆55Updated last year
- Level-based Foraging (LBF): A multi-agent environment for RL☆152Updated this week
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆265Updated 3 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆311Updated 2 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆131Updated last year
- PyTorch implementation of SAC-Discrete.☆273Updated last month