HaiyinPiao / pytorch-a2clstm-DRQNLinks
using recurrent networks(LSTM) to solve POMDPs
☆35Updated 7 years ago
Alternatives and similar repositories for pytorch-a2clstm-DRQN
Users that are interested in pytorch-a2clstm-DRQN are comparing it to the libraries listed below
Sorting:
- ppo-lstm-parallel☆48Updated 6 years ago
- Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms includ…☆33Updated 3 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆137Updated 4 months ago
- ☆42Updated 6 years ago
- There will be updates later☆87Updated 6 years ago
- Collection of OpenAI parametrized action-space environments.☆68Updated 10 months ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆56Updated 3 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Updated last year
- Solving POMDP using Recurrent networks☆92Updated 5 years ago
- Assignments for CS294-112.☆30Updated 6 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆43Updated 6 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆90Updated 5 years ago
- Pytorch implementation of intrinsic curiosity module with proximal policy optimization☆55Updated 7 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Updated 3 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆109Updated 3 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆90Updated 2 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆149Updated 7 years ago
- ☆49Updated 4 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆99Updated 5 years ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆89Updated 7 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆105Updated 5 years ago
- ☆123Updated 2 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆63Updated 7 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆32Updated 3 years ago
- The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.☆41Updated 2 years ago
- DSAC; Distributional Soft Actor-Critic☆136Updated 11 months ago
- Distributional Soft Actor Critic☆59Updated 5 years ago
- Implementation of the Option-Critic Architecture☆40Updated 7 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆71Updated 2 years ago