ovechkin-dm / ppo-lstm-parallel
ppo-lstm-parallel
☆42Updated 5 years ago
Related projects: ⓘ
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆13Updated 9 months ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆60Updated 10 months ago
- There will be updates later☆79Updated 5 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆92Updated 4 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆87Updated 3 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆61Updated 3 years ago
- using recurrent networks(LSTM) to solve POMDPs☆33Updated 5 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆25Updated 4 years ago
- A simple RNN meta-learner☆10Updated 5 years ago
- Distributional Soft Actor Critic☆49Updated 4 years ago
- behavior cloning from observation☆34Updated 3 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆34Updated 4 years ago
- ☆44Updated 3 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆79Updated 4 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆57Updated 2 years ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆88Updated 5 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated last month
- Official implementation of the paper `Augmenting GAIL with BC for sample efficient imitation learning` in PyTorch☆30Updated 3 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆27Updated 3 years ago
- multi-agent reinforcement learning for competitive environments using pytorch☆12Updated 4 years ago
- ☆45Updated 5 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆61Updated last year
- Advantage weighted Actor Critic for Offline RL☆46Updated 2 years ago
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆22Updated 5 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆131Updated last year
- The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.☆36Updated last year
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆34Updated 5 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆44Updated 3 years ago
- ☆13Updated 4 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆146Updated last year