ASzot / ppo-pytorch
Proximal policy optimization in PyTorch. Easy to read and understand.
☆48Updated 4 years ago
Alternatives and similar repositories for ppo-pytorch:
Users that are interested in ppo-pytorch are comparing it to the libraries listed below
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆135Updated 6 years ago
- ☆71Updated 7 months ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆45Updated 2 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆100Updated 4 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆134Updated 6 months ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 5 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- FEN Code☆37Updated 5 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆115Updated 2 months ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆30Updated 3 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆136Updated last year
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆34Updated 5 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆64Updated 5 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆97Updated 5 years ago
- Soft Actor-Critic with advanced features☆48Updated this week
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 5 years ago
- ☆120Updated last year
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated 5 years ago
- ☆83Updated 6 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆71Updated last year
- Adversarial Imitation Via Variational Inverse Reinforcement Learning☆95Updated 5 years ago
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆49Updated 3 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆44Updated 2 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆60Updated 5 years ago
- ☆26Updated 6 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆79Updated 6 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- Multi-Agent Determinantal Q-Learning☆42Updated 2 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago