Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆147Jan 12, 2019Updated 7 years ago
Alternatives and similar repositories for ppo-pytorch
Users that are interested in ppo-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of intrinsic curiosity module with proximal policy optimization☆55Dec 20, 2018Updated 7 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆20Dec 17, 2023Updated 2 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆147Mar 12, 2023Updated 3 years ago
- The implement of the policy gradient RL algorithm with pytorch☆40Dec 7, 2020Updated 5 years ago
- The implement of GAIL with pytorch☆14Mar 11, 2020Updated 6 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- PPO with Hindsight Experience Replay (HER)☆11May 8, 2018Updated 7 years ago
- Intrinsic Curiosity Module (ICM) + PPO on the Pyramid and PushBlock environment.☆12Sep 3, 2019Updated 6 years ago
- Random Network Distillation pytorch☆260Mar 4, 2019Updated 7 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆295Feb 24, 2021Updated 5 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆56Apr 27, 2020Updated 5 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆336Nov 24, 2021Updated 4 years ago
- Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch☆2,324Jul 9, 2024Updated last year
- Code for generating options for planning and reinforcement learning☆12Feb 18, 2021Updated 5 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆55Nov 10, 2025Updated 4 months ago
- ☆13Jun 1, 2020Updated 5 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,882May 29, 2022Updated 3 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆51Oct 30, 2020Updated 5 years ago
- Series of deep reinforcement learning algorithms 🤖☆29Jun 19, 2021Updated 4 years ago
- MSc Informatics dissertation project - University of Edinburgh: Curiosity in Multi-Agent Reinforcement Learning☆13Aug 16, 2019Updated 6 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆67Dec 30, 2019Updated 6 years ago
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆15Dec 8, 2020Updated 5 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆254May 3, 2020Updated 5 years ago
- An environment based on JSBSIM aimed at one-to-one close air combat.☆19Sep 14, 2025Updated 6 months ago
- PyTorch implementation of SAC-Discrete.☆314Jul 25, 2024Updated last year
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆18Apr 15, 2022Updated 3 years ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Jun 28, 2019Updated 6 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆313Feb 16, 2021Updated 5 years ago
- Hello, I pushed some python environments for Multi Agent Reinforcement Learning.☆741May 23, 2022Updated 3 years ago
- Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019☆794May 29, 2022Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆106Jun 9, 2020Updated 5 years ago
- PyTorch implementation of CommNet☆37Dec 2, 2017Updated 8 years ago
- PyTorch implementation of Trust Region Policy Optimization☆450Sep 13, 2018Updated 7 years ago
- Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)☆777Dec 22, 2023Updated 2 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆270May 20, 2020Updated 5 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆89Jul 9, 2020Updated 5 years ago
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆24Aug 14, 2019Updated 6 years ago
- ☆62Jun 22, 2018Updated 7 years ago