junkwhinger / PPO_PyTorchLinks
This repo contains PPO implementation in PyTorch for LunarLander-v2
☆11Updated 4 years ago
Alternatives and similar repositories for PPO_PyTorch
Users that are interested in PPO_PyTorch are comparing it to the libraries listed below
Sorting:
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆98Updated 5 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 3 months ago
- The hierarchy reinforcement learning algorithm(based on DDPG)☆10Updated 6 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆64Updated 5 years ago
- ☆184Updated 3 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 4 years ago
- ☆20Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆110Updated 4 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆162Updated 11 months ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆43Updated 4 years ago
- PyTorch implementation of DDPG for continuous control tasks.☆46Updated 5 years ago
- Implementation of Continuous Control RL Algorithms☆11Updated 2 years ago
- Solution for Taxi env using HRL (Hierarchical reinforcement learning) (2018)☆21Updated 5 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆141Updated 6 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆151Updated last year
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆119Updated 7 months ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆289Updated 4 years ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆21Updated 4 years ago
- ppo-lstm-parallel☆44Updated 6 years ago
- There will be updates later☆84Updated 6 years ago
- Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3☆99Updated 2 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆50Updated 6 years ago
- Implementation of Soft Actor-Critic (SAC) algorithm using TensorFlow 2.1.0☆12Updated 5 years ago
- A clean and robust Pytorch implementation of PPO on Discrete action space☆69Updated last year
- Implement many Sparse Reward algorithms in Gym Fetch environment☆88Updated 4 years ago
- ☆40Updated 3 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆79Updated 2 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆86Updated 4 years ago
- Actor-Critic and openAI clipped PPO in gym cartpole-v0 and pendulum-v0 environment☆25Updated 4 years ago