junkwhinger / PPO_PyTorchLinks
This repo contains PPO implementation in PyTorch for LunarLander-v2
☆11Updated 5 years ago
Alternatives and similar repositories for PPO_PyTorch
Users that are interested in PPO_PyTorch are comparing it to the libraries listed below
Sorting:
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 4 months ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆156Updated last year
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆288Updated 4 years ago
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆218Updated 6 years ago
- Single-file pytorch implementation of hybrid-SAC☆58Updated 4 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆180Updated last year
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆72Updated 6 years ago
- ☆184Updated 3 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆142Updated 6 years ago
- OpenAI LunarLander-v2 DeepRL-based solutions (DQN, DuelingDQN, D3QN)☆41Updated 3 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆65Updated 6 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆99Updated 5 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆44Updated 4 years ago
- Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.☆106Updated 4 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆87Updated 4 years ago
- ☆28Updated 4 years ago
- The code for maddpg using pytorch☆170Updated 4 years ago
- There will be updates later☆84Updated 6 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆68Updated 10 months ago
- Nash Q Learning☆31Updated 4 years ago
- BipedalWalker & BipedalWalkerHardcore solved by SAC☆25Updated last year
- Implementation of Off Policy Adversarial Inverse Reinforcement Learning☆23Updated 4 years ago
- Multi-Objective Reinforcement Learning☆279Updated 3 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆119Updated 8 months ago
- use tensorflow to implement the MADDPG(simple_tag)☆18Updated 7 years ago
- Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.☆435Updated 2 years ago
- Code for Weighted QMIX☆139Updated 4 years ago
- Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆11Updated 4 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆110Updated 4 years ago