junkwhinger / PPO_PyTorchLinks

This repo contains PPO implementation in PyTorch for LunarLander-v2

☆11

Alternatives and similar repositories for PPO_PyTorch

Users that are interested in PPO_PyTorch are comparing it to the libraries listed below

Sorting:

Jonathan-Pearce / DDPG_PER
Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)
☆51Updated 4 months ago
XinJingHao / PPO-Continuous-Pytorch
A clean and robust Pytorch implementation of PPO on continuous action space.
☆156Updated last year
keep9oing / DRQN-Pytorch-CartPole-v1
Deep recurrent Q learning on CartPole-v1 environment
☆91Updated last year
BY571 / Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…
☆288Updated 4 years ago
cycraig / MP-DQN
Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"
☆218Updated 6 years ago
nisheeth-golakiya / hybrid-sac
Single-file pytorch implementation of hybrid-SAC
☆58Updated 4 years ago
chauncygu / Multi-Agent-Constrained-Policy-Optimisation
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
☆180Updated last year
AgrawalAmey / safe-explorer
Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]
☆72Updated 6 years ago
philtabor / Actor-Critic-Methods-Paper-To-Code
☆184Updated 3 years ago
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆142Updated 6 years ago
DanielPalaio / LunarLander-v2_DeepRL
OpenAI LunarLander-v2 DeepRL-based solutions (DQN, DuelingDQN, D3QN)
☆41Updated 3 years ago
cyoon1729 / Multi-agent-reinforcement-learning
Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG
☆65Updated 6 years ago
cyoon1729 / Policy-Gradient-Methods
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
☆99Updated 5 years ago
ZhongZ-Wang / Model-Based-RL
这是一个关于基于模型的强化学习的资料，包括一些代码地址、paper、slide等。
☆44Updated 4 years ago
namidairo777 / Distributed-MADDPG
Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.
☆106Updated 4 years ago
JohannesAck / MATD3implementation
Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…
☆87Updated 4 years ago
parametersharingmadrl / parametersharingmadrl
☆28Updated 4 years ago
DKuan / MADDPG_torch
The code for maddpg using pytorch
☆170Updated 4 years ago
Sonkyunghwan / QTRAN
There will be updates later
☆84Updated 6 years ago
uoe-agents / robotic-warehouse
Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment
☆68Updated 10 months ago
tocom242242 / nash_q_learning
Nash Q Learning
☆31Updated 4 years ago
CoderAT13 / BipedalWalkerHardcore-SAC
BipedalWalker & BipedalWalkerHardcore solved by SAC
☆25Updated last year
SaminYeasar / Off_Policy_Adversarial_Inverse_Reinforcement_Learning
Implementation of Off Policy Adversarial Inverse Reinforcement Learning
☆23Updated 4 years ago
RunzheYang / MORL
Multi-Objective Reinforcement Learning
☆279Updated 3 years ago
hsvgbkhgbv / SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …
☆119Updated 8 months ago
wwxFromTju / maddpg-tf
use tensorflow to implement the MADDPG(simple_tag)
☆18Updated 7 years ago
openai / safety-starter-agents
Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.
☆435Updated 2 years ago
oxwhirl / wqmix
Code for Weighted QMIX
☆139Updated 4 years ago
CUN-bjy / gym-td3-keras
Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework
☆11Updated 4 years ago
watakandai / hiro_pytorch
Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)
☆110Updated 4 years ago