geekyutao / PyTorch-PPOLinks

PyTorch implementation of PPO algorithm

☆22

Alternatives and similar repositories for PyTorch-PPO

Users that are interested in PyTorch-PPO are comparing it to the libraries listed below

Sorting:

Jingliang-Duan / DSAC-v1
DSAC; Distributional Soft Actor-Critic
☆129Updated 4 months ago
AlgTUDelft / WCSAC
Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"
☆55Updated last year
XinJingHao / TD3-BipedalWalkerHardcore-v2
Solve BipedalWalkerHardcore-v2 with TD3
☆90Updated 2 years ago
alirezakazemipour / SAC
Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.
☆28Updated last month
tinyzqh / control-of-jump-systems-based-on-reinforcement-learning
Code for the paper “Control Strategy of Speed Servo Systems Based on Deep Reinforcement Learning”
☆24Updated 2 years ago
BY571 / D4PG
PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…
☆23Updated 4 years ago
schneimo / ddpg-pytorch
PyTorch implementation of DDPG for continuous control tasks.
☆46Updated 5 years ago
chauncygu / Multi-Agent-Constrained-Policy-Optimisation
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
☆179Updated last year
TobiasLv / RAD
☆51Updated 3 weeks ago
TianhongDai / distributed-ppo
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
☆62Updated 6 years ago
MyRepositories-hub / Simple-Policy-Optimization
☆64Updated last month
Jonathan-Pearce / transfer_learning_rl
Transfer learning in deep reinforcement learning for continuous control. Implemented DDPG and TD3 algorithms and evaluated ability to ada…
☆16Updated 4 months ago
toshikwa / soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
☆103Updated 5 years ago
Xingyu-Lin / mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆170Updated 3 years ago
LinghengMeng / LSTM-TD3
The implementation of LSTM-TD3.
☆81Updated 2 years ago
aravindsrinivas / neural-mpc
☆73Updated 4 years ago
Ullar-Kask / TD3-PER
An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer
☆23Updated 5 years ago
BlueFisher / Advanced-Soft-Actor-Critic
Soft Actor-Critic with advanced features
☆50Updated last week
zlr20 / saferl_kit
☆75Updated last year
ZhongZ-Wang / Model-Based-RL
这是一个关于基于模型的强化学习的资料，包括一些代码地址、paper、slide等。
☆43Updated 4 years ago
CoderAT13 / BipedalWalkerHardcore-SAC
BipedalWalker & BipedalWalkerHardcore solved by SAC
☆25Updated last year
dnandha / mopac
Model Predictive Actor-Critic Reinforcement Learning
☆63Updated 3 years ago
kantologist / multiagent-sac
Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.
☆37Updated 4 years ago
yeshenpy / ERL-Re2
This is the official implementation of ERL-Re2.
☆64Updated last year
suneelbelkhale / model-based-meta-rl-for-flight
Codebase for Model-Based Meta-Reinforcement Learning for Flight with Suspended Payloads paper. Website: https://sites.google.com/view/met…
☆31Updated 2 years ago
baimingc / delay-aware-MBRL
Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".
☆26Updated 5 years ago
hijkzzz / noisy-mappo
Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)
☆64Updated 2 years ago
akjayant / PPO_Lagrangian_PyTorch
Implementation of PPO Lagrangian in PyTorch
☆49Updated 2 years ago
LilTwo / DRL-using-PyTorch
PyTorch implementation of Deep Reinforcement Algorithm
☆30Updated 2 years ago
AliBaheri / Safe-Reinforcement-Learning
The aim of this repo is to bring ideas and relevant literature relating to Safe-RL in the context of autonomous vehicles.
☆48Updated 6 years ago