nikhilbarhate99 / PPO-PyTorchLinks

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

☆2,278

Alternatives and similar repositories for PPO-PyTorch

Users that are interested in PPO-PyTorch are comparing it to the libraries listed below

Sorting:

sfujim / TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
☆2,008Updated 2 years ago
ericyangyu / PPO-for-Beginners
A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-par…
☆1,178Updated last year
Lizhi-sjtu / DRL-code-pytorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
☆1,424Updated 2 years ago
quantumiracle / Popular-RL-Algorithms
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…
☆1,318Updated 9 months ago
marlbenchmark / on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
☆1,819Updated last year
vwxyzjn / ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
☆905Updated last year
ikostrikov / pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…
☆3,870Updated 3 years ago
sweetice / Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
☆4,530Updated 2 years ago
oxwhirl / pymarl
Python Multi-Agent Reinforcement Learning framework
☆2,135Updated 3 years ago
pranz24 / pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
☆927Updated 5 months ago
Khrylx / PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Lear…
☆1,265Updated 4 years ago
DLR-RM / rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents include…
☆2,676Updated last week
starry-sky6688 / MARL-Algorithms
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario…
☆1,698Updated 3 years ago
rail-berkeley / rlkit
Collection of reinforcement learning algorithms
☆2,839Updated last year
Farama-Foundation / D4RL
A collection of reference environments for offline reinforcement learning
☆1,626Updated last year
Curt-Park / rainbow-is-all-you-need
Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow
☆2,003Updated 3 months ago
rail-berkeley / softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…
☆1,386Updated 2 years ago
openai / maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
☆1,926Updated last year
dxyang / DQN_pytorch
Vanilla DQN, Double DQN, and Dueling DQN implemented in PyTorch
☆551Updated 7 years ago
MorvanZhou / pytorch-A3C
Simple A3C implementation with pytorch + multiprocessing
☆657Updated 2 years ago
openai / multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
☆2,708Updated last year
oxwhirl / smac
SMAC: The StarCraft Multi-Agent Challenge
☆1,302Updated last year
louisnino / RLcode
☆1,034Updated 2 years ago
ikostrikov / pytorch-a3c
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
☆1,308Updated 6 years ago
HumanCompatibleAI / imitation
Clean PyTorch implementations of imitation and reward learning algorithms
☆1,664Updated 11 months ago
kzl / decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
☆2,735Updated last year
seungeunrho / minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
☆3,122Updated 2 years ago
haarnoja / sac
Soft Actor-Critic
☆1,194Updated 2 years ago
Replicable-MARL / MARLlib
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
☆1,242Updated last year
ghliu / pytorch-ddpg
Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch
☆624Updated 7 years ago