andompesta / ppo2Links
Pytorch implementation of PPO2
☆17Updated 6 years ago
Alternatives and similar repositories for ppo2
Users that are interested in ppo2 are comparing it to the libraries listed below
Sorting:
- A pytorch tutorial for DRL(Deep Reinforcement Learning)☆224Updated 2 years ago
- MADDPG in Ray/RLlib☆54Updated 6 years ago
- ☆92Updated 5 years ago
- Simple Cartpole example writed with pytorch.☆169Updated 6 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆335Updated 4 years ago
- PyTorch implementation of DDPG algorithm for continuous action reinforcement learning problem.☆421Updated 4 years ago
- ☆173Updated 2 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆96Updated 4 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆147Updated 7 years ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆183Updated 2 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆99Updated 5 years ago
- ☆148Updated last year
- Random Network Distillation pytorch☆260Updated 6 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆106Updated 6 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆269Updated 5 years ago
- PyTorch implementation of SAC-Discrete.☆314Updated last year
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆130Updated 3 years ago
- DQN to play Atari Pong☆114Updated 7 years ago
- Hierarchical-DQN in pytorch (not actively maintained)☆73Updated 8 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆100Updated 6 years ago
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…☆139Updated 5 years ago
- Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)☆231Updated 3 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆117Updated 3 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆294Updated 4 years ago
- an implementation of CommNet☆35Updated 8 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 7 years ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆359Updated 6 years ago
- Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)☆372Updated 6 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆132Updated 2 years ago
- Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.☆107Updated 5 years ago