andompesta / ppo2
Pytorch implementation of PPO2
☆17Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for ppo2
- ☆92Updated 3 years ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆170Updated last year
- Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…☆82Updated 7 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆133Updated 5 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 5 years ago
- Hierarchical-DQN in pytorch (not actively maintained)☆68Updated 7 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆90Updated 3 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆296Updated 3 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆131Updated last year
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆159Updated 4 years ago
- ☆80Updated 5 months ago
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆149Updated last year
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆128Updated last year
- Pytorch implementation of distributed deep reinforcement learning☆74Updated 2 years ago
- ☆190Updated last year
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…☆129Updated 3 years ago
- The implement of the policy gradient RL algorithm with pytorch☆36Updated 3 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆82Updated 4 years ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆88Updated 5 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆120Updated 3 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆253Updated 4 years ago
- Code for "Multi-task Reinforcement Learning with Soft Modularization"☆112Updated 3 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆61Updated 6 years ago
- DQN to play Atari Pong☆111Updated 5 years ago
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆77Updated last year
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆328Updated 2 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆40Updated 5 years ago
- Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.☆99Updated 3 years ago
- ☆71Updated 5 years ago