junkwhinger / PPO_PyTorch
This repo contains PPO implementation in PyTorch for LunarLander-v2
☆10Updated 4 years ago
Alternatives and similar repositories for PPO_PyTorch:
Users that are interested in PPO_PyTorch are comparing it to the libraries listed below
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆97Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆48Updated 4 years ago
- Code for Weighted QMIX☆126Updated 4 years ago
- There will be updates later☆83Updated 5 years ago
- Single-file pytorch implementation of hybrid-SAC☆54Updated 3 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆128Updated 7 months ago
- The hierarchy reinforcement learning algorithm(based on DDPG)☆11Updated 5 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆99Updated 2 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆64Updated 4 months ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆60Updated 2 years ago
- ☆47Updated 3 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆97Updated 5 years ago
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆13Updated 3 years ago
- Implementation of Off Policy Adversarial Inverse Reinforcement Learning☆22Updated 4 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆14Updated last year
- Generate expert demonstrations; GAIL(Generative Adversarial Imitation Learning); IRL(Inverse Reinforcement Learning)☆33Updated 3 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆83Updated last year
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆199Updated 5 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆68Updated 2 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆57Updated 4 years ago
- The implement of the policy gradient RL algorithm with pytorch☆37Updated 4 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆160Updated 2 years ago
- Implementation for mSAC methods in PyTorch☆40Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆121Updated 11 months ago
- ☆39Updated 3 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆157Updated 9 months ago
- ☆19Updated last year
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆38Updated 6 years ago
- ☆83Updated 3 years ago