junkwhinger / PPO_PyTorch
This repo contains PPO implementation in PyTorch for LunarLander-v2
☆10Updated 4 years ago
Alternatives and similar repositories for PPO_PyTorch:
Users that are interested in PPO_PyTorch are comparing it to the libraries listed below
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆49Updated last week
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆117Updated 4 months ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆161Updated 10 months ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆134Updated 8 months ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆69Updated 5 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆64Updated 5 years ago
- OpenAI LunarLander-v2 DeepRL-based solutions (DQN, DuelingDQN, D3QN)☆37Updated 3 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆69Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆100Updated 3 years ago
- Actor-Critic and openAI clipped PPO in gym cartpole-v0 and pendulum-v0 environment☆24Updated 4 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆98Updated 5 years ago
- DSAC; Distributional Soft Actor-Critic☆125Updated 3 weeks ago
- A clean and robust Pytorch implementation of PPO on Discrete action space☆66Updated 8 months ago
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆202Updated 5 years ago
- Nash Q Learning☆30Updated 4 years ago
- Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.☆101Updated 4 years ago
- ☆183Updated 3 years ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆45Updated 5 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 4 years ago
- Collection of OpenAI parametrized action-space environments.☆62Updated 2 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆286Updated 4 years ago
- [NeurIPS 2022] Code for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments☆12Updated 2 years ago
- There will be updates later☆84Updated 5 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆53Updated last year
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆137Updated 6 years ago
- Single-file pytorch implementation of hybrid-SAC☆54Updated 3 years ago
- ☆83Updated 3 years ago
- use tensorflow to implement the MADDPG(simple_tag)☆18Updated 7 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆41Updated 4 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆38Updated 6 years ago