geekyutao / PyTorch-PPOLinks
PyTorch implementation of PPO algorithm
☆22Updated 5 years ago
Alternatives and similar repositories for PyTorch-PPO
Users that are interested in PyTorch-PPO are comparing it to the libraries listed below
Sorting:
- I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf☆35Updated 3 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆62Updated 2 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 7 years ago
- Pytorch version of the MPC in model-based reinforcement learning (MBRL), currently only test in the CartPole-swing-up environment☆90Updated 5 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆208Updated last year
- Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.☆29Updated 7 months ago
- DSAC; Distributional Soft Actor-Critic☆134Updated 10 months ago
- Solve BipedalWalkerHardcore-v2 with TD3☆94Updated 2 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆67Updated 6 years ago
- Collection of OpenAI parametrized action-space environments.☆67Updated 8 months ago
- Transformer-based Multi-Agent Actor-Critic Framework☆46Updated 3 years ago
- Code for the paper “Control Strategy of Speed Servo Systems Based on Deep Reinforcement Learning”☆25Updated 2 years ago
- Single-file pytorch implementation of hybrid-SAC☆62Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆105Updated 5 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆95Updated last year
- Deep Reinforcement Learning for Continuous Control in PyTorch☆105Updated 3 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆149Updated 4 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆294Updated 4 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Updated 4 years ago
- ☆80Updated 6 years ago
- A simple example of how to implement vector based DDPG for MARL tasks using PyTorch and a ML-Agents environment.☆37Updated 6 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆169Updated last year
- Approximate dynamic programming (ADP) and Policy gradient (PG) based sequential optimal experimental design (sOED)☆20Updated 3 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆121Updated 4 years ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆226Updated last year
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆30Updated 6 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆151Updated last year
- Evolution-based Soft Actor-Critic (ESAC)☆42Updated last year
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆79Updated 5 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆45Updated 5 years ago