geekyutao / PyTorch-PPOLinks
PyTorch implementation of PPO algorithm
☆22Updated 5 years ago
Alternatives and similar repositories for PyTorch-PPO
Users that are interested in PyTorch-PPO are comparing it to the libraries listed below
Sorting:
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆59Updated 2 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆63Updated 7 years ago
- DSAC; Distributional Soft Actor-Critic☆130Updated 6 months ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆45Updated 5 years ago
- Evolution-based Soft Actor-Critic (ESAC)☆42Updated last year
- Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.☆28Updated 3 months ago
- Code for the paper “Control Strategy of Speed Servo Systems Based on Deep Reinforcement Learning”☆24Updated 2 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆23Updated 4 years ago
- This repository is the official implementation of Learning Multi-Agent Coordination for Enhancing Target Coverage in Directional Sensor N…☆46Updated 4 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆65Updated 6 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆185Updated last year
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- Project on multi agent reinforcement learning applied on patrolling agents☆40Updated 5 years ago
- This is the official implementation of ERL-Re2.☆67Updated last year
- ☆77Updated 6 years ago
- Distributed Heuristic Multi-Agent Path Finding with Communication - ICRA 2021☆72Updated 3 years ago
- I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf☆35Updated 2 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆161Updated last year
- Pytorch version of the MPC in model-based reinforcement learning (MBRL), currently only test in the CartPole-swing-up environment☆88Updated 5 years ago
- A simple and fast 2D RL environment with obstacles to learn navigation.☆21Updated 5 years ago
- A simple example of how to implement vector based DDPG for MARL tasks using PyTorch and a ML-Agents environment.☆37Updated 6 years ago
- Approximate dynamic programming (ADP) and Policy gradient (PG) based sequential optimal experimental design (sOED)☆20Updated 3 years ago
- The implementation of LSTM-TD3.☆85Updated 2 years ago
- The continuous mountain car problem solved with DDPG☆13Updated 5 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆91Updated 2 years ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆30Updated 2 years ago
- ☆74Updated 5 years ago
- Transformer in RL for decision-making☆100Updated 2 years ago
- Single-file pytorch implementation of hybrid-SAC☆58Updated 4 years ago
- scalable multi agents reinforcement learning☆63Updated 7 years ago