geekyutao / PyTorch-PPOLinks
PyTorch implementation of PPO algorithm
☆22Updated 6 years ago
Alternatives and similar repositories for PyTorch-PPO
Users that are interested in PyTorch-PPO are comparing it to the libraries listed below
Sorting:
- [Algorithms] Official implementation of “Control Strategy of Speed Servo Systems Based on Deep Reinforcement Learning”☆25Updated 2 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆65Updated 2 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆63Updated 7 years ago
- ☆80Updated 6 years ago
- I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf☆35Updated 3 years ago
- Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.☆29Updated 9 months ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆67Updated 6 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Updated 4 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Updated 3 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆30Updated 7 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆96Updated 2 years ago
- The aim of this repo is to bring ideas and relevant literature relating to Safe-RL in the context of autonomous vehicles.☆51Updated 7 years ago
- Project on multi agent reinforcement learning applied on patrolling agents☆40Updated 6 years ago
- Evolution-based Soft Actor-Critic (ESAC)☆42Updated last year
- Reinforcement Learning approaches for learning communication in Multi Agent Systems.☆18Updated 7 years ago
- Single-file pytorch implementation of hybrid-SAC☆64Updated 4 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆220Updated last year
- PyTorch implementation of Soft Actor-Critic(SAC).☆105Updated 5 years ago
- DSAC; Distributional Soft Actor-Critic☆137Updated last year
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆54Updated 11 months ago
- Transformer-based Multi-Agent Actor-Critic Framework☆45Updated 3 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆169Updated last year
- Deep recurrent Q learning on CartPole-v1 environment☆94Updated 2 years ago
- Approximate dynamic programming (ADP) and Policy gradient (PG) based sequential optimal experimental design (sOED)☆21Updated 3 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆294Updated 4 years ago
- Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.☆107Updated 5 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆46Updated 5 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆54Updated 3 months ago
- PyTorch implementation of DDPG for continuous control tasks.☆46Updated 6 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆122Updated 5 years ago